Sage Journals: Discover world-class research

Abstract

Keywords

digital pathology tissue image analysis artificial intelligence machine learning

For decades, it has been postulated that digital pathology is the future. By now it is safe to say that we are living that future. Digital pathology has expanded into all aspects of pathology, including human diagnostic pathology, veterinary diagnostics, research, drug development, regulatory toxicologic pathology primary reads, and peer review. Digital tissue image analysis has enabled users to extract quantitative and complex data from digitized whole-slide images. With recent advancements of artificial intelligence (AI)-based tools in the field of computer vision, great improvements both in quantitative image analysis and morphometric assessments for decision-making aids are on the near horizon. It is important that toxicologic pathologists gain a common basic understanding of digital pathology, image analysis tools, and machine learning (ML) approaches. This skill is not only necessary to be able to effectively leverage these applications, but more importantly, also to be able to impact their development and advancements in a meaningful way. The American College of Veterinary Pathologists (ACVP) highlighted this need for education in the digital pathology space by incorporating it into its current strategic plan. As a direct consequence, the ACVP’s Pathology Informatics Education Committee was formed.¹ This parallels the pathology informatics efforts in the human pathology field. Another overarching effort highlighted in this special issue is the Innovative Medicines Initiative (IMI) project related to digital pathology.² This multiyear, European, private and public consortium effort is the first of its kind. It will involve digitizing, annotating, and linking millions of both nonclinical and clinical whole-slide images with metadata and other diagnostic and research end points, ensuring availability of data and algorithms to all legally and ethically entitled stakeholders. Given the breakthrough catalyzing effects of similar efforts in general-purpose computer vision, the output of the IMI can be expected to stimulate a novel generation of digital pathology projects.

One challenge for pathologists who want to familiarize themselves with digital pathology, image analysis, and ML is the specialized terminology and its inconsistent use. “WSI” for example: Does it stand for Whole Slide Image, Whole Slide Images, or Whole Slide Imaging? A small inconsistency perhaps, but it can lead to frustrations and possibly errors if not clarified. To aid in establishing a shared vocabulary that gains consistent use, the European Society of Toxicologic Pathology (ESTP) workshop paper contains a table with definitions of commonly used digital pathology terms.³

The seemingly endless stream of new companies and software offerings joining the market poses a true triaging challenge when assessing capabilities and interoperability of systems. While initially, every brand of whole-slide scanners produced files in its own proprietary format, there are now efforts underway to standardize file formats and metadata integration, similar to those pioneered by digital radiology. This Digital Imaging and Communications in Medicine (DICOM) standard and its application to toxicologic pathology is introduced to the readers by Dr Clunie.⁴

Digital primary reads of toxicologic pathology studies and digital peer review are an active area of discussion that currently engages professional societies, key opinion leaders, and representatives of regulatory agencies alike. The ESTP workshop paper summarizes how far we have come toward regulatory acceptance of digital toxicologic pathology and highlights key challenges.³ Bradley et al explore digital primary reads and peer review using whole-slide images in a limited scope, proof-of-concept study.⁵

In 2019, the Special Interest Group on Digital Pathology and Image Analysis of the Society of Toxicologic Pathology published an opinion piece introducing AI and ML to the toxicologic pathology community.⁶ The minireview in this special issue provides an update on this subject, opines on the ongoing challenges and opportunities of ML in toxicologic pathology, and aims at encouraging our peers to learn more about these novel technologies.⁷

Transforming any pathology lab workflow is no small undertaking in general but requires additional consideration for digital toxicologic pathology adoption. Workflows should be designed to minimize the impact of preanalytical variables on digital pathology and downstream image analysis.⁸ The pathologist is uniquely qualified to play an important role in quality control along key steps of the workflow, not only early when it comes to tissue, slides, and staining quality but also in project planning, monitoring ongoing work, and finalizing image analysis projects. While some quality control concepts equally apply to both traditional image analysis and work using AI-based tools, other aspects are unique to each approach.⁹

With the advancement of AI, computer vision, and ML, technological building blocks have become available that enable the use of AI-based approaches as a more global tool, both across an entire study and even across an entire digital database. Hoefling et al describe the development of a deep learning-based model trained on normal histology slides from toxicologic pathology studies.¹⁰ The application of this model to then distinguish normal from abnormal tissue is demonstrated by Freyre et al.¹¹ We believe that toxicologic pathology will benefit from such foundational models which can be adapted for specific purposes or turned for example into general abnormality detectors, rather than having an exploding number of unrelated task-specific models. Kuklyte et al demonstrate the need to consider and the value of multimagnification convolutional neural networks for the determination and quantitation of lesions in nonclinical pathology studies.¹²

Aside from these examples of the overarching use of AI-based morphometric assessments to entire studies, this special issue incorporates specific image analysis use-cases relevant for toxicologic pathology, many of which utilized AI-based tools. These include proprietary in-house built solutions, such as AI models built to count ovarian follicles,¹³ or to quantify changes within retinal layer morphology,¹⁴ and detection of endothelial tip cells in the oxygen-induced retinopathy model,¹⁵ as well the utilization of commercially available application for spermatogenic staging,¹⁶ analysis of rodent cardiomyocytes,¹⁷ to support scoring of dextran sulfate sodium-induced colitis mouse model histology,¹⁸ enumeration of cynomolgus bone marrow histology,¹⁹ quantitative evaluation of hepatocellular cell hypertrophy in rats,²⁰ quantitate cell proliferation via common immunohistochemical biomarkers,²¹ and for verification of changes observed in the Tg-rasH2 mouse used in carcinogenicity studies.²² A fluorescence-based image analysis use-case (commercial software) is provided by Wilson et al.²³ As novel applications at the periphery of the bread-and-butter imaging work of a toxicologic pathologist are continuously emerging, Rousselle et al introduce a digital 3D topographic microscopy technique called scanning optical microscopy to evaluate re-endothelialization of vascular lumen after endovascular procedures.²⁴

Due to a substantial knowledge gap between those who are using AI-based tools in pathology, and those who do not, it is becoming increasingly challenging to write and publish papers on the subject that both contain the needed technical details and appropriate terminology to enable reproducibility of scientific data generation, as well as are written in a language that is accessible to all Toxicologic Pathology readers. In addition, due to this technology still being rather new, publishing standards vary. Balance can be achieved by ensuring that the body of an article can be fully understood by the target audience of the journal, while the information in the Materials and Methods section (plus Supplementary Material when applicable) should allow computational pathologists/scientists to reproduce the work as closely as possible. With AI functionality increasingly integrated into graphical user interfaces of commercial histopathology tools, more common ML tasks can be performed by a pathologist without specialized coding skills. However, the apparent ease of performing such tasks often does not extend toward reproducibility or systematic evaluation of predictive performance. These AI-reproducibility issues in the digital pathology space are discussed in detail by Bizzego et al.²⁵ The serious “reproducibility crisis” of ML in general has been featured prominently,²⁶ and in response to this the foremost ML conference Neural Information Processing Systems (NeurIPS) now requires researchers to fill out a reproducibility checklist. In addition, NeurIPS has launched a “reproducibility challenge” with the sole purpose of reproducing published results.²⁷ With regard to AI in toxicologic pathology publications, we recommend a pragmatic balance: Not fulfilling strict reproducibility criteria should by itself not be grounds for manuscript rejection, as in some cases this is not feasible (yet). In particular, it is often impossible to share proprietary data used in publications. Nevertheless, we recommend several key points that should be discussed in articles when not fulfilled. Most important is a rigorous approach toward separating training data (“training set) from data that are used to measure the predictive performance (“test set”). It is not enough to state that a model performed well on the histology slides on which it was trained. Only a separate test set can ensure a meaningful assessment. Moreover, there are subtle pitfalls even in assembling the test set, which can compromise the assumption of its independence from the training set. For example, different regions from a single slide should not be allocated to training and test sets, respectively. Similar limitations apply to different slides originating from a single animal. Ideally, a truly independent test set should consist of one or more completely separate studies, possibly from different staining laboratories. Such a rigorous approach will inevitably lead to lower metrics of predictive performance, and this—besides avoiding publication bias—is one of the reasons why we recommend at this time not to make high values of predictive performance metrics a strict criterion for acceptance or rejection. However, not making quantitative assessments of predictive performance at all and instead solely relying on anecdotal “visual assessments” will not suffice to draw general conclusions about how well a model is suited for a given task. Among other pitfalls is the fact that the test set should only be used once. It is a serious mistake, for example, to tune parameters of a classifier, repeatedly assess the performance on the test set, and report the outcome of the most successful experiment. For such purposes, a separate “validation” data set should be used, and only once a final model has been chosen should the test set be used for generating a definitive estimate of predictive performance.

Even with the best possible standards, performance estimates reported in different articles should not be compared naively due to the many possible sources of variation originating from both data sets and methodology. Therefore, articles repeating previous works or comparing approaches from several articles should be highly welcomed. Other application fields of AI have already developed publicly available benchmark data sets which can be used to improve comparability of results across different papers—something which would be desirable to have in toxicologic pathology as well, and which can be enabled by a large public repository that is forthcoming as part of the IMI (discussed above). This issue of reproducibility could be made a subject for a forthcoming community challenge in computational (toxicologic) pathology, akin to previous tissue image analysis challenges such as CAMELYON, TUPAC, or ACDC-LungHP.^28
–30

The task of finding and selecting the most appropriate data sets for a computational pathology study is complicated by the fact that currently histopathology data are often not fully organized according to FAIR data principles: Findable, Accessible, Interoperable, and Reusable.³¹

Along with detailed information about the data, articles should also contain details about the annotations performed by pathologists. For example, were annotations performed on whole slides or only within regions of interest (ROIs)? In the latter case, how were ROIs selected? On which magnification level were the annotations made? Were smaller patches generated from the full slides, and if so, what was the exact procedure?

The special issue editors (Figure 1) hope that this compilation of articles on digital toxicologic pathology and related topics will be useful to readers of Toxicologic Pathology in learning about this continuously developing field and its applications as we expect it to impact how we conduct our profession, not only now and in the near new future, but for generations to come.

Figure 1.

The guest editors of this special issue. (A) Oliver Turner BSc (Hons), BVSc, MRCVS, PhD, DACVP, DABT; (B) Famke Aeffner, DVM, PhD, DACVP; (C) Tobias Sing, PhD.

Footnotes

Acknowledgments

The authors would like to thank all colleagues involved in the peer review process of the papers included in this issue, many of which were willing to provide expert feedback on more than one submission.

Declaration of Conflicting Interests

The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article. F.A. is employed by Amgen Inc. and holds shares in the company. O.T. and T.S. are employed by Novartis and hold shares in the company.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

References

Schutten

Knoblaugh

Salas

, et al. Letter to the editor regarding “Pathology Informatics Education Committee of the American College of Veterinary Pathologists (ACVP)” [published online October 12, 2020]. Toxicol Pathol. 2020:192623320962427.

Moulin

Grünberg

Barale-Thomas

der Laak

. IMI–Bigpicture: a central repository for digital pathology [published online February 11, 2021]. Toxicol Pathol. 2021:0192623321989644.

Schumacher

Aeffner

Barale-Thomas

, et al. The application, challenges, and advancement toward regulatory acceptance of digital toxicologic pathology: Results of the 7th ESTP International Expert Workshop (September 20-21, 2019) [published online December 10, 2020]. Toxicol Pathol. 2020:0192623320975841.

Clunie

DAJTP

. DICOM format and protocol standardization—a core requirement for digital pathology success [published online October 16, 2020]. Toxicol Pathol. 2020:0192623320965893.

Bradley

Cary

Isobe

Naylor

Drew

SJTP

. Proof of concept: the use of whole-slide images (WSI) for peer review of tissues on routine regulatory toxicology studies [published online January 05, 2021]. Toxicol Pathol. 2021:192623320983252.

Turner

Aeffner

Bangari

, et al. Society of Toxicologic Pathology Digital Pathology and Image Analysis Special Interest Group Article*: Opinion on the application of artificial intelligence and machine learning to digital toxicologic pathology. Toxicol Pathol. 2020;48(2):277–294.

Turner

Knight

Zuraw

Litjens

Rudmann

. Mini review: the last mile—opportunities and challenges for machine learning in digital toxicologic pathology [published online February 16, 2021]. Toxicol Pathol. 2021:0192623321990375.

Chlipala

Bendzinski

Dorner

, et al. An image analysis solution for quantification and determination of immunohistochemistry staining reproducibility. Appl Immunohistochem Mol Morphol. 2020;28(6):428–436.

Zuraw

Staup

Klopfleisch

, et al. Developing a qualification and verification strategy for digital tissue image analysis in toxicological pathology [published online December 29, 2020]. Toxicol Pathol. 2020:0192623320980310.

10.

Hoefling

Sing

Hossain

, et al. HistoNet: a deep learning-based model of normal histology [published online March 03, 2021]. Toxicol Pathol. 2021:0192623321993425.

11.

Freyre

Spiegel

Gubser Keller

, et al. Biomarker-based classification and localization of renal lesions using learned representations of histology—a machine learning approach to histopathology [published online February 24, 2021]. Toxicol Pathol. 2020:0192623320987202.

12.

Kuklyte

Fitzgerald

Nelissen

, et al. Evaluation of the use of single-and multi-magnification convolutional neural networks for the determination and quantitation of lesions in nonclinical pathology studies [published online February 23, 2021]. Toxicol Pathol. 2021:0192623320986423.

13.

Carboni

Marxfeld

Tuoken

, et al. A workflow for the performance of the differential ovarian follicle count using deep neuronal networks [published online December 08, 2020]. Toxicol Pathol. 2020:0192623320969130.

14.

De Vera Mudry

Martin

Schumacher

Venugopal

. Deep learning in toxicologic pathology: a new approach to evaluate rodent retinal atrophy [published online December 29, 2020]. Toxicol Pathol. 2020:0192623320980674.

15.

Zingman

Zippel

Birk

, et al. Deep learning–based detection of endothelial tip cells in the oxygen-induced retinopathy model [published online December 02, 2020]. Toxicol Pathol. 2020:0192623320972964.

16.

Creasy

Panchal

Garg

Samanta

. Deep learning-based spermatogenic staging assessment for hematoxylin and eosin-stained sections of rat testes [published online November 28, 2020]. Toxicol Pathol. 2020:0192623320969678.

17.

Tokarz

Steinbach

Lokhande

, et al. Using artificial intelligence to detect, classify, and objectively score severity of rodent cardiomyopathy [published online December 08, 2020]. Toxicol Pathol. 2020:0192623320972614.

18.

Bédard

Westerling-Bui

Zuraw

. Proof of concept for a deep learning algorithm for identification and quantification of key microscopic features in the murine model of DSS-induced colitis [published online February 12, 2021]. Toxicol Pathol. 2021:0192623320987804.

19.

Smith

Westerling-Bui

Wilcox

Schwartz

. Screening for bone marrow cellularity changes in cynomolgus macaques in toxicology safety studies using artificial intelligence models [published online January 05, 2021]. Toxicol Pathol. 2021:0192623320981560.

20.

Pischon

Mason

Lawrenz

, et al. Artificial intelligence in toxicologic pathology: quantitative evaluation of compound-induced hepatocellular hypertrophy in rats [published online January 05, 2021]. Toxicol Pathol. 2021:0192623320983244.

21.

Hvid

Skydsgaard

Jensen

, et al. Artificial intelligence-based quantification of epithelial proliferation in mammary glands of rats and oviducts of Göttingen minipigs [published online August 25, 2020]. Toxicol Pathol. 2020:0192623320950633.

22.

Rudmann

Albretsen

Doolan

, et al. Using deep learning artificial intelligence algorithms to verify N-Nitroso-N-Methylurea and urethane positive control proliferative changes in tg-rash2 mouse carcinogenicity studies [published online December 8, 2020]. Toxicol Pathol. 2020:0192623320973986.

23.

Wilson

Vitelli

, et al. Quantitative assessment of neuroinflammation, myelinogenesis, demyelination, and nerve fiber regeneration in immunostained sciatic nerves from twitcher mice with a tissue image analysis platform [published online March 11, 2021]. Toxicol Pathol. 2021:0192623321991469.

24.

Rousselle

. Digital 3D topographic microscopy: bridging the gaps between macroscopy, microscopy and scanning electron microscopy [published online December 29, 2020]. Toxicol Pathol. 2020:0192623320979908.

25.

Bizzego

Bussola

Chierici

, et al. Evaluating reproducibility of AI algorithms in digital pathology with DAPPER. PLoS Comput Biol. 2019;15(3):e1006269.

26.

Hutson

. Artificial intelligence faces reproducibility crisis. Am Assoc Adv Sci. 2018:359(6377):725–726.

27.

Sinha

Pineau

Forde

Larochelle

HJRC

. NeurIPS 2019 reproducibility challenge. Re Sci. 2020;6(2):11.

28.

Sun

Jiang

Zheng

Xie

. A comparative study of CNN and FCN for histopathology whole slide image analysis. In: Zhao

Barnes

Chen

Westermann

Kong

Lin

, eds. Image and Graphics. ICIG 2019. Lecture Notes in Computer Science. Springer; 2019. Volume 11902. https://doi.org/10.1007/978-3-030-34110-7_47

29.

Litjens

Bandi

Ehteshami Bejnordi

, et al. 1399 H&E-stained sentinel lymph node sections of breast cancer patients: the CAMELYON dataset. Gigascience. 2018;7(6):giy065.

30.

Rousson

Hedlund

Andersson

, et al. Tumor proliferation assessment of whole slide images. Paper presented at: Medical imaging 2018: Digital Pathology. 2018.

31.

Wilkinson

Dumontier

Aalbersberg

, et al. The FAIR guiding principles for scientific data management and stewardship. Sci Data. 2016;3(1):1–9.

Special Issue on Digital Pathology,Tissue Image Analysis,Artificial Intelligence,and Machine Learning: Approximation of the Effect of Novel Technologies on Toxicologic Pathology

Abstract

Keywords

Footnotes

Acknowledgments

Declaration of Conflicting Interests

Funding

References