Abstract
Animal models have provided an important tool to help make the decision to take potential therapies from preclinical studies to humans. In the past several years, the strong reliance of the pharmaceutical discovery and development process on the use of animal models has come under increasing scrutiny for ethical and scientific reasons. Several prominent and widely publicized articles have reported limited concordance of animal experiments with subsequent human clinical trials. Recent assessments of the quality of animal studies have suggested that this translational failure may be due in part to shortcomings in the planning, conduct, and reporting of
Introduction
The use of animals in biomedical research, including those utilized in pharmaceutical discovery and development efforts, has come under increasing public scrutiny. While certain elements in the animal rights community have long questioned the value of preclinical studies for informing human health, more recently serious questioning of the value of animal studies has arisen within the scientific community itself (Perel et al. 2007; Pound et al. 2004; van der Worp 2010). The lack of concordance of some human clinical trial outcomes with preclinical animal efficacy findings has led to a questioning of the methods of how animal studies are conducted (Macleod 2011; van der Worp and Macleod 2011). In addition, the lack of concordance of animal efficacy data with human trial outcomes has led to a questioning of the fundamental role that animal-based studies have played in the preclinical arena for certain therapeutic areas (Pound and Bracken 2014; Sena et al. 2007). There have been several prominent publications questioning the translational value of preclinical animal models (Couzin-Frankel 2013; Seok et al. 2013), as well as a growing number of articles that question the quality of
This article is based on an invited talk presented by the author at the 33rd Annual Symposium of the Society of Toxicologic Pathologists in a plenary session dealing with progress in preclinical testing in translational science. A disclaimer is warranted as many issues of animal-based research are both complex and controversial. The thoughts expressed in this article belong to the author alone and may or may not be shared by present and/or previous employers. They are based on the aggregate career experience of a toxicologic pathologist who has spent many years leading animal care and use programs. This article will review some current trends in the pharmaceutical industry that influence the use of preclinical animal efficacy models. It will also review important areas for improvement in methodology to enhance the planning, conduct, and reporting of preclinical efficacy studies. Perhaps most importantly, it will discuss the role for the toxicologic pathologist in advancing these studies in order to bring about needed change in study conduct.
Background
The primary objective of the pharmaceutical industry is to create new therapies to solve unmet medical needs to advance the human condition. To date, this pursuit has involved the use of numerous animals of various species in a wide variety of models and testing paradigms. Detailed discussion of testing paradigms and models exceed the scope of this article, but it should be noted that the majority of animal use (rodents) takes place early in drug discovery in preclinical studies from target validation to the selection of candidates, early in the drug discovery/development process. The last decade has seen numerous advances in refinements and changes in methods in how animals are used in drug discovery and development. These include (but of course are not limited to) advances in noninvasive imaging modalities, telemetric monitoring of animals, microsampling methods, and genetic manipulation of rodents and other species. In general, there has been a trend to multiplex end points to maximize information from fewer and fewer animals with an ever-increasing complexity to the models and experimental designs. Advances in animal model technology have been used in conjunction with a wide array of technological advancements in end point assays, as well as numerous experimental advances using computer simulation,
There are unique challenges for pharmaceutical company animal care and use programs that operate in a global multisite environment across multiple regulatory and oversight systems. Little international agreement exists for harmonization concerning “best practice” in the animal care and use world. In addition, companies often combine significant internal operations and capability with extensive interactions with academic, industry, and governmental collaborations as well as work at a myriad of contract research organizations. Animals are cared for and used in a highly regulated environment, but what makes this arena somewhat unique within the larger highly regulated pharmaceutical research enterprise is the fact that public thinking varies around the world concerning how animals should be cared for and used. These differing public attitudes are based on a variety of legal and regulatory frameworks, cultural, religious, and political differences.
There are a number of recent trends in the pharmaceutical industry environment that impact preclinical model usage and practices. One of the most impactful has been the significant increase in the globalization and externalization of discovery activities. There have been increasing academic and intercompany collaborations, as well as connectivity of the industry through public–private partnerships. Working with well-conceived and described standard methods and protocols as well as with excellent reporting methods is increasingly recognized as mission critical. The pharmaceutical industry has recognized the value in working together in the precompetitive space to advance the sharing of knowledge with animal models, as well as the opportunity to work across boundaries to advance translational science. Examples of this are the cooperation and support of the industry with the National Centre for the Replacement, Refinement and Reduction of Animals in Research (NC3Rs) in the United Kingdom and with the Institute of Laboratory Animal Research (ILAR) Roundtable in the United States.
The 3Rs
The cornerstone of international policy on animal-based science is embodied in the replacement, reduction, and refinement (3Rs) set out in the now famous treatise of R. Russell and W. M. S. Burch entitled “The Principles of Humane Experimental Technique” (London, 1959). Replacement refers to the preferred use of nonanimal methods over animal methods whenever it is possible to achieve the same scientific aim. Reduction refers to methods that enable researchers to obtain comparable levels of information from fewer animals or to obtain more information from the same number of animals. Refinement is much broader and refers to methods that alleviate or minimize potential pain, suffering, or distress, and enhance animal welfare for the animals still used. Most investigators use the term refinement for the replacement of one animal model with another model of a species lower on the phylogenetic spectrum, whereas others think of this in the category of replacement with nonanimal models. Collectively, these 3Rs have been deemed the so-called alternative approaches, and they are referred to in many of the legal and regulatory frameworks used in animal care and use oversight. There are many instances where 3Rs advancements have been incorporated to enhance preclinical animal model science (Chapman et al. 2013; Emerson 2010; Madden et al. 2012). Despite the connection of the 3Rs to scientific advancement, many North American scientists more readily associate the 3Rs with animal welfare issues rather than with science. A recent survey in Canada showed that many
Some scientists would suggest that framing the issue of animal use around reduction is simplistic and goes against scientific principles, potentially leading to compromise of study quality. These scientists suggest that the conversation should center on optimization of animal numbers rather than reduction in animal use per se. In this scenario, the scientific conversation would revolve around refinement of experimental design and utilization of robust statistical expertise. In many but not all cases, statistical models and experimental design approaches can be incorporated to reduce animal numbers (Festing and Altman 2002). In some instances, optimizing design means that more animals would be needed to obtain the desired scientific aims. The concern with focus on reduction per se was discussed in a recent
In recent years, there has been significant diminution in the numbers of animals used within the pharmaceutical industry relative to R&D expenditure, and there has been significant focus on application of 3Rs principles as exemplified by the collaboration of the industry with the NC3Rs (Chapman et al. 2013). In addition, the creation in 2012 of the 3Rs Leadership Group within the IQ Consortium is another indication that the pharmaceutical industry intends to provide cross-industry cooperation in this area. The IQ Consortium is a pharmaceutical and biotechnology association that aims to advance innovation and quality in the biopharmaceutical industry.
In the view of the author, good animal welfare is inextricably linked to good science with preclinical animal models, but they are not one and the same. Invariably, many animal studies are a balance between competing needs. Since some scientists are most interested in their scientific objectives as opposed to animal welfare issues, it is proposed that they would readily identify with a second set of “3Rs” (those being relevance, robustness, and reproducibility) that are more aligned to scientific data associated with animal-based experiments. Relevance refers to how the model translates to address the scientific question at hand. Often in the pharmaceutical industry, it pertains to the ability of the preclinical model to predict the intended clinical outcome. Robustness refers to the strength of the model to remain unaffected by minor variations in test conditions that may occur over the course of experimentation in a single laboratory. Reproducibility pertains to the ability to utilize the model repetitively under the same conditions and obtain the same outcome. It is likely that by focusing on the scientific data of
The issue of reproducibility in animal studies has been of great concern as evident by an ILAR-sponsored workshop in June 2014 at the U.S. National Academy of Sciences. This workshop was conducted in part to explore the concerns that have arisen because a significant number of peer-reviewed studies have not been able to be reproduced. There is evidence that a variety of factors may contribute to the lack of reproducibility in animal studies, including incomplete reporting of details of study design and conduct, inadequate means to control study bias, and a lack of methodological rigor in study conduct (Kilkenny et al. 2009; Macleod et al. 2009). The finding of lack of animal study reproducibility and in the quality of reporting of animal studies is particularly vexing to the pharmaceutical industry due to the recent trend to greater externalization of discovery research activities, and the trend to increased industry academic collaborations and partnerships. The pharmaceutical industry has experienced a significant disparity between published findings and the ability to validate results internally in both animal and nonanimal preclinical studies (Begley and Ellis 2012; Prinz, Schlange, and Asadullah 2011). One experience that illustrates this concern is a report from the hematology/oncology department at Amgen, where industry scientists chose 53 “landmark” publications, many in top-tier journals, and were only able to confirm the findings completely in 6 of the publications (Begley and Ellis 2012). One of the lessons that one can take away from this lack of reproducibility is the need to critically evaluate studies and not just depend on the journal peer review processes. When evaluating a scientific article, the reader should critically assess the materials and methods section to assure that reporting is robust. Studies should be examined with respect to methods to control bias (randomization, blinding, etc.), whether controls are proper, if all results and repetitions are shown, as well as to verify the use of appropriate reagents and statistical methods (Begley 2013).
Increasing the Methodological Quality of In Vivo Studies
Planning the Animal Study
The methodological quality of an animal study starts with the study planning process and is best performed with the “end in sight.” Many of the methodological details believed to be important for reporting so that a study can be evaluated and reproduced (such as the checklist of factors in the Animal Research Reporting
The generation of a high-quality protocol for an
Experimental Design Considerations in Preclinical Efficacy Studies
Many aspects of study quality are important to consider in the experimental design for preclinical animal studies (Table 1). Preclinical efficacy studies in pharmaceutical discovery generally do not entail specific considerations that are not important in other animal studies, but there is a special emphasis on understanding pharmacodynamic (PD)/pharmacokinetic (PK) relationships and absorption, distribution, metabolism, excretion (ADME) associated with compound administration and target engagement. Inherent to this is a determination of the best methods, form, and timing of dose administration. It is also critically important to determine what methods are to be used to minimize bias on study. Recently, there have been calls for preclinical animal studies to be conducted much more akin to the methodology of randomized clinical trials using similar methods for bias control (randomization, blinding, sample size justification, data transparency, etc.; Muhlhausler, Bloomfield, and Gillman 2013). There are many potential sources of bias to be considered across all study phases ranging from design to interpretation (Lindner 2007). In a number of therapeutic areas, there are best practices that are being formulated to advance the translational value of preclinical studies, and undoubtedly this effort will continue and accelerate (E. Sena et al. 2007; Scott et al. 2008; Shineman et al. 2011).
Aspects of preclinical study to be detailed in protocol to help assure study quality.
It cannot be over emphasized, that while there are excellent default positions for best practice such as the use of appropriate randomization methods, and blinding and allocation concealment techniques, there is no single best practice for all circumstances. Customization of each protocol is required as optimal study design is not a “one-size-fits-all” proposition. The author has on many occasions found utility in walking statisticians through animal use procedures to have meaningful dialog on how to minimize and control for bias and variation on study. What is good in theory and what is good in practice sometimes differs markedly. For example, in chronic dosed feed experiments, some study design personnel might ask to have cages of rodents randomly distributed across racks in an animal room to minimize variation due to environmental influences. Animal husbandry reality would dictate that the likelihood of cross contamination across groups might pose far greater risk and impart greater degree of variation on the study.
Sources of Variation in Preclinical Animal Studies
There are numerous sources of variation to consider and control in the design and conduct as well as in the reporting of
In drug discovery and development studies, the method, dose form and timing of dose administration, types and preparation of excipients and vehicles, blood and tissue sampling sites and methods, handling of subjects, and a myriad of other experimental conditions can all markedly effect study outcomes. Things that may appear to be minor procedural details can prove important. An example is the finding that significant difference in the serum hepatic enzyme, alanine transaminase, can occur if mice are handled by the body instead of the tail (Swaim, Taylor, and Jersey 1985). In addition to effects of animal handling, there have been significant reported differences in research end points, such as cytokine concentrations, depending on method/site of blood removal (Mella et al. 2014). There have been many reviews of factors that are important to consider when conducting and reporting animal studies (Alfaro 2005; Hooijmans, Leenaars, and Ritskes-Hoitinga 2010; Institute for Laboratory Animal Research 2011; Kilkenny et al. 2010). A recent report that caught the eye of the popular press found that even the sex of the researcher could affect the outcome of certain rodent studies through olfactory cues (Sorge et al. 2014).
Reporting Animal Studies
A significant challenge to being able to reproduce preclinical animal studies is the general lack of transparent and accurate reporting in the literature. Analyses of published animal studies have demonstrated numerous deficiencies in the reporting of details in research methods (Kilkenny et al. 2009; Vesterinen et al. 2011). Despite multiple publications over the past 25 years calling attention to the critical factors and information necessary to enhance such reporting (Alfaro 2005; Ellery 1985), there have been continuing deficiencies in the literature. Lack of sufficient experimental procedural detail concerning animal studies has both scientific and ethical implications. It limits the ability to confirm and build on research findings and may lead to the unnecessary use of animals in studies that fail to reproduce the reported results. In addition, it may mask problems in the quality of the design and conduct of animal studies, as well as limit the ability to use systematic reviews (SRs; Hooijmans, Leenaars, and Ritskes-Hoitinga 2010).
Efforts to improve the reporting of biological and biomedical investigations resulted in the Minimum Information for Biological and Biomedical Investigations (MIBBI) project to enhance reporting in biological experiments (Taylor et al. 2008). Authors, reviewers, and journal editors as well as professional societies all have a role in the effort to increase the quality of publication of animal-based research. Recently, there have been a number of significant reports giving guidance for enhancing the description of animal studies including preclinical efficacy studies (Hooijmans et al. 2011; Hooijmans, Leenaars, and Ritskes-Hoitinga 2010; ILAR 2011; Kilkenny et al. 2010). This guidance helps define requirements for accurate descriptions of the research animal as an experimental test system, the critical elements of the research animal environment, the aspects of experimental design, and animal care and use practices that affect research results. The ARRIVE Guidelines from the U.K. National Center for the 3Rs have been endorsed by a variety of journals and funding sources. Unfortunately, despite the fact that the ARRIVE guidelines have been widely endorsed, the implementation has been much less successful to date in the first 2 years since its publication (Baker et al. 2014). Although the ARRIVE guidelines and other guidance listed earlier are excellent tools, the author recommends that each journal customize guidance based on the types of studies reported and the particular information and details needed to ensure reproducibility. There have been some excellent specialized guidance developed for some highly specialized areas of animal studies. Published guidance for preclinical imaging studies and chronobiology experiments are exemplars (Portaluppi, Touitou, and Smolensky 2008; Stout et al. 2013). The importance of reporting quality for preclinical animal studies was noted in a recent meeting of stakeholders by the U.S. National Institute of Neurological Disorders and Stroke in June 2012 that discussed how to improve the methodological reporting of animal studies in grant applications and publications (Landis et al. 2012).
Difficulty in reproducing published studies reflects some of the biases in scientific publications (Prinz, Schlange, and Asadullah 2011). It is generally felt that there is a significant publication bias in the literature that should be realized by scientists. There is relatively little incentive for journals to publish negative studies or to publish nonnovel findings and repeated studies. The extreme pressure to publish in top-tier journals for academic survival is having effects on the quality of the animal literature. It has been noted that for some therapeutic areas there are far too many animal studies with statistically significant positive results, and there is an increasing number of studies demonstrating publication bias (Korevaar, Hooft, and Ter Riet 2011; Sena et al. 2010; Ter Riet et al. 2012; Tsilidis et al. 2013).
Recently, there have been suggestions that preclinical animal studies should be conducted, reported, and analyzed more like clinical trials (Muhlhausler, Bloomfield, and Gillman 2013). The author agrees that methodological rigor, control of bias, and transparent reporting are warranted, but it would be potentially short-sighted to just assume that preclinical studies and clinical trials should be conducted in the same manner. The intended purpose, types of bias, and the collection of end points can be quite different.
SRs
An important recent trend in the analysis of preclinical animal efficacy studies is the increasing use of SRs as well as methods of meta-analysis (van Lujik et al. 2014). In contrast to a narrative review that has no standardized methodology, an SR is a type of review that is structured, thorough, and transparent. The SR aims to identify all relevant studies to answer a particular research question and is comprehensive and evidence based. Data from studies used in SRs are often used in meta-analysis (Vesterinen et al. 2014). The use of SRs is growing substantially in the preclinical efficacy literature, and there have been centers set up such as Collaborative Approach to Meta-analysis and Review of Animal Data from Experimental Studies (CAMARADES) and Systematic Review Centre for Laboratory Animal Experimentation (SYRCLE) to help foster design of preclinical studies and methods of analysis (Hooijmans et al. 2014; Ritskes-Hoitinga et al. 2014; Sena et al. 2014).
The Value of the Toxicologic Pathologist in Preclinical Efficacy Study Conduct
Toxicologic pathologists, especially those with a veterinary medical background, are uniquely qualified by training and experience to help improve the conduct of preclinical animal efficacy studies in the pharmaceutical industry. These individuals usually have an excellent background in laboratory animal science and animal model characterization. A very useful trait of the toxicologic pathology working environment is the use of standard operating procedures, standard lexicons and protocols, and well-established reporting practices, as well as the expectation of peer review input. Pathologists work in a subject area that is both subjective and quantitative at the same time, and one in which there are many sources of variation and bias that need to be controlled. Most pathologists engage with statistics personnel and interact on issues such as whether or not pathology data should be gathered blinded with dose group concealment. The debate in the professional societies concerning blinded reading of slides is but one example (Dodd 1988; Temple 1988).
Conclusion
The concept of the
Footnotes
Author Contribution
Jeffrey I. Everitt drafted the manuscript, gave final approval, and agrees to be accountable for all aspects of work ensuring integrity and accuracy.
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
The author(s) received no financial support for the research, authorship, and/or publication of this article.
