Abstract

The problem of extrapolation
The problem of extrapolation arises because average study results do not always apply to target populations. Three examples illustrate different aspects of the problem.
DECLARATIONS
None of the authors has competing financial interests
JH was the recipient of an MRC/ESRC Postdoctoral Fellowship (G0800055) while developing some of the ideas for this paper. JH is currently the recipient of a National Institutes of Health Research fellowship
This was not a study involving any subjects, we did not require ethical approval
JH
JH wrote the first draft, and the paper was developed in meetings and correspondence between all three authors. PG was responsible for emphasizing how mechanisms could be of help and extensive editing. JKA was responsible for the proposed taxonomy, several key examples and extensive editing
In the European Atrial Fibrillation Trial (EAFT) the risk of intracranial haemorrhage in patients with atrial fibrillation taking warfarin was minimal. 1 However, when warfarin was given to certain patients in clinical practice (those in sinus rhythm) it increased the risk of haemorrhage. 2 Treatments that benefit study patients may be useless or even harmful for some patients in the potential target population.
In another study carotid endarterectomy appeared to increase overall mortality by 0.5%. But in a subgroup analysis of patients with severe carotid stenosis the procedure had a clear benefit.3,4 This example illustrates why the terms ‘external validity’ and ‘generalizability’ can be misleading: the problem of extrapolation also arises when applying average study results to individuals or subpopulations within the study (particularizing).
Study and target contexts can also differ in ways that make extrapolation problematic. For example, the Tamil Nadu Integrated Nutrition Programs improved the nutritional status of Tamil children, by using a package of care that included food supplements, medical care and (importantly) nutrition education for pregnant mothers. Unfortunately, when the same intervention was used in Bangladesh it did not have any measurable benefit. 5 This was because Bangladeshi mothers-in-law and husbands were in charge of shopping so educating mothers in that context had little effect. 5 Even when study and target populations are similar, relevant contextual factors must be shared for extrapolation to be justified.
One strategy for dealing with the problem of extrapolation is simple induction. For instance, the Consolidated Standards of Reporting Trials (CONSORT) checklist 6 includes an item about ‘generalizability’ that involves asking whether target populations would have met the inclusion criteria for a trial. But as we illustrate below, even if a patient would not have met the inclusion criteria it is nevertheless sometimes safe to implement the study results, while in other cases although the patient would have met the inclusion criteria it is unsafe to do so.
Critics and proponents of Evidence-Based Medicine (EBM) suggest that knowledge of patho-physiological mechanisms can solve the problem of extrapolation.7,8 Guyatt et al. 8 suggest ‘a good understanding of pathophysiology is necessary … for appropriate interpretation of evidence (especially in deciding on its generalizability)’. However, no explicit guidelines for implementing this suggestion are provided. Paul Glasziou and Gordon Guyatt (in conversation) offer the following example to illustrate the EBM proposal. If a trial of an intervention excludes everyone over the age of 60, they claim the intervention is likely to work for a 61-year-old but not for a 90-year-old. Presumably they take it that the success of the intervention depends on the operation of pathophysiological mechanisms that change only slowly beyond 60. So the mechanisms would not have changed substantially in most 61-year-olds but would be highly likely to have changed by 90. This example is useful as a starting point, and here we elaborate on how to use mechanistic knowledge to justify extrapolation.
Mechanisms of action and why they do not always help
A mechanism of action is the causal chain or web linking the intervention with the clinical outcome via pathophysiologic mechanisms 9 (see Figure 1, middle). If we know the mechanism of action in the study population, and we know it is shared with the target population, then extrapolation is more likely to be justified. Unfortunately, we rarely have sufficiently complete knowledge of the mechanisms of action in the study to form any basis for comparison (and justifying extrapolation).
Comparative clinical studies, mechanisms of action
For example, widespread use of antiarrhythmic drugs for (supposedly) reducing mortality was based on an incomplete understanding of the mechanism(s) of action of the drugs. Several mechanisms (swallowing, gastric emptying, metabolism, circulatory and binding mechanisms) were involved in getting the drug to its pharmacological targets. These mechanisms are often well understood and are referred to as ADME (mechanisms for absorption, distribution, metabolism and excretion). Having reached their cellular targets, antiarrhythmic drugs were believed to reduce the frequency of ventricular extra beats (VEBs) by modifying the heart's electrochemical mechanism. Finally, a reduction in VEBs should (allegedly) reduce the risk of sudden death, presumably by reducing the risks associated with insufficient blood flow to vital organs. But subsequently a randomized trial of two antiarrhythmic drugs, encainide and flecainide, suggested that they increased mortality (see Figure 1). 10 The evidence for the mechanism of action linking antiarrhythmic drugs with a reduced frequency of VEBs was strong, but the evidence for the mechanism linking reduced VEBs with reduced mortality was weak. It turns out that antiarrhythmic drugs (and indeed most interventions) activate unsuspected mechanisms, making their effects on patient-relevant outcomes difficult to predict. Sometimes the unsuspected pathways can lead to a paradoxical effect; 11 for example antiarrhythmic drugs are proarrhythmic in about 7% of patients (see Figure 1, right-hand side). Many other recent examples show how incomplete knowledge of relevant mechanisms led to adoption of harmful treatments. 9 If our knowledge of mechanisms is flawed or incomplete, then any comparison of such mechanisms in study and target populations will not lead to justifiable extrapolation.
To make things worse, the actions of many mechanisms are discovered in tightly controlled laboratory experiments, eliminating potentially interfering variables, but, in vitro and in vivo mechanisms can behave differently. Knowledge gained in these conditions is inevitably oversimplified when compared with real clinical cases, often rendering inferences from in vitro and in vivo effects unwarranted.
How knowledge of mechanisms of action can be used to mitigate the problem of extrapolation
The National Institute for Health and Clinical Excellence (NICE) in the UK recommends macro-gols for the treatment of chronic constipation in children under two years of age. 12 However, this is an unlicensed indication since the available evidence in children relates to those aged over two years. 13 Appealing to CONSORT would not support such use, because children under two years did not meet the inclusion criteria of published trials. Nevertheless, we know that the mechanism of action for how macrogols reduce constipation (by drawing fluid into the gut osmotically) is independent of age. Hence, the NICE recommendations are acceptable.
Conversely, the clinical evidence that metformin causes lactic acidosis comes from many anecdotal reports, and trial evidence has failed to reveal an increased risk. This is probably because the trials were not sufficiently powered to detect this rare complication. Nevertheless, current guidelines recommend that metformin should not be used in people with renal impairment. 14 Again following the CONSORT strategy would lead us to believe that metformin was acceptable for patients with renal impairment. However, we know that the mechanisms of action in patients with renal impairment differ from other patients; hence, extrapolation between results in these two groups is unjustified. 15
Besides telling us when extrapolation is justified, knowledge of the mechanistic of action is also necessary for guiding adequate descriptions of interventions.16,17 A description of metformin therapy would specify the dose, the dosage interval, the duration of therapy and so on, but not presumed irrelevant factors (such as clothing and planet position). Factors can be deemed irrelevant if they do not affect the mechanisms involved in the intervention's action.
A guide to using mechanisms to help solve the problem of extrapolation
While our understanding of mechanisms of action can be flawed, it would be unwise to ignore the cases in which our understanding is more complete than others. 9 Our proposed method for using mechanistic knowledge to mitigate the problem of extrapolation involves three steps and the use of a mechanistic taxonomy.
Step 1. Establish whether the problems with obtaining and interpreting mechanisms of action have been overcome. Sometimes we have correctly identified the mechanisms, and we have evidence that the links in the mechanistic chain are well established. For example, the proximate causes of stroke have been known for centuries.18,19 A burst artery in the brain causes a haemorrhagic stroke, while an ischaemic stroke is caused by a blockage of an artery, by either thrombosis or embolism. Aspirin benefits patients who have had an ischaemic stroke, but may harm those who have had a haemorrhagic stroke. The cause of the stroke (identification of the mechanism that has been disturbed) can be discovered by a computed tomography scan. In this case identifying the mechanism helps us to identify patients who are likely to benefit or not from aspirin.
Step 2. Establish whether the mechanisms of action in the study and target populations are similar. Having specified the mechanistic chain linking the intervention and the outcome in the study and target populations, we can compare them. If mechanisms are shared (as they were in the macrogol example), extrapolation is more likely to be safe and beneficial; otherwise (as was the case in the metformin example), we should proceed cautiously.
Step 3. Establish that the study and target context are sufficiently similar. Finally, we need to ensure that contextual factors influencing whether relevant mechanisms are activated are shared. Failure to do this (as in the TINP example) can lead to unsuccessful extrapolation (Box 1).
What is known about this topic and what this study adds
What is known about this topic
Average study results are sometimes not applicable to target populations.
What this study adds
The problem of extrapolation is two problems: particularizing and generalizing.
Appealing to inclusion/exclusion criteria is important but insufficient.
Cautious use of mechanistic knowledge can justify extrapolation of study results in target populations.
Table 1 shows a taxonomy on which decisions about shared mechanisms of action can be based in cases of pharmacological interventions. It classifies mechanisms as determined by genetic factors, age- and sex-related factors, physiological variants, co-morbidities, drug–drug interactions and sociological factors. Examples are given in each case. Using this taxonomy, one can compare mechanisms of action in study and target populations.
A taxonomy of the types of mechanisms that can inform the wisdom or otherwise of extrapolating from trial results to another population*
Includes examples of both similar (generalizable) and dissimilar (non-generalizable) mechanisms
Table 2 provides a checklist that can be used to decide whether average study results are applicable to target populations. Judgement will always be required, and the problem of extrapolation will never be completely resolved. At the same time, extrapolation to target populations is sometimes better justified than at other times. If the answer to all the questions in Table 1 is ‘Yes’, then extrapolation is justified, while too many ‘No’ answers suggest reasons to be cautious.
A checklist for using mechanisms to justify extrapolation of study results to target populations
If the answer to all these questions is ‘Yes’, then extrapolation may be justified. Otherwise, extrapolation may be risky
Conclusions
We all hope that study results will benefit target populations. However, experience suggests that extrapolation based on simple induction can be problematic. Here we have argued that examining mechanisms of action in study and target populations can help justify successful extrapolation and also guide adequate descriptions of interventions. Our proposed method involves (i) establishing that common problems with obtaining knowledge of mechanisms of action have been overcome; (ii) verifying that relevant mechanisms in the study and target populations are similar; and (iii) establishing that study and target contexts are relevantly similar.
Footnotes
Acknowledgements
George Davey-Smith, Raffaella Campaner, Maria Carla Galavotti, David Teira, Barbara Osimani, Alfredo Morabia, Miriam Solomon, Jeremy Simon, Adam La Caze, Bill Bechtel, Nancy Cartwright, Lane Desautels and Peter Gill all provided comments on earlier drafts or presentations of the work in earlier drafts of this paper. We are grateful to John Worrall for suggesting the example of benoxaprofen
