Abstract
Background:
Recently emerging results from a few placebo-controlled randomized trials of COVID-19 vaccines revealed estimates of 62%–95% relative reductions in risk of virologically confirmed symptomatic COVID-19 disease, over approximately 2-month average follow-up period. Additional safe and effective COVID-19 vaccines are needed in a timely manner to adequately address the pandemic on an international scale. Such safe and effective vaccines would be especially appealing for international deployment if they also have favorable stability, supply, and potential for implementation in mass vaccination campaigns. Randomized trials provide particularly reliable insights about vaccine efficacy and safety. While enhanced efficiency and interpretability can be obtained from placebo-controlled trials, in settings where their conduct is no longer possible, randomized non-inferiority trials may enable obtaining reliable evaluations of experimental vaccines through direct comparison with active comparator vaccines established to have worthwhile efficacy.
Methods:
The usual objective of non-inferiority trials is to reliably assess whether the efficacy of an experimental vaccine is not unacceptably worse than that of an active control vaccine previously established to be effective, likely in a placebo-controlled trial. This is formally achieved by ruling out a non-inferiority margin identified to be the minimum threshold for what would constitute an unacceptable loss of efficacy. This article not only investigates non-inferiority margins, denoted by
Results:
Using the margin
Conclusion:
Non-inferiority trials using the proposed margins may enable reliable randomized evaluations of efficacy and safety of experimental COVID-19 vaccines. Such trials often require approximately two- to three-fold the person-years follow-up than a placebo-controlled trial. This could be achieved, without substantive increases in sample size, by increasing the average duration of follow-up from 2 months to approximately 4–6 months, assuming efficacy of the active comparator vaccine has been reliably evaluated over that longer duration.
Introduction
Safe and effective vaccines that meaningfully reduce the spread of SARS-CoV-2 virus will have indisputable value in addressing the COVID-19 pandemic, which has disrupted health and taken lives around the world. New vaccines have been developed and testing begun at an unprecedented pace, with at least seven vaccines in ongoing placebo-controlled randomized trials. 1 Additional vaccines are expected to enter placebo-controlled trials soon, including through the imminent initiation of the World Health Organization (WHO) Solidarity Vaccines Trial 2 that follows the principles of a core protocol. 3 This platform trial is designed to evaluate multiple candidate vaccines against a common placebo control, where new candidates can be added to the randomization as soon as they become available, meet local regulatory standards, and meet WHO’s prioritization criteria. 4 These trials are rigorously designed with “virologically confirmed symptomatic COVID-19 symptomatic disease” as the primary endpoint. If “vaccine efficacy” denotes the relative reduction in the rate of such primary endpoint events in a vaccinated group of participants compared to placebo controls, then the WHO- and Food and Drug Administration (FDA)-recommended standard for worthwhile efficacy is having a point estimate of ≥50% vaccine efficacy with a 95% confidence interval (CI) lower bound of ≥30% efficacy, chosen to assure that deployed vaccines do more good than harm.5,6
Recently, initial reports of high efficacy in the short term for several vaccines have been published. Two vaccines manufactured by Pfizer/BioNtech and Moderna, which use novel mRNA technology, are yielding estimated efficacy of 94%–95%7,8 over a median follow-up of about 2 months. Additional reports of two vaccines that use Adenovirus vectors also have been disclosed, from AstraZeneca/Oxford in the United Kingdom and the Gamaleya Research Institute in Russia, with initial estimates of efficacy ranging from 62%–92%.9,10
The mRNA-based vaccines have started to become available, and access is expected to increase in many wealthy nations over the next several months. However, these vaccines have significant challenges in manufacturing and distribution, with requirements for cold temperatures during transport and storage that may make them particularly challenging for worldwide distribution in the short term. The adenovirus vaccines have potential advantages over the mRNA-based vaccines in manufacturability and distribution; with some countries including India building capacity to manufacture their own supply in the near future, the AstraZeneca vaccine may become available to a wide set of countries before the mRNA vaccines can.
However, even with these exciting reports, it is still important that other vaccine regimens be evaluated. Widespread implementation of multiple safe and effective vaccines will be needed given the breadth of the pandemic. Vaccines that can be administered in a single dose would be particularly useful in mass vaccination campaigns, and open questions remain about the durability of any vaccine effects and the potential for emerging concerns about safety.
Methods
Non-inferiority studies can be an important tool in evaluating the efficacy of a new vaccine when there is one established in that population, and when randomization to placebo is not possible. The frequent goal in a non-inferiority trial is to reliably assess whether the efficacy of an experimental vaccine is not unacceptably worse than that of an active control vaccine that previously had been established to be effective, likely in a placebo-controlled trial. This is formally achieved by identifying a minimum threshold for what would constitute an unacceptable loss of efficacy, that is, a non-inferiority margin, and then designing the non-inferiority trial to rule out that margin. An important consideration in the design and conduct of non-inferiority trials is the need to address the inherent uncertainty about whether the effect of the active comparator vaccine, as estimated in its placebo-controlled trial, reliably represents its true effect in the setting of the non-inferiority trial. This is referred to as the constancy assumption.
To illustrate the fundamental importance of the constancy assumption, suppose an active comparator vaccine truly has vaccine efficacy of 95% over a short 2-month duration of follow-up, and true vaccine efficacy of 88% over 6 months of follow-up. Suppose further that, in its placebo-controlled trial, the active control vaccine was evaluated over only 2 months, but the non-inferiority trial will follow for events over 6 months. If it was inaccurately assumed that the active comparator vaccine would have the same 95% vaccine efficacy over 6 months, the resulting violation of the constancy assumption would lead to meaningfully overestimating an experimental vaccine with true 30% vaccine efficacy as being 71% = 100 [1 – (1 − 0.3){(1 − 0.95)/(1 − 0.88)}]%.
Based on these insights, one important consideration in the identification of the margin in the non-inferiority trial is to address the inherent uncertainty about the validity of the constancy assumption, while a second relates to ensuring the experimental vaccine achieves proper preservation of effect. These two are formally stated to be as follows: 11
Consideration A: the non-inferiority margin should be formulated using adjustments to account for bias or lack of reliability in the estimate of the effect of the active comparator regimen in the setting of the non-inferiority trial.
Consideration B: the non-inferiority margin should be formulated to achieve preservation of an appropriate percentage of the effect of the active comparator regimen.
One could take several approaches to properly address Considerations A and B when formulating margins in non-inferiority trials, especially in the specific context of vaccines to stop a pandemic. A widely implemented approach with precedent for regulatory support is the “95–95” method,11,12,13 in which Consideration A is addressed by assuming the true effect of the active comparator vaccine in the non-inferiority trial would be the lower limit of the 95% CI for its estimated vaccine efficacy in the setting of the previously conducted randomized placebo-controlled trial(s); Consideration B is often addressed by preserving at least 50% of the effect of the active comparator vaccine, where, as discussed below, this effect is estimated using the active comparator to placebo hazard ratio (HR) in this time-to-event analysis setting.
Cox regression analyses are used to estimate the HR or relative rates of primary endpoint events on a vaccine versus a comparator regimen. Data from the placebo-controlled randomized trial that established the active comparator vaccine as having worthwhile efficacy are used to estimate the active comparator to placebo HR and, in turn, the active comparator’s vaccine efficacy is estimated as 100 (1 − HR). Working in the context of the estimated HR and thus, using the log-scale when calculating half the estimated effect, we are led to the following formula
12
for the non-inferiority margin
The term, (95% CI upper limit of the HR)½, is the “preservation of effect” adjustment and addresses Consideration B. Note that equation (1) simplifies to
If the trial is conducted in a setting where there would be emerging availability of an effective vaccine and thus that would be a proper control regimen, yet at a time when current availability of safe and effective vaccines would not meet local and worldwide needs, then a non-inferiority margin more lenient than
Hence, in the non-inferiority trial, ruling out that the experimental to active comparator HR is ≥
Finally, there might be reasons to choose a margin between
Results
We will explore the properties of non-inferiority trials using margins
Consider non-inferiority trials designed with primary analysis to rule out the non-inferiority margin,
RCT: randomized controlled trial; CI: confidence interval; HR: hazard ratio; NI: non-inferiority; PLA: placebo.
Calculated assuming 90% power when vaccine efficacy of the experimental (EXP) and active control (AC) vaccines is equal, using a statistic having 2.5% false positive error when
This represents the highest estimated experimental (EXP) to active control (AC) estimated hazard ratio that yields a positive result in the non-inferiority trial.
Consider non-inferiority trials designed with primary analysis to rule out the non-inferiority margin,
RCT: randomized controlled trial; CI: confidence interval; HR: hazard ratio; NI: non-inferiority; PLA: placebo.
Calculated assuming 90% power when the experimental vaccine (EXP) has 60% vaccine efficacy, using a statistic having 2.5% false positive error when
This represents the highest estimated experimental (EXP) to active control (AC) estimated hazard ratio that yields a positive result in the non-inferiority trial.
Consider non-inferiority trials designed with primary analysis to rule out the non-inferiority margin,
HR: hazard ratio; NI: non-inferiority.
The approaches to non-inferiority presented in this article could also be contemplated as part of a hybrid approach, for settings where the placebo control is replaced by an active comparator vaccine, either in the same or a different trial. The hybrid approach would efficiently aggregate evidence about the efficacy of a candidate experimental vaccine, by combining evidence about the efficacy of that experimental vaccine obtained from the placebo-controlled and active comparator settings. Such hybrid approaches are not further considered here.
Scenario 1
Scenario 1 involves the use of an active comparator vaccine having vaccine efficacy that is 95% over 2 months and 90% over 6 months, in non-inferiority trials designed with primary analysis to rule out the non-inferiority margin,
The first scenario we consider is the one where a vaccine with very high efficacy becomes available in a region, such as the United States. For illustration purposes, we assume that this vaccine has been estimated to have 95% vaccine efficacy over 2 months and 90% over 6 months. While we consider both timeframes for a potential non-inferiority trial, we note that the number of participants and time to accrue an adequate number of infections with two highly effective vaccines make a 6-month non-inferiority trial more likely.
Considering first a 2-month trial, we assume an active comparator vaccine has estimated 95% vaccine efficacy, and a lower bound for the 95% CI of 0.9145 from 175 events in the placebo-controlled randomized trial. This level of evidence was achieved by both Moderna and Pfizer at the time of their requests to the FDA to grant an Emergency Use Authorization.7,8
Translating this onto the HR scale (see Table 1), the estimated active comparator to placebo HR is 0.05, with a 95% CI upper limit of 0.0855. Applying equations (1) and (2), we calculate the margins
With 34 events, the least favorable result to rule out the
Consider instead a 6-month trial comparing to an active comparator vaccine with 90% vaccine efficacy based on a randomized trial accruing 350 cases by this 6-month mark. In the HR scale, the estimated active comparator to placebo HR is 0.10, with a 95% CI upper limit of 0.1348. Applying equations (1) and (2), the margins are
With 48 events, the least favorable result to rule out the
In scenario 1, the 34-event trial comparing two vaccines having approximate 95% vaccine efficacy and the 48-event trial comparing two vaccines having approximate 90% efficacy would require approximately two- to three-fold person years of follow-up relative to a frequently used design of a 150-event placebo-controlled trial of a vaccine having 60% vaccine efficacy, assuming these trials were conducted in settings having similar attack rates. For this reason, as noted earlier, the scenario of the 6-month non-inferiority trial seems more likely.
Based on these insights, when the active comparator vaccine has very high efficacy, even when using
Scenario 2
Scenario 2 involves the use of an active comparator vaccine having vaccine efficacy of 60% over 4–6 months, in non-inferiority trials designed with primary analysis to rule out the non-inferiority margin,
Suppose the placebo-controlled evidence for the active comparator vaccine exceeds the threshold for meeting the WHO–FDA criteria for success, by having 60% estimated vaccine efficacy and, with 350 events, a lower limit of the 95% CI that is 50.0%. Then, the estimated active comparator to placebo HR is 0.4 and the 95% CI upper limit of the HR is 0.500. Plugging in the observed upper bound of 0.500 into equations (1) and (2),
Continue to assume the placebo-controlled active comparator trial had 350 events and a new experimental vaccine has true vaccine efficacy of 60%. Then, the alternative hypothesis for the HR for the experimental to active control vaccine is 1.0. Under the hypothesis that the true HR is 1.0, and preserving a 2.5% false positive error rate, a non-inferiority trial based on a margin of
As in scenario 1, the 164-event trial in scenario 2 would require approximately two- to three-fold person years of follow-up relative to a 150-event placebo-controlled trial of a vaccine having 60% vaccine efficacy, assuming these trials were conducted in settings having similar attack rates. However, unlike scenario 1, in scenario 2—where the active comparator vaccine has an estimated true vaccine efficacy in the range of approximately 60%, as detailed Table 2—trials would be well powered to rule out the non-inferiority margin,
Further increases in efficiency could be obtained through interim monitoring. In scenario 2 where the active comparator vaccine would have vaccine efficacy of 60% over 4–6 months, for an experimental vaccine having considerably higher true efficacy, an interim analysis in the non-inferiority trial could be definitively positive. These interim evaluations could be achieved, for example, using standard group sequential monitoring boundaries to assess whether interim data are sufficiently favorable to rule out the non-inferiority margin
Conclusion
Non-inferiority trials using margins proposed in this article may provide the ability to obtain reliable randomized evaluations of efficacy and safety of experimental COVID-19 vaccines. Such trials are well powered to reliably evaluate experimental vaccines that truly are similarly effective to an active comparator vaccine having any level of “worthwhile” efficacy. However, when the active comparator vaccine has efficacy ≥90%, an important limitation of this non-inferiority approach is its low power to confirm, as worthwhile, a safe and effective experimental vaccine having a favorable 60%–70% level of efficacy and a desirable profile such as characteristics readily enabling mass production. Use of the proposed more lenient non-inferiority margin,
Non-inferiority trials, as presented in the scenarios in Tables 1 and 2, often require approximately two- to three-fold the person-years follow-up relative to a placebo-controlled trial of an experimental vaccine having hypothesized 60% vaccine efficacy. Given this, together with the likelihood that attack rates might be reduced by the impact of available vaccines with “worthwhile efficacy” in the regions in which the non-inferiority trial would be conducted, it seems likely that the duration of the non-inferiority trial would be 4–6 months, if not longer. In turn, to properly derive the non-inferiority margin, evidence about the effect of the active comparator regimen would need to be available over a similar duration.
The reliability of non-inferiority trials depends on the validity of the constancy assumption, that is, that the true efficacy of the active comparator vaccine in the setting of the non-inferiority trial will be accurately estimated using evidence about its effect from its placebo-controlled trial. Hence, validity of the non-inferiority trial could be influenced by factors that might meaningfully alter the efficacy of the active comparator regimen, such as whether the non-inferiority trial and the placebo-controlled trial that evaluated the active comparator vaccine are conducted in populations with adequately similar strains of SARS-CoV-2 virus and, as noted above, have similar durations of follow-up. To illustrate how the constancy assumption could be violated in an impactful manner, consider a plausible scenario where an active comparator’s efficacy is very high over the 2-month interval it was evaluated in the placebo-controlled trial, yet meaningfully wanes during the next 4 months. In a non-inferiority trial following participants over 6 months, if its non-inferiority margin was derived under the false assumption that the 2-month level of efficacy of the active comparator was sustained over 6 months, this violation of the constancy assumption would result in a substantial overestimation of the efficacy of the experimental vaccine. Hence, in potential scenarios considered in this article, a fundamentally important assumption is the duration of follow-up in the non-inferiority trial does not exceed the follow-up duration in the placebo-controlled randomized clinical trial that evaluated the active comparator vaccine.
The above scenario also makes it clear that, even in placebo-controlled trials that produce short-term 95% vaccine efficacy,7,8 it is important to continue to follow participants in a blinded manner as long as possible. While recent publications have provided strong motivation to do so based on the importance of obtaining reliable insights about durability of efficacy, long-term safety and effects on severe disease,5,6,15 it is important to recognize that extending the length of blinded follow-up would have the additional positive consequence of improving our ability to use such vaccines as active comparators in non-inferiority trials.
Placebo-controlled trials are particularly efficient in providing reliable and interpretable evidence about efficacy and safety of COVID-19 vaccines. They would be a preferred design in settings where countries have limited or no access to licensed vaccines having worthwhile efficacy. 15 However, in settings where placebo-controlled trials would no longer be possible due to emerging availability of safe and effective vaccines, non-inferiority trials would be ethically and scientifically appealing, given the need for multiple safe and effective vaccines. There is considerable need for new vaccines that not only have a particularly favorable safety profile or improved efficacy but also could be administered in a single dose, without cold chain constraints, and with scalability enhancing the ability to enable mass vaccination campaigns. It is likely that non-inferiority trial designs, such as those discussed in this article, soon will be needed to achieve these objectives and, in turn, to succeed in the battle against the COVID-19 pandemic.
Footnotes
Acknowledgements
The authors thank Colin Begg and his fellow senior editors of the journal for the timely and substantive guidance provided in their review of this article.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The source of financial support for research described in this article, in part, is the National Institutes of Health (NIH)/National Institute of Allergy and Infectious Diseases (NIAID) grant entitled “Statistical Issues in AIDS Research” (R37 AI 29168). The opinions expressed in this article do not necessarily reflect those of the US Food and Drug Administration, the US National Institutes of Health, or the World Health Organization.
