Sage Journals: Discover world-class research

Abstract

Background

Well-designed phase II trials must have acceptable error rates relative to a pre-specified success criterion, usually a statistically significant p-value. Such standard designs may not always suffice from a clinical perspective because clinical relevance may call for more. For example, proof-of-concept in phase II often requires not only statistical significance but also a sufficiently large effect estimate.

Purpose

We propose dual-criterion designs to complement statistical significance with clinical relevance, discuss their methodology, and illustrate their implementation in phase II.

Methods

Clinical relevance requires the effect estimate to pass a clinically motivated threshold (the decision value (DV)). In contrast to standard designs, the required effect estimate is an explicit design input, whereas study power is implicit. The sample size for a dual-criterion design needs careful considerations of the study’s operating characteristics (type I error, power).

Results

Dual-criterion designs are discussed for a randomized controlled and a single-arm phase II trial, including decision criteria, sample size calculations, decisions under various data scenarios, and operating characteristics. The designs facilitate GO/NO-GO decisions due to their complementary statistical–clinical criterion.

Limitations

While conceptually simple, implementing a dual-criterion design needs care. The clinical DV must be elicited carefully in collaboration with clinicians, and understanding similarities and differences to a standard design is crucial.

Conclusion

To improve evidence-based decision-making, a formal yet transparent quantitative framework is important. Dual-criterion designs offer an appealing statistical–clinical compromise, which may be preferable to standard designs if evidence against the null hypothesis alone does not suffice for an efficacy claim.

Keywords

Clinical relevance dual-criterion evidence GO/NO-GO operating characteristics phase II design proof-of-concept statistical significance

Get full access to this article

View all access options for this article.

References

Wasserstein

Lazar

. The ASA’s statement on p-values: context, process, and purpose. Am Stat 2016; 70: 129–133.

Fleming

. One-sample multiple testing procedure for phase II clinical trials. Biometrics 1982; 38: 143–151.

Herson

Carter

. Calibrated phase II clinical trials in oncology. Stat Med 1986; 5: 441–447.

Simon

. Optimal two-stage designs for phase II clinical trials. Control Clin Trials 1989; 10: 1–10.

Schaid

Wieand

Therneau

. Optimal two-stage screening designs for survival comparisons. Biometrika 1990; 77: 507–513.

Storer

. A class of phase II designs with three possible outcomes. Biometrics 1992; 48: 55–60.

Liu

Dahlberg

Crowley

. Selection designs for pilot studies based on survival. Biometrics 1993; 49: 391–398.

Liu

LeBlanc

Desai

. False positive rates of randomized phase II designs. Control Clin Trials 1999; 20: 343–352.

Sargent

Chan

Goldberg

. A three-outcome design for phase II clinical trials. Control Clin Trials 2001; 22: 117–125.

10.

Korn

Arbuck

Pluda

et al . Clinical trial designs for cytostatic agents: are new approaches needed?J Clin Oncol 2001; 19: 265–272.

11.

Rubinstein

Korn

Freidlin

et al . Design issues of randomized phase II trials and a proposal for phase II screening trials. J Clin Oncol 2005; 23: 7199–7206.

12.

Simon

Steinberg

Hamilton

et al . Clinical trial designs for the early clinical development of therapeutic cancer vaccines. J Clin Oncol 2001; 19: 1848–1854.

13.

Parashar

Bowden

Starr

et al . An optimal stratified Simon two-stage design. Pharm Stat 2016; 15: 333–340.

14.

Cartwright

Cohen

Fleishaker

et al . Proof of concept: a PhRMA position paper with recommendations for best practice. Clin Pharmacol Ther 2010; 87: 278–285.

15.

Nicewander

Price

. A consonance criterion for choosing sample size. Am Stat 1997; 51: 311–317.

16.

Chuang-Stein

Kirby

Hirsch

et al . The role of the minimum clinically important difference and its impact on designing a trial. Pharm Stat 2011; 10: 250–256.

17.

Chuang-Stein

Kirby

French

et al . A quantitative approach for making GO/NO-GO decisions in drug development. Drug Inf J 2011; 45: 187–202.

18.

Neuenschwander

Rouyrre

Hollaender

et al . A proof of concept phase II non-inferiority criterion. Stat Med 2011; 30: 1618–1627.

19.

Fisch

Jones

et al . Bayesian design of proof-of-concept trials. Ther Innov Regul Sci 2015; 49: 155–162.

20.

Frewer

Mitchell

Watkins

et al . Decision-making in early clinical drug development. Pharm Stat 2016; 15: 255–263.

21.

Senn

Statistical issues in drug development. New York; Chichester: John Wiley & Sons, 1997.

22.

Casella

Berger

. Reconciling Bayesian and frequentist evidence in the one-sided testing problem. J Am Stat Assoc 1987; 82: 106–111.

23.

Berger

Sellke

. Testing a point null hypothesis: the irreconcilability of P values and evidence. J Am Stat Assoc 1987; 82: 112–122.

24.

R Core Team. R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing, 2015, https://www.R-project.org/

25.

Gsponer

Gerber

Bornkamp

et al . A practical guide to Bayesian group sequential designs. Pharm Stat 2014; 13: 71–80.

Beyond p -values: A phase II dual-criterion design with statistical significance and clinical relevance