Sage Journals: Discover world-class research

Abstract

We conducted a systematic comparison of statistical methods used for the analysis of time-to-event outcomes under various proportional and non-proportional hazard (NPH) scenarios. Our study used data from recently published oncology trials to compare the Log-rank test, still by far the most widely used option, against some available alternatives, including the MaxCombo test, the Restricted Mean Survival Time difference test, the Generalized Gamma model and the Generalized F model. Power, type I error rate, and time-dependent bias with respect to the survival probability and median survival time were used to evaluate and compare the performance of these methods. In addition to the real data, we simulated three hypothetical scenarios with crossing hazards chosen so that the early and late effects “cancel out” and used them to evaluate the ability of the aforementioned methods to detect time-specific and overall treatment effects. We implemented novel metrics for assessing the time-dependent bias in treatment effect estimates to provide a more comprehensive evaluation in NPH scenarios. Recommendations under each NPH scenario are provided by examining the type I error rate, power, and time-dependent bias associated with each statistical approach.

Keywords

Clinical trials survival analysis immuno-oncology trials nonproportional hazards

Get full access to this article

View all access options for this article.

References

Schoenfeld

. The asymptotic properties of nonparametric tests for comparing survival distributions. Biometrika 1981; 68(1): 316–319.

Hodi

O’Day

McDermott

, et al. Improved survival with ipilimumab in patients with metastatic melanoma. N Engl J Med 2010; 363(8): 711–723.

Small

Schellhammer

Higano

, et al. Placebo-controlled phase III trial of immunologic therapy with sipuleucel-T (APC8015) in patients with metastatic, asymptomatic hormone refractory prostate cancer. J clin Oncol 2006; 24(19): 3089–3094.

Wolchok

Neyns

Linette

, et al. Ipilimumab monotherapy in patients with pretreated advanced melanoma: a randomised, double-blind, multicentre, phase 2, dose-ranging study. Lancet Oncology 2010; 11(2): 155–164.

Alexander

Schoenfeld

Trippa

. Hazards of hazard ratios-deviations from model assumptions in immunotherapy. N Engl J Med 2018; 378(12): 1158–1159.

Rahman

Ventz

Fell

, et al. Divining responder populations from survival data. Ann Oncol 2019; 30(6): 1005–1013.

Disis

. Mechanism of action of immunotherapy. In: Seminars in oncology, volume 41. Elsevier, 2014. pp.S3–S13.

Mok

Thongprasert

, et al. Gefitinib or carboplatin–paclitaxel in pulmonary adenocarcinoma. N Engl J Med 2009; 361(10): 947–957.

Ananthakrishnan

Green

Previtali

, et al. Critical review of oncology clinical trial design under non-proportional hazards. Crit Rev Oncol Hematol 2021; 162: 103350.

10.

Magirr

. Non-proportional hazards in immuno-oncology: Is an old perspective needed? Pharm Stat 2021; 20(3): 512–527.

11.

Fleming

Harrington

. A class of hypothesis tests for one and two sample censored survival data. Commun Stat-Theory Methods 1981; 10(8): 763–794.

12.

Roychoudhury

Anderson

, et al. Robust design and analysis of clinical trials with nonproportional hazards: a straw man guidance from a cross-pharma working group. Stat Biopharm Res 2023; 15(2): 280–294.

13.

Magirr

Burman

. Modestly weighted logrank tests. Stat Med 2019; 38(20): 3782–3790.

14.

Pepe

Fleming

. Weighted kaplan–meier statistics: Large sample and optimality considerations. J R Stat Soc: Ser B (Methodological) 1991; 53(2): 341–352.

15.

Zhao

Claggett

Tian

, et al. On the restricted mean survival time curve in survival analysis. Biometrics 2016; 72(2): 215–221.

16.

Davis

Xie

. Caution: hazards crossing! using the renyi test statistic in survival analysis. Pharma AUG2011-Paper SP06. 2011.

17.

Cox

Chu

Schneider

, et al. Parametric survival analysis and taxonomy of hazard functions for the generalized gamma distribution. Stat Med 2007; 26(23): 4352–4374.

18.

Shen

Wang

Sirisha

, et al. Nonproportional hazards—an evaluation of the maxcombo test in cancer clinical trials. Stat Biopharm Res 2023; 15(2): 300–309.

19.

Royston

Parmar

. A simulation study comparing the power of nine tests of the treatment effect in randomized controlled trials with a time-to-event outcome. Trials 2020; 21: 1–17.

20.

Cox

. The generalized F distribution: An umbrella for parametric survival analysis. Stat Med 2008; 27(20): 4301–4312.

21.

Grambsch

Therneau

. Proportional hazards tests and diagnostics based on weighted residuals. Biometrika 1994; 81(3): 515–526.

22.

Xue

Wang

Yan

, et al. An online updating approach for testing the proportional hazards assumption with streams of survival data. Biometrics 2020; 76(1): 171–182.

23.

Fleming

Harrington

. Counting processes and survival analysis. Hoboken, New Jersey: John Wiley & Sons, 2005.

24.

Andersen

Hansen

Klein

. Regression analysis of restricted mean survival time based on pseudo-observations. Lifetime Data Anal 2004; 10: 335–350.

25.

Royston

Parmar

. Restricted mean survival time: an alternative to the hazard ratio for the design and analysis of randomized trials with a time-to-event outcome. BMC Med Res Methodol 2013; 13: 1–15.

26.

Royston

Parmar

. The use of restricted mean survival time to estimate the treatment effect in randomized clinical trials when the proportional hazards assumption is in doubt. Stat Med 2011; 30(19): 2409–2421.

27.

Trinquart

Jacot

Conner

, et al. Comparison of treatment effects measured by the hazard ratio and by the ratio of restricted mean survival times in oncology randomized controlled trials. J Clin Oncol 2016; 34(15): 1813–1819.

28.

Tian

Jin

Uno

, et al. On the empirical choice of the time window for restricted mean survival time. Biometrics 2020; 76(4): 1157–1166.

29.

Jackson

Sharples

Thompson

. Survival models in health economic evaluations: balancing fit and parsimony to improve prediction. Int J Biostat 2010; 6(1).

30.

Box-Steffensmeier

Jones

. Event history modeling: A guide for social scientists. New York: Cambridge University Press, 2004.

31.

Jackson

. flexsurv: a platform for parametric survival modeling in R. J Stat Softw 2016; 70(8): 1–33.

32.

Wei

. The accelerated failure time model: a useful alternative to the cox regression model in survival analysis. Stat Med 1992; 11(14-15): 1871–1879.

33.

Prentice

. Discrimination among some parametric models. Biometrika 1975; 62(3): 607–614.

34.

Lin

Roychoudhury

, et al. Alternative analysis methods for time to event endpoints under nonproportional hazards: a comparative analysis. Stat Biopharm Res 2020; 12(2): 187–198.

35.

Guyot

Ades

Ouwens

, et al. Enhanced secondary analysis of survival data: reconstructing the data from published Kaplan–Meier survival curves. BMC Med Res Methodol 2012; 12: 1–13.

36.

Benboubker

Dimopoulos

Dispenzieri

, et al. Lenalidomide and dexamethasone in transplant-ineligible patients with myeloma. N Engl J Med 2014; 371(10): 906–917.

37.

Kantarjian

DeAngelo

Stelljes

, et al. Inotuzumab ozogamicin versus standard therapy for acute lymphoblastic leukemia. N Engl J Med 2016; 375(8): 740–753.

38.

Bevacizumab-ProductLabel. Genentech. https://www.accessdata.fda.gov/drugsatfda_docs/label/2020/125085s337lbl.pdf. 2020.

39.

Burger

Brady

Bookman

, et al. Incorporation of bevacizumab in the primary treatment of ovarian cancer. N Engl J Med 2011; 365(26): 2473–2483.

40.

Lin

Roychoudhury

, et al. Alternative analysis methods for time to event endpoints under nonproportional hazards: A comparative analysis. Stat Biopharm Res 2020; 12(2): 187–198.

41.

Rajkumar

Jacobus

Callander

, et al. Lenalidomide plus high-dose dexamethasone versus lenalidomide plus low-dose dexamethasone as initial therapy for newly diagnosed multiple myeloma: an open-label randomised controlled trial. Lancet Oncology 2010; 11(1): 29–37.

42.

Group NMAC. Chemotherapy in addition to supportive care improves survival in advanced non–small-cell lung cancer: a systematic review and meta-analysis of individual patient data from 16 randomized controlled trials. J Clin Oncol 2008; 26(28): 4617–4625.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.01 MB

Statistical considerations for evaluating treatment effect under various non-proportional hazard scenarios

Abstract

Keywords

Get full access to this article

References

Supplementary Material