With the advancement of medical treatments, many historically incurable diseases have become curable. An accurate estimation of the cure rates is of great interest. When there are no clear biomarker indicators for cure, the estimation of cure rate is intertwined with and influenced by the specification of hazard functions for uncured patients. Consequently, the commonly used proportional hazards (PH) assumption, when violated, may lead to biased cure rate estimation. Meanwhile, longitudinal biomarker measurements for individual patients are usually available. To accommodate non-PH functions and incorporate individual longitudinal biomarker trajectories, we propose a new joint model for cure, survival, and longitudinal data, with hazard ratios between different covariate subgroups flexibly varying over time. The proposed joint model has individual random effects shared between its longitudinal and cure-survival submodels. The regression parameters are estimated by maximization of the non-parametric likelihood via the Monte Carlo expectation-maximization algorithm. The standard error estimation applies a jackknife resampling method. In simulation studies, we consider crossing and non-crossing survival curves, and the proposed model provides unbiased estimates for the cure rates. Our proposed joint cure model is illustrated via a study of chronic myeloid leukemia.
In clinical studies, longitudinal biomarkers are usually available to track patients’ disease progression and evaluate treatment effects on survival. One may attempt to analyze longitudinal and time-to-event data separately. However, there are limitations on this simple approach. One of the concerns is the potential presence of informative censoring when the correlations between longitudinal and survival data are not taken into account. Consequently, joint modeling of longitudinal and survival data is more commonly accepted. Moreover, when they are jointly modeled, more precise parameter estimation and more efficient statistical inference would be obtained.1 Most joint models incorporate individual-level random effects that are shared between and impact both the longitudinal and survival submodels. Papageorgiou et al.2 presented an overview of joint modeling, and Ibrahim et al.3 discussed advantages of joint models for longitudinal and survival data in cancer clinical trials. Furthermore, joint models have the ability to predict patients’ risks at an individual level.
A proportion of patients with some diseases may be cured after treatment and thus never experience the failure event of interest (e.g., disease recurrence); such diseases include but are not limited to colorectal cancer,4–6 leukemia,7 and breast cancer.8,9 Their survival time distribution (cured and uncured combined), when estimated by the Kaplan–Meier (KM) method, presents a stable plateau at its right tail, given a sufficiently long follow-up time. Various cure models are proposed to model survival data with a cure fraction.
The two main groups of cure models are the mixture cure model and the promotion time cure (PTC) model, introduced by Boag10 and Yakovlev and Tsodikov,11 respectively. Amico and Keilegom12 and Peng and Yu13 provided a comprehensive summary of cure models. The mixture cure models could be parametric,10,14–16 semi-parametric,17–20 or non-parametric.21,22 Milienos23 introduced a new reparameterization of a flexible family of cure models including the mixture cure model incorporating data with a small or no cured proportion. Safari et al.24 proposed an estimation method for the latency function (of the susceptible patients) in the mixture cure model when the cure status is partially available. In the mixture cure model for the interval-censored data, Pal et al.25 implemented the support vector machine approach to estimate the cure proportion, and Pan et al.26 proposed a Bayesian estimation method.
Different forms of the PTC model have been proposed, such as the proportional hazards cure (PHC) model,27,28 a parametric cure model,29 and a non-parametric PTC model.30 Liu and Shen31 developed a PTC model for the interval-censored data. Pal and Aselisewine32 proposed to use a support vector machine method to model the incidence part of the PTC model. However, only a limited number of studies focused on cure models’ performance when some covariates violate the proportional hazards (PH) assumption. Unlike the traditional cure models, some models utilize frailty terms to explain observed and unobserved heterogeneity among patients.33–36 Although these proposed models allow for covariates’ non-PH structures, they bring additional assumptions on survival data and contain complex model structures.
A motivating example for our study is a randomized phase III study (CA180-034) for dose optimization.37 It compared different dose schedules of dasatinib in 670 patients with chronic-phase chronic myeloid leukemia (CML) who could not receive imatinib anymore due to resistance or intolerance. CML is caused by an error of a chromosomal translocation, occurring on the Abelson (ABL1) protooncogene in chromosome 9 and the breakpoint cluster region (BCR) gene in chromosome 22.38,39 In this data set, the BCR-ABL fusion gene’s expression level was a longitudinal biomarker varying between 0.03 and 99.99, reflecting how well dasatinib inhibited the growth of the BCR-ABL expressing gene for each patient within the time of the study.40 The event of interest was disease progression. All patients’ times to progression statuses were recorded as survival outcomes. Among all patients, 566 patients had at least one longitudinal value during the follow-up time. Figure 1(a) presents the trajectories of randomly selected 20 patients’ logarithms of BCR-ABL expression gene levels as the black curves and the overall trajectory of all 566 patients’ logarithms of BCR-ABL expression gene levels as the blue curve over the logarithms of the measurement times in years smoothed by the loess function. With the treatment, patients tended to have less severe disease status on average over time reflected by the decreasing overall curve. Figure 1(b) shows KM estimates of progression-free survival (PFS) by groups of baseline BCR-ABL expression levels categorized by the quartiles. The stable plateau at the right tail of each survival curve indicates a proportion of cured patients in the corresponding subgroup. The non-parametric test of Maller and Zhou41 further verified the existence of the cure proportions in all the baseline BCR-ABL expression level groups and among all patients (-values ). Patients with lower baseline BCR-ABL expression levels tended to have larger short- and long-term survival probabilities.
(a) Trajectories (black curves) of randomly selected 20 patients’ and the overall trajectory (blue curve) of 566 patients’ logarithms of BCR-ABL expression gene levels over logarithms of measurement times and (b) Kaplan–Meier estimates of progression-free survival by groups of baseline BCR-ABL expression levels defined by the quartiles (14.68, 39.35, and 73.69) in the chronic myeloid leukemia study for 566 patients with at least one follow-up biomarker measurement after the baseline. BCR: breakpoint cluster region; ABL: Abelson.
In this motivating example, to obtain patient subgroups’ cure rates, we can use the baseline information only, which may result in biased estimated cure rates. Therefore, we must jointly model longitudinal and cure-survival data to incorporate the correlation between longitudinal and survival data and the within-patient correlation of repeatedly measured biomarker values. In the joint modeling framework, we encounter a baseline covariate’s violation of the PH assumption. In detail, the baseline predictors that we are most interested in are the dose level (100 or 140 mg daily) and age (< or years). The Grambsch-Therneau test42 revealed that only the age group did not follow the PH assumption (p-value = 0.026) in a Cox PH model with the following covariates: dose level group, age group, and baseline BCR-ABL expressing gene levels. Therefore, we need to have a cure submodel that can handle covariates’ non-PH structure in the joint model. This motivating example demonstrates the need for cure models that can deal with the violation of the PH assumption while jointly modeling the longitudinal and cure-survival data to improve cure rate estimation for patient subgroups.
Several approaches have been proposed for the joint modeling of longitudinal and cure-survival data. The joint model proposed by Law et al.43 modeled the longitudinal biomarker by a nonlinear hierarchical mixed-effects submodel and the time-to-event data by a time-dependent Cox PH submodel conditional on random effects. Yu et al.44,45 also implemented a nonlinear hierarchical mixed submodel to model the longitudinal data, but the cure-survival data was modeled by a mixture cure submodel with a time-dependent PH model for susceptible patients and a logistic function for the cure fraction. Pan et al.46 proposed a joint model with the longitudinal data modeled by a linear mixed (LM) submodel and the cure-survival data modeled by a mixture cure submodel, and the properly scaled random effects connect these two submodels. The joint model proposed by Yang et al.47 modeled longitudinal measurements by implementing a partly LM submodel with the -spline and fit a semi-parametric mixture cure submodel with shared gamma frailty and baseline hazard formed with -spline for survival data with a cure fraction. Barbieri and Legrand48 applied mixture cure submodels for time-to-event outcomes with cures, but they focused on different latent structures linking the mixture cure submodel and the LM submodel. Ekong et al.49 proposed a joint model with longitudinal outcome modeled using spline function and survival outcome modeled by the PH mixture cure model, where the two models were connected by shared random effects. The proposed joint model was estimated by approximate Bayesian inference using latent Gaussian models.
There are also a few joint models using the PTC submodel for cure-survival data. In the work of Brown and Ibrahim,50 the joint cure-rate model was formulated by modeling the longitudinal measure from a mixture distribution with a point mass at zero and adding time-vary covariates into the PTC submodel. Song et al.51 used the random effects from a LM submodel to link the longitudinal and PTC submodels. Kim et al.52 had a non-parametric baseline distribution in the PTC submodel and integrated the PH and odds cure models into a general form. Chi et al.53 proposed a joint cure model for longitudinal ordinal categorical item response data, whose cure submodel was a PTC model with the PH assumption. A few studies have considered longitudinal and cure-survival data with multiple longitudinal biomarkers.54,55 These joint cure models have not been assessed to have the ability to handle the violation of the PH assumption.
Even though a few studies have incorporated the longitudinal data into the cure models, to the best of our knowledge, no existing cure model has considered longitudinal biomarkers and also emphasized or evaluated the performance under the situation when some covariates follow the PH assumption while others do not. A false PH assumption can lead to biased cure rate estimates for patient subgroups, which may convey misleading messages about patients’ disease status or the effect of clinical treatments. We propose a new model accounting for longitudinal data and flexible patterns of hazard functions to simultaneously handle some baseline covariates with the PH structure and others with the non-PH structure in the longitudinal and cure-survival data.
In our joint model, we extend the PHC model to relax the PH assumption via flexible patterns of hazard functions for the cure-survival data and employ an LM-effects submodel for the longitudinal biomarker. The two submodels from the joint model share a common set of patients’ random effects. The cure submodel is named a flexible-hazards cure (FHC) submodel. Due to the flexible hazard functions, the FHC submodel does not force constant hazard ratios for different covariate levels of the baseline covariates over time. We implement a non-parametric maximum likelihood estimation method incorporating the Monte Carlo expectation-maximization (MCEM) algorithm for parameter estimation. In the FHC submodel, the standardized baseline cumulative hazard function has a non-parametric form and is estimated by the Lagrange multiplier method. The standard error estimation is achieved by the jackknife resampling method. In summary, compared to the existing models, the contributions of our proposed model are as follows: (1) While accounting for the longitudinal data, it possesses a natural way to capture the non-PH structure of baseline covariates, and even allows extreme cases with crossing survival curves; (2) when some covariate groups have a non-PH structure, it estimates their cure rates more accurately than current models in the literature; and (3) it poses no parametric assumptions on the standardized baseline cumulative hazard function.
In the simulation studies, we create scenarios with longitudinal and cure-survival data sets containing crossing and non-crossing survival curves. Our proposed joint model using an FHC submodel for cure-survival data (JMFHC) accurately estimates regression parameters, and its estimated survival curves match well with true survival functions in all scenarios. It also produces unbiased estimated cure rates for subgroups. When the PH assumption is not satisfied, as a comparison to our proposed model, a joint model using a PHC submodel for cure-survival data (JMPHC) cannot obtain unbiased cure estimates due to its incapability to relax the PH assumption. The simulation studies adequately present that JMFHC outperforms JMPHC.
The rest of the paper is organized as follows. Section 2 introduces the proposed model JMFHC and illustrates its property of flexible hazards. Section 3 presents the estimation method. In Section 4, we evaluate JMFHC’s performance and compare it with JMPHC under simulation scenarios with the violation and satisfaction of the PH assumption. In Section 5, we implement JMFHC and JMPHC to analyze the CML study and compute their estimated cure rates for different patient subgroups. Section 6 summarizes the study. The R package for fitting our proposed model can be found at https://github.com/cxie19/jmfhc.
Model
Notation
Denote the sample size as . All subjects are assumed to be independent. For subject , where , let be the total number of a biomarker’s observed measurements. The total number of biomarker measurements for all patients is denoted as . Denote the vector of these observed biomarker values for subject at measurement time points as . Let and be two vectors of baseline covariates of interest with lengths of and , respectively, which may or may not overlap with each other. Denote the time to an event of interest and the censoring time for subject as and , respectively. Let the observed survival time be min and the event indicator be , where is an indicator function. Given a subject’s random effects, we assume that its longitudinal and survival processes are independent.48,56,57
A linear mixed-effects submodel for the longitudinal biomarker
We let subject ’s observed biomarker values at a vector of measurement times be . Composed of its true biomarker values denoted as and measurement errors , is formulated as
where is an matrix for subject ’s baseline covariates, and its rows are the same and contain any covariates of and ; is a -length vector of fixed-effect regression parameters for ; and are an design matrix for fixed effects and an design matrix for random effects, respectively, with the first columns as 1’s and the remaining columns containing subject ’s functions of biomarker measurement time points (e.g., ), and and can have overlapping columns; and are vectors containing intercepts of fixed effects and random effects with lengths and for and , respectively; follows a multivariate normal distribution and the unstructured covariance matrix with elements of , and , for and ; follows a multivariate normal distribution .
Here all and are mutually independent. The conditional distribution of given the random effects is a multivariate normal distribution , and are independent. In summary, subjects’ longitudinal measurements are modeled by an LM submodel with subject-specific intercepts and slopes.
The new joint model with a flexible-hazards cure submodel for cure-survival data
The cure-survival data is modeled by an FHC submodel connecting with an LM submodel via shared random effects. The FHC submodel is an extension of the PHC model. The PHC model’s survival function at time conditional on subject ’s covariates is written as
where is an unknown scalar; is a -length vector of unknown regression parameters; and is a monotone increasing function with and . The PHC model can be considered as a Cox PH model with a bounded baseline cumulative hazard and called a bounded cumulative hazard model. As goes to infinity, tends to 1 and the cure rate can be computed as . Thus, is regarded as the long-term covariates. With subject ’s random effects from an LM submodel, the PHC submodel for the cure-survival data in the joint model can be defined as
where and are two vectors of unknown regression parameters with lengths and for baseline covariates and random effects , respectively.
To relax the PH assumption, we propose a new joint model, with an FHC submodel extending the above PHC submodel by raising a power on , as follows. The survival function in our joint model (i.e., JMFHC) for subject at time conditional on covariates and and random effects is assumed to be
where is a vector of unknown regression parameters with length . Based on our JMFHC, the cure rate conditional on and can be obtained as for subject . Since is not observed, the cure rate at baseline given can be computed as
where is the multivariate normal density function of . Parameters reflect the covariate effects on cure rates, and a positive indicates lower cure rates when the values of increase, for . On the other hand, for finite time where is less than one, the covariates can contribute to the short-term survival via . Since is fixed between 0 and 1, changes of values can shrink (when ) or expand (when ) the values to . When increases, a positive reflects a larger short-term survival probability,
When we take a logarithm of JMFHC’s cumulative hazard function, which is expressed as
it is in the form of a linear model of with as an intercept and as a slope. This slope term allows non-PH (including crossing) cumulative hazard functions. Changes of can only move the curve of up and down without changing its shape. Therefore, covariates following the PH assumption can only belong to . Depending on the effect patterns of and , covariates with the non-PH structure could be included in either only or both and . The term prompts flexibility of hazard functions and feasibility of capturing the non-PH structure of covariates without a complex model. The identifiability of our proposed model is evaluated shown in Section S1 of the Supplemental Material.
Estimation method
The data consists of independent samples, . The observed times are assumed to be in ascending order for the simplicity of notations in the estimation procedure. Denote the parameters as , where and , the observed values as with , for , and the unobserved values or random effects as .
With the unknown random effects, we use the expectation-maximization algorithm to estimate regression parameters . For subject , we let the hazard function at time be , whose expression is shown as
The survival function at time is expressed as
The density function of given is written as
The density function of is expressed as
With equations (4)–(7), the complete-data likelihood function of JMFHC is written as
Suppose is observed. The full likelihood function is written as
With the complex form of equation (9), subject ’s marginal likelihood function is expressed as
The integral in equation (10) does not have a closed form. In such case, the posterior distribution function of subject ’s random effects with equation (10) as the denominator, which is expressed as
does not have a closed form. We derive the expectation of the complete-data log-likelihood function with respect to the posterior distribution of random effects , which is written as
where
Hence, the MCEM algorithm is implemented for parameter estimation. In the E step, the expectation of the complete-data log-likelihood function with respect to the posterior distribution of random effects is numerically obtained through Monte Carlo simulations. More specifically, an adaptive Metropolis (AM) algorithm58 is implemented to generate samples of subject-specific random effects. In the M step, the parameters are updated by maximizing the expected log-likelihood function obtained in the E step.
Let be the number of iterations, where . We set as the initial step (Step 1 below). The E step of the MCEM algorithm is shown in Step 2, and the M step is presented as Steps 3–5. Steps 2–5 are considered as one iteration of estimation. A parameter value at the iteration has a superscript on the corresponding notation. The estimation procedure is outlined as follows:
Initial values of regression parameters in two submodels of JMFHC are obtained.
In the LM submodel, we use the restricted maximum likelihood59 to estimate parameters and , and predict subjects’ random effects denoted as .
In the cure submodel, given , we determine the initial values of parameters and . First, is set as . are estimated via the maximum partial likelihood estimation method under a standard Cox PH model. Denote the Breslow estimator as , where is the baseline hazard function at time . Based on the Breslow estimator, the initial value of is computed as . The initial values of are derived as The corresponding point mass of is computed as for subject , where is an indicator function. Given the rest of initial values of parameters , we obtain by maximizing the logarithm of equation (9).
At iteration , where , the AM algorithm58 is implemented to generate chains of the random effects . The target distribution for the Monte Carlo simulations is shown in equation (11) with the latest estimated parameters . The details of the AM algorithm are shown in Section S2 of the Supplemental Material. Suppose the number of samples in subject ’s final chain is after burn-in and thinning. The expectations of sufficient statistics involving subject ’s random effects can be computed with sampled random effects, such as
In the following steps, all the parameters are updated by maximizing equation (12) with all the latest estimated parameters.
The parameters in the longitudinal submodel , and are estimated by maximizing equation (12). By taking the partial derivatives of with respect to , and with the equations set to zero, we obtain the estimators of , and as follows:
where . Similarly, we solve and obtain
The parameters in the cure submodel , , and are estimated by maximizing equation (12) (using R function optim), which can be simplified as With , , and , the estimator is obtained by solving , which is written as
The function is updated. We implement the Lagrange multiplier method to derive jump sizes first, , under the constraint . Similar methods have been applied in the estimation of other cure models60,61 and a joint model of longitudinal cure-survival data.52 Define
where is the Lagrange multiplier and is equation (12) with each replaced by . By maximizing equation (13), we obtain the estimator of at the iteration, which is expressed as
where
When , . We solve for by using since is a monotone function of . An updated estimator can be obtained with the estimate . We update the function via
Steps 2–5 are iterated until convergence of estimated parameters . Based on our estimation method due to the constraint of ’s jump sizes, =1, censored patients whose censoring times are greater than or equal to the maximum of all event times are considered to be cured. This assumption ensures the cure submodel identifiability, which is also used by other cure models.11,21,22,28,30 The proof of identifiability for our proposed model is presented in Section S1 of the Supplemental Material.
We apply a jackknife resampling method to compute the standard errors of . The non-parametric bootstrap cannot achieve the covariance estimation for our model since the ties from the resampling with replacement would make the cure model’s fitting difficult. The jackknife resampling method removes one subject’s record each time, and the same estimation procedure for point estimation presented above is used on the resampled data. The standard errors of parameter estimators can be computed as where are the estimated parameters from model fitting on different resampled data sets, and . The point estimation and standard error estimation for our comparison model JMPHC are the same as JMFHC except that is set as for JMPHC throughout the estimation.
Simulation studies
We performed simulation studies under four scenarios to assess the performance and estimation method of the proposed model JMFHC and compared it with JMPHC. These two models consisted of an LM submodel as the longitudinal submodel and cure submodels with flexible patterns of hazard functions (i.e., JMFHC) and PH structure (i.e., JMPHC), respectively. Scenarios 1 and 4 had baseline covariates violating the PH assumption with crossing. Under scenario 2, baseline covariates had non-crossing survival functions, but violated the PH assumption. Scenario 3 followed the PH assumption and thus satisfied the assumption of JMPHC. Compared to scenario 4, scenarios 1–3 have smaller censoring rates and larger cure rates.
We conducted 500 runs for each scenario with a sample size of 500. In each simulation scenario, a data set was generated from JMFHC based on equations (1) and (2). We generated a binary baseline variable taking 0 or 1 with equal probability. In the simulations, we considered a linear trajectory of longitudinal biomarker measurements. The function of subject ’s biomarker measurement at time was written as
The fixed population intercept and slope were set as . The subject-specific random intercept and slope were generated from a bivariate normal distribution with mean and a covariance matrix with , and . The standard deviation of the measurement errors was set as . The prespecified measurement time points were set as .
In the underlying JMFHC’s cure submodel with the form of equation (2), we let and , and the FHC submodel was written as
The true was set as . We considered with a non-PH structure under scenarios 1, 2, and 4 and a PH structure under scenario 3. The true values of and were set as for scenarios 1 and 4, for scenario 2, and for scenario 3.
The function of survival times was derived from equation (15), which is expressed as
where was the inverse of , and followed a standard uniform distribution. For subject with the value of less than or equal to 1, its survival time was . Otherwise, this patient’s survival time was set as 99999, which meant that he or she was cured. The censoring times were generated from a continuous uniform distribution with parameters of under scenarios 1–3 and under scenario 4. For each subject, we discarded the generated biomarker values after its observed survival time. The mean number of subjects’ biomarker measurements per person was about 22 under scenarios 1–3 and 18 under scenario 4. Under scenarios 1–4, the means of censoring rates were 38.9%, 35.8%, 34.5%, and 58.3%, respectively, and the percentages of cured ones among these censored subjects were 67.4%, 66.1%, 68.5%, and 45.0%, respectively.
The convergence criteria for except were set to be the relative differences between previous and current estimates (e.g., ) less than 0.002 or the absolute differences (e.g., ) less than 0.001. The convergence criterion for was the average of the relative differences between previous and current estimates less than 0.005 or the average of absolute differences less than 0.002. The maximum number of estimation iterations was set as 200. In the step of the AM algorithm, the final chain for each subject was obtained by thinning 10,000 draws by five after 1,000 draws of burn-in. With the MCEM algorithm, the convergence rates for JMFHC were 100% under simulation scenarios 2 and 3 and 98.2% under scenarios 1 and 4. The convergence rates for JMPHC were 100% under all scenarios.
Our proposed model JMFHC showed unbiased regression parameter estimates and estimated standard errors matching the empirical ones. Table 1 presents all regression parameters’ means of estimates, empirical standard errors (ESEs), averages of estimated standard errors (ASEs), and 95% coverage probabilities (CPs) from JMFHC and JMPHC under four scenarios. Across four scenarios, JMFHC performed well with means of estimates close to the true values of the regression parameters from two submodels. Also, these regression parameters’ ASEs agreed with their corresponding ESEs. All empirical CPs were near the nominal level of 95%. On the other hand, when the PH assumption was violated under scenarios 1, 2, and 4, JMPHC had the means of estimated parameters from its cure submodels, especially and , biased from the true values, even though its means of estimated regression parameters from LM submodels were close to the true values. Not all cure-survival parameters’ CPs were near the nominal level of 95%. Under scenario 3, JMPHC had the means of estimated parameters from two submodels close to the true values since the PH assumption was satisfied, and the 95% CPs were around the nominal level. ASEs of regression parameters estimated by JMPHC agreed with the corresponding ESEs under all four scenarios.
Results of simulation studies for 500 data sets with a sample size of 500 by fitting a joint model with a flexible-hazards cure model for survival data (JMFHC) and a joint model with a proportional hazards cure model for survival data (JMPHC) under scenarios 1–4.
JMFHC
JMPHC
Scenario
Parameter
True
Est
ESE
ASE
CP
Est
ESE
ASE
CP
1
Cure submodel
(The PH
−0.1
−0.1103
0.0962
0.0959
0.941
0.0700
0.1016
0.1051
0.650
assumption
1
1.0228
0.1361
0.1388
0.961
0.5713
0.1226
0.1233
0.062
is violated;
0.5
0.5102
0.0888
0.0949
0.963
0.4706
0.0834
0.0882
0.946
crossing
−0.7
−0.6978
0.1267
0.1274
0.969
−0.6530
0.1221
0.1212
0.914
survival
0.8
0.8206
0.1091
0.1073
0.927
–
–
–
–
functions)
Longitudinal submodel
5
4.9975
0.0406
0.0619
0.996
4.9929
0.0404
0.0654
0.998
−1
−0.9978
0.0219
0.0223
0.957
−0.9979
0.0221
0.0225
0.956
0.8
0.8000
0.0329
0.0351
0.957
0.8006
0.0327
0.0351
0.958
0.5
0.4988
0.0150
0.0160
0.949
0.4988
0.0150
0.0160
0.948
0
−0.0020
0.0510
0.0534
0.949
−0.0019
0.0509
0.0533
0.950
1
1.0000
0.0073
0.0071
0.949
0.9999
0.0072
0.0071
0.946
2
Cure submodel
(The PH
1
0.9970
0.1029
0.0988
0.932
1.3157
0.1179
0.1235
0.252
assumption
−1
−0.9980
0.1411
0.1356
0.942
−1.4776
0.1334
0.1328
0.034
is violated;
0.5
0.5115
0.0897
0.0910
0.954
0.5695
0.0997
0.1003
0.894
non-crossing
−0.7
−0.7027
0.1249
0.1241
0.944
−0.7549
0.1378
0.1334
0.934
survival
0.8
0.8134
0.1260
0.1146
0.926
–
–
–
–
functions)
Longitudinal submodel
5
4.9978
0.0410
0.0625
0.994
4.9952
0.0410
0.0734
1.000
−1
−0.9979
0.0223
0.0225
0.954
−0.9978
0.0223
0.0242
0.956
0.8
0.7994
0.0331
0.0354
0.960
0.7985
0.0331
0.0356
0.966
0.5
0.4988
0.0150
0.0160
0.954
0.4987
0.0150
0.0160
0.954
0
−0.0018
0.0514
0.0537
0.958
−0.0012
0.0515
0.0538
0.958
1
0.9998
0.0075
0.0073
0.948
1.0000
0.0075
0.0073
0.950
3
Cure submodel
(The PH
1
0.9888
0.0971
0.1004
0.952
0.9922
0.0890
0.0955
0.956
assumption
−1
−0.9890
0.1335
0.1301
0.938
−0.9956
0.1183
0.1195
0.956
is valid)
0.5
0.5095
0.0893
0.0902
0.966
0.5101
0.0899
0.0904
0.962
−0.7
−0.7013
0.1233
0.1227
0.934
−0.7017
0.1239
0.1228
0.942
0
0.0127
0.0991
0.0910
0.918
–
–
–
–
Longitudinal submodel
5
4.9938
0.0412
0.0711
1.000
4.9932
0.0411
0.0739
0.998
−1
−0.9974
0.0223
0.0225
0.954
−0.9974
0.0223
0.0235
0.954
0.8
0.7995
0.0332
0.0357
0.966
0.7995
0.0332
0.0357
0.964
0.5
0.4987
0.0150
0.0161
0.956
0.4987
0.0151
0.0160
0.954
0
−0.0016
0.0517
0.0539
0.960
−0.0015
0.0517
0.0540
0.960
1
0.9998
0.0077
0.0074
0.954
0.9998
0.0077
0.0074
0.954
4
Cure submodel
(The PH
−0.1
−0.1221
0.1263
0.1275
0.943
0.1388
0.1247
0.1293
0.548
assumption
1
1.0407
0.1934
0.1838
0.943
0.4294
0.1539
0.1513
0.042
is violated;
0.5
0.5134
0.1132
0.1160
0.953
0.4773
0.1071
0.1104
0.956
crossing
−0.7
−0.6993
0.1509
0.1553
0.967
−0.6574
0.1450
0.1494
0.946
survival
0.8
0.8322
0.1334
0.1267
0.941
–
–
–
–
functions;
Longitudinal submodel
higher
5
4.9987
0.0414
0.0502
0.986
4.9976
0.0411
0.0502
0.986
censoring rate;
−1
−0.9988
0.0219
0.0226
0.957
−0.9989
0.0221
0.0226
0.954
lower
0.8
0.8001
0.0346
0.0366
0.959
0.8004
0.0344
0.0366
0.958
cure rate)
0.5
0.4988
0.0151
0.0160
0.955
0.4987
0.0151
0.0160
0.956
0
−0.0020
0.0518
0.0547
0.963
−0.0015
0.0518
0.0546
0.962
1
0.9999
0.0084
0.0081
0.959
0.9998
0.0085
0.0081
0.956
Est: mean of estimates; ESE: empirical standard error; ASE: average of estimated standard errors; CP: 95% coverage probability.
In addition to the good estimation results of JMFHC’s regression parameters shown in Table 1, the estimation of non-parametric performed well and is presented in Figure 2. Under each of four scenarios, we can see that the mean curve of estimated overlaps with the true curve of , and the estimated values of from all runs are well clustered around the true curve.
Comparison of true and estimated functions from the joint model with a flexible-hazards cure model for survival data (JMFHC) under simulation scenarios 1–4. The true was set as , which is presented by the black curve. Each red shaded curve represents the mean of estimated functions from 500 runs in each scenario. Pink dots are the estimated values of from 500 runs.
JMFHC easily captured covariates with non-PH structures. In Figure 3, we compared the means of estimated survival functions by from JMFHC and JMPHC with the true survival functions under four scenarios to evaluate the performance of the two models. The estimated survival functions by in each run were the means of survival functions by computed with 1000 randomly generated pairs of random effects given the estimated parameter estimates and from that run. Under scenario 1, the true survival functions by had a clear crossing shown in Figures 3(a) and (b). Subjects with had larger survival probabilities or lower hazard rates in an early period than those with , but they had smaller survival probabilities in the later period, indicating ’s opposite survival directions for short- and long-term effects. Figure 3(a) shows that JMFHC’s estimated survival functions overlap with the true functions under scenario 1, but JMPHC’s estimated survival functions in Figure 3(b) cannot capture the crossing. Also, JMPHC forced a constant relative hazard rate between two subgroups, and the two stable plateaus at the right tails of JMPHC’s estimated survival functions were closer than the ones from the true survival functions, revealing that JMPHC underestimated the long-term survival effect of . Two models’ performances under scenario 4 shown in Figures 3(g) and (h) are similar to those under scenario 1. Under scenario 2, the true survival functions by with a non-PH structure do not cross as shown in Figures 3(c) and (d). In Figure 3(c), the survival functions estimated by JMFHC overlap with the true survival functions. The violation of the PH assumption caused JMPHC to have estimated survival functions not close to the true survival functions in Figure 3(d). In this case, JMPHC underestimated ’s short-term effect and overestimated its long-term effect. Under Scenario 3, Figures 3(e) and (f) show that the survival functions estimated by JMFHC and JMPHC match the true survival functions.
True survival functions (solid curves) and means of estimated survival functions (dashed curves) from (a, c, e, g) the joint model with a flexible-hazards cure model for survival data (JMFHC) and (b, d, f, h) the joint model with a proportional hazards cure model for survival data (JMPHC) by (0 vs 1) under simulation scenarios 1–4.
Lastly, JMFHC produced more accurate cure rates than JMPHC. Part (a) of Table 2 shows the true cure rates and the means of estimated cure rates by from JMFHC and JMPHC. The true and estimated cure rates were computed from equation (3) with true and estimated parameter regressions, respectively. JMFHC’s estimated cure rates by were close to the true values in all scenarios. Under scenarios 1, 2, and 4 with a non-PH structure of , the estimated cure rates from JMPHC were biased. JMPHC’s cure rate estimates were close to the true values under scenario 3 satisfying the PH assumption.
Comparison of estimated cure rates (%) for subgroups from the joint model with a flexible-hazards cure model for survival data (JMFHC) and the joint model with a proportional hazards cure model for survival data (JMPHC): (a) in the simulation studies with the true cure rates (%) as the reference, and (b) in the real data analysis with the estimated cure rates (%) from Kaplan–Meier estimates’ stable plateaus at the right tails as the reference.
JMFHC
JMPHC
(a) Simulation studies
Subgroup
True
Est (SD)
Diff
Est (SD)
Diff
Scenario 1
40.28
40.61 (0.0310)
0.33
34.74 (0.0328)
5.54
12.07
12.05 (0.0222)
0.02
17.79 (0.0226)
5.72
Scenario 2
10.15
10.45 (0.0195)
0.35
6.20 (0.0148)
3.95
37.03
37.09 (0.0320)
0.06
42.24 (0.0295)
5.21
Scenario 3
10.15
10.57 (0.0189)
0.42
10.49 (0.0175)
0.34
37.03
37.07 (0.0318)
0.04
37.17 (0.0309)
0.14
Scenario 4
40.28
40.98 (0.0404)
0.70
32.57 (0.0397)
7.71
12.07
12.11 (0.0307)
0.04
19.82 (0.0314)
7.75
(b) Real data analysis
Dose level, age
KM
Est (SE)
Diff
Est (SE)
Diff
140 mg daily, 60 years
34.04
33.38 (0.0550)
0.66
35.95 (0.0539)
1.91
100 mg daily, 60 years
32.37
40.36 (0.0643)
7.99
43.20 (0.0609)
10.83
140 mg daily, < 60 years
49.40
50.45 (0.0592)
1.05
48.23 (0.0604)
1.17
100 mg daily, < 60 years
63.25
57.45 (0.0653)
5.80
55.46 (0.0641)
7.79
Diff: absolute difference of (Est True) in part (a) and (Est KM) in part (b); KM: Kaplan–Meier estimate; Est: estimated cure rate; SD: standard deviation; SE: standard error; JMPHC: joint model proportional hazards cure; JMFHC: joint model flexible-hazards cure.
In summary, JMFHC and JMPHC work well when the PH assumption holds. When the PH assumption is violated, JMFHC can easily accommodate crossing or non-crossing survival functions by allowing different relative hazard rates in both short and long terms, but JMPHC forces the PH assumption on covariates and produces biased estimated cure rates.
To illustrate our R program’s running time of fitting JMFHC and JMPHC, we took one data set under simulation scenario 1 with crossing survival functions as an example. On a laptop computer with an Intel i7-10610U CPU and 16.0GB RAM, it took JMFHC and JMPHC about 3.9 and 2.5 hours, respectively, to obtain the point estimation. The 500 replicates of simulations with four scenarios were run on the high-performance computing cluster. The running time of jackknife for standard error estimation in one dataset was around 7 hours by using a job array on the high-performance computing cluster.
Real data analysis
We compared the results of JMFHC and JMPHC applied to the CML study with 566 patients. This dose optimization study aimed to investigate dasatinib’s optimization of dose level (100 or 140 mg in total per day) and schedule (once or twice daily) for patients with chronic-phase CML who had experienced imatinib resistance or intolerance. Since the schedule of dose taking did not affect the efficacy of dasatinib significantly,37 we focused our analysis on the effect of dose level [100 mg daily (), 140 mg daily ()] with the disease progression as the event of interest. Besides dose level, the data contains age group (< or years) as another clinically significant baseline variable. The censoring rate in this study was 58.0%.
In an LM submodel, the logarithms of the BCR-ABL expression levels were assumed to be associated with the logarithms of measurement time points as fixed and random effects, and dose group and age group as the fixed effects. Figure 1(a) shows the overall trajectory of all patients’ logarithms of BCR-ABL expression gene levels over logarithms of measurement times in years, which presents a roughly linear relationship. The relationship between the logarithms of BCR-ABL expression gene levels and the measurement times in the original form is less linear shown in Figure S1 of the Supplemental Material. In an LM model with the logarithms of the measurement time points as the fixed and random effects, the conditional coefficient of determination interpreting as a variance explained by both fixed and random effects is , which is higher than that of the model with the original measurement time points (). Patient ’s logarithm of observed BCR-ABL expression level at the logarithm of measurement time point , for and was modeled as
where the vector of random effects with ’s components of , and , and .
In the Cox model with dose level, age group, and baseline BCR-ABL expression as the covariates, dose level and age group were significant. The age group violated the PH assumption significantly (-value=0.026) from the Grambsch-Therneau test42 and was set as a short-term covariate in JMFHC’s cure submodel. The long-term covariates in JMFHC and JMPHC were dose [Dose140 (mg daily), 1: 140, 0: 100], age [Age60 (years), 1: 60, 0: < 60], and a shared set of individuals’ random intercepts and slopes obtained from the LM submodel. With the shared set of random effects , the cure submodel of JMFHC for patient at survival time was expressed as
In JMPHC’s cure submodel, . The estimation for JMFHC and JMPHC used the same convergence criteria, maximum number of iterations, and scenario of chains in the AM algorithm mentioned in Section 4.
Table 3 presents the estimated regression parameters from JMFHC and JMPHC. In the cure submodels of these two models, dose, age, and individual-specific random effects had the same directions of long-term survival effects and significant effects on PFS at the significance level of 0.05. In JMFHC, age’s short- and long-term parameter estimates indicated that the directions of its short- and long-term effects differed, and patients aged 60 years had significantly better short-term survival (-value = 0.023) and worse long-term survival (-value ) than those < 60 years old. Regression parameters in the two models’ longitudinal submodels have similar results.
Estimation results for the chronic myeloid leukemia (CML) study by fitting the joint model with a flexible-hazards cure model for survival data (JMFHC) and the joint model with a proportional hazards cure model for survival data (JMPHC) .
JMFHC
JMPHC
Parameter
Est
SE
value
Est
SE
value
Long-term survival effect parameter
Intercept
−0.7790
0.2804
0.005
−0.6937
0.2739
0.011
Dose 140 mg daily
0.2942
0.1570
0.061
0.3007
0.1553
0.053
Age years
0.7089
0.1829
<0.001
0.5068
0.1552
0.001
Random intercept
0.2906
0.0870
0.001
0.2758
0.0870
0.002
Random slope for
1.1661
0.1685
<0.001
1.1728
0.1735
<0.001
Short-term survival effect parameter
Age years
0.2682
0.1179
0.023
–
–
–
Longitudinal submodel
Intercept
2.7745
0.1999
<0.001
2.7883
0.1921
<0.001
Slope for
−1.0351
0.1556
<0.001
−1.0195
0.1586
<0.001
Dose 140 mg daily
0.0462
0.1540
0.764
0.0441
0.1489
0.767
Age years
0.2145
0.1612
0.183
0.2059
0.1544
0.182
SD of random intercepts
1.2763
0.0702
<0.001
1.2752
0.0703
<0.001
SD of random slopes
0.8869
0.0332
<0.001
0.8869
0.0331
<0.001
Correlation coefficient
0.1731
0.0593
0.003
0.1732
0.0577
0.003
SD of measurement errors
1.1078
0.0332
<0.001
1.1082
0.0332
<0.001
Est: estimate; SE: standard error; SD: standard deviation.
Cure rates estimated by JMFHC were less affected by a covariate’s non-PH structure than those estimated by JMPHC. Part (b) of Table 2 compares the two models’ estimated cure rates for patient subgroups of different dose levels and age groups with those obtained from stable plateaus of the KM estimates at the right tails. Since we did not know the true parameter values for real data analysis and could not obtain the true cure rates of subgroups, we used the estimated cure rates from the KM estimates as surrogate values. For JMFHC and JMPHC, estimated cure rates were computed as shown in equation (3) with estimated regression parameters for subgroups of dose level and age groups. For each subgroup, cure rates estimated by JMPHC deviated from the corresponding KM estimates more than those estimated by JMFHC. In summary, JMFHC outperforms JMPHC when the PH assumption is violated.
Discussion
Our proposed joint model, JMFHC, not only incorporates the longitudinal biomarker but also relaxes the PH assumption on covariates. It possesses improved performance compared to a joint model with a widely used cure submodel requiring the PH assumption. An LM-effects submodel is used to model the longitudinal biomarker, and its random effects are shared with the cure submodel. We extend the PHC model into a cure submodel that can flexibly handle covariates’ effect patterns, which is achieved by adding short-term covariates via raising a power on . In our model, estimated by the Lagrange multiplier method, is a non-parametric function with a constraint. Due to the unknown random effects on patients’ longitudinal biomarker measurements, a non-parametric maximum likelihood estimation method incorporating the MCEM algorithm is applied for regression estimation. The jackknife resampling scheme is applied for standard error estimation. Good performance of the proposed model and its estimation method has been demonstrated by the simulation studies. In simulations, the results of our comparison model, JMPHC, show that if the PH assumption is subject to violation, estimated cure rates are biased. Our proposed model handles this situation well and avoids the bias of estimated cure rates. The real data analysis further illustrates that JMFHC can handle covariates’ non-PH structures and estimate the cure rates more accurately.
Our model assumes the same for different covariate groups. This constraint may bias the estimation of and regression parameters in the submodels, which affects the estimation of cure rates. Stratified baseline cumulative hazard functions in JMFHC merit future research to obtain more accurate estimated cure rates. In addition, we could incorporate time-varying variables as fixed variables in the LM-effects submodel. Last, instead of the shared random effects connecting the submodels for longitudinal biomarker values and survival outcomes, the current value of the longitudinal process can be a time-varying covariate in the survival submodel.48,57 The joint model using an FHC submodel for cure-survival data with the current value of the longitudinal process as a covariate merits future research.
Supplemental Material
sj-pdf-1-smm-10.1177_09622802251320793 - Supplemental material for A new cure model accounting for longitudinal data and flexible patterns of hazard ratios over time
Supplemental material, sj-pdf-1-smm-10.1177_09622802251320793 for A new cure model accounting for longitudinal data and flexible patterns of hazard ratios over time by Can Xie, Xuelin Huang, Ruosha Li, Yu Shen, Nicholas J Short and Kapil N Bhalla in Statistical Methods in Medical Research
Footnotes
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors received no financial support for the research, authorship and/or publication of this article. The research of Huang was partially supported by the US National Institutes of Health grants R01CA272806, U54CA096300, U01CA253911, and 5P50CA100632. Both Huang and Xie were partially supported by the Dr. Mien-Chie Hung and Mrs. Kinglan Hung Endowed Professorship. The research of Li was partially supported by the National Institutes of Health grants R01DK117209 and R03NS108136. The Research of Xie was also partially supported by the MD Anderson Cancer Center Multidisciplinary Research Program award for the Myeloproliferative Neoplasms SPORE application (PI: Bhalla).
ORCID iDs
Can Xie
Xuelin Huang
Ruosha Li
Supplemental material
Supplemental material is available for this article online.
References
1.
HickeyGLPhilipsonPJorgensenA, et al. Joint modelling of time-to-event and multivariate longitudinal outcomes: recent developments and issues. BMC Med Res Methodol2016; 16: 1–15.
2.
PapageorgiouGMauffKTomerA, et al. An overview of joint modeling of time-to-event and longitudinal outcomes. Annu Rev Stat Appl2019; 6: 223–240.
3.
IbrahimJGChuHChenLM. Basic concepts and methods for joint models of longitudinal and survival data. J Clin Oncol2010; 28: 2796–2801.
4.
AmanpourFAkbariSAzizmohammad LoohaM, et al. Mixture cure model for estimating short-term and long-term colorectal cancer survival. Gastroenterol Hepatol Bed Bench2019; 12: S37–S43.
5.
EngelsEAMandalSCorleyDA, et al. Cure models, survival probabilities, and solid organ transplantation for patients with colorectal cancer. Am J Transplant2024; S1600-6135: 00527-6.
6.
WangYWangWTangY. A Bayesian semiparametric accelerate failure time mixture cure model. Int J Biostat2021; 18: 473–485.
7.
PengYXuJ. An extended cure model and model selection. Lifetime Data Anal2012; 18: 215–233.
8.
HollandJF. Breaking the cure barrier 25 years later. J Clin Oncol2008; 26: 1575–1575.
9.
YilmazYELawlessJFAndrulisIL, et al. Insights from mixture cure modeling of molecular markers for prognosis in breast cancer. J Clin Oncol2013; 31: 2047–2054.
10.
BoagJW. Maximum likelihood estimates of the proportion of patients cured by cancer therapy. J R Stat Soc: Ser B (Methodological)1949; 11: 15–53.
11.
YakovlevAYTsodikovAD. Stochastic Models of Tumor Latency and Their Biostatistical Applications. Series in Mathematical Biology and Medicine. Singapore: World Scientific, 1996.
12.
AmicoMKeilegomIV. Cure models in survival analysis. Annu Rev Stat Appl2018; 5: 311–342.
13.
PengYYuB. Cure Models: Methods, Applications, and Implementation. New York: Chapman and Hall/CRC, 2021.
14.
FarewellVT. The use of mixture models for the analysis of survival data with long-term survivors. Biometrics1982; 38: 1041–1046.
15.
McLachlanGMcGiffinD. On the role of finite mixture models in survival analysis. Stat Methods Med Res1994; 3: 211–226.
16.
PengYDearKBGDenhamJW. A generalized F mixture model for cure rate estimation. Stat Med1998; 17: 813–830.
17.
KukAYCChenCH. A mixture model combining logistic regression with proportional hazards regression. Biometrika1992; 79: 531–541.
18.
LiCSTaylorJMG. A semi-parametric accelerated failure time cure model. Stat Med2002; 21: 3235–3247.
19.
MaoMWangJL. Semiparametric efficient estimation for a class of generalized proportional odds cure models. J Am Stat Assoc2010; 105: 302–311.
PengYDearKBG. A nonparametric mixture model for cure rate estimation. Biometrics2000; 56: 237–243.
22.
WangLDuPLiangH. Two-component mixture cure rate model with spline estimated nonparametric components. Biometrics2012; 68: 726–735.
23.
MilienosFS. On a reparameterization of a flexible family of cure models. Stat Med2022; 41: 4091–4111.
24.
SafariWCLópez-de UllibarriIJácomeMA. Latency function estimation under the mixture cure model when the cure status is available. Lifetime Data Anal2023; 29: 608–627.
25.
PalSPengYAselisewineW. A new approach to modeling the cure rate in the presence of interval censored data. Comput Stat2024; 39: 2743–2769.
26.
PanCCaiBSuiX. A Bayesian proportional hazards mixture cure model for interval-censored data. Lifetime Data Anal2024; 30: 327–344.
27.
BroëtPRyckeYTubert-BitterP, et al. A semiparametric approach for the two-sample comparison of survival times with long-term survivors. Biometrics2001; 57: 844–852.
28.
TsodikovA. A proportional hazards model taking account of long-term survivors. Biometrics1998; 54: 1508–1516.
29.
SpostoR. Cure model analysis in cancer: an application to data from the children’s cancer group. Stat Med2002; 21: 293–312.
30.
ChenTDuP. Promotion time cure rate model with nonparametric form of covariate effects. Stat Med2018; 37: 1625–1635.
31.
LiuHShenY. A semiparametric regression cure model for interval-censored data. J Am Stat Assoc2009; 104: 1168–1178.
32.
PalSAselisewineW. A semiparametric promotion time cure model with support vector machine. Ann Appl Stat2023; 17: 2680–2699.
33.
CalsavaraVFMilaniEABertolliE, et al. Long-term frailty modeling using a non-proportional hazards model: Application with a melanoma dataset. Stat Methods Med Res2020; 29: 2100–2118.
34.
CalsavaraVFRodriguesASTomazellaVLD, et al. Frailty models power variance function with cure fraction and latent risk factors negative binomial. Commun in Stat - Theory Method2017; 46: 9763–9776.
35.
TawiahRMcLachlanGJNgSK. Mixture cure models with time-varying and multilevel frailties for recurrent event data. Stat Methods Med Res2020; 29: 1368–1385.
36.
XuYCheungYB. Frailty models and frailty-mixture models for recurrent event times. Stata J: Promot Commun stat Stata2015; 15: 135–154.
37.
ShahNPKantarjianHMKimDW, et al. Intermittent target inhibition with dasatinib 100 mg once daily preserves efficacy and improves tolerability in imatinib-resistant and -intolerant chronic-phase chronic myeloid leukemia. J Clin Oncol2008; 26: 3204–3212.
38.
BartramCRde KleinAHagemeijerA, et al. Translocation of c-ab1 oncogene correlates with the presence of a philadelphia chromosome in chronic myelocytic leukaemia. Nature1983; 306: 277–280.
39.
GroffenJStephensonJRHeisterkampN, et al. Philadelphia chromosomal breakpoints are clustered within a limited region, bcr, on chromosome 22. Cell1984; 36: 93–99.
40.
Quintás-CardamaAChoiSKantarjianH, et al. Predicting outcomes in patients with chronic myeloid leukemia at any time during tyrosine kinase inhibitor therapy. Clin Lymphoma Myeloma Leuk2014; 14: 327–334.e8.
41.
MallerRAZhouS. Estimating the proportion of immunes in a censored sample. Biometrika1992; 79: 731–739.
42.
GrambschPMTherneauTM. Proportional hazards tests and diagnostics based on weighted residuals. Biometrika1994; 81: 515–526.
43.
LawNJTaylorJMSandlerH. The joint modeling of a longitudinal disease progression marker and the failure time process in the presence of cure. Biostatistics2002; 3: 547–563.
44.
YuMLawNJTaylorJMG, et al. Joint longitudinal-survival-cure models and their application to prostate cancer. Stat Sin2004; 14: 835–862.
45.
YuMTaylorJMGSandlerHM. Individual prediction in prostate cancer studies using a joint longitudinal survival–cure model. J Am Stat Assoc2008; 103: 178–187.
46.
PanJBaoYDaiH, et al. Joint longitudinal and survival-cure models in tumour xenograft experiments. Stat Med2014; 33: 3229–3240.
47.
YangLSongHPengY, et al. Joint analysis of longitudinal measurements and survival times with a cure fraction based on partly linear mixed and semiparametric cure models. Pharm Stat2020; 20: 362–374.
48.
BarbieriALegrandC. Joint longitudinal and time-to-event cure models for the assessment of being cured. Stat Methods Med Res2020; 29: 1256–1270.
49.
EkongAHOlayiwolaMODawoduAG, et al. Latent gaussian approach to joint modelling of longitudinal and mixture cure outcomes. Comput J Math Stat Sci2025; 4: 72–95.
50.
BrownERIbrahimJG. Bayesian approaches to joint cure-rate and longitudinal models with applications to cancer vaccine trials. Biometrics2003; 59: 686–693.
51.
SongHPengYTuD. A new approach for joint modelling of longitudinal measurements and survival times with a cure fraction. Cana J Stat2012; 40: 207–224.
52.
KimSZengDLiY, et al. Joint modeling of longitudinal and cure-survival data. J Stat Theory Pract2013; 7: 324–344.
53.
ChiMWangXSongH, et al. Joint analysis of longitudinal ordinal categorical item response data and survival times with cure fraction. Stat Biopharm Res2023; 1–11.
54.
ChenMHIbrahimJGSinhaD. A new joint model for longitudinal and survival data with a cure fraction. J Multivar Anal2004; 91: 18–34.
55.
ChiYYIbrahimJG. Joint models for multivariate longitudinal and multivariate survival data. Biometrics2006; 62: 432–445.
56.
RizopoulosD. JM: An R package for the joint modelling of longitudinal and time-to-event data. J Stat Softw2010; 35: 1–33.
57.
RizopoulosD. The R package JMbayes for fitting joint models for longitudinal and time-to-event data using MCMC. J Stat Softw2016; 72: 1–46.
58.
HaarioHSaksmanETamminenJ. An adaptive metropolis algorithm. Bernoulli2001; 7: 223–242.
59.
HarvkilleDA. Bayesian inference for variance components using only error contrasts. Biometrika1974; 61: 383–385.
60.
TsodikovADIbrahimJGYakovlevAY. Estimating cure rates from survival data: an alternative to two-component mixture models. J Am Stat Assoc2003; 98: 1063–1078.
61.
ZengDYinGIbrahimJG. Semiparametric transformation models for survival data with a cure fraction. J Am Stat Assoc2006; 101: 670–684.
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.