Sage Journals: Discover world-class research

Abstract

Longitudinal biomarkers such as patient-reported outcomes (PROs) and quality of life (QOL) are routinely collected in cancer clinical trials or other studies. Joint modelling of PRO/QOL and survival data can provide a comparative assessment of patient-reported changes in specific symptoms or global measures that correspond to changes in survival. Motivated by a head and neck cancer clinical trial, we develop a class of trajectory-based models for longitudinal and survival data with disease progression. Specifically, we propose a class of mixed effects regression models for longitudinal measures, a cure rate model for the disease progression time ( $T_{P}$ ) and a Cox proportional hazards model with time-varying covariates for the overall survival time ( $T_{D}$ ) to account for $T_{P}$ and treatment switching. Under the semi-competing risks framework, the disease progression is the non-terminal event, the occurrence of which is subject to a terminal event of death. The properties of the proposed models are examined in detail. Within the Bayesian paradigm, we derive the decompositions of the deviance information criterion (DIC) and the logarithm of the pseudo-marginal likelihood (LPML) to assess the fit of the longitudinal component of the model and the fit of each survival component, separately. We further develop $Δ$ DIC as well as $Δ$ LPML to determine the importance and contribution of the longitudinal data to the model fit of the $T_{P}$ and $T_{D}$ data.

Keywords

cure rate model DIC decomposition Markov chain Monte Carlo Patient-reported outcome shared parameter model time-varying covariates

1 Introduction

A patient-reported outcome (PRO) is any report of the status of a patient's health condition that comes directly from the patient, without interpretation of the patient's response by a clinician or anyone else (U.S. Food and Drug Administration Guidance for Industry, 2009). PROs can help identify symptoms or problems that may be missed during clinical queries, guide policy and health care delivery, and help patients choose between two equally efficacious therapies. Joint modelling of PRO and survival data can provide a comparative assessment of patient-reported changes in specific symptoms or global measures that correspond to changes in survival.

In joint modelling literature, there are usually a longitudinal component, a survival component and a dependence structure connecting those two components. There has been much work on joint modelling of one longitudinal measure and a survival measure, including Schluchter (1992), Hogan and Laird (1997), Law et al. (2002), Brown and Ibrahim (2003), Chen et al. (2004), Ibrahim et al. (2004), Brown et al. (2005), Chi and Ibrahim (2006), Chi and Ibrahim (2007), Ibrahim et al. (2010), Ye et al. (2008), Brown (2009), Rizopoulos (2012)), Pawitan and Self (1993), De Gruttola and Tu (1994), Lavalley and Degruttola (1996), De Gruttola and Tu (1994), Xu and Zeger (2001b), Xu and Zeger (2001a), Jacqmin-Gadda et al. (2010)) and (Proust-Lima et al. (2014)). (Tudur-Smith et al. (2016)) provided a recent review on joint modelling. In the recent development, multiple longitudinal processes and spline-based methods have been used in the joint modelling (Rizopoulos and Ghosh (2011)). Joint modelling has also been applied to more complex data structures such as competing risks data (Williamson et al., 2008; Elashoff et al., 2008; Huang et al., 2011; Li et al., 2012; Proust-Lima et al., 2016) and semi-competing risks data (Li and Su, 2017). In this article, we propose a mixed effects regression model for longitudinal measures, a cure rate model for the disease progression time ( $T_{P}$ ) and a Cox proportional hazards model with time-varying covariates for the overall survival time ( $T_{D}$ ) to account for $T_{P}$ and treatment switching. Note that Chen et al. (2020) provide a detailed elaboration of treatment switching, and they also develop a method for estimating the intended treatment effect accounting for treatment switching. In addition, the trajectory function from the longitudinal model is substituted into the hazard function of each survival submodel, thereby serving as another time-varying covariate in the survival submodel. Thus, our model is the trajectory model (TM). Although Li and Su (2017) address a very similar context as semi-competing risks within the joint modelling framework, they assume a probit model for the discrete time hazard of dropout and another probit model for the discrete time hazard of death. Li and Su (2017) further assume the shared parameter model (SPM), where the longitudinal model and the survival submodels share common random effects, which then induce correlation between the longitudinal and survival components.

Zhang et al. (2017)) developed a novel decomposition of DIC and LPML to assess the fit of the longitudinal and survival components of the joint model, separately. Based on this decomposition, they then proposed new Bayesian model assessment criteria, namely $Δ$ DIC and $Δ$ LPML, to determine the importance and contribution of the longitudinal data to the model fit of the survival data. In this article, we extend the decomposition of Zhang et al. (2017) to our joint model with semi-competing risks. We derive the decompositions of DIC and LPML (i.e., DIC $=$ DIC $_{Long}$ $+$ DIC $_{Pg ∣ Long}$ $+$ DIC $_{Surv ∣ Pg, Long}$ and LPML $=$ LPML $_{Long}$ $+$ LPML $_{Pg ∣ Long}$ $+$ LPML $_{Surv ∣ Pg, Long}$ , respectively) to assess the fit of the longitudinal component of the model and the fit of each survival component, separately. Let $Δ {DIC}_{Surv}$ be the difference of ${DIC}_{Surv | Pg, Long}$ and DIC of the overall survival model alone without joint modelling. $Δ {DIC}_{Surv}$ measures the additional information gained by adding disease progression time and longitudinal components into modelling of overall survival data. Similarly, $Δ {DIC}_{Pg}$ measures the additional information gained by adding longitudinal components into modelling of progression data. Similar ideas apply to $Δ {LPML}_{Surv}$ and $Δ {LPML}_{Pg}$ . We also develop a simple but more attractive prior for the covariance matrix based on its Cholesky decomposition borrowing the idea of the Lewandowski, Kurowicka, and Joe (LKJ) prior for the correlation matrix $C$ which has a density function of $f (C | τ) ⋑ | C |^{τ - 1}$ (Lewandowski et al., 2009).

The rest of the article is organized as follows. Section 2 gives a detailed description of the motivating longitudinal and survival data from a head and neck clinical cancer trial. In Section 3, we propose a mixed effects regression model for the longitudinal measure, a cure rate model for $T_{P}$ and a time-varying covariates model for $T_{D}$ . The properties of the proposed models are discussed. The likelihood functions under the proposed models are derived in Section 4. In Section 4, we also present the prior and posterior distributions, develop an efficient Markov chain Monte Carlo sampling algorithm for carrying out the posterior computations, and provide the formulation of the decomposition of the LPML and $Δ$ LPML's for model comparison. A comprehensive analysis of the longitudinal and survival data from a head and neck cancer trial is given in Section 5. We conclude the article with a discussion including some extensions in Section 6.

2 Motivating longitudinal and survival data from a head and neck cancer clinical trial

The motivating study was an open-label, randomized, active-controlled phase III trial comparing the efficacy and safety of afatinib versus methotrexate in patients with recurrent/metastatic squamous carcinoma cancer of head and neck (HNSCC Machiels et al., 2015). Patients were treated on either afatinib or methotrexate, according to randomization, until the occurrence of disease progression or intolerable adverse events. Upon discontinuation of randomized treatment, a patient was free to take other anti-cancer therapies per protocol (except that patients discontinued methotrexate cannot start afatinib). It was reported that more than half of the patients who discontinued study medication started another anti-cancer treatment.

Figure 1:

Graphical representation of the disease progression and survival data from the head and neck cancer clinical trial

Of the total 483 patients, 332 patients were randomized to the treatment arm (methotrexate) and 161 patients were randomized to the control arm (afatinib). Furthermore, 246 patients switched the treatments, 358 died and 125 were censored. The median follow-up of 6.7 months (interquartile range (IQR): 3.1 –9.0). Figure 1 shows the counts of disease progression, treatment switch and death in each treatment arm. Three patient-reported measures: global health status (GH), head and neck pain scale (HNPA), and head and neck swallowing condition (HNSW) were collected during this phase III trial. The number of patients with at least one post-baseline values of the longitudinal measures were 481 for GH, 480 for HNPS and 473 for HNSW as shown in Table 1. In this article, we investigate the associations between changes in the longitudinal measures and risks of disease progression (

T_{P}

) and death (

T_{D}

), in which

T_{D}

was the key secondary endpoint of the trial.

Table 1:

A summary of the observed counts of longitudinal measures GH, HNPA and HNSW

Number of observed longitudinal measures	Count (percentage)
	GH	HNPA	HNSW
0	6 (1.24%)	7 (1.45%)	14 (2.90%)
1	78 (16.15%)	77 (15.94%)	82 (16.98%)
2	86 (17.81%)	87 (18.01%)	94 (19.46%)
3	111 (22.98%)	113 (23.40%)	104 (21.53%)
4	100 (20.70%)	96 (19.88%)	91 (18.84%)
5	52 (10.77%)	52 (10.77%)	50 (10.35%)
$\geq 6$	50 (10.35%)	51 (10.56%)	48 (9.94%)

We consider the following baseline covariates: patients’ prior treatment with an epidermal growth factor receptor (EGFR)’targeted antibody, cetuximab (PEGFRA: 1 if and 0 if no), age more than 65 years (AGE: 1 if yes and 0 if no), gender (SEX: male/female), prior treatment with plantinum-based chemotherapy and radiotherapy (PCRTDC: 1 if yes and 0 if no), primary tumour site (TMSITED01: 1 if oral cavity, oropharynx, hypopharynx, larynx and 0 if others), metastatic sites of lymph node (METLOC2: 1 if yes and 0 if no), metastatic sites of lung (METLOC3: 1 if yes and 0 if no), liver (METLOC4: 1 if yes and 0 if no), or bone (METLOC5: 1 if yes and 0 if no), Eastern Cooperative Oncology Group (ECOG) performance status at baseline (ECOG: 0/1) and BSDTARG (Baseline target lesion: mm). We exclude two patients from the analysis because of missing BSDTARG measures.

Median survival time in days and its 95% confidence interval are 434 and (386, 470), median switch time, and its 95% confidence interval are 166 and (147, 190), and median progression time and its 95% confidence interval are 84 and (81, 86).

3 The proposed models

Let $Y (t)$ denote the longitudinal measure observed at time $t$ . Also, let $T_{P}$ denote the time from the study entry until disease progression and let $T_{D}$ denote the time from the study entry until death. In addition, we let $x$ be the $p$ -dimensional vector of baseline covariates including the randomized treatment but not the intercept.

3.1 Longitudinal component of the joint model

We first assume a mixed effects regression model for the longitudinal measure $Y (t)$ , which is given by

Y (t) = (θ_{0}^{*} + θ)^{'} g (t) + γ^{'} x + ε (t),

(3.1)

where $g (t)$ is a $(q + 1)$ -dimensional vector of functions of $t$ including the intercept, $θ_{0}^{*}$ is a $(q + 1)$ -dimensional vector of subject-dependent random effects, $θ$ is the $(q + 1)$ -dimensional vector of the overall effects, $γ$ is a $p$ -dimensional vector of regression coefficients, and $ε (t)$ is a measurement error. We further assume that $θ_{0}^{*} = (θ_{00}^{*}, \dots, θ_{0 q}^{*})^{'}$ and $ε (t)$ for $t \geq 0$ are independent;

θ_{0}^{*} \sim N (0, Ω),

(3.2)

where $Ω$ is a $(q + 1) \times (q + 1)$ positive definite covariance matrix; and for $t \geq 0$ ,

ε (t) \sim N (0, σ^{2}),

(3.3)

where $σ^{2} > 0$ is a variance parameter.

We note that in (3.1), if $q = 1$ , $g (t) = (1, t)^{'}$ and $(θ_{0}^{*} + θ)^{'} g (t)$ represents a linear trajectory, and if $q = 2$ , $g (t) = (1, t, t^{2})^{'}$ and $(θ_{0}^{*} + θ)^{'} g (t)$ leads to a quadratic trajectory. A more general form of $g (t)$ is given by $g (t) = (1, b_{1} (t), \dots, b_{q} (t))^{'}$ , where $b_{1} (t)$ , $\dots$ , $b_{q} (t)$ are a set of base functions. We also note that in (3.1), $θ_{0}^{*}$ captures the patient-level dependence of the $Y (t)$ over time $t$ . However, the covariance matrix $Ω$ depends on the data only via random effects $θ_{0}^{*}$ , which can lead to slow convergence of Gibbs sampling. To overcome this issue, we consider the reparametrization: $θ_{0}^{*} = Γ θ^{*}$ , where $Γ$ is the Cholesky decomposition of $Ω$ with all diagonal elements being positive such that $Ω = Γ Γ^{'}$ . Under this reparametrization, we have

θ^{*} \sim N (0, I_{q + 1})

(3.4)

and a reparametrized version of (3.1) is written as

Y (t) = (Γ θ^{*} + θ)^{'} g (t) + γ^{'} x + ε (t) .

(3.5)

Remark 3.1: Write

Γ = [\begin{matrix} exp (b_{1}) & 0 & 0 & \dots & 0 \\ a_{21} & exp (b_{2}) & 0 & \dots & 0 \\ 5 \\ a_{q + 1, 1} & a_{q + 1, 2} & a_{q + 1, 3} & \dots & exp (b_{q + 1}) \end{matrix}] .

Then for any values of $b_{1}, \dots, b_{q + 1}$ and $a_{ij}$ , $i = 2, \dots, q + 1$ , $j = 2, \dots, i - 1$ , $Γ$ is a full-rank matrix. Thus, there are no constraints on $b_{1}, \dots, b_{q + 1}$ and $a_{ij}$ , $i = 2, \dots, q + 1$ , $j = 2, \dots, i - 1$ . Consequently, a multivariate normal prior can be specified for these parameters. Therefore, in this sense, our parametrization approach may be more attractive than the one discussed in Lewandowski et al. (2009). In addition, we have $g (t)^{'} Γ θ^{*} = \sum_{j = 1}^{q + 1} \sum_{i = j}^{q + 1} g_{i} (t) a_{ij} θ_{j}^{*}$ , where $a_{jj} = exp (b_{j})$ for $j = 1, 2, \dots, q + 1$ . We may view $θ_{j}^{*}$ ’s as ‘known data points’ and $a_{ij}$ ’s as the ‘regression coefficients’. Under some mild conditions of the longitudinal data, the parameters $a_{ij}$ ’s are stochastically identifiable.

3.2 Survival component of the joint model

Let $δ$ denote the disease progression status of subjects such that $δ = 1$ if the subject has disease progression and $δ = 0$ if the subject never has disease progression. We assume a logistic regression model for $δ$ as

logit (P (δ = 1 | ϕ, x)) = log {\frac{P (δ = 1 | ϕ, x)}{1 - P (δ = 1 | ϕ, x)}} = ϕ_{0} + ϕ_{1}^{'} x,

(3.6)

where $ϕ = (ϕ_{0}, ϕ_{1}^{'})^{'}$ is a ( $p + 1$ )’dimensional vector of regression coefficients. Following Berkson and Gage (1952) and Chen et al. (1999)), we assume a cure rate model for $T_{p}$ with

P (T_{P} = \infty | x, θ^{*}, δ = 0) = 1

(3.7)

and a Cox-type model when $δ = 1$ with

\begin{matrix} P (T_{P} > t | λ_{10}, β_{1}, α_{1}, x, θ^{*}, θ, δ = 1) \\ = & exp [- \int_{0}^{t} λ_{10} (u) exp {β_{1} (Γ θ^{*} + θ)^{'} g (u) + α_{1}^{'} x} du], \end{matrix}

(3.8)

where $λ_{10} (t)$ is a proper baseline hazard function such that $\int_{0}^{\infty} λ_{10} (t) dt = \infty$ , $β_{1}$ is a regression coefficient corresponding to the longitudinal trajectory function $(Γ θ^{*} + θ)^{'} g (t)$ and $α_{1}$ is a $p$ -dimensional vector of regression coefficients corresponding to the baseline covariates. From (3.7) and (3.8), we see that $T_{P}$ can be written as

T_{P} = \infty \times 1 {δ = 0} + T_{P}^{*} \times 1 {δ = 1},

(3.9)

where $\infty \times 0 = 0$ , $T_{P}^{*}$ is a proper random survival time that follows the distribution specified in (3.8) and $1 {A}$ denotes the indicator function, which takes a value of 1 if $A$ is true and 0 otherwise.

We assume a time-varying covariates Cox model for $T_{D}$ with hazard function defined as

\begin{matrix} λ_{2} (t | λ_{20}, β_{2}, θ^{*}, θ, α_{2}, η, x, z (t)) = & λ_{20} (t) exp {β_{2} (Γ θ^{*} + θ)^{'} g (t) + α_{2}^{'} x + η^{'} z (t)}, \end{matrix}

(3.10)

where $β_{2}$ is a regression coefficient corresponding to the longitudinal trajectory function $(Γ θ^{*} + θ)^{'} g (t)$ , $α_{2}$ is a $p$ -dimensional vector of regression coefficients and $z (t)$ is the history of time-dependent covariate up to $t$ with $η$ being a vector of the corresponding regression coefficients.

Remark 3.2: We note that although we use $x$ to denote a vector of baseline covariates, different subsets of baseline covariates may be used in (3.1, 3.6, 3.8) and (3.10), respectively. Also $x$ can be different in the longitudinal and survival models. For the notational simplicity, we use the same $x$ for baseline covariates throughout the article.

Remark 3.3: Let $V_{T_{P}} (t) = 1 {T_{P} \leq t}$ . The variable $V_{T_{P}} (t)$ essentially tracks the status of disease progression so that $V_{T_{P}} (t) = 0$ if the patient has not had disease progression at time $t$ and $V_{T_{P}} (t) = 1$ if the patient has had disease progression before time $t$ . Similarly, we denote $V_{T_{SW}} (t) = 1$ if the subject has switched treatment at time $t$ , and $V_{T_{SW}} (t) = 0$ if the subject has switched treatment at time $t$ . Suppose that $R$ denotes the indicator for the randomized treatment such that $R = 1$ if the subject is on the treatment arm and $R = 0$ if the subject is on the control arm. In (3.10), we take $z (t) = (V_{T_{P}} (t), V_{T_{SW}} (t), R \times V_{T_{SW}} (t))$ .

Remark 3.4: From the proposed joint model for the longitudinal data and survival data, we see that $θ^{*}$ captures the dependence between the $Y (t)$ and $(T_{P}, T_{D})$ , while the joint distribution of $T_{P}$ and $T_{D}$ is specified by the product of the marginal distribution $[T_{P}]$ and the conditional distribution $[T_{D} | T_{P}]$ . Since the event ‘ $δ = 0$ ’ is not observed, $ϕ$ is only partially identifiable as shown in Chen et al. (1999). Thus, in our analysis of the head and neck cancer data and the simulation study, we specify a moderate informative prior for $ϕ$ .

Remark 3.5: The regression coefficient $β_{1}$ in (3.8) ( $β_{2}$ in (3.10)) captures the association between the longitudinal measure $Y (t)$ and $T_{P}$ ( $T_{D}$ ). If $β_{1} = 0$ and $β_{2} = 0$ , $Y (t)$ is independent of $T_{P}$ and $T_{D}$ . If $β_{1} > 0$ , an increase in the longitudinal measure $Y (t)$ is associated with a shorter disease programme time $T_{P}$ . Similarly, if $β_{2} > 0$ , an increase in $Y (t)$ is associated with a worse overall survival mortality. In addition, the longitudinal trajectory function $(Γ θ^{*} + θ)^{'} g (t)$ is directly included in the hazard function for $T_{P}$ in (3.8) and the hazard function for $T_{D}$ in (3.10) as a time-dependent covariate. Therefore, the proposed joint model is the TM.

3.3 The likelihood function

Suppose that there are $n$ subjects. For the $i th$ subject, let $y_{i} (t)$ denote the longitudinal measure, which is observed at time $t \in {a_{i 1}, a_{i 2}, \dots, a_{i m_{i}}}$ , where $0 \leq a_{i 1} < a_{i 2} < \dots < a_{{im}_{i}}$ and $m_{i} \geq 0$ . Note that $y_{i} (0)$ corresponds to the baseline value. Also let $x_{i}$ denote a $p$ -dimensional vector of covariates, which may include the randomized treatment. Let $t_{i}$ and $ν_{i}$ denote the overall survival time and the right-censored indicator, respectively, where $ν_{i} = 1$ if $t_{i}$ is a death time and 0 if $t_{i}$ is right-censored for the $i th$ subject. Also let $t_{Pi}$ be the disease progression time and $δ_{i}$ be the status of disease progression. If disease progression has been observed, then $t_{Pi} < t_{i}$ and $δ_{i} = 1$ . However, if disease progression has not yet been observed at time $t_{i}$ , $t_{Pi}$ and $δ_{i}$ are not observed. Let $t_{SWi}$ denote treatment switch time, and if treatment switch has not been observed yet at time $t_{i}$ , $t_{SWi}$ is not observed. Let $z_{i} (t) = (V_{t_{Pi}} (t), V_{t_{SWi}} (t), R \times V_{t_{SWi}} (t))$ , where $t \leq t_{i}$ .

Write $y_{i} = (y_{i} (a_{i 1}), \dots, y_{i} (a_{{im}_{i}}))^{'}$ and $W_{i} = (W_{i 1}, W_{i 2})$ , which is an $m_{i} \times (q + 1 + p)$ matrix with $m_{i} \times (q + 1)$ matrix $W_{i 1} = (g (a_{ij})^{'}, j = 1, \dots, m_{i})$ and $m_{i} \times p$ matrix $W_{i 2} = (x_{i}^{'}, j = 1, \dots, m_{i})$ . Then, the complete data likelihood function for the longitudinal data $y_{i}$ is given by

\begin{matrix} f (y_{i} | γ, σ^{2}, Γ, θ_{i}^{*}, θ, x_{i}) = & \frac{1}{(2 π σ^{2})^{\frac{m_{i}}{2}}} exp {- \frac{1}{2 σ^{2}} (y_{i} - W_{i 1}^{*} θ_{i}^{*} - W_{i} (θ^{'}, γ^{'})^{'})^{'} \\ (y_{i} - W_{i 1}^{*} θ_{i}^{*} - W_{i} (θ^{'}, γ^{'})^{'})}, \end{matrix}

(3.11)

where $θ_{i}^{*}$ is a $(q + 1)$ -dimensional vector of random effects, $W_{i 1}^{*} = W_{i 1} Γ$ and, $W_{i} (θ^{'}, γ^{'})^{'} = W_{i 1} θ + W_{i 2} γ$ .

Using (3.2), the density of $θ_{i}^{*}$ is given by

f (θ_{i}^{*}) = \frac{1}{(2 π)^{\frac{q + 1}{2}}} exp {- \frac{1}{2} (θ_{i}^{*})^{'} (θ_{i}^{*})},

(3.12)

for $i = 1, \dots, n$ .

Based on the definition of $z (t)$ , we see that $z (t)$ depends on $t_{Pi}$ and $t$ only through $V_{t_{Pi}} (t) = 1 {t_{Pi} \leq t}$ . Write $p (ϕ, x_{i}) = \frac{exp (ϕ_{0} + ϕ_{1}^{'} x_{i})}{1 + exp (ϕ_{0} + ϕ_{1}^{'} x_{i})}$ . We also let $ζ_{i}$ denote the group indicator.

Define the survival function for $T_{D}$ $P (T_{D} > t_{i} | ϕ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ) = exp [- \int_{0}^{t_{i}} λ_{20} (u)$ $exp {β_{2} ((Γ θ_{i}^{*} + θ)^{'} g (u)) + α_{2}^{'} x_{i} + η^{'} z_{i} (u)} du]$ , and the density function $f_{2} (t_{i} | ϕ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ) = λ_{20} (t_{i}) exp {β_{2} ((Γ θ_{i}^{*} + θ)^{'} g (t_{i})) + α_{2}^{'} x_{i} + η^{'} z_{i} (t_{i})} P (T_{D} > t_{i} | ϕ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ)$ . Define the survival function for $T_{P}$ $P (T_{P} > t_{i} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1) = exp [- \int_{0}^{t_{i}} λ_{10} (u) exp {β_{1} ((Γ θ_{i}^{*} + θ)^{'} g (u)) + α_{1}^{'} x_{i}} du]$ , and the density function $f_{1} (t_{Pi} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1) = λ_{10} (t_{Pi}) exp {β_{1} ((Γ θ_{i}^{*} + θ)^{'} g (t_{Pi})) + α_{1}^{'} x_{i}} P (T_{P} > t_{Pi} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1)$ .

Under the proposed cure rate and time-varying covariates model, the observed data can be divided into four groups of observations. We derive the likelihood function for each of the four groups as follows.

Group 1. Subjects are observed to die at time $t_{i}$ and no disease progressions have been incurred. The survival data consists of $D_{i 1} = {t_{i}, ν_{i} = 1, x_{i}, z_{i} (t)}$ , group indicator $ζ_{i} = 1$ and $δ_{i}$ is not observed. Since $t_{Pi} = \infty$ when $δ_{i} = 0$ and $t_{Pi} \geq t_{i}$ when $δ_{i} = 1$ . The likelihood function is given by

\begin{matrix} L (ϕ, λ_{10}, & β_{1}, α_{1}, λ_{20}, β_{2}, α_{2}, η | D_{i 1}, θ_{i}^{*}) = f_{2} (t_{i} | ϕ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ) \\ \times (1 - p (ϕ, x_{i}) + p (ϕ, x_{i}) P (T_{P} > t_{i} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1)) . \end{matrix}

(3.13)

Group 2. Subjects are observed to have disease progression at time $t_{Pi}$ and then die at $t_{i}$ . The survival data consists of $D_{i 2} = {t_{i}, ν_{i} = 1, t_{Pi}, δ_{i} = 1, x_{i}, z_{i} (t)}$ , $t_{i} > t_{Pi}$ , and $ζ_{i} = 2$ . The likelihood function is given by

\begin{matrix} L (ϕ, λ_{10}, & β_{1}, α_{1}, λ_{20}, β_{2}, α_{2}, η | D_{i 2}, θ_{i}^{*}) = f_{2} (t_{i} | ϕ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ) p (ϕ, x_{i}) \\ \times f_{1} (t_{Pi} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1) \end{matrix}

(3.14)

Group 3. Subjects are observed to have disease progression at time $t_{Pi}$ and then censor at $t_{i}$ . The survival data consists of $D_{i 3} = {t_{i}, ν_{i} = 0, t_{Pi}, δ_{i} = 1, x_{i}, z_{i} (t)}$ , $t_{i} > t_{Pi}$ , and $ζ_{i} = 3$ . The likelihood function takes the form

\begin{matrix} L (ϕ, λ_{10}, & β_{1}, α_{1}, λ_{20}, β_{2}, α_{2}, η | D_{i 3}, θ_{i}^{*}) = P (T_{D} > t_{i} | ϕ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ) \\ \times p (ϕ, x_{i}) f_{1} (t_{Pi} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1) . \end{matrix}

(3.15)

Group 4. Subjects are censored at $t_{i}$ and no disease progression occurs before $t_{i}$ . The survival data consists of $D_{i 4} = {t_{i}, ν_{i} = 0, x_{i}, z_{i} (t)}$ , $ζ_{i} = 4$ and $δ_{i}$ is not observed. The likelihood function is written as

\begin{matrix} L (ϕ, λ_{10}, & β_{1}, α_{1}, λ_{20}, β_{2}, α_{2}, η | D_{i 4}, θ_{i}^{*}) = P (T_{D} > t_{i} | ϕ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ) \\ \times (1 - p (ϕ, x_{i}) + p (ϕ, x_{i}) P (T_{P} > t_{i} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1)) . \end{matrix}

(3.16)

Remark 3.6: The disease progression is a non-terminal event while the death is a terminal event. The overall survival time $T_{D}$ censors the disease progression time $T_{P}$ , but not vice versa. Thus, the disease progress and the death constitute the semicompeting risks. Under the proposed model, these semi-competing risks are factored into the constructions of the likelihood functions in Groups 1 and 4.

4 Bayesian inference

4.1 Prior and posterior distributions

In (3.8) and (3.10), we assume a piecewise constant hazard function for $λ_{j 0} (t)$ defined as

λ_{j 0} (t) = λ_{jk}, t \in (s_{j, k - 1}, s_{jk}] for k = 1, \dots, K_{j},

(4.1)

where $0 = s_{j 0} < s_{j 1} < s_{j 2} < \dots < s_{j, K_{j} - 1} < s_{{jK}_{j}} = \infty$ is a finite partition of the time axis, where $K_{j} \geq 1$ for $j = 1, 2$ . Write $λ_{j} = (λ_{j 1}, \dots, λ_{{jK}_{j}})^{'}$ for $j = 1, 2$ . We assume independent normal priors for $γ$ , $θ$ , $ϕ$ , $β_{1}$ , $α_{1}$ , $β_{2}$ , $α_{2}$ and $η$ with $γ \sim N (μ_{01}, V_{01})$ , $θ \sim N (μ_{02}, V_{02})$ $ϕ \sim N (μ_{03}, V_{03})$ $(β_{1}^{'}, α_{1}^{'})^{'} \sim N (μ_{04}, V_{04})$ , and $(β_{2}^{'}, α_{2}^{'}, η^{'})^{'} \sim N (μ_{05}, V_{05})$ , where $μ_{0 j}$ and $V_{0 j}$ (a positive definite matrix) for $j = 1, \dots, 5$ are pre-specified hyperparameters. An inverse gamma (IG) prior is specified for $σ^{2}$ , that is, $σ^{2} \sim IG (a_{0}, b_{0})$ with mean $b_{0} / (a_{0} - 1)$ , where $a_{0} \geq 0$ and $b_{0} \geq 0$ are pre-specified. When $q = 1$ , we write $Γ = [\begin{matrix} a & 0 \\ c & b \end{matrix}]$ where $a = exp (a^{*})$ and $b = exp (b^{*})$ . Then, we assume normal prior $N (μ_{06}, V_{06})$ for $(a^{*}, b^{*}, c)$ . We further assume independent gamma priors for $λ_{1}$ and $λ_{2}$ with $λ_{jk} \sim Gamma (a_{j}, b_{j})$ , where $a_{j} \geq 0$ and $b_{j} \geq 0$ for $k = 1, \dots, K_{j}$ and $j = 1, 2$ . In Section 5 (the real data analysis), the pre-specified hyperparameters are given by $μ_{01} = μ_{02} = μ_{03} = μ_{04} = μ_{05} = μ_{06} = 0$ , $V_{01} = 1000 I_{p}$ , $V_{02} = 1000 I_{q + 1}$ , $V_{03} = 10 I_{p + 1}$ , $V_{04} = 10000 I_{p_{3}}$ with $p_{3} = \dim ((β_{1}^{'}, α_{1}^{'})^{'})$ , $V_{05} = 10000 I_{p_{4}}$ with $p_{4} = \dim ((β_{2}^{'}, α_{2}^{'}, η^{'})^{'})$ , $V_{06} = 10 I_{2}$ $a_{0} = 0.001$ , $b_{0} = 0.001$ , $d_{0} = q + 2$ , $Ω_{0} = 0.001 I_{q + 1}$ , and $a_{j} = 0.001$ and $b_{j} = 0.001$ for $j = 1, 2$ , where $I_{p + q + 1}$ , $I_{p + 1}$ , $I_{p_{3}}$ , $I_{p_{4}}$ and $I_{2}$ are the identity matrices. These choices of the hyperparameters yield noninformative priors for all the model parameters except for $ϕ$ .

Let $ϕ_{1} = (γ, σ^{2}, θ, Γ)$ , $ϕ_{2} = (ϕ, ϕ_{21})$ , where $ϕ_{21} = (λ_{1}, β_{1}, α_{1})$ , and $ϕ_{3} = (λ_{2}, β_{2}, α_{2}, η)$ denote the collections of the model parameters for the longitudinal component, the survival component for the disease progression time and the survival component for the overall survival time, respectively, under the joint model. We further let $ϕ = (ϕ_{1}, ϕ_{2}, ϕ_{3})$ be the collection of all the model parameters. Then, the joint prior of $ϕ$ is given by

\begin{matrix} π (ϕ) = & π (γ) π (θ) π (σ^{2}) π (a^{*}, b^{*}, c) π (ϕ) π (β_{1}, α_{1}) π (β_{2}, α_{2}, η) π (λ_{1}) π (λ_{2}) . \end{matrix}

(4.2)

Let $D_{obs} = (y_{i}, D_{i 1} 1 {ζ_{i} = 1}, D_{i 2} 1 {ζ_{i} = 2}, D_{i 3} 1 {ζ_{i} = 3}, D_{i 3} 1 {ζ_{i} = 4}, i = 1, \dots, n)$ denote the observed data, where the indictor function $1 {ζ_{i} = j} = 1$ if $ζ_{i} = j$ and 0 if $ζ_{i} \neq j$ for $j = 1, \dots, 4$ . The observed likelihood function is

\begin{matrix} L (ϕ | D_{obs}) = \prod_{i = 1}^{n} \int [f (y_{i} | γ, σ^{2}, θ, Γ, θ_{i}^{*}, x_{i}) \prod_{j = 1}^{4} {L (ϕ_{1}, ϕ_{2} | D_{ij}, θ_{i}^{*})}^{1 {ζ_{i} = j}}] f (θ_{i}^{*}) d θ_{i}^{*}, \end{matrix}

(4.3)

where $f (y_{i} | γ, σ^{2}, θ, Γ, θ_{i}^{*}, x_{i})$ , $f (θ_{i}^{*})$ , and $L (ϕ_{1}, ϕ_{2} | D_{ij}, θ_{i}^{*})$ , $j = 1, \dots, 4$ , are given in (3.11, 3.12, 3.13) to (3.15) and (3.16), respectively, and $π (ϕ)$ is defined by (4.2). The joint posterior distribution of $ϕ$ given the observed data is written as

\begin{matrix} π (ϕ | D_{obs}) = \frac{L (ϕ | D_{obs}) π (ϕ)}{c (D_{obs})}, \end{matrix}

(4.4)

where

\begin{matrix} c (D_{obs}) = \int L (ϕ | D_{obs}) π (ϕ) d ϕ . \end{matrix}

(4.5)

4.2 Computational development

To ease exposition, we present the computational algorithm only for $β_{2} (Γ θ_{i}^{*} + θ)^{'} g (t)$ and $β_{1} (Γ θ_{i}^{*} + θ)^{'} g (t)$ in the survival functions. We can derive a similar function when assuming the term $β_{1} ((Γ θ_{i}^{*} + θ)^{'} g ({Tmax}_{i}))$ and $β_{2} (Γ θ_{i}^{*} + θ)^{'} g ({Tmax}_{i})$ after certain time point ${Tmax}_{i}$ . Models as a similar algorithm can be developed for the SPM. We introduce a latent vector $δ^{*} = (δ_{i}^{*} : ζ_{i} = 1 or ζ_{i} = 4, 1 \leq i \leq n)^{'}$ . $P (δ_{i}^{*} = 1) = \frac{p (ϕ, x_{i}) * P (T_{P} > t_{i} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1)}{1 - p (ϕ, x_{i}) + p (ϕ, x_{i}) * P (T_{P} > t_{i} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1)}$ and $P (δ_{i}^{*} = 0) = 1 - P (δ_{i}^{*} = 1)$ . Let ${θ^{*}}^{R} = ({θ_{1}^{*}}^{'}, \dots, {θ_{n}^{*}}^{'})^{'}$ denote the vector of the random effects. Then, the augmented posterior distribution of $π (ϕ | D_{obs})$ in (4.4) can be written as

\begin{matrix} π (ϕ, δ^{*}, {θ^{*}}^{R} | D_{obs}) \\ \propto \prod_{i = 1}^{n} ({f_{2} (t_{i} ∣ φ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ) {[1 - p (ϕ, x_{i})]}^{(1 - δ_{i}^{*})} [p (ϕ, x_{i}) \\ \times {f_{2} (t_{i} | ϕ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ) p (ϕ, x_{i}) f_{1} (t_{Pi} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1)}^{1 {ζ_{i} = 2}} \\ \times {P (T_{D} > t_{i} | ϕ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ) p (ϕ, x_{i}) f_{1} (t_{Pi} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1)}^{1 {ζ_{i} = 3}} \\ \times {P (T_{D} > t_{i} | ϕ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ) [1 - p (ϕ, x_{i})]^{(1 - δ_{i}^{*})} \\ p (ϕ, x_{i}) P (T_{P} > t_{i} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1)]^{δ_{i}^{*}}}^{1 {ζ_{i} = 4}} \\ \times f (y_{i} | γ, σ^{2}, θ, Γ, θ_{i}^{*}, x_{i}) f (θ_{i}^{*})) \times π (ϕ), \end{matrix}

(4.6)

where $f_{2} (t_{i} | ϕ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ)$ , $P (T_{D} > t_{i} | ϕ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ)$ , $f_{1} (t_{Pi} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1)$ and $P (T_{P} > t_{i} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1)$ are defined in Section 3.3. $f (y_{i} | γ, σ^{2}, θ, Γ, θ_{i}^{*}, x_{i})$ , $f (θ_{i}^{*})$ , $π (ϕ)$ are given in (3.11, 3.12) and (4.2). We note that $\int \sum_{δ^{*}} π (ϕ, δ^{*}, θ^{R} | D_{obs}) d θ^{R} = π (ϕ | D_{obs})$ .

Although an analytical evaluation of the posterior distribution in (4.4) is not possible, a convenient Gibbs sampling algorithm can be developed to sample from the augmented posterior distribution in (4.6). Specifically, we sample from the following full conditional distributions in turn: (i) $[γ | σ^{2}, Γ, θ^{* R}, θ, D_{obs}]$ ; (ii) $[σ^{2} | θ^{* R}, θ, Γ, D_{obs}, γ]$ ; (iii) $[(a^{*}, b^{*}, c) | γ, σ^{2}, θ, θ^{* R}, D_{obs}]$ ; (iv) $[δ^{*} | ϕ_{2}, Γ, θ_{i}^{*}, θ, D_{obs}]$ ; (v) $[ϕ | δ^{*}, D_{obs}]$ ; (vi) $[β_{1}, α_{1} | λ_{1}, Γ, θ^{* R}, θ, δ^{*}, D_{obs}]$ ; (vii) $[λ_{1} | β_{1}, α_{1}, Γ, θ^{* R}, θ, δ^{*}, D_{obs}]$ ; (viii) $[β_{2}, α_{2}, η | λ_{2}, Γ, θ^{* R}, θ, D_{obs}]$ ; (ix) $[λ_{2} | α_{2}, β_{2}, Γ, θ^{* R}, θ, D_{obs}]$ ; (x) $[θ^{* R} | ϕ_{1}$ , $ϕ_{21}, ϕ_{3}, δ^{*}, D_{obs}]$ ; and (xi) $[θ | ϕ_{1}$ , $ϕ_{21}, ϕ_{3}, δ^{*}, θ^{* R}, D_{obs}]$ . We briefly discuss the full conditional distributions in Section S.1 in the supplementary materials.

4.3 Bayesian model assessment

Let $f (y_{i} | ϕ_{1}, x_{i}) = \int f (y_{i} | ϕ_{1}, θ_{i}^{*}, x_{i}) f (θ_{i}^{*}) d θ_{i}^{*}$ , where marginal distribution of $y_{i}$ follows $y_{i} | ϕ_{1}^{*}, x_{i} \sim N (W_{i} (θ^{'}, γ^{'})^{'}, σ^{2} I_{m_{i}} + W_{i 1} Γ Γ^{'} W_{i 1}^{'})$ . Let $f (t_{Pi}, δ_{i} | ϕ_{1}, ϕ_{2}, θ_{i}^{*}, x_{i}) = {1 - p (ϕ, x_{i}) + p (ϕ, x_{i}) P (T_{P} > t_{Pi} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1)}^{1 {ζ_{i} = 1} + 1 {ζ_{i} = 4}} {p (ϕ, x_{i}) f_{1} (t_{Pi} | ϕ_{21}, x_{i}, Γ, θ_{i}^{*}, θ, δ = 1)}^{1 {ζ_{i} = 2} + 1 {ζ_{i} = 3}}$ , $f (t_{Pi}, δ_{i}, y_{i}, θ_{i}^{*} | ϕ_{2}, ϕ_{1}, x_{i}) = f (t_{Pi}, δ_{i} | ϕ_{2}, Γ, θ_{i}^{*}, θ, x_{i}) f (y_{i} | ϕ_{1}, θ_{i}^{*}, x_{i}) f (θ_{i}^{*})$ , and $f (t_{Pi}, δ_{i}, y_{i} | ϕ_{1}, ϕ_{2}, x_{i}) = \int f (t_{Pi}, δ_{i}, y_{i}, θ_{i}^{*} | ϕ_{2}, ϕ_{1}, x_{i}) d θ_{i}^{*}$ , $f (t_{i} | ϕ_{1}, ϕ_{3}, θ_{i}^{*}, x_{i}, z_{i} (t), δ_{i}, ν_{i}) = {f_{2} (t_{i} | ϕ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ)}^{1 {ζ_{i} = 1} + 1 {ζ_{i} = 2}} {P (T_{D} > t_{i} | ϕ_{3}, x_{i}, z_{i} (t), Γ, θ_{i}^{*}, θ)}^{1 {ζ_{i} = 3} + 1 {ζ_{i} = 4}}$ .

Define the joint distribution of of $(y_{i}, t_{Pi}, δ_{i}, t_{i}, θ_{i}^{*})$ to be

\begin{matrix} f (y_{i}, t_{Pi}, & δ_{i}, t_{i}, θ_{i}^{*} | ϕ, x_{i}, z_{i} (t), ν_{i}) = f (t_{i} | ϕ_{1}, ϕ_{3}, θ_{i}^{*}, x_{i}, z_{i} (t), δ_{i}, ν_{i}) \\ f (t_{Pi}, δ_{i} | ϕ_{2}, Γ, θ_{i}^{*}, θ, x_{i}) f (y_{i} | ϕ_{1}, θ_{i}^{*}, x_{i}) f (θ_{i}^{*}), \end{matrix}

(4.7)

and the marginal distribution $(y_{i}, t_{Pi}, δ_{i}, t_{i})$

\begin{matrix} f (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ, x_{i}, z_{i} (t), ν_{i}) = \int f (y_{i}, t_{Pi}, δ_{i}, t_{i}, θ_{i}^{*} | ϕ, x_{i}, z_{i} (t), ν_{i}) d θ_{i}^{*} . \end{matrix}

(4.8)

Based on the joint distributions, define the following conditional distributions: $f (t_{Pi}, δ_{i} | ϕ_{2}$ , $ϕ_{1}, x_{i}, y_{i}) = f (t_{Pi}, δ_{i}, y_{i} | ϕ_{1}, ϕ_{2}, x_{i}) / f (y_{i} | ϕ_{1}, x_{i})$ and $f (t_{i} | ϕ, x_{i}, z_{i} (t), y_{i}, δ_{i}, ν_{i}) = f (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ, x_{i}, z_{i} (t), ν_{i}) / f (t_{Pi}, δ_{i}, y_{i} | ϕ_{1}, ϕ_{2}, x_{i})$ .

4.3.1 DIC decomposition

The deviance information criterion (DIC; Spiegelhalter et al. (2002)) for the model is defined as

DIC = Dev (\overset{̅}{ϕ}) + 2 p_{D},

(4.9)

where $Dev (ϕ) = - 2 log L (ϕ | D_{obs})$ , $p_{D} = \overset{̅}{Dev} (ϕ) - Dev (\overset{̅}{ϕ})$ , where $\overset{̅}{ϕ}$ and $\overset{̅}{Dev} (ϕ)$ are the posterior means of $ϕ$ and $Dev (ϕ)$ , respectively, with respect to the posterior distribution in (4.4).

To assess the fit of the longitudinal component of the model and the fit of each survival component, we decompose DIC into three parts:

\begin{matrix} DIC = {DIC}_{Long} + {DIC}_{Pg | Long} + {DIC}_{Surv | Pg, Long}, \end{matrix}

where ${DIC}_{Long} = {Dev}_{Long} (\overset{̅}{ϕ}) + 2 p_{D [Long]}$ , ${DIC}_{Pg | Long} = {Dev}_{Pg | Long} (\overset{̅}{ϕ}) + 2 p_{D [Pg | Long]}$ , and ${DIC}_{Surv | Pg, Long} = {Dev}_{Surv | Pg, Long} (\overset{̅}{ϕ}) + 2 p_{D [Surv | Pg, Long]}$ .

Write ${Dev}_{Long} (\overset{̅}{ϕ}) = - 2 \sum_{i = 1}^{n} log f (y_{i} | \overset{̅}{ϕ_{1}}, x_{i})$ , $p_{D [Long]} = E [- 2 \sum_{i = 1}^{n} log f (y_{i} | ϕ_{1}, x_{i}) | D_{obs}] + 2 \sum_{i = 1}^{n} log f (y_{i} | \overset{̅}{ϕ_{1}}, x_{i})$ , ${Dev}_{Long} (\overset{̅}{ϕ}) + {Dev}_{Pg | Long} (\overset{̅}{ϕ}) = - 2 \sum_{i = 1}^{n} log \int [f (t_{Pi}, δ_{i} | \overset{̅}{ϕ_{1}}, \overset{̅}{ϕ_{2}}, θ_{i}^{*}, x_{i})$ $\times f (y_{i} | \overset{̅}{ϕ_{1}}, θ_{i}^{*}, x_{i}) f (θ_{i}^{*})] d θ_{i}^{*}$ , and $p_{D [Long]} + p_{D [Pg | Long]} = E [- 2 \sum_{i = 1}^{n} log \int f (t_{Pi}, δ_{i} | ϕ_{1}, ϕ_{2}, θ_{i}^{*}$ , $x_{i}) f (y_{i} | ϕ_{1}, θ_{i}^{*}, x_{i}) f (θ_{i}^{*}) d θ_{i}^{*} | D_{obs}] + 2 \sum_{i = 1}^{n} log \int f (t_{Pi}, δ_{i} | \overset{̅}{ϕ_{1}}, \overset{̅}{ϕ_{2}}, θ_{i}^{*}, x_{i}) f (y_{i} | \overset{̅}{ϕ_{1}}, θ_{i}^{*}, x_{i}) f (θ_{i}^{*}) d θ_{i}^{*}$ . Then, we have ${Dev}_{Long} (\overset{̅}{ϕ}) + {Dev}_{Pg | Long} (\overset{̅}{ϕ}) + {Dev}_{Surv | Pg, Long} (\overset{̅}{ϕ}) = \overset{̅}{Dev} (ϕ)$ and $p_{D [Long]} + p_{D [Pg | Long]} + p_{D [Surv | Pg, Long]} = p_{D}$ .

When we fit the survival data alone, assuming $β_{2} = 0$ and $η = 0$ . Define $λ_{2}^{*} (t | λ_{20}, α_{2}, x) = λ_{20} (t) exp {α_{2}^{'} x}$ , and $f_{0} (t_{i} | λ_{2}, α_{2}, x_{i}, ν_{i}) = [λ_{2}^{*} (t_{i} | λ_{20}, α_{2}, x)]^{ν_{i}} exp [- \int_{0}^{t_{i}} λ_{2}^{*} (u | λ_{20}, α_{2}, x) du]$ .

\begin{matrix} {DIC}_{Surv 0} = {Dev}_{Surv 0} (\overset{̅}{λ_{2}}, \overset{̅}{α_{2}}) + 2 p_{D [Surv 0]}, \end{matrix}

where ${Dev}_{Surv 0} (\overset{̅}{λ_{2}}, \overset{̅}{α_{2}}) = - 2 \sum_{i = 1}^{n} log f_{0} (t_{i} | \overset{̅}{λ_{2}}, \overset{̅}{α_{2}}, x_{i}, ν_{i})$ and $p_{D [Surv 0]} = E [- 2 \sum_{i = 1}^{n} log {$ $f_{0} (t_{i} | λ_{2}, α_{2}, x_{i}, ν_{i}) | D_{Surv, obs}}] + 2 \sum_{i = 1}^{n} log f_{0} (t_{i} | \overset{̅}{λ_{2}}, \overset{̅}{α_{2}}, x_{i}, ν_{i})$ . Define model assessment criterion:

\begin{matrix} Δ {DIC}_{Surv} = {DIC}_{Surv 0} - {DIC}_{Surv | Pg, Long} . \end{matrix}

$Δ {DIC}_{Surv}$ measures the additional information gained by adding time to disease progression and longitudinal components into modelling of overall survival data. Large values of $Δ {DIC}_{Surv}$ correspond to more gain in the fit of the overall survival data. We can also fit survival data alone with time-dependent covariates, assuming $β_{2} = 0$ .

Similarly, denote

\begin{matrix} Δ {DIC}_{Pg} = {DIC}_{Pg 0} - {DIC}_{Pg | Long}, \end{matrix}

where ${DIC}_{Pg 0}$ is obtained by assuming $β_{1} = 0$ (fitting the time to progression model alone). Define $λ_{1}^{*} (t | λ_{10}, α_{1}, x) = λ_{10} (t) exp {α_{1}^{'} x}$ , $P^{*} (T_{P} > t_{Pi} | λ_{10}, α_{1}, x_{i}, δ = 1) = exp [- \int_{0}^{t_{Pi}} λ_{1}^{*} (u | λ_{10}, α_{1}, x) du]$ , and $f_{0} (t_{Pi}, δ_{i} | λ_{1}, α_{1}, ϕ, x_{i}) = {1 - p (ϕ, x_{i}) + p (ϕ, x_{i}) P^{*} (T_{P} > t_{Pi} | λ_{10}, α_{1}, x_{i}, δ = 1)}^{1 {ζ_{i} = 1} + 1 {ζ_{i} = 4}} {p (ϕ, x_{i}) λ_{1}^{*} (t | λ_{10}, α_{1}, x) P^{*} (T_{P} > t_{Pi} | λ_{10}, α_{1}, x_{i}, δ = 1)}^{1 {ζ_{i} = 2} + 1 {ζ_{i} = 3}}$ . We have

{DIC}_{Pg 0} = {Dev}_{Pg 0} (\overset{̅}{λ_{1}}, \overset{̅}{α_{1}}, \overset{̅}{ϕ}) + 2 p_{D [Pg 0]},

where ${Dev}_{Pg 0} ({\overset{̅}{λ}}_{1}, {\overset{̅}{α}}_{1}, \overset{̅}{ϕ}) = - 2 \sum_{i = 1}^{n} log f_{0} (t_{Pi}, δ_{i} | \overset{̅}{λ_{1}}, \overset{̅}{α_{1}}, \overset{̅}{ϕ}, x_{i})$ and

p_{D [Pg 0]} = E [- 2 \sum_{i = 1}^{n} log f_{0} (t_{Pi}, δ_{i} | λ_{1}, α_{1}, ϕ, x_{i}) | D_{Pg, obs}] + 2 \sum_{i = 1}^{n} log f_{0} (t_{Pi}, δ_{i} | {\overset{̅}{λ}}_{1}, {\overset{̅}{α}}_{1}, \overset{̅}{ϕ}, x_{i}),

and $D_{Pg, obs} = (t_{Pi}, δ_{i}, x_{i} i = 1, \dots, n)$ . $Δ {DIC}_{Pg}$ measures the additional information gained by longitudinal components into modelling of progression data. Large values of $Δ {DIC}_{Pg}$ correspond to more gain in the fit of progression data.

As pointed out in Zhang et al. (2017), when the penalty for the additional parameters in the overall survival model (3.10) or the time to progression model (3.8) outweighs the gain of the fit in the respective survival component, $Δ {DIC}_{Surv}$ or $Δ {DIC}_{Pg}$ can be negative. When $Δ {DIC}_{Surv}$ ( $Δ {DIC}_{Pg}$ ) is zero or negative, the longitudinal measure is not associated with the overall survival (the time to disease progression). This phenomenon is also applied to the $Δ$ measures developed based on the CPO statistics below.

4.3.2 CPO decomposition and LPML decomposition

In this subsection, we first establish the conditional predictive ordinate (CPO) identity which will lead to the decomposition of CPO. CPO (Geisser and Eddy, 1979; Chen et al., 2000; Zhang et al., 2017) for the $i$ th subject has the following equivalent definition and identities:

\begin{matrix} {CPO}_{i} = \int f (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ, x_{i}, z_{i} (t), ν_{i}) π (ϕ | D_{obs}^{(- i)}) d ϕ, \\ {CPO}_{i} = \frac{c (D_{obs})}{c (D_{obs}^{(- i)})} = \frac{f (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ, x_{i}, z_{i} (t), ν_{i}) π (ϕ | D_{obs}^{(- i)})}{π (ϕ | D_{obs})}, \\ {CPO}_{i} = \frac{1}{\int \frac{1}{f (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ, x_{i}, z_{i} (t), ν_{i})} π (ϕ | D_{obs}) d ϕ}, \end{matrix}

(4.13)

where $c (D_{obs})$ is defined in (4.5), $c (D_{obs}^{(- i)}) = \int Π_{j \neq i}^{n} f (y_{j}, t_{Pj}, δ_{j}, t_{j} | ϕ, x_{j}, ν_{j}) π (ϕ) d ϕ$ , and $f (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ, x_{i}, ν_{i})$ is defined in (4.8).

Let $ϕ_{1}^{*}$ , $ϕ_{2}^{*}$ and $ϕ_{3}^{*}$ denote the posterior means $ϕ_{1}$ , $ϕ_{2}$ and $ϕ_{3}$ , and $ϕ^{*} = (ϕ_{1}^{*}, ϕ_{2}^{*}, ϕ_{3}^{*})$ . We have

\begin{matrix} f (y_{i}, t_{Pi}, & δ_{i}, t_{i} | ϕ^{*}, x_{i}, z_{i} (t), ν_{i}) = f (t_{i} | ϕ^{*}, x_{i}, z_{i} (t), δ_{i}, ν_{i}, y_{i}) f (t_{Pi}, δ_{i} | y_{i}, ϕ_{1}^{*}, ϕ_{2}^{*}, x_{i}) \\ \times f (y_{i} | ϕ_{1}^{*}, x_{i}) . \end{matrix}

(4.14)

We also observe that $π (ϕ^{*} | D_{obs}^{(- i)}) = π (ϕ_{1}^{*} | D_{obs}^{(- i)}) π (ϕ_{2}^{*} | ϕ_{1}^{*}, D_{obs}^{(- i)}) π (ϕ_{3}^{*} | ϕ_{2}^{*}, ϕ_{1}^{*}, D_{obs}^{(- i)})$ , $π (ϕ^{*} | D_{obs}) = π (ϕ_{1}^{*} | D_{obs}) π (ϕ_{2}^{*} | ϕ_{1}^{*}, D_{obs}) π (ϕ_{3}^{*} | ϕ_{2}^{*}, ϕ_{1}^{*}, D_{obs})$ . We can propose the CPO decomposition:

\begin{matrix} {CPO}_{i} = {CPO}_{i, Long} * {CPO}_{i, Pg | Long} * {CPO}_{i, Surv | Pg, Long}, \end{matrix}

(4.15)

where ${CPO}_{i, Long} = \frac{f (y_{i} | ϕ_{1}^{*}, x_{i}) π (ϕ_{1}^{*} | D_{obs}^{(- i)})}{π (ϕ_{1}^{*} | D_{obs})}$ , ${CPO}_{i, Pg | Long} = \frac{f (t_{Pi}, δ_{i} | y_{i}, ϕ_{1}^{*}, ϕ_{2}^{*}, x_{i}) π (ϕ_{2}^{*} | ϕ_{1}^{*}, D_{obs}^{(- i)})}{π (ϕ_{2}^{*} | ϕ_{1}^{*}, D_{obs})}$ , ${CPO}_{i, Surv | Pg, Long} = \frac{f (t_{i} | ϕ_{1}^{*}, ϕ_{2}^{*}, ϕ_{3}^{*}, x_{i}, t_{Pi}, δ_{i}, ν_{i}) π (ϕ_{3}^{*} | ϕ_{2}^{*}, ϕ_{1}^{*}, D_{obs}^{(- i)})}{π (ϕ_{3}^{*} | ϕ_{2}^{*}, ϕ_{1}^{*}, D_{obs})}$ .

Let $D_{Long, obs}^{(- i)} = {(y_{j}, x_{j}), j = 1, . . ., i - 1, . . ., n}$ , $D_{Pg, obs}^{(- i)} = {(t_{Pj}, δ_{j}, x_{j}), j = 1, . . ., i - 1, . . ., n}$ , and $D_{Surv, obs}^{(- i)} = {(t_{j}, ν_{j}, x_{j}), j = 1, . . ., i - 1, . . ., n}$ . When we assume that $D_{Long, obs}$ , $D_{Pg, obs}$ and $D_{Surv, obs}$ are mutually independent. Assume $π (ϕ) = π (ϕ_{1}) π (ϕ_{2}) π (ϕ_{3})$ . We have

\begin{matrix} {CPO}_{i, Surv | Pg, Long} = {CPO}_{i, Surv 0} = \int f_{0} (t_{i} | λ_{2}, α_{2}, x_{i}, ν_{i}) π (ϕ_{3} | D_{Surv, obs}^{(- i)}) d ϕ_{3}, \\ {CPO}_{i, Pg | Long} = {CPO}_{i, Pg 0} = \int f_{0} (t_{Pi}, δ_{i} | λ_{1}, α_{1}, ϕ, x_{i}) π (ϕ_{2} | D_{Pg, obs}^{(- i)}) d ϕ_{2} . \end{matrix}

From Section S.2 in the supplementary materials, we have the following identities for ${CPO}_{i}$ , ${CPO}_{i, Long}$ , ${CPO}_{i, Pg | Long}$ and ${CPO}_{i, Surv | Pg, Long}$ :

\begin{matrix} {CPO}_{i, Long} = {CPO}_{i} f (y_{i} | ϕ_{1}^{*}, x_{i}) \\ \times \int f^{- 1} (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ_{1}^{*}, ϕ_{2}, ϕ_{3}, x_{i}, z_{i} (t), ν_{i}) π (ϕ_{2}, ϕ_{3} | ϕ_{1}^{*}, D_{obs}) d ϕ_{2} d ϕ_{3}, \\ {CPO}_{i, Pg | Long} = f (t_{Pi}, δ_{i} | y_{i}, ϕ_{1}^{*}, ϕ_{2}^{*}, x_{i}) \\ \times \frac{[\int f^{- 1} (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ_{1}^{*}, ϕ_{2}^{*}, ϕ_{3}, x_{i}, z_{i} (t), ν_{i}) π (ϕ_{3} | ϕ_{1}^{*}, ϕ_{2}^{*}, D_{obs}) d ϕ_{3}]}{[\int f^{- 1} (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ_{1}^{*}, ϕ_{2}, ϕ_{3}, x_{i}, z_{i} (t), ν_{i}) π (ϕ_{2}, ϕ_{3} | ϕ_{1}^{*}, D_{obs}) d ϕ_{2} d ϕ_{3}]}, \\ {CPO}_{i, Surv | Pg, Long} = [\int f^{- 1} (t_{i} | ϕ_{1}^{*}, ϕ_{2}^{*}, ϕ_{3}, x_{i}, t_{Pi}, δ_{i}, ν_{i}, y_{i}) π (ϕ_{3} | ϕ_{1}^{*}, ϕ_{2}^{*}, D_{obs}) d ϕ_{3}]^{- 1} . \end{matrix}

The logarithm of the pseudo-marginal likelihood (LPML; Ibrahim et al., 2001) is defined as

\begin{matrix} LPML = \sum_{i = 1}^{n} log ({CPO}_{i}) . \end{matrix}

Then LPML can be decomposed as

\begin{matrix} LPML = {LPML}_{Long} + {LPML}_{Pg | Long} + {LPML}_{Surv | Pg, Long}, \end{matrix}

where ${LPML}_{Long} = \sum_{i = 1}^{n} log ({CPO}_{i, Long})$ , ${LPML}_{Pg | Long} = \sum_{i = 1}^{n} log ({CPO}_{i, Pg | Long})$ and ${LPML}_{Surv | Pg, Long} = \sum_{i = 1}^{n} log ({CPO}_{i, Surv | Pg, Long})$ .

Under the same assumptions for the calculation of ${DIC}_{Surv 0}$ , define ${LPML}_{Surv 0} = \sum_{i = 1}^{n} log {CPO}_{i, Surv 0}$ . We propose the model assessment criterion:

\begin{matrix} Δ {LPML}_{Surv} = {LPML}_{Surv | Pg, Long} - {LPML}_{Surv 0} . \end{matrix}

$Δ {LPML}_{Surv}$ measures the additional information gained by adding time to disease progression and longitudinal components into modelling of overall survival data. Large values of $Δ {LPML}_{Surv}$ correspond to more gain in the fit of the overall survival data. Similarly, denote $Δ {LPML}_{Pg} = {LPML}_{Pg | Long} - {LPML}_{Pg 0}$ . ${LPML}_{Pg 0}$ is from assuming $β_{1} = 0$ and fit time to progression model alone. $Δ {LPML}_{Pg}$ measures the additional information gained by longitudinal components into modelling of progression data. Large values of $Δ {LPML}_{Pg}$ correspond to more gain in the fit of progression data.

Algorithm I. For $f (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ_{1}, ϕ_{2}, ϕ_{3}, x_{i}, z_{i} (t), ν_{i})$ , $f (t_{Pi}, δ_{i}, y_{i} ϕ_{1}, ϕ_{2}, x_{i})$ can be approximated by adaptive Gaussian quadrature (AGQ).

Now, let ${ϕ_{b}, b = 1, \dots, B}$ denote a Gibbs sample of $ϕ$ . An estimate of ${CPO}_{i}^{- 1}$ is given by

{\hat{CPO}}_{i}^{- 1} = \frac{1}{B} \sum_{b = 1}^{B} f^{- 1} (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ_{b}, x_{i}, z_{i} (t), ν_{i}) .

Next, fix $ϕ_{1} = ϕ_{1}^{*}$ , and let ${ϕ_{2 b}, ϕ_{3 b}, b = 1, \dots, B}$ denote a Gibbs sample of $(ϕ_{2}, ϕ_{3})$ .

\begin{matrix} {\hat{CPO}}_{i, Long} & = \hat{{CPO}_{i}} f (y_{i} | ϕ_{1}^{*}, x_{i}) \frac{1}{B} \sum_{b = 1}^{B} f^{- 1} (y_{i}, t_{Si}, δ_{i}, t_{i} | ϕ_{1}^{*}, ϕ_{2 b}, ϕ_{3 b}, x_{i}, z_{i} (t), ν_{i}) . \end{matrix}

Next, fix $ϕ_{1} = ϕ_{1}^{*}$ and $ϕ_{2} = ϕ_{2}^{*}$ , and let ${ϕ_{3 b}, b = 1, \dots, B}$ denote a Gibbs sample of $(ϕ_{3})$ . We can calculate $\sum_{b = 1}^{B} f^{- 1} (y_{i}, t_{Si}, δ_{i}, t_{i} | ϕ_{1}^{*}, ϕ_{2}^{*}, ϕ_{3 b}, x_{i}, z_{i} (t), ν_{i})$ . Then we get

\begin{matrix} {\hat{CPO}}_{i, Pg | Long} & = \frac{f (t_{Si}, δ_{i}, y_{i} | ϕ_{1}^{*}, ϕ_{2}^{*}, x_{i})}{f (y_{i} | ϕ_{1}^{*}, x_{i})} \frac{\sum_{b = 1}^{B} f^{- 1} (y_{i}, t_{Si}, δ_{i}, t_{i} | ϕ_{1}^{*}, ϕ_{2}^{*}, ϕ_{3 b}, x_{i}, z_{i} (t), ν_{i})}{\sum_{b = 1}^{B} f^{- 1} (y_{i}, t_{Si}, δ_{i}, t_{i} | ϕ_{1}^{*}, ϕ_{2 b}, ϕ_{3 b}, x_{i}, z_{i} (t), ν_{i})} . \end{matrix}

Finally, ${\hat{CPO}}_{i, Surv | Pg, Long} = \frac{{\hat{CPO}}_{i}}{{\hat{CPO}}_{i, Long} {\hat{CPO}}_{i, Pg | Long}}$ .

Algorithm II. Following Section S.2 of the supplementary materials, Let $w_{i} (θ_{i}^{*})$ be a normalized weight function such that $\int w_{i} (θ_{i}^{*}) d θ_{i}^{*} = 1$ . $\hat{{CPO}_{i}^{- 1}} = \int w_{i} (θ_{i}^{*}) f^{- 1} (y_{i}, t_{Si}, δ_{i}, t_{i}, θ_{i}^{*} | ϕ, x_{i}, z_{i} (t), ν_{i}) π (ϕ, θ^{* R} | D_{obs}) d θ^{* R} d ϕ$ . The opti- mal choice for $w_{i} (θ_{i}^{*})$ in minimizing the variance of the Monte Carlo estimator for ${\hat{CPO}}_{i}$ is $\frac{f (y_{i}, t_{Si}, δ_{i}, t_{i}, θ_{i}^{*} | ϕ, x_{i}, z_{i} (t), ν_{i})}{f (y_{i}, t_{Si}, δ_{i}, t_{i} | ϕ, x_{i}, z_{i} (t), ν_{i})}$ . Now, let ${ϕ_{b}, θ_{b}^{* R}, b = 1, \dots, B}$ denote a Gibbs sample of $(ϕ, θ^{* R})$ . A estimate of ${CPO}_{i}^{- 1}$ is given by

{\hat{CPO}}_{i}^{- 1} = \frac{1}{B} \sum_{b = 1}^{B} w_{i} (θ_{ib}^{*}) f^{- 1} (y_{i}, t_{Pi}, δ_{i}, t_{i}, θ_{ib}^{*} | ϕ_{b}, x_{i}, z_{i} (t), ν_{i}) .

(4.10)

Similarly, $\int f^{- 1} (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ_{1}^{*}, ϕ_{2}^{*}, ϕ_{3}, x_{i}, z_{i} (t), ν_{i}) π (ϕ_{3} | ϕ_{1}^{*}, ϕ_{2}^{*}, D_{obs}) d ϕ_{3} = \int w_{i} (θ_{i}^{*}) f^{- 1} (y_{i}, t_{Pi}, δ_{i}, t_{i}, θ_{i}^{*} | ϕ_{1}^{*}, ϕ_{2}^{*}, ϕ_{3}, x_{i}, z_{i} (t), ν_{i}) π (ϕ_{3}, θ^{* R} | ϕ_{1}^{*}, ϕ_{2}^{*}, D_{obs}) d θ^{* R} d ϕ_{3}$ . The optimal choice for $w_{i} (θ_{i}^{*})$ in minimizing the variance of the Monte Carlo estimator is $f (y_{i}, t_{Pi}, δ_{i}, t_{i}, θ_{i}^{*} | ϕ_{1}^{*}, ϕ_{2}^{*}, ϕ_{3}, x_{i}, z_{i} (t), ν_{i}) f^{- 1} (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ_{1}^{*}, ϕ_{2}^{*}, ϕ_{3}, x_{i}, z_{i} (t), ν_{i})$ . $\int f^{- 1} (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ_{1}^{*}, ϕ_{2}, ϕ_{3}, x_{i}, z_{i} (t), ν_{i}) π (ϕ_{2}, ϕ_{3} | ϕ_{1}^{*}, D_{obs}) d ϕ_{2} d ϕ_{3} = \int w_{i} (θ_{i}^{*}) f^{- 1} (y_{i}, t_{Pi}, δ_{i}, t_{i}, θ_{i}^{*} | ϕ_{1}^{*}, ϕ_{2}, ϕ_{3}, x_{i}, z_{i} (t), ν_{i}) π (ϕ_{2}, ϕ_{3}, θ^{* R} | ϕ_{1}^{*}, D_{obs}) d θ^{* R} d ϕ_{2} d ϕ_{3}$ . The optimal choice for $w_{i} (θ_{i}^{*})$ in minimizing the variance of the Monte Carlo estimator is $f (y_{i}, t_{Pi}, δ_{i}, t_{i}, θ_{i}^{*} | ϕ_{1}^{*}, ϕ_{2}, ϕ_{3}, x_{i}, z_{i} (t), ν_{i}) f^{- 1} (y_{i}, t_{Pi}, δ_{i}, t_{i} | ϕ_{1}^{*}, ϕ_{2}, ϕ_{3}, x_{i}, z_{i} (t), ν_{i})$ .

Next, fix $ϕ_{1} = ϕ_{1}^{*}$ , and let ${ϕ_{2 b}, ϕ_{3 b}, θ_{b}^{* R}, b = 1, \dots, B}$ denote a Gibbs sample of $(ϕ_{2}, ϕ_{3}, θ^{* R})$ . Set the weight function to be $w_{i 1} (θ_{i}^{*})$ .

\begin{matrix} {\hat{CPO}}_{i, Long} = & \hat{{CPO}_{i}} f (y_{i} | ϕ_{1}^{*}, x_{i}) \\ \times \frac{1}{B} \sum_{b = 1}^{B} w_{i 1} (θ_{ib}^{*}) f^{- 1} (y_{i}, t_{Si}, δ_{i}, t_{i}, θ_{ib}^{*} | ϕ_{1}^{*}, ϕ_{2 b}, ϕ_{3 b}, x_{i}, z_{i} (t), ν_{i}) . \end{matrix}

Next, fix $ϕ_{1} = ϕ_{1}^{*}$ and $ϕ_{2} = ϕ_{2}^{*}$ , and let ${ϕ_{3 b}, θ_{b}^{* R}, b = 1, \dots, B}$ denote a Gibbs sample of $(ϕ_{3}, θ^{* R})$ . Set the weight function to be $w_{i 2} (θ_{i}^{*})$ . We can calculate $\sum_{b = 1}^{B} w_{i 2} (θ_{ib}^{*}) f^{- 1} (y_{i}, t_{Si}, δ_{i}, t_{i}, θ_{ib}^{*} | ϕ_{1}^{*}, ϕ_{2}^{*}, ϕ_{3 b}, x_{i}, z_{i} (t), ν_{i})$ . Then we get

\begin{matrix} {\hat{CPO}}_{i, Pg | Long} = & \frac{f (t_{Si}, δ_{i}, y_{i} | ϕ_{1}^{*}, ϕ_{2}^{*}, x_{i})}{f (y_{i} | ϕ_{1}^{*}, x_{i})} \\ \times \frac{\sum_{b = 1}^{B} w_{i 2} (θ_{ib}^{*}) f^{- 1} (y_{i}, t_{Si}, δ_{i}, t_{i}, θ_{ib}^{*} | ϕ_{1}^{*}, ϕ_{2}^{*}, ϕ_{3 b}, x_{i}, z_{i} (t), ν_{i})}{\sum_{b = 1}^{B} w_{i 1} (θ_{ib}^{*}) f^{- 1} (y_{i}, t_{Si}, δ_{i}, t_{i}, θ_{ib}^{*} | ϕ_{1}^{*}, ϕ_{2 b}, ϕ_{3 b}, x_{i}, z_{i} (t), ν_{i})} . \end{matrix}

Finally, ${\hat{CPO}}_{i, Surv | Pg, Long} = \frac{{\hat{CPO}}_{i}}{{\hat{CPO}}_{i, Long} {\hat{CPO}}_{i, Pg | Long}}$ .

Due to the computational difficulty of the optimal weight functions, another possible choice is $w_{i, cond} (θ_{i}^{*}) = f (θ_{i}^{*} | ϕ_{1}, x_{i}, y_{i})$ . The conditional distribution of $θ_{i}^{*} | ϕ_{1}, x_{i}, y_{i}$ is $θ_{i}^{*} | ϕ_{1}, x_{i}, y_{i} \sim N (μ^{*}, Σ^{*})$ , where $μ^{*} = Γ^{'} W_{i 1}^{'} (σ^{2} I_{m_{i}} + W_{i 1} Γ Γ^{'} W_{i 1}^{'})^{- 1} (y_{i} - W_{i} (θ^{'}, γ^{'})^{'})$ and $Σ^{*} = I_{p} - Γ^{'} W_{i 1}^{'} (σ^{2} I_{m_{i}} + W_{i 1} Γ Γ^{'} W_{i 1}^{'})^{- 1} W_{i 1} Γ$ .

5 Analysis of the head and neck cancer data

We apply our proposed method in Section 3 to analyse the head and neck cancer data introduced in Section 2. Out of the total 483 patients, we exclude two patients from the analysis because of missing BSDTARG0 measures. We are interested in identifying which of the three patient-reported measures: GH, HNPA and HNSW contributes most to the model fit of the survival data. Each of GH, HNPA and HNSW was measured on a continuous scale from 0 to 100. Following Ibrahim et al. (2001), we apply transformation to the original longitudinal measures: $\sqrt{100 - GH}$ , $\sqrt{100 - HNPA}$ and $\sqrt{100 - HNSW}$ , respectively, to achieve a better normal approximation. For each transformed longitudinal measure, we fit the joint model for the longitudinal measure, disease progression time and the overall survival, with a time-dependent covariate accounting for the model fit of the survival data. We include 11 covariates described in Section 2, namely ECOG, AGE, PEGFRA, FEMALE PCRTDC, TMSITED01, METLOC2, METLOC3, METLOC4, ETLOC5, BSDTARG, and treatment indicator R in the time to progression model (3.8) and the overall survival model (3.10). We include five covariates, namely ECOG, AGE, PEGFRA, FEMALE, and PCRTDC, and treatment indicator R in the cure rate model (3.6). In the time to progression model the survival model, and the cure rate model, all of the covariates including those binary endpoints, except for the treatment indicator R, are standardized to have mean 0 and standard division 1. We use a linear trajectory in the longitudinal model. No standardization is done for any of these covariates in the longitudinal model. In the overall survival model, we also include time-dependent indicator for treatment switch $V_{T_{SW}}$ , time-dependent indicator for progression status $V_{T_{P}}$ and an interaction term $R * V_{T_{P}}$ .

We used DIC and LPML under the survival model alone for overall survival and the cure rate model alone for time to progression to determine $K_{j}$ , the number of pieces for the piecewise constant hazard function for $j = 1, 2$ . As reported in the footnotes under Tables S.5 and S.6 in the supplementary materials, the ‘best choices’ were $K_{1} = 25$ for time to progression and $K_{2} = 7$ for overall survival according to both DIC and LPML. Under these choices of $K_{1}$ and $K_{2}$ , we fit the models of the survival data jointly with each of GH, HNPA and HNSW longitudinal data and calculated the decompositions of DIC and LPML. The values of DIC, $Δ {DIC}_{Pg}$ , $Δ {DIC}_{Surv, 1}$ $Δ {DIC}_{Surv, 2}$ , LPML, $Δ {LPML}_{Pg}$ , $Δ {LPML}_{Surv, 1}$ , and $Δ {LPML}_{Surv, 2}$ are shown in Figure 2 for GH and reported in Table 2 for all three longitudinal measures.

Figure 2:

Diagram of the decompositions of DIC and LPML for the disease progression and survival data jointly with GH

Larger values of $Δ {DIC}_{Surv}$ and $Δ {LPML}_{Surv}$ correspond to more gain in the fit of the overall survival data by adding time to disease progression and different longitudinal components. All values of $Δ {DIC}_{Surv, 1}$ and $Δ {LPML}_{Surv, 1}$ in Table 2 were positive, implying that all three longitudinal measures together with the time-dependent covariates helped in the fit of the overall survival data. All values of $Δ {DIC}_{Surv, 2}$ and $Δ {LPML}_{Surv, 2}$ were positive, indicating that all three longitudinal measures helped in the fit of the overall survival data in addition to the time-dependent covariates. Comparing the magnitudes of $Δ {DIC}_{Surv, 2}$ and $Δ {LPML}_{Surv, 2}$ in Table 2, in terms of gain in the fit of overall survival, GH is more associated with the overall survival time than HNSW and HNPA, while HNSW has a stronger association with the overall survival time than HNPA. Similarly, larger values of $Δ {DIC}_{Pg}$ and $Δ {LPML}_{Pg}$ correspond to more gain in the fit of disease progression data by adding different longitudinal components. The results in Table 2 show that GH is helpful in the fit of the time to progression data, while there were weak associations between HNPA or HNSW and the time to progression data. After comparing the values of overall DIC for the models jointly with different longitudinal measures in Table 2, we see that HNPA $<$ GH $<$ HNSW in terms of the corresponding DIC values, while based on ${DIC}_{Pg | Long}$ and ${DIC}_{Pg | Long}$ , GH has the smallest values. The reason for this is that there were different numbers of observations (1 589 for GH, 1 587 for HNPA and 1 525 for HNSW) for the three longitudinal measures. Thus, the values of the overall DIC were not comparable. The similar results of LPML and LPML decompositions were observed.

Table 2:

The Decompositions of DIC and LPML for three longitudinal measures

	GH	HNPA	HNSW		GH	HNPA	HNSW
DIC	9 348.07	9 226.44	9 430.50	LPML	$-$ 4 687.24	$-$ 4 631.66	$-$ 4 805.36
${DIC}_{Long}$	6 111.25	5 964.54	6 336.95	${LPML}_{Long}$	$-$ 3 060.59	$-$ 2 991.56	$-$ 3 173.07
${DIC}_{Pg \| Long}$	1 090.54	1 096.82	1 098.59	${LPML}_{Pg \| Long}$	$-$ 552.44	$-$ 556.68	$-$ 555.80
${DIC}_{Surv \| Pg, Long}$	2 146.27	2 165.09	2 151.79	${LPML}_{Surv \| Pg, Long}$	$-$ 1 074.21	$-$ 1 083.42	$-$ 1 076.50
$Δ {DIC}_{Surv, 1}$	78.53	59.71	73.00	$Δ {LPML}_{Surv, 1}$	38.93	29.72	36.64
$Δ {DIC}_{Surv, 2}$	34.75	15.93	29.23	$Δ {LPML}_{Surv, 2}$	17.17	7.96	14.88
$Δ {DIC}_{Pg}$	7.48	1.21	$-$ 0.56	$Δ {LPML}_{Pg}$	2.57	$-$ 1.66	-0.78

Source: ${DIC}_{Surv 0, 1} = 2 224.80$ , ${DIC}_{Surv 0, 2} = 2 181.02$ , ${DIC}_{Pg 0} = 1 098.03$ , ${LPML}_{Surv 0, 1} = - 1 113.14$ , ${LPML}_{Surv 0, 2} = - 1 091.38$ , ${LPML}_{Pg 0} = - 555.01$

Table 3 shows the posterior estimates and 95% HPD intervals of the hazard ratios of variables/parameters for the overall survival model (3.10) and the time to progression model (3.8), and the odds ratios of variables for the cure rate logistic regression model (3.6) and the posterior estimates and the 95% HPD intervals of parameters for the longitudinal model (3.1) under the joint model with GH. Posterior means, posterior standard deviations, and 95% HPD intervals of the parameters under the joint models with GH, HNPA and HNSW are reported in Tables S.2 to S.4 in the supplementary materials. We see from Table S.2 that both

β_{1}

and

β_{2}

with 95% HPD intervals (0.052, 0.246) and (0.195, 0.393), respectively, were significant, in the sense that their corresponding 95% HPD intervals do not contain zero. From Table 3, we also see that the 95% HPD intervals of the hazard ratios corresponding to

β_{1}

and

β_{2}

are (1.053, 1.279) and (1.215, 1.482), respectively, which do not contain 1. Thus, a covariate is significant if its 95% HPD interval of the hazard ratio does not contain 1. These results imply that an increase in the time-dependent longitudinal trajectory function is associated with a shorter disease progression time as well as a worse overall survival mortality, which is clinically appealing since a large value of the longitudinal trajectory function corresponds to a small GH value. Under the proposed joint model,

β_{1}

constitutes relationship between the longitudinal measure and the time to progression, while

β_{2}

induces correlation between the longitudinal measure and the overall survival time. In particular,

β_{1} = 0

β_{2} = 0

implies that the longitudinal measure does not influence the risk of disease progression or overall survival. From Tables S.3 and S.4, we see that only

β_{2}

was significant for both HNPA with respective 95% HPD intervals (

-

0.237,

-

0.087) and (

-

0.193,

-

0.095). These results were consistent with our decompositions of DIC and LPML as the values of

Δ {DIC}_{Surv, 2}

and

Δ {LPML}_{Surv, 2}

were 34.75 and 17.17 for GH, 15.93 and 7.96 for HNPA, and 29.23 and 14.88 for HNSW, while the values of

Δ {DIC}_{Pg}

and

Δ {LPML}_{Pg}

were 7.48 and 2.57 for GH, 1.21 and

-

1.66 for HNPA, and

-

0.56 and

-

0.78 for HNSW. Similarly, from Tables S.2 to S.4, we see that the 95%HPD for the coefficient of time-dependent covariate

V_{T_{P}}

was significant for all of GH, HNPA and HNSW, which is consistent with the results shown in Table 2, in which

Δ {DIC}_{Surv, 1} > Δ {DIC}_{Surv, 2}

and

Δ {LPML}_{Surv, 1} > Δ {LPML}_{Surv, 2}

under all of GH, HNPA and HNSW. Interestingly, the longitudinal measures could have a minor impact on the significance of covariates under the model for the overall survival. Specifically, METLOC5 was not significant under the joint model with GH, while METLOC5 was significant under the joint models with HNPA and HNSW. Also, the treatment effect (R) without treatment switch was significant under the joint models with GH and HNSW, while it was not significant under the joint model with HNPA. Finally, we note that (a) the treatment effect

R

was not significant under the cure rate model alone for time to progression (Table S.5); (b) the survival model alone for overall survival without time-dependent covariates (Table S.6); and (c) the treatment effect (R) without treatment switch was significant under the survival model alone for overall survival with time-dependent covariates (Table S.7).

Table 3:

Parameter estimates under the joint model with GH

Parameter	EST	95%HPD	Parameter	EST	95%HPD
Survival (HR)			Progression (HR)
ECOG	2.221	(1.669, 2.949)	ECOG	1.062	(0.780, 1.442)
AGE	0.937	(0.707, 1.207)	AGE	0.833	(0.577, 1.190)
PEGFRAD	1.174	(0.916, 1.500)	PEGFRAD	1.548	(1.170, 2.043)
Female	0.992	(0.707, 1.375)	Female	0.964	(0.642, 1.446)
PCRTDC	1.242	(0.978, 1.566)	PCRTDC	1.028	(0.750, 1.349)
TMSITED01	1.507	(1.150, 1.947)	TMSITED01	0.853	(0.624, 1.123)
METLOC2	1.072	(0.797, 1.408)	METLOC2	0.942	(0.686, 1.229)
METLOC3	0.755	(0.575, 0.949)	METLOC3	1.046	(0.800, 1.389)
METLOC4	1.699	(1.137, 2.499)	METLOC4	1.983	(1.225, 3.007)
METLOC5	1.397	(0.910, 2.055)	METLOC5	1.127	(0.696, 1.758)
BSDTARG	1.009	(1.006, 1.012)	BSDTARG	1.003	(1.000, 1.006)
R	0.705	(0.501, 0.945)	R	0.973	(0.723, 1.288)
$V_{T_{P}}$	2.490	(1.904, 3.283)	$β_{1}$	1.161	(1.053, 1.279)
$V_{T_{SW}}$	0.782	(0.515, 1.129)	Longitudinal
$R * V_{T_{SW}}$	1.410	(0.861, 2.118)	$θ_{1}$	4.863	(4.367, 5.397)
$β_{2}$	1.346	(1.215, 1.482)	$θ_{2}$	0.126	(0.080, 0.173)
Logistic (OR)			ECOG	1.285	(0.940, 1.626)
ECOG	0.893	(0.133, 7.473)	AGE	0.187	(-0.144, 0.539)
AGE	0.569	(0.104, 16.586)	PEGFRAD	0.263	(-0.021, 0.578)
PEGFRAD	0.679	(0.056, 4.709)	Female	0.113	(-0.326, 0.534)
Female	1.270	(0.114, 95.950)	PCRTDC	0.102	(-0.189, 0.415)
PCRTDC	2.715	(0.447, 24.536)	TMSITED01	0.204	(-0.131, 0.520)
R	1.796	(0.457, 9.381)	METLOC2	0.169	(-0.196, 0.537)
			METLOC3	-0.028	(-0.342, 0.267)
			METLOC4	-0.604	(-1.182, -0.093)
			METLOC5	0.339	(-0.197, 0.898)
			BSDTARG	0.002	(-0.002, 0.006)
			R	-0.029	(-0.331, 0.295)
			$Ω_{11}$	2.036	(1.644, 2.463)
			$Ω_{21}$	-0.021	(-0.097, 0.059)
			$Ω_{22}$	0.028	(0.000, 0.057)

In all the Bayesian computations, we used 5 000 MCMC samples, which were taken from every third iteration, after a burn-in of 1 500 iterations for each model to compute all posterior estimates, including posterior means, posterior SDs, 95% HPD intervals, DIC and LPML. The convergence of the MCMC sampling algorithm was checked using several diagnostic procedures discussed in (Chen et al., 2000). The trace and autocorrelation plots of MCMC chains for the model parameters are shown in Figures S.1 to S.18. We see from these figures that the autocorrelations disappear even after lag 2 for some parameters and after lag 20 for most of the parameters. These trace and autocorrelation plots show good convergence and mixing of these MCMC chains. The HPD intervals were computed via the Monte Carlo method developed by (Chen and Shao, 1999). Computer code was written for the FORTRAN 95 compiler, and we used IMSL subroutines with double precision accuracy. The FORTRAN code is available from the authors upon request. In addition, an R interface of our Fortran code is developed, which can be downloaded from the journal website (http://www.statmod.org/smij/archive.html).

6 Discussion

In this article, we proposed a class of mixed effects regression models for longitudinal measures, a cure rate model for the disease progression time and a time-varying covariates model for the overall survival. In Section 5, we carried out a Bayesian analysis of the head and neck cancer data using the computational algorithm discussed in Section 4.2. To examine the empirical performance of the posterior estimates under the proposed joint model, we conducted a simulation study. The design and results of the simulation study are presented in Section S.3 of the supplementary materials. As shown in Table S.1, the posterior estimates are close to the true values for almost all of the parameters except for $ϕ$ , and the CP's are close to 95% for most of the parameters. The simulation results empirically confirm convergence of the proposed Markov chain Monte Carlo sampling algorithm and identifiability of the model parameters except for $ϕ$ under the proposed joint model.

Within the semi-competing risk setting, we developed the decompositions of DIC and LPML to determine which PRO is most associated with the survival outcome. As discussed in Section 4.3, the DIC and DIC decompositions require a numerical approximation of an integral over random effects $θ^{*}$ , while the LPML and LPML decompositions can be computed using the Monte Carlo methods. Thus, the LPML and LPML decompositions are more advantageous when the dimension of random effects becomes large. The proposed joint models and the corresponding decompositions of Bayesian model selection criteria can be extended to discrete longitudinal biomarkers such as Eastern Cooperative Oncology Group (ECOG) performance status and other non-normal longitudinal biomarkers. Other extensions include the joint models of the multiple longitudinal measures and the survival data with semi-competing risks. These extensions are currently under investigation.

Footnotes

Acknowledgments

This work was done when Dr Zhang was at University of Connecticut and when Dr Cong was at Boehringer Ingelheim (China) Investment Co. Ltd, Shanghai, China. We would like to thank the guest editors and two anonymous reviewers for their very helpful and constructive comments, which have led to a much improved version of the article.

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.

Funding

Dr M.-H. Chen's research was partially supported by NIH grants #GM 70335 and #CA 74015.

Supplementary materials

The supplementary materials, which include Section S.1 The Full Conditional Posterior Distributions, Section S.2 Results and Formulas, Section S.3 A Simulation Study, Tables S.1 to S.7 and Figures S.1 to S.18, can be found through the link: http://www.statmod.org/smij/archive.html.

References

Berkson

Gage

(1952) Survival curve for cancer patients following treatment. Journal of the American Statistical Association , 47, 501–515.

Brown

(2009) Assessing the association between trends in a biomarker and risk of event with an application in pediatric HIV/AIDS. The Annals of Applied Statistics , 3, 1163–1182.

Brown

Ibrahim

(2003) Bayesian approaches to joint cure-rate and longitudinal models with applications to cancer vaccine trials. Biometrics , 59, 686–693.

Brown

Ibrahim

DeGruttola

(2005) A flexible B-spline model for multiple longitudinal biomarkers and survival. Biometrics , 61, 64–73.

Chen

M-H

Ibrahim

Sinha

(1999) A new Bayesian model for survival data with a surviving fraction. Journal of the American Statistical Association , 94, 909–919.

Chen

M-H

Ibrahim

Sinha

(2004) A new joint model for longitudinal and survival data with a cure fraction. Journal of Multivariate Analysis , 91, 18–34.

Chen

M-H

Shao

Q-M

(1999) Monte Carlo estimation of Bayesian credible and HPD intervals. Journal of Computational and Graphical Statistics , 8, 69–92.

Chen

M-H

Shao

Q-M

Ibrahim

(2000) Monte Carlo Methods in Bayesian Computation . New York, NY: Springer-Verlag.

Chen

Zhang

Chen

Cong

(2020) Estimation of treatment effects and model diagnostics with two-way time-varying treatment switching: An application to a head and neck study. Lifetime Data Analysis . doi: 10.1007/s10985-020-09495-0.

10.

Chi

Y-Y

Ibrahim

(2006) Joint models for multivariate longitudinal and multivariate survival data. Biometrics , 62, 432–445.

11.

Chi

Y-Y

Ibrahim

(2007) Bayesian approaches to joint longitudinal and survival models accommodating both zero and nonzero cure fractions. Statistica Sinica , 17, 445–462.

12.

De Gruttola

(1994) Modelling progression of CD4-lymphocyte count and its relationship to survival time. Biometrics , 50, 1003–1014.

13.

Elasho

(2008) A joint model for longitudinal measurements and survival data in the presence of multiple failure types. Biometrics , 64, 762–771.

14.

Geisser

Eddy

(1979) A predictive approach to model selection. Journal of the American Statistical Association , 74, 153–160.

15.

Hogan

Laird

(1997) Mixture models for the joint distribution of repeated measures and event times. Statistics in Medicine , 16, 239–257.

16.

Huang

Elasho

Pan

(2011) A general joint model for longitudinal measurements and competing risks survival data with heterogeneous random effects. Lifetime Data Analysis , 17, 80–100.

17.

Ibrahim

Chen

Lipsitz

(2001) Trustmissing responses in generalised linear mixed models when the missing data mechanism is nonignorable. Biometrika , 88, 551–564.

18.

Ibrahim

Chen

M-H

Sinha

(2004) Bayesian methods for joint modeling of longitudinal and survival data with applications to cancer vaccine trials. Statistica Sinica , 14, 863–883.

19.

Ibrahim

Chu

Chen

(2010) Basic concepts and methods for joint models of longitudinal and survival data. Journal of Clinical Oncology , 28, 2796–2801.

20.

Jacqmin-Gadda

Proust-Lima

Taylor

Commenges

(2010) Score test for conditional independence between longitudinal outcome and time to event given the classes in the joint latent class model. Biometrics , 66, 11–19.

21.

Lavalley

Degruttola

(1996) Models for empirical Bayes estimators of longitudinal CD4 counts. Statistics in Medicine , 15, 2289–2305.

22.

Law

Taylor

Sandler

(2002) The joint modeling of a longitudinal disease progression marker and the failure time process in the presence of cure. Biostatistics , 3, 547–563.

23.

Lewandowski

Kurowicka

Joe

(2009) Generating random correlation matrices based on vines and extended onion method. Journal of Multivariate Analysis , 100, 1989–2001.

24.

Elasho

Tseng

C-H

(2012) Joint analysis of bivariate longitudinal ordinal outcomes and competing risks survival times with nonparametric distributions for random effects. Statistics in Medicine , 31, 1707–1721.

25.

(2017). Accommodating informative dropout and death: A joint modelling approach for longitudinal and semicompeting risks data. Journal of the Royal Statistical Society: Series C (Applied Statistics) , 67, 145–163.

26.

Machiels

J-PH

Haddad

Fayette

Licitra

Tahara

Vermorken

Clement

Gauler

Cupissol

Grau

Guigay

Caponigro

de Castro

Jr de Souza Viana

Keilholz

Del Campo

Cong

Ehrnrooth

Cohen

EEW

LUX-H&N 1 investigators (2015) Afatinib versus methotrexate as second-line treatment in patients with recurrent or metastatic squamous-cell carcinoma of the head and neck progressing on or after platinum-based therapy (lux-head & neck 1): An open-label, randomised phase 3 trial. The Lancet Oncology , 16, 583–594.

27.

Pawitan

Self

(1993) Modeling disease marker processes in AIDS. Journal of the American Statistical Association , 88, 719–26.

28.

Proust-Lima

Sene

Taylor

Jacqmin-Gadda

(2014) Joint latent class models for longitudinal and time-to-event data: A review. Statistical Methods in Medical Research , 23, 74–90.

29.

Proust-Lima

Dartigues

J-F

Jacqmin-Gadda

(2016) Joint modeling of repeated multivariate cognitive measures and competing risks of dementia and death: A latent process and latent class approach. Statistics in Medicine , 35, 382–398.

30.

Rizopoulos

(2012) Joint Models for Longitudinal and Time-to-Event Data: With Applications in R . Boca Raton, FL: CRC Press.

31.

Rizopoulos

Ghosh

(2011) A Bayesian semiparametric multivariate joint model for multiple longitudinal outcomes and a time-to-event. Statistics in Medicine , 30, 1366–1380.

32.

Schluchter

(1992) Methods for the analysis of informatively censored longitudinal data. Statistics in Medicine , 11, 1861–1870.

33.

Spiegelhalter

Best

Carlin

Van Der Linde

(2002) Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B (Statistical Methodology) , 64, 583–639.

34.

Tudur-Smith

Sudell

Kolamunnage-Dona

(2016) Joint models for longitudinal and time-to-event data: A review of reporting quality with a view to meta-analysis. BMC Medical Research Methodology , 16, 168.

35.

U.S. Department of Health and Human Services (2009) Guidance for industry. Patient-reported outcome measures: Use in medical product development to support labeling claims . URL: http://www.fda.gov/downloads/Drugs/GuidanceComplianceRegulatoryInformation/Guidances/UCM193282.pdf

36.

Williamson

Kolamunnage-Dona

Philipson

Marson

(2008) Joint modelling of longitudinal and competing risks data. Statistics in Medicine , 27, 6426–6438.

37.

Zeger

(2001a) The evaluation of multiple surrogate endpoints. Biometrics , 57, 81–87.

38.

Zeger

(2001b) Joint analysis of longitudinal data comprising repeated measures and times to events. Journal of the Royal Statistical Society: Series C (Applied Statistics) , 50, 375–387.

39.

Lin

Taylor

(2008) Semiparametric modeling of longitudinal measurements and time-to-event data: A two-stage regression calibration approach. Biometrics , 64, 1238–1246.

40.

Zhang

Chen

M-H

Ibrahim

Boye

Shen

(2017) Bayesian model assessment in joint modeling of longitudinal and survival data with applications to cancer clinical trials. Journal of Computational and Graphical Statistics , 26, 121–133.

Assessing importance of biomarkers: A Bayesian joint modelling approach of longitudinal and survival data with semi-competing risks

Abstract

Keywords

1 Introduction

2 Motivating longitudinal and survival data from a head and neck cancer clinical trial

Figure 1:

Graphical representation of the disease progression and survival data from the head and neck cancer clinical trial

3.1 Longitudinal component of the joint model

4.1 Prior and posterior distributions

Figure 2:

Diagram of the decompositions of DIC and LPML for the disease progression and survival data jointly with GH

Footnotes

Acknowledgments

Declaration of conflicting interests

Funding

Supplementary materials

References