Sage Journals: Discover world-class research

Abstract

We develop a new age-period-cohort model for cancer surveillance research; the theory and methods are broadly applicable. In the new model, cohort deviations are weighted to account for the variable number of periods that each cohort is observed. Weighting ensures that the fitted rates can be naturally expressed as a function of age × a function of period × a function of cohort. Furthermore, the age, period, and cohort deviations are split into orthogonal quadratic components plus higher-order terms. These decompositions enable powerful combination significance tests of first- and second-order age, period, and cohort effects. The regression parameters of the orthogonal quadratic polynomials (global curvatures) quantify how fast on average the trends in the rates are changing. Importantly, the global curvature for cohort determines the least squares slope of the expected annual percentage changes by age group versus age (local drifts), thereby providing a powerful one-degree-of-freedom test of age-period interactions. We introduce new estimable functions, including age gradients that quantify the rate of change of the longitudinal and cross-sectional age curves at each attained age, and gradient shifts that quantify how the cross-sectional age trend varies by period. We illustrate the new model using nationally representative multiple myeloma incidence. Comprehensive proofs are given in technical appendices. We provide an R package.

Keywords

Models statistical trends age factors cohort effects SEER Program

1 Introduction

Cancer surveillance research is an observational science of cancer-related rates and risks ascertained in population-based cohorts, notably cancer registries.¹ In this field, a standard age-period-cohort (APC) model is now well accepted.² APC models are used to track cancer burden, reveal disparity, provide etiological clues, gauge real-world effectiveness of screening and therapy, quantify natural history and its evolution, and forecast cancer incidence and burden.³

Our experience analyzing cancer registry data motivated us to develop a next-generation APC model that retains all the good features of the standard model, yet provides new and improved estimable parameters and functions; a natural decomposition of fitted rates into age, period, and cohort effects; and more powerful statistical tests.

In the standard categorical APC model, the age, period, and cohort “deviations” are ensembles of parameters that together measure departures from the overall log-linear temporal trends.⁴ Our new model incorporates two key innovations. First, cohort deviations are weighted to account for the variable number of periods that each cohort is observed. Because of this modification, the fitted rates can be decomposed into a function of age × a function of period × a function of cohort, without invoking any additional assumptions or constraints. Second, in the new model, the age, period, and cohort deviations are partitioned into orthogonal quadratic components plus higher-order terms. As we will show, the three new “global curvature” parameters corresponding to the quadratic terms for age, period, and cohort, respectively, are intimately connected with fundamental rate patterns and signals that may not be clearly seen when the data are analyzed with the standard model.

We illustrate the new model using cancer incidence data for multiple myeloma (MM) among Black men ascertained through the National Cancer Institute's Surveillance, Epidemiology, and End Results (SEER) program (https://seer.cancer.gov/). SEER is the authoritative source for cancer statistics in the United States (https://seer.cancer.gov/about/). Comprehensive derivations and proofs are presented in technical appendices. We provide an R package.

2 Event rates on a Lexis diagram

APC models parameterize age-specific event rates over time. Formally, the event rates are assumed to be a realization of a Poisson point process over a Lexis diagram, which is a rectangular field with attained age along one axis and calendar period along the other.⁵ Our convention is the attained age runs along the y-axis and calendar period along the x-axis, to be consistent with outputs from the popular SEERSTAT programs (https://seer.cancer.gov/seerstat/). Individual events and corresponding person-years at risk are summed within a grid of cells defined by equal-width age and period intervals. For any given dataset, we denote the corresponding Lexis diagram as $L (a_{0}, p_{0}, Δ, A, P),$ where $a_{0}$ is the lower limit of the youngest included age group, $p_{0}$ is the lower limit of the earliest included calendar period, Δ is the common age and period binning interval, A is the total number of age groups, and P is the total number of calendar periods. A Lexis diagram $L (47, 1992, 2, 19, 11)$ for the myeloma data is shown in Figure 1(a).

Figure 1.

Lexis diagram for the myeloma data, displayed as a heat map of observed rates binned into two-year age groups and two-year calendar periods (a). Canonical plot of age-specific rates by birth cohort (b); for each age group, the first point in series corresponds to the earliest period of observation (1992–1993) and last point to the latest year (2013–2014).

For purposes of modeling, cells along diagonals with slope $- 1$ are considered to represent the experience over time of individuals in the same birth-year cohorts. There is always a total of $C = A + P - 1$ birth cohorts, from the oldest in the lower left cell through the youngest in the upper right cell.

Each cell can be referenced by values or by indices. Values equal the midpoints of the cell's defining age and period intervals, while indices equal the corresponding sequence number within the Lexis diagram's A consecutive age groups and P consecutive calendar periods.

Thus, values, denoted with subscripted *'s, and indices are related as follows

\begin{matrix} a_{*} (a) = (a_{0} - \frac{Δ}{2}) + Δ a, a = 1, \dots, A p_{*} (p) = (p_{0} - \frac{Δ}{2}) + Δ p, p = 1, \dots, P c_{*} (c) = (c_{0} - A Δ) + Δ c, c = 1, \dots, A + P - 1 \equiv C \end{matrix}

where

c_{0} \equiv p_{0} - a_{0}

. From these definitions, it follows that values

c_{*} (p_{*} (p), a_{*} (a)) = p_{*} (p) - a_{*} (a)

and indices

c = p - a + A

Different cohorts are followed at variable ages and in different periods. The number of cells per cohort is

\begin{matrix} n_{c_{*} (c)} & = & {\begin{matrix} c f o r 1 \leq c \leq min (A, P) - 1 \\ min (A, P) f o r min (A, P) \leq c \leq max (A, P) \\ A + P - c f o r max (A, P) + 1 \leq c \leq A + P - 1 \end{matrix} \end{matrix}

and cohort

c_{*} (c)

is followed from index age

a_{0}^{c} = max (1, A - c + 1)

through index age

a_{1}^{c} = min (A - c + P, A)

. When fitting the new model, we use canonical reference values corresponding to the centroid of the Lexis diagram

\begin{matrix} {\bar{a}}_{*} = (a_{0} - \frac{Δ}{2}) + Δ \frac{(A + 1)}{2} \\ {\bar{p}}_{*} = (p_{0} - \frac{Δ}{2}) + Δ \frac{(P + 1)}{2} \\ {\bar{c}}_{*} = (c_{0} - A Δ) + Δ \frac{(A + P)}{2} \\ \Rightarrow {\bar{c}}_{*} = {\bar{p}}_{*} - {\bar{a}}_{*} \end{matrix}

The rate in each cell per

10^{5}

person-years is

λ_{p_{*} a_{*}} = 10^{5} \frac{Y_{p_{*} a_{*}}}{O_{p_{*} a_{*}}}

, where

Y_{p_{*} a_{*}}

is the observed number of events and

O_{p_{*} a_{*}}

is the corresponding number of person-years. After the model is fitted we can use arbitrary reference values in estimable functions (Table 1).

Table 1.

Estimable functions of age-period-cohort model parameters.

Estimable function	Equation^a
Fitted cohort pattern	$\begin{matrix} FCP (c_{} \| a_{} = a_{} (a^{ref})) = exp ((π_{L} + γ_{L}) (c_{} - {\bar{c}}_{}) + θ_{γ} q_{c_{}}^{2} (c_{}) + {\tilde{\tilde{γ}}}_{c_{}} + μ + (α_{L} + π_{L}) (a_{} (a^{ref}) - {\bar{a}}_{}) + {\tilde{α}}_{a_{*} (a^{ref})}) \end{matrix}$
Cohort rate ratio curve	$\begin{matrix} CR R_{c_{} : c_{} (c^{ref})} = \frac{FCP (c_{} \| a_{} = a_{} (a^{ref}))}{FCP (c_{} (c^{ref}) \| a_{} = a_{} (a^{ref}))} = exp ((π_{L} + γ_{L}) (c_{} - c_{} (c^{ref})) + θ_{γ} {q_{c_{}}^{2} (c_{}) - q_{c_{}}^{2} (c_{} (c^{ref}))} + {{\tilde{\tilde{γ}}}_{c_{}} - {\tilde{\tilde{γ}}}_{c_{} (c^{ref})}}) \end{matrix}$
Fitted temporal trends	$\begin{matrix} FTT (p_{} \| a_{} = a_{} (a^{ref})) = exp ((π_{L} + γ_{L}) (p_{} - {\bar{p}}_{}) + θ_{π} q_{p_{}}^{2} (p_{}) + {\tilde{\tilde{π}}}_{p_{}} + μ + (α_{L} - γ_{L}) (a_{} (a^{ref}) - {\bar{a}}_{}) + {\tilde{α}}_{a_{*} (a^{ref})}) \end{matrix}$
Period rate ratio curve	$\begin{matrix} PR R_{p_{} : p_{} (p^{ref})} = \frac{FTT (p_{} \| a_{} = a_{} (a^{ref}))}{FTT (p_{} (p^{ref}) \| a_{} = a_{} (a^{ref}))} = exp ((π_{L} + γ_{L}) (p_{} - p_{} (p^{ref})) + θ_{π} {q_{p_{}}^{2} (p_{}) - q_{p_{}}^{2} (p_{} (p^{ref}))} + {{\tilde{\tilde{π}}}_{p_{}} - {\tilde{\tilde{π}}}_{p_{} (p^{ref})}}) \end{matrix}$
Longitudinal age curve	$\begin{matrix} LongAge (a_{} \| c_{} = c_{} (c^{ref})) = exp ((α_{L} + π_{L}) (a_{} - {\bar{a}}_{}) + θ_{α} q_{a_{}}^{2} (a_{}) + {\tilde{\tilde{α}}}_{a_{}} + μ + (π_{L} + γ_{L}) (c_{} (c^{ref}) - {\bar{c}}_{}) + {\tilde{γ}}_{c_{*} (c^{ref})}) \end{matrix}$
Longitudinal age rate ratio curve	$\begin{matrix} LAR R_{a_{} : a_{} (a^{ref})} = \frac{LongAge (a_{} \| c_{} = c_{} (c^{ref}))}{LongAge (a_{} (a^{ref}) \| c_{} = c_{} (c^{ref}))} = exp ((α_{L} + π_{L}) (a_{} - a_{} (a^{ref})) + θ_{α} {q_{a_{}}^{2} (a_{}) - q_{a_{}}^{2} (a_{} (a^{ref}))} + {{\tilde{\tilde{α}}}_{a_{}} - {\tilde{\tilde{α}}}_{a_{} (a^{ref})}}) \end{matrix}$
Cross-sectional age curve	$\begin{matrix} CrossAge (a_{} \| p_{} = p_{} (p^{ref})) = exp ((α_{L} - γ_{L}) (a_{} - {\bar{a}}_{}) + θ_{α} q_{a_{}}^{2} (a_{}) + {\tilde{\tilde{α}}}_{a_{}} + μ + (π_{L} + γ_{L}) (p_{} (p^{ref}) - {\bar{p}}_{}) + {\tilde{π}}_{p_{*} (p^{ref})}) \end{matrix}$
Cross-sectional age rate ratio curve	$\begin{matrix} CAR R_{a_{} : a_{} (a^{ref})} = \frac{CrossAge (a_{} \| p_{} = p_{} (p^{ref}))}{CrossAge (a_{} (a^{ref}) \| p_{} = p_{} (p^{ref}))} = exp ((α_{L} - γ_{L}) (a_{} - a_{} (a^{ref})) + θ_{α} {q_{a_{}}^{2} (a_{}) - q_{a_{}}^{2} (a_{} (a^{ref}))} + {{\tilde{\tilde{α}}}_{a_{}} - {\tilde{\tilde{α}}}_{a_{} (a^{ref})}}) \end{matrix}$

Estimable functions can be conditioned on any reference values of age, period, or cohort that occur in the Lexis diagram. To define all estimable functions with respect to the same reference cell, set the cohort index to $c^{ref} = p^{ref} - a^{ref} + A$ .

3 The standard APC model

For each cell in the Lexis diagram, the expected log rate per 1 person-year is⁶

ρ_{c_{*} a_{*}} = \ln (λ_{c_{*}, a_{*}}) = \ln E (\frac{Y_{p_{*} - a_{*}, a_{*}}}{O_{p_{*} - a_{*}, a_{*}}}) = μ + (α_{L} + π_{L}) (a_{*} - {\bar{a}}_{*}) + (π_{L} + γ_{L}) (c_{*} - {\bar{c}}_{*}) + {\tilde{α}}_{a_{*}} + {\tilde{π}}_{c_{*} + a_{*}} + {\tilde{γ}}_{c_{*}}

This is a longitudinal representation when we hold $c_{*}$ fixed and allow $a_{*}$ to vary over the $n_{c_{*} (c)}$ observed age values from $a_{*} (a_{0}^{c})$ through $a_{*} (a_{1}^{c})$ . The equation

ρ_{p_{*} a_{*}} = \ln (λ_{p_{*}, a_{*}}) = \ln E (\frac{Y_{p_{*}, a_{*}}}{O_{p_{*}, a_{*}}}) = μ + (α_{L} - π_{L}) (a_{*} - {\bar{a}}_{*}) + (π_{L} + γ_{L}) (p_{*} - {\bar{p}}_{*}) + {\tilde{α}}_{a_{*}} + {\tilde{π}}_{p_{*}} + {\tilde{γ}}_{p_{*} - a_{*}}

provides a cross-sectional representation when we define

(α_{L} - γ_{L}) \equiv (α_{L} + π_{L}) - (π_{L} + γ_{L}),

and hold

a_{*}

fixed and allow

p_{*}

to vary over the P observed periods from

p_{*} (1)

through

p_{*} (P)

.The fitted values for the two equations are identical. The parameters

{\tilde{α}}_{a_{*}}

{\tilde{π}}_{p_{*}}

and

{\tilde{γ}}_{c_{*}}

are the deviations.

To fit the standard model, we impose six identifiability constraints, two for age, two for period, and two for cohort, in order to make the deviations orthogonal to the intercept and the corresponding log-linear trends. Importantly, in our set-up we impose the constraints over all cells in the Lexis diagram, and not simply over the sequence of A age deviations, P period deviations, and C cohort deviations, which implies that the constraints on the C unique cohort deviations ${\tilde{γ}}_{c_{*} (1)}, \dots, {\tilde{γ}}_{c_{*} (C)}$ are weighted by $n_{c_{*} (c)}$ .

As described subsequently, this approach insures that the fitted rates can be consistently decomposed into a function of age × a function of period × a function of cohort. We call the quantity $100 \times (e^{(π_{L} + γ_{L})} - 1)$ the net drift⁷ (units are percentage change per calendar year), which is the model-based analogue of the estimated annual percentage change⁸ (commonly referred to as the EAPC). When there is no possibility of confusion we also refer to the log-linear parameter $(π_{L} + γ_{L})$ as the net drift. We refer to the quantity $100 \times (e^{(α_{L} + π_{L})} - 1)$ as the longitudinal age trend or LAT, and to $100 \times (e^{(α_{L} - γ_{L})} - 1)$ as the cross-sectional age trend or CAT⁹ (units of LAT and CAT are percentage change per year of age). Considering the relationship between parameters in the standard versus new model, described subsequently, we now refer to ${\tilde{α}}_{a_{*}}$ , ${\tilde{π}}_{p_{*}}$ , and ${\tilde{γ}}_{c_{*}}$ as the “complete” age, period, and cohort deviations, respectively.

4 Example: MM in Black men

MM is the most common hematological malignancy among African-American (Black) men, with an estimated lifetime risk of 1.25%.¹⁰ We extracted MM incidence from the SEER 13 Registries Database (2015 release)¹¹ for Black men aged 47–84 during calendar years 1992–2013. We tabulated events and man-years within 19 two-year age groups (47–48 through 83–84) and 11 two-year calendar periods (1992–1993 through 2012–2013), which cover 29 nominal birth cohorts centered on birth years 1909 through 1965. In all, there were 3747 incident cases in 10,602,515 man-years.

In a “canonical” plot of age-specific incidence rates by birth cohort¹² (Figure 1(b)), there is a suggestion that incidence is rising in older age groups but has remained comparatively stable in younger age groups. However, the observed rates are “noisy” and simple inspection does not to us suggest any firm conclusions. Outputs from the standard APC model⁶ (Figure 2) are also quite variable. The net drift of 0.89% per year (95% Confidence Interval [CI]: 0.36–1.42 %/year) is statistically significant (p = 0.001), hence, on average, the rates in Figure 1(b) and the Period and Cohort Rate Ratios in Figure 2(e) and (f) are increasing significantly over birth cohorts and calendar periods, respectively. The LAT of 8.0% per year (95% CI: 7.3–8.7%/year) is highly significant (p ≈ 0). Also, there appears to be some “deceleration” In the cohort deviations circa 1943 (Figure 2(c)) such that incidence increases in successive cohorts through circa 1943 and then decreases (Figure 2(e)), a pattern which is consistent with Figure 1(b). Formally, however, the cohort deviations are not statistically significant (p = 0.499). Overall, our impression is results from the standard model are too noisy to make firm conclusions about temporal effects; it would be appealing to incorporate additional smoothness in the model parameters.

Figure 2.

Estimable functions from the standard APC model fitted to the myeloma data. The reference values are 66 years for age, 2003 for period, and 1937 for birth cohort.

5 The new APC model

The basic idea is we split the complete deviations ${\tilde{α}}_{a_{*}}$ , ${\tilde{π}}_{p_{*}}$ , and ${\tilde{γ}}_{c_{*}}$ into an orthogonal quadratic component plus higher-order terms,

\begin{matrix} {\tilde{α}}_{a_{*}} = θ_{α} q_{a_{*}}^{2} (a_{*}) + {\tilde{\tilde{α}}}_{a_{*}} \\ {\tilde{π}}_{p_{*}} = θ_{π} q_{p_{*}}^{2} (p_{*}) + {\tilde{\tilde{π}}}_{p_{*}} \\ {\tilde{γ}}_{c_{*}} = θ_{γ} q_{c_{*}}^{2} (c_{*}) + {\tilde{\tilde{γ}}}_{c}_{*} . \end{matrix}

so that

ρ_{c_{*} a_{*}} = μ + (α_{L} + π_{L}) (a_{*} - {\bar{a}}_{*}) + (π_{L} + γ_{L}) (c_{*} - {\bar{c}}_{*}) + θ_{α} q_{a_{*}}^{2} (a_{*}) + θ_{π} q_{p_{*}}^{2} (c_{*} + a_{*}) + θ_{γ} q_{c_{*}}^{2} (c_{*}) + {\tilde{\tilde{α}}}_{a_{*}} + {\tilde{\tilde{π}}}_{p_{*} + a_{*}} + {\tilde{\tilde{γ}}}_{c}_{*} .

The functions $q_{a_{*}}^{2} (a_{*})$ , $q_{p_{*}}^{2} (p_{*})$ , and $q_{c_{*}}^{2} (c_{*})$ are quadratic polynomials that are orthogonal to the intercept and corresponding log-linear trend, and the higher-order terms ${\tilde{\tilde{α}}}_{a_{*}}, {\tilde{\tilde{π}}}_{p_{*}}$ and ${\tilde{\tilde{γ}}}_{c_{*}}$ are each orthogonal to the intercept and the corresponding log-linear and quadratic terms. In our model, orthogonality is defined¹³ with respect to the usual inner product $〈 x | y 〉 = \sum x_{i} y_{i} .$ The new model has nine constraints in all (Appendix A.1). We will call the parameters $θ_{α},$ $θ_{π}$ , and $θ_{γ}$ the “global curvature” for age, period, and cohort, respectively.

In Appendix A.2 we show that the orthogonal quadratic polynomial for age equals

q_{a_{*}}^{2} (a_{*}) = a_{*}^{2} - [Δ A + 2 a_{0}] a_{*} + [(a_{0} + Δ \frac{A}{2})^{2} - \frac{Δ^{2}}{12} (A^{2} - 1)],

for period equals

q_{p_{*}}^{2} (p_{*}) = p_{*}^{2} - [Δ P + 2 p_{0}] p_{*} + [(p_{0} + Δ \frac{P}{2})^{2} - \frac{Δ^{2}}{12} (P^{2} - 1)],

and for cohort equals

q_{c_{*}}^{2} (c_{*}) = c_{*}^{2} - [Δ (A + P) + 2 (c_{0} - Δ A)] c_{*} + [(c_{0} - Δ A)^{2} + (A + P) Δ {\bar{c}}_{*} - \frac{Δ^{2}}{6} {2 A^{2} + 3 AP + 2 P^{2} - 1}] .

Each function is completely determined by characteristics of the corresponding Lexis diagram $L (a_{0}, p_{0}, Δ, A, P)$ . A design matrix for the new model is presented in Appendix A.3, and fitting the model is described in Appendix A.4.

For the myeloma data, Figure 3 shows the decomposition of the complete deviations for age, period, and cohort into constituent quadratic components plus higher-order terms. Global curvature for age (Figure 3(b)) and cohort (Figure 3(h)) but not period (Figure 3(e)) is statistically significant; the higher-order deviations for age (Figure 3(c)), period (Figure 3(f)), and cohort (Figure 3(i)) are not statistically significant. Hence, most of the variation in myeloma incidence around the intercept μ can be explained by just four of the model's 56 parameters, the two slope parameters $(α_{L} + π_{L})$ and $(π_{L} + γ_{L}),$ and the two curvature parameters $θ_{α}$ and $θ_{γ}$ .

Figure 3.

Decomposition of complete age, period, and cohort deviations into orthogonal quadratic components plus higher-order terms.

As shown in Appendix A.2, the locations of the roots of the orthogonal quadratic terms are a fixed property of the Lexis diagram. Hence, as shown in Figure 3, panels B, I, and H, the corresponding confidence bands must shrink to 0 at the roots.

6 Estimable functions and the fundamental decomposition principle

Estimable functions are log-linear combinations of identifiable APC coefficients. Estimable functions have been prominently deployed in cancer surveillance studies. In Table 1, we present popular estimable functions of age, period, and cohort, here defined using parameters of the new APC model.

Table 2 shows how estimable functions in Table 1 can be combined to produce equivalent sets of fitted rates. Two of these fundamental decompositions are shown in Figure 4 for the myeloma data, the longitudinal age representation (Figure 4(a) to (c)) and the cross-sectional age representation (Figure 4(d) to (f)). These decompositions yield fitted rates that are identical to the fitted rates obtained directly from the Poisson regression or weighted least squares fit, i.e. to the values of

\hat{λ} = 10^{5} \times exp (X \hat{β})

(Appendix A.4). These decompositions will not hold if one substitutes unweighted cohort deviations for the weighted values (Appendix A.7).

Table 2.

Fundamental decompositions of age-period-cohort fitted rates.

Temporality	Time scale	Equation^a
Longitudinal	Age	$λ (a_{} \| c_{}) = LongAge (a_{} \| c_{} = c_{} (c^{ref})) \times CR R_{c_{} : c_{} (c^{ref})} \times e^{{\tilde{π}}_{c_{} + a_{*}}}$
Longitudinal	Cohort	$λ (c_{} \| a_{}) = FCP (c_{} \| a_{} = a_{} (a^{ref})) \times LAR R_{a_{} : a_{} (a^{ref})} \times e^{{\tilde{π}}_{c_{} + a_{*}}}$
Cross-sectional	Age	$λ (a_{} \| p_{}) = CrossAge (a_{} \| p_{} = p_{} (p^{ref})) \times PR R_{p_{} : p_{} (p^{ref})} \times e^{{\tilde{γ}}_{p_{} - a_{*}}}$
Cross-sectional	Period	$λ (p_{} \| a_{}) = FTT (p_{} \| a_{} = a_{} (a^{ref})) \times CAR R_{a_{} : a_{} (a^{ref})} \times e^{{\tilde{γ}}_{p_{} - a_{*}}}$

The expected rates can be expressed in terms of any reference value of age, period, or cohort that occurs in the Lexis diagram. To define all of the equations in terms of the same reference cell, set the cohort index to $c^{ref} = p^{ref} - a^{ref} + A, so that c_{*} (c^{ref}) = p_{*} (p^{ref}) - a_{*} (a^{ref})$ .

Figure 4.

Estimable functions of the myeloma data based on new APC model. The bold curve in each panel shows the contributions of the linear and quadratic terms. The reference values are 66 years for age, 2003 for period, and 1937 for birth cohort.

Other useful estimable functions involve finite first and second derivatives of the deviations and estimable functions shown in Table 1. Let

\begin{matrix} S_{P} = [\begin{matrix} 1 \\ \begin{matrix} 2 \\ : \end{matrix} \\ P \end{matrix}] and 1_{P} \equiv [\begin{matrix} 1 \\ : \\ 1 \end{matrix}]_{P \times 1} \end{matrix}

As described in Appendix B, the gradient of the cross-sectional age curve on the logarithmic scale equals

\begin{matrix} {\overset{\cdot}{ρ}}_{CROSS} (a_{*} | p_{*} = p_{*} (p^{ref})) = G_{(A, Δ)} [ρ_{CROSS} (a_{*} (a) | p_{*} = p_{*} (p^{ref}))]_{a_{*} (1)}^{a_{*} (A)} \\ = (α_{L} - γ_{L}) 1_{A - 1} + 2 θ_{α} Δ {S_{A - 1} - \frac{A}{2} 1_{A - 1}} + G_{(A, Δ)} {\tilde{\sim} α}_{a_{*}}, \end{matrix}

where

G_{(A, Δ)}

is a finite difference operator. The gradient of the longitudinal age curve has the same form with

(α_{L} - γ_{L})

replaced by

(α_{L} + π_{L})

In the literature, there has been considerable interest in second differences of cohort deviations.^14,15 These can be extracted from ${\tilde{γ}}_{c}_{*}$ using a finite second difference operator $G_{C}^{Δ}^{2}$ . It is worth noting (Appendix A.6) that second differences of the traditional unweighted cohort deviations are identical to second differences of the weighted values.

7 Combination hypothesis tests of period and cohort deviations

A fundamental question is whether period or cohort effects influence the rates over and above log-linear influences captured through the net drift parameter $(π_{L} + γ_{L})$ . The standard model's approach is to calculate omnibus Chi-square tests with $(P - 2)$ degrees of freedom ( $df$ ) and $(C - 2) df,$ respectively. A problem is these tests can be under-powered for rare outcomes or granular data. We seek a test that is sensitive to signals in the quadratic or higher-order components alone, or simultaneously. Using the new model, we propose to use Tippett's Method¹⁶ to construct a new p-value combination test of significance of the complete period deviations, i.e. quadratic and/or higher-order period signals. Tippett's method is used to combine k independent statistical tests into an overall test of the global null hypothesis $H_{0}$ . Tippett's method rejects $H_{0}$ at overall significance level α if any of $P_{1}, \dots, P_{k} \leq α' = 1 - (1 - α)^{1 / k}$ . In practice, we use the conservative approximation $\frac{α}{k} < 1 - (1 - α)^{1 / k}$ , so that the combination test is

T_{α}^{\tilde{π}} = min (2 P^{θ_{π}}, 2 P^{\tilde{\tilde{π}}}, 1),

where

P^{θ_{π}}

is the p value for the 1-df Wald Chi-square test of global curvature for period, and

P^{\tilde{\tilde{π}}}

is the p value for the

(P - 3)

df Wald Chi-square test of higher-order period deviations.

Similarly, we propose a new p-value combination test for the complete cohort deviations

T_{α}^{\tilde{γ}} = min (2 P^{θ_{γ}}, 2 P^{\tilde{\tilde{γ}}}, 1),

where

P^{θ_{γ}}

is the p value for the 1-df Wald Chi-square test of global curvature for cohort, and

P^{\tilde{\tilde{γ}}}

is the p value for the

(C - 3)

df Wald Chi-square test of higher-order cohort deviations.

For the myeloma data, the standard Wald test for period deviations is $χ_{9}^{2} = 6.75, P = 0.664$ , and the new composite test is $min (2 \times 0.0987, 2 \times 0.8157, 1) = 0.1973$ , so neither test is significant. The standard Wald test for cohort deviations is $χ_{27}^{2} = 26.36, P = 0.499$ , and the new composite test is $min (2 \times 0.0042, 2 \times 0.8291, 1) = 0.0084$ , so the new test clearly identifies a signal that is missed by the old test.

8 Combination hypothesis tests of period and cohort rate ratios

A related question is whether there is any association whatsoever of the rates with calendar period. Using the estimable functions defined in Table 1, under $H_{0} :$ constant period rate ratios

\begin{matrix} FTT (p_{*} (1)) = \dots = FTT (p_{*} (P)) = constant = e^{μ + (α_{L} - γ_{L}) (a_{*} (a^{ref}) - {\bar{a}}_{*}) + {\tilde{α}}_{a_{*} (a^{ref})}} \\ \Leftrightarrow PRR (p_{*} (1)) = \dots = PRR (p_{*} (P)) = 1 . \end{matrix}

a p-value combination test of this hypothesis based on Tippett's method is

T_{α}^{FTT (p_{*})} \equiv T_{α}^{PRR (p_{*})} = min (3 P^{(π_{L} + γ_{L})}, 3 P^{θ_{π}}, 3 P^{\tilde{\tilde{π}}}, 1) .

Similarly, under $H_{0} :$ constant cohort rate ratios

\begin{matrix} FCP (c_{*} (1)) = \dots = FCP (c_{*} (C)) = constant = e^{μ + (α_{L} + π_{L}) (a_{*} (a^{ref}) - {\bar{a}}_{*}) + {\tilde{α}}_{a_{*} (a^{ref})}} \\ \Leftrightarrow CRR (c_{*} (1)) = \dots = CRR (c_{*} (C)) = 1 . \end{matrix}

a p-value combination test for this hypothesis is

T_{α}^{FCP (c_{*})} \equiv T_{α}^{CRR (c_{*})} = min (3 P^{(π_{L} + γ_{L})}, 3 P^{θ_{γ}}, 3 P^{\tilde{\tilde{γ}}}, 1) .

For the myeloma data, the standard Wald test for period rate ratios is $χ_{10}^{2} = 19.82, P = 0.0310$ , and the new combination test is $min (3 \times 0.0001, 3 \times 0.0987, 3 \times 0.8157, 1) = 0.0003$ , thus both tests are significant but the new test much more so. The standard Wald test for cohort rate ratios is $χ_{28}^{2} = 37.67, P = 0.1049$ , and the new combination test is $min (3 \times 0.0001, 3 \times 0.0042, 3 \times 0.8291, 1) = 0.003$ , so the new test clearly identifies a signal that is missed by the old test.

9 Power of Wald tests versus combination tests: simulation study

We carried out a simulation study to confirm the level-correctness of our tests and to gauge the potential real-world power gains (if any) obtainable using the new combination tests. We used point estimates from the myeloma data's APC model to define a baseline scenario, and simulated Poisson counts under the baseline and alternative scenarios.

We set $α = 0.05$ for all tests. To assess the performance of tests of cohort effects, we multiplied both $θ_{γ}$ and ${\tilde{\tilde{γ}}}_{c}$ by 0, 0.5, 0.75, or 1, for two total sample sizes, $1 y_{+ +} or 2 y_{+ +}$ , where $y_{+ +} \equiv 3, 747$ is the total observed number of cases. We used the same design to assess the performance of tests of period effects, except we scaled $θ_{π}$ and ${\tilde{\tilde{π}}}_{p}$ rather than $θ_{γ}$ and ${\tilde{\tilde{γ}}}_{c}$ .

For cohort effects (Figure 5(a) and (b)), both the Wald tests and the combination tests are level correct for both ${\tilde{γ}}_{c}$ (cohort deviations; Figure 5(a)) and cohort rate ratios (CRR; Figure 5(b)). The combination tests are appreciably more powerful; at the observed sample size the power gains are around 5 to 10 percentage points. For period effects (Figure 5(c) and (d)), as expected, both the Wald tests and the combination tests are level correct for both ${\tilde{π}}_{p}$ (period deviations; Figure 5(c)) and period rate ratios (PRR; Figure 5(d)). The combination tests have slightly higher power or are on par.

Figure 5.

Simulation results assessing Type-I error rates and power of Wald versus combination tests of complete cohort deviations (a); cohort rate ratios (CRR) (b); complete period deviations (c); period rate ratios (PRR) (d). See text for details.

10 Local drifts

Local drifts are model-based estimates of annual percentage changes by age group.⁶ Departures of local drifts from the overall net drift are a consequence of cohort deviations, hence local drifts quantify the heuristic that cohort effects represent an age interaction over time. Local drifts can be constructed by sliding a window of bandwidth P through the cohort deviations and extracting the least squares slopes, and then adding these “deflection” terms to the overall net drift.

Formally, the cross-sectional age-specific rates over time equal

ρ (p_{*} | a_{*} = a_{*} (a)) = (π_{L} + γ_{L}) (p_{*} - {\bar{p}}_{*}) + {\tilde{γ}}_{p_{*} - a_{*} (a)} + {\tilde{π}}_{p_{*}} + μ + (α_{L} - γ_{L}) (a_{*} (a) - {\bar{a}}_{*}) + {\tilde{α}}_{a_{*} (a)} .

The sub-vector of P complete cohort deviations belonging to age group $a_{*} (a)$ can be regressed on period (vectors denoted by bold-face) yielding the identity

\begin{matrix} {\tilde{γ}}_{p_{*} - a_{*} (a)} = [\overset{oldest}{\overset{︷}{{\tilde{γ}}_{p_{*} (1) - a_{*} (a)}}}, \dots, \overset{youngest}{\overset{︷}{{\tilde{γ}}_{p_{*} (P) - a_{*} (a)}}}]' = [{\tilde{γ}}_{c_{*} (1 - a + A)}, \dots, {\tilde{γ}}_{c_{*} (P - a + A)}]' \\ = {\bar{\tilde{γ}}}_{c_{*} (1 - a + A) : c_{*} (P - a + A)} + δ_{a_{*} (a)} (p_{*} - {\bar{p}}_{*}) + ε p_{*} | a_{*} (a) . \end{matrix}

As we move from one age group to the next, we slide along by one birth cohort. We can calculate the ensemble of A deflections $δ_{a_{*} (a)}$ as follows. Vector

L_{1 \times P}^{Δ} = \frac{12}{Δ (P - 1) P (P + 1)} (S_{P} - \frac{P + 1}{2} 1_{P})'

extracts the least squares slope for any vector of ordinates of length P that are evenly spaced by Δ units apart. Hence, matrix

\begin{matrix} K_{(A, C)}^{Δ} = [\begin{matrix} {0_{(1)} & \dots \begin{matrix} 0_{(A - 1)}} & L_{1 \times P}^{Δ} \end{matrix} \\ {0_{(1)} & \dots 0_{(A - 2)}} & \begin{matrix} L_{1 \times P}^{Δ} & 0 \end{matrix} \\ \begin{matrix} : \\ \begin{matrix} 0 \\ L_{1 \times P}^{Δ} \end{matrix} \end{matrix} & \begin{matrix} : \\ \begin{matrix} L_{1 \times P}^{Δ} \\ {0_{(1)} \dots \end{matrix} \end{matrix} & \begin{matrix} {0_{(1)} & \begin{matrix} : \\ \dots \begin{matrix} 0_{(A - 2)}} \\ 0_{(A - 1)}} \end{matrix} \end{matrix} \end{matrix} \end{matrix}] \end{matrix}

extracts from the complete cohort deviations all A deflection terms for the youngest through the oldest age groups,

K_{(A, C)}^{Δ} {\tilde{γ}}_{C \times 1} = δ_{A \times 1} .

As shown in Appendix A.5, under the standard model the local drifts equal

[(π_{L} + γ_{L})_{a_{*} (a)}] = (π_{L} + γ_{L}) \cdot 1_{A} + K_{(A, C)}^{Δ} {\tilde{γ}}_{C \times 1} = (π_{L} + γ_{L}) \cdot 1_{A} + δ_{A \times 1},

while under the new model the local drifts equal

[(π_{L} + γ_{L})_{a_{*} (a)}] = (π_{L} + γ_{L}) \cdot 1_{A} - 2 θ_{γ} Δ (S_{A} - \frac{A + 1}{2} 1_{A}) + K_{(A, C)}^{Δ} {\tilde{\sim} γ}_{C \times 1} .

Thus, the least squares slope of the local drifts with respect to age is determined by the global curvature for cohort. When $θ_{γ} > 0$ and ${\tilde{\sim} γ}_{C \times 1}$ is comparatively small, as we move from older through younger cohorts the deflections change sign from negative values that apply to the older age groups $a_{*} (a) > {\bar{a}}_{*}$ to positive values for the younger age groups $a_{*} (a) < {\bar{a}}_{*}$ . Conversely, when $θ_{γ} < 0$ , the deflections have positive values for older age groups and negative values for younger age groups.

Figure 6 presents local drifts for the myeloma data. For context, canonical plots of age-specific rates over time (Figures 6(a) and 1(b)) suggest that perhaps the rates are increasing in older age groups and are stable or perhaps decreasing in younger age groups. Local drifts from the standard model support this conclusion (Figure 6(b)), but the estimates are uncertain. The new model (Figure 6(c)) confirms that the Local drifts indeed increase consistently with age. On average across ages 45–84, the net drift increases by $100 \times (e^{- 2 θ_{γ}} - 1) = 0.083 %$ per calendar year per year of age (95% CI: $0.03 - 0.14 % / Y r^{2}$ ). This relatively small increase in drift per single-year-of-age accumulates over the age range. Incidence is essentially flat among men aged 50 years, but among men aged 80 years, the incidence is significantly increasing by 2% per year.

Figure 6.

Local drifts of the myeloma data. (a) Canonical plot of age-specific rates over time (age binned into eight-year groups); (b) estimated annual percentage change per calendar year by age group (local drifts) estimated using standard APC model; (c) local drifts estimates from new APC model.

An important point is local drifts are cross-sectional quantities, and the same cross-sectional values hold whether we condition on age and examine the rates over calendar periods or we condition on calendar periods and examine the rates across age groups. In Appendix C, we show that the log-linear slopes of the cross-sectional age-specific rates within calendar periods equal

[(α_{L} - γ_{L})_{p_{*} (p)}] = (α_{L} - γ_{L}) \cdot 1_{P} - 2 θ_{γ} Δ (S_{P} - \frac{P + 1}{2} 1_{P}) - K_{(P, C)} {\tilde{\sim} γ}_{C \times 1},

where matrix

K_{(P, C)}

slides a length P window through a length C sequence of ordinates evenly spaced by Δ units apart and extract the least squares slope. We call these quantities “Gradient Shifts”. Thus, both the slope of the local drifts over age groups and the slope of the gradient shifts over calendar periods are equal to

- 2 θ_{γ}

11 Local drifts tests

When testing for significance of the local drifts using the standard model, we often rely on an omnibus Wald statistic. The null hypothesis is that all local drifts are equal to the overall net drift, which is true if and only if $δ_{A \times 1} = 0_{A \times 1}$ . By the Delta method

V_{δ_{A \times 1}} = K_{(A, C)}^{Δ} V_{{\tilde{γ}}_{C \times 1}} K_{(A, C)}^{Δ}'^{,}

hence the Wald test statistic is

\begin{matrix} W_{δ} = δ_{A \times 1} {' V}_{δ_{A \times 1}}^{-} δ_{A \times 1} \sim χ_{df}^{2} df = rank (V_{δ_{A \times 1}}) \end{matrix} .

As shown in Appendix A.5, when the Lexis diagram is rectangular $(A \neq P)$ , $V_{δ_{A \times 1}}$ is full rank and $df = A$ . However, when the Lexis diagram is square $(A = P)$ , $\sum_{a = 1}^{A} δ_{a} = 0$ , hence $df = A - 1$ . In other words, when the Lexis diagram is square we lose one degree of freedom and thereby gain a small increase in power. In practice, to calculate the test when $A = P$ , we leave out one component of $δ_{A \times 1}$ so that the corresponding reduced variance-covariance matrix has full rank.

The new model offers a 1-degree-of-freedom test. Clearly, if $θ_{γ} \neq 0$ then $δ_{A \times 1} \neq 0_{A \times 1}$ . Hence, a 1-degree-of-freedom test evaluates $H_{0} : θ_{γ} = 0$ . The test can be carried out using standard outputs from the model fit, i.e. the coefficient ${\hat{θ}}_{γ}$ and its variance $Var ({\hat{θ}}_{γ})$ ; the 1 degree of freedom test is simply $W_{θ_{γ}} = \frac{{\hat{θ}}_{γ}^{2}}{Var ({\hat{θ}}_{γ})}$ , which is distributed $χ_{1}^{2}$ under $H_{0}$ .

In principle, one could construct a combination test $min (2 P^{θ_{γ}}, 2 P^{K_{(A, C)}^{Δ} {\tilde{\sim} γ}_{C \times 1}}, 1),$ but we have not done so because it is difficult to interpret the third-order effects $K_{(A, C)}^{Δ} {\tilde{\sim} γ}_{C \times 1}$ . Instead, we construct the combination test $T_{α}^{LocalDrifts (a_{*})} = min (2 P^{W_{θ_{γ}}}, P^{W_{δ}}, 1)$ . The test may be conservative because the component tests are dependent.

For the myeloma data, the Omnibus Wald Test is $χ_{19}^{2} = 19.79, P = 0.4072$ , and the new 1-degree-of-freedom test is $χ_{1}^{2} = 8.19, P = 0.0042$ , so the new test clearly identifies a signal that is missed by the old test. The combination test statistic is 0.0084, also highly significant.

12 Power of local drifts tests: simulation study

We carried out a simulation study of our new 1- degree-of-freedom (1-df ) and combination tests of local drifts, versus the standard Omnibus test. Our aim was to assess the level-correctness of our tests and to gauge the potential real-world power gains (if any) obtainable using the new tests, when the local drift signals arise from quadratic or higher-order cohort effects alone or simultaneously. As before, we used point estimates from the myeloma data's APC model to define a baseline scenario.

To assess the performance of the drifts tests, we considered three situations. First, ${\tilde{\sim} γ}_{c} \equiv 0$ and $θ_{γ} = 0 θ_{γ,}, 0.5 θ_{γ}, 1 θ_{γ}$ , $or 1.5 θ_{γ}$ , hence all the signal (if any) is coming through the global curvature for cohort. Second, $θ_{γ} \equiv 0$ and ${\tilde{\sim} γ}_{c} = 0 {\tilde{\sim} γ}_{c}, 0.5 {\tilde{\sim} γ}_{c}, 1 {\tilde{\sim} γ}_{c}, or 1.5 {\tilde{\sim} γ}_{c}$ , hence all the signal (if any) is coming through the higher-order cohort effects. Third, $θ_{γ}$ and ${\tilde{\sim} γ}_{c}$ are both multiplied by 0, 0.5, 1, or 1.5. For each situation, we considered two total sample sizes, $1 y_{+ +} or 4 y_{+ +}$ , where y₊₊ = 3747 is the total observed number of cases.

As shown in Figure 7(a), all three tests are level correct. The 1-df test is much more powerful than the Omnibus test; as one would expect (since in this situation the signal arises from cohort curvature alone) the combination test is slightly less powerful. The 1-df test has substantially higher power than the Omnibus test, 36% versus 9%, 87% versus 32%, and 99% versus 77% in the three non-null situations shown in Figure 7(a). In contrast, when the signal is entirely due to higher-order cohort effects (situation 2, Figure 7(b)), the 1-df test remains level correct. Here the Omnibus test is quite sensitive (59% power for the baseline case, 100% when the sample size is increased four-fold). The combination test is somewhat less powerful. In situation 3 (Figure 7(c)), even when there is signal from both sources, the 1-df test is more powerful or as powerful as the Omnibus test. In some situations, the combination test is more powerful than the 1-df test, and in some less. On balance, the combination test appears to be a useful compromise test.

Figure 7.

Simulation results assessing Type-I error rates and power of Omnibus test, 1-df test, and combination test for assessing significance of local drifts. See text for details.

13 Discussion

We developed our new APC model with applications to cancer surveillance research in mind. Our methods are equally appropriate for many other health-related events. For example, using APC analysis, Shiels et al.¹⁷ analyzed all-cause premature mortality in the US, and Best et al.¹⁸ analyzed premature mortality from cancer, heart disease, accidents, suicide, and chronic liver disease/cirrhosis.

Our new APC model partitions age, period, and cohort deviations into orthogonal quadratic components plus higher-order terms. Other investigators have developed spline-based methods which include quadratic terms along with other polynomials.^13,19 These approaches provide appealingly smooth estimable functions, but the regression parameters themselves provide little insight. In contrast, the global curvature parameters $θ_{α}, θ_{π}, and θ_{γ}$ associated with our orthogonal quadratic polynomials for age, period, and cohort, respectively, quantify how fast on average the trends in the rates are changing. By including these additional parametric terms, our new APC model provides novel estimable functions and hypothesis tests.

We recently developed a random effects model for simultaneous analysis of multiple sets of rates ascertained from multiple population strata over the same Lexis diagram.²⁰ In that paper, we also present a quadratic model. A key innovation in this report is we use orthogonal polynomials that are characteristic of the underlying Lexis diagram, and demonstrate the surprisingly large utility of doing so. Going forwards, our random effects methods could use the orthogonal parameterization presented here to investigate the heterogeneity of local drifts and local drift trends (i.e. the quantity $- 2 θ_{γ}$ ) over a potentially large number of strata.

Three theoretical points are worth emphasizing. First, all of the parameters and functions described here are identifiable. This is illustrated for example in Figure 3. Second, our new hypotheses tests are level correct, and can be more powerful than the usual global tests.²¹ Third, the weighted cohort deviations described here are very useful, notwithstanding that unweighted deviations ${\tilde{γ}}_{c_{*}}^{H}$ have been widely used in the past.²² As we show (Appendix A.6), the absolute difference between the unweighted deviations ${\tilde{γ}}_{c_{*}}^{H}$ and our weighted deviations ${\tilde{γ}}_{c_{*}}$ is a linear function of birth cohort. This result implies that the decompositions in Table 2, which are of great utility in practical applications, do not hold if you substitute ${\tilde{γ}}_{c_{*}}^{H}$ for ${\tilde{γ}}_{c_{*}} .$

Our new model opens avenues of future methodological research, including comparative analysis, power calculations, and forecasting.³ For pairwise comparison of from 2 through around 10 strata (for example, 9-registry SEER analyses), we expect our new model will enable more powerful combination tests of log-parallelism and proportionality.²³ These methods would complement the random effects models that work best when the number of strata is around 10 or more.²⁰ Asymptotically, the Wald tests have non-central Chi-square distributions, which permits analytical power calculations.

One of the main limitations of the new model is shared by the old: currently, methods are limited to assess goodness-of-fit. One approach that might prove useful would use a smooth Gaussian process to model any residual departures after fitting the usual age, period, and cohort effects.²⁴ Another important topic is fitting models that incorporate external information, for example, data on established cancer risk factors or screening.

Because our new APC model subsumes a standard APC model yet provides serious advantages, we propose it be considered a new standard. In particular, we believe the value of estimating $θ_{γ}$ , the global curvature for cohort, in cancer studies will become increasingly apparent. Whenever $θ_{γ}$ is significantly greater than 0, the trend over time is less favorable in younger versus older persons, which is always a cause for concern. In fact, some cancers do appear to be increasing in younger age groups,^25,26 which supports the potential practical utility of our new model. Every statistic introduced in this paper is implemented in our R package (available at https://dceg.cancer.gov/tools/analysis/apc), and we plan to update our web tool (https://analysistools.nci.nih.gov/apc/).

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

Appendix A

References

Siegel

Miller

Jemal

. Cancer statistics, 2017. CA Cancer J Clin 2017; 67: 7–30.

Smith

Wakefield

. A review and comparison of age-period-cohort models for cancer incidence. Stat Sci 2016; 31: 591–610.

Bray

Moller

. Predicting the future burden of cancer. Nat Rev Cancer 2006; 6: 63–74.

Holford

. Understanding the effects of age, period, and cohort on incidence and mortality rates. Annu Rev Public Health 1991; 12: 425–457.

Keiding

. Statistical-inference in the Lexis diagram. Philos T Roy Soc A 1990; 332: 487–509.

Rosenberg

Check

Anderson

. A web tool for age-period-cohort analysis of cancer incidence and mortality rates. Cancer Epidemiol Biomarkers Prev 2014; 23: 2296–2302.

Clayton

Schifflers

. Models for temporal variation in cancer rates. I: Age-period and age-cohort models. Stat Med 1987; 6: 449–467.

Fay

Tiwari

Feuer

, et al. Estimating average annual percent change for disease rates without assuming constant change. Biometrics 2006; 62: 847–854.

Robertson

Boyle

. Age-period-cohort analysis of chronic disease rates. I: Modelling approach. Stat Med 1998; 17: 1305–1323.

10.

Cancer Facts and Figures 2017, Atlanta, GA: American Cancer Society, 2017.

11.

SEER-13. Surveillance, Epidemiology, and End Results (SEER) Program (www.seer.cancer.gov) SEER*Stat Database: Incidence-SEER 13 Regs Limited-Use, Nov 2015 Sub (1992–2013) < Single Ages to 85+, Katrinia/Rita Population Adjustment > -Linked To County Attributes – Total U.S., 1969–2014 Counties, National Cancer Institute, DCCPS, Surveillance Research Program, Cancer Statistics Branch, released April 2016, based on the November 2015 submission. 2015.

12.

Robertson

Boyle

. Age-period-cohort models of chronic disease rates. II: Graphical approaches. Stat Med 1998; 17: 1325–1339.

13.

Carstensen

. Age-period-cohort models for the Lexis diagram. Stat Med 2007; 26: 3018–3045.

14.

Clayton

Schifflers

. Models for temporal variation in cancer rates. II: Age-period-cohort models. Stat Med 1987; 6: 469–481.

15.

Tarone

Chu

. Evaluation of birth cohort patterns in population disease rates. Am J Epidemiol 1996; 143: 85–91.

16.

Westberg

. Combining independent statistical tests. J Roy Stat Soc D-Sta 1985; 34: 287–296.

17.

Shiels

Chernyavskiy

Anderson

, et al. Trends in premature mortality in the USA by sex, race, and ethnicity from 1999 to 2014: an analysis of death certificate data. Lancet 2017; 389: 1043–1054.

18.

Best

Haozous

de Gonzalez

, et al. Premature mortality projections in the USA through 2030: a modelling study. Lancet Public Health 2018; 3: e374–e384.

19.

Heuer

. Modeling of time trends and interactions in vital rates using restricted regression splines. Biometrics 1997; 53: 161–177.

20.

Chernyavskiy

Little

Rosenberg

. A unified approach for assessing heterogeneity in age-period-cohort model parameters using random effects. Stat Meth Med Res 2017, pp. 962280217713033.

21.

Anderson

Rabkin

Turner

, et al. The changing face of noncardia gastric cancer incidence among US non-Hispanic Whites. J Nat Cancer Inst 2018; 110: 608–615. doi: 10.1093/jnci/djx262. .

22.

Holford

Armitage

Colton

Age-period-cohort analysis. In: Armitage

Colton

(eds). Encyclopedia of biostatistics, Hoboken, New Jersey: John Wiley & Sons, Ltd., 2005, pp. 82–99. .

23.

Rosenberg

Anderson

. Proportional hazards models and age-period-cohort analysis of cancer rates. Stat Med 2010; 29: 1228–1238.

24.

Chernyavskiy

Little

Rosenberg

. Correlated Poisson models for age-period-cohort analysis. Stat Med 2018; 37: 405–424.

25.

Rosenberg

Barker

Anderson

. Estrogen receptor status and the future burden of invasive and in situ breast cancers in the United States. J Nat Cancer Inst 2015, pp. 107.

26.

Siegel

Fedewa

Anderson

, et al. Colorectal cancer incidence patterns in the United States, 1974–2013. J Nat Cancer Inst 2017, pp. 109.

A new age-period-cohort model for cancer surveillance research

Abstract

Keywords

1 Introduction

2 Event rates on a Lexis diagram

3 The standard APC model

4 Example: MM in Black men

5 The new APC model

6 Estimable functions and the fundamental decomposition principle

7 Combination hypothesis tests of period and cohort deviations

8 Combination hypothesis tests of period and cohort rate ratios

9 Power of Wald tests versus combination tests: simulation study

10 Local drifts

11 Local drifts tests

12 Power of local drifts tests: simulation study

13 Discussion

Footnotes

Declaration of conflicting interests

Funding

Appendix A

References