This paper reviews models for incomplete continuous and categorical longitudinal data. In terms of Rubin's classification of missing value processes we are specifically concerned with the problem of nonrandom missingness. A distinction is drawn between the classes of selection and pattern-mixture models and, using several examples, these approaches are compared and contrasted. The central roles of identifiability and sensitivity are emphasized throughout.
Get full access to this article
View all access options for this article.
References
1.
Afifi A , Elashoff R.Missing observations in multivariate statistics I: Review of the literature . Journal of the American Statistical Association1966; 61: 595-604 .
2.
Hartley HO , Hocking R.The analysis of incomplete data . Biometrics1971; 27: 778-808 .
3.
Dempster AP , Laird NM , Rubin DB.Maximum likelihood from incomplete data via the EM algorithm (with discussion) . Journal of the Royal Statistical Society, Series B1977; 39: 1-38 .
4.
Rubin DB.Multiple imputation for nonresponse in surveys. Chichester: John Wiley , 1987.
5.
Tanner M , Wong W.The calculation of posterior distributions by data augmentation (with discussion) . Journal of the American Statistical Association1987; 82: 528-550 .
6.
Goss PE , Winer EP , Tannock IF , Schwartz LH , Kremer AB.A randomized phase III trial comparing the new potent and selective third-generation aromatase inhibitor vorozole with megestrol acetate in post-menopausal advanced breast cancer patients. In press.
7.
Schipper H , Clinch J , McMurray A.Measuring the quality of life of cancer patients: the functional-living index-cancer: development and validation . Journal of Clinical Oncology1984; 2: 472-483 .
8.
Diggle PJ , Kenward MG.Informative dropout in longitudinal data analysis (with discussion) . Applied Statistics1994; 43: 49-94 .
9.
Kenward MG.Selection models for repeated measurements with nonrandom dropout: an illustration of sensitivity . Statistics in Medicine1998; 17: 2723-2732 .
10.
Diggle PJ , Liang K-Y , Zeger SL.Analysis of longitudinal data. Oxford: Oxford University Press , 1994.
Verbeke G , Molenberghs G eds. Linear mixed models in practice. New York: Springer , 1997.
13.
Molenberghs G , Lesaffre E.Marginal modelling of correlated ordinal data using a multivariate Plackett distribution . Journal of the American Statistical Association1994; 89: 633-644 .
14.
Kenward MG , Lesaffre E , Molenberghs G.An application of maximum likelihood and generalized estimating equations to the analysis of ordinal data from a longitudinal study with cases missing at random . Biometrics1994; 50: 945-953 .
15.
Molenberghs G , Kenward MG , Lesaffre E.The analysis of longitudinal ordinal data with non-random dropout . Biometrika1997; 84: 33-44 .
16.
Molenberghs G , Goetghebeur E , Lipsitz SR , Kenward MG , Lesaffre E , Michiels B.Missing data perspectives of the fluvoxamine data set: a review . Statistics in Medicine1999; 18. In press.
17.
Heyting A , Tolboom JTBM , Essers JGA.Statistical handling of drop-outs in longitudinal clinical trials . Statistics in Medicine1992; 11: 2043-2061 .
18.
Shih WJ , Quan H.Testing for treatment differences with dropouts in clinical trials - a composite approach . Statistics in Medicine1997; 16: 1225-1239 .
19.
Little RJA , Yau L.Intent-to-treat analysis for longitudinal studies with drop-outs . Biometrics1996; 52: 1324-1333 .
20.
Wang-Clow F , Lange N , Laird NM , Ware JH.A simulation study of estimators for rate of change in longitudinal studies with attrition . Statistics in Medicine1995; 14: 283-297 .
21.
Little RJA.A class of pattern-mixture models for multivariate incomplete data . Biometrika1994; 81: 471-483 .
22.
Rubin DB.Inference and missing data . Biometrika1976; 63: 581-592 .
23.
Little RJA , Rubin DB.Statistical analysis with missing data. Chichester: John Wiley , 1986.
24.
Murray GD , Findlay JG.Correcting for the bias caused by drop-outs in hypertension trials . Statistics in Medicine1988; 7: 941-946 .
25.
Little RJA.Modeling the dropout mechanism in repeated-measures studies . Journal of the American Statistical Association1995; 90: 112-121 .
26.
Molenberghs G , Kenward MG.Calculating the appropriate information matrix for log-linear models when data are missing at random . In: Gregoire TG , Brillinger DR , Diggle PJ , Russek-Cohen E , Warren WG , Wolfinger RD eds. Lecture notes in statistics 122, Proceedings of the Nantucket conference on modelling longitudinal and spatially correlated data: methods applications and future directions. New York: Springer, 1997: 331-338 .
27.
Kenward MG , Molenberghs G.Likelihood based frequentist inference when data are missing at random . Statistical Science1998; 13: 236-247 .
28.
Robins JM , Rotnitsky A , Zao LP.Analysis of semiparametric regression models for repeated ourcomes in the presence of missing data . Journal of the American Statistical Association1995; 90: 106-121 .
29.
Fitzmaurice GM , Molenberghs G , Lipsitz SR.Regression models for longitudinal binary responses with informative dropout . Journal of the Royal Statistical Society, Series B1995; 57: 691-704 .
30.
Hogan JW , Laird NM.Model-based approaches to analysing incomplete longitudinal and failure time data . Statistics in Medicine1997; 16: 259-272 .
31.
Heckman JJ.The common structure of statistical models of truncation, sample selection and limited dependent variables and a simple estimator for such models . Annals of Economic and Social Measurement. 1976; 5: 475-492 .
32.
Molenberghs G , Michiels B , Kenward MG , Diggle PJ.Missing data mechanisms and pattern-mixture models . Statistica Nederlandica1998; 52: 153-161 .
33.
Amemiya T.Tobit models: a survey . Journal of Econometrics1984; 24: 3-61 .
34.
Little RJA.A note about models for selectivity bias . Econometrika1986; 53: 1469-1474 .
35.
Baker SG.Marginal regression for repeated binary data with outcome subject to nonignorable non-response . Biometrics1995; 51: 1042-1052 .
36.
Fitzmaurice GM , Laird NM , Zahner GEP.Multivariate logistic models for incomplete binary response . Journal of the American Statistical Association1996; 91: 99-108 .
37.
Fitzmaurice GM , Heath G , Clifford P.Logistic regression models for binary panel data with attrition . Journal of the Royal Statistical Society, Series A1996; 159: 249-264 .
38.
Fay RE.Causal models for patterns of nonresponse . Journal of the American Statistical Association1986; 81: 354-365 .
39.
Baker SG , Laird NM.Regression analysis for categorical variables with outcome subject to nonignorable nonresponse . Journal of the American Statistical Association1988; 83: 62-69 .
40.
Baker SG , Rosenberger WF , DerSimonian R.Closed-form estimates for missing counts in two-way contingency tables . Statistics in Medicine199211: 643-657 .
41.
Baker SG.Composite linear models for incomplete data . Statistics in Medicine1994; 13: 609-622 .
42.
Park T , Brown MB.Models for categorical data with nonignorable nonresponse . Journal of the American Statistical Association1994; 89: 44-51 .
43.
Molenberghs G , Goetghebeur EJT , Lipsitz SR , Kenward MG.Non-random missingness in categorical data: strengths and limitations . The American Statistician1999; 52. In press.
44.
Ekholm A , Skinner C.The Muscatine children's obesity data reanalysed using pattern mixture models . Applied Statistics1998; 47: 251-264 .
45.
Greenlees WS , Reece JS , Ziexhang KD.Imputation of missing values when the probability of nonresponse depends on the variable being imputed . Journal of the American Statistical Association1982; 77: 251-256 .
46.
Troxel AB , Harrington DP , Lipsitz SR.Analysis of longitudinal data with nonignorable non-monotone missing values . Applied Statistics1998; 47: 425-438 .
47.
Nelder JA , Mead R.A simplex method for function maximisation . Computing Journal1965; 7: 303-313 .
48.
Little RJA , Rubin DB.The EM algorithm and extensions. Chichester: John Wiley , 1997.
49.
Rubin DB. Discussion of Diggle, P. J. and Kenward, M. G. : Informative dropout in longitudinal data analysis . Applied Statistics1994; 43: 80-82 .
50.
Little RJA. Discussion of Diggle, P. J. and Kenward, M. G. : Informative dropout in longitudinal data analysis . Applied Statistics1994; 43: 78-78 .
51.
Laird NM. Discussion of Diggle, P. J. and Keuward, M. G. : Informative dropout in longitudinal data analysis . Applied Statistics1994; 43: 84-84 .
52.
Hogan JW , Laird NM.Mixture models for joint distribution of repeated measures and event times . Statistics in Medicine1997; 16: 239-257 .
53.
Wu MC , Carroll RJ.Estimation and comparison of changes in the presence of informative right censoring by modelling the censoring process . Biometrics1988; 44: 175-188 .
54.
Laird NM , Ware JH.Random-effects models for longitudinal data . Biometrics1982; 38: 963-974 .
55.
Schluchter MD.Methods for the analysis of informatively censored longitudinal data . Statistics in Medicine1992; 11: 1861-1870 .
56.
De Gruttola V , Tu XM.Modelling progression of CD4-lymphocyte count and its relationship to survival time . Biometrics1994; 50: 1003-1014 .
57.
Cowles MK , Carlin BP , Connett JE.Bayesian tobit modeling of longitudinal ordinal clinical trial compliance data with nonignorable missingness . Journal of the American Statistical Association1996; 91: 86-98 .
58.
Foliman D , Wu M.An approximate generalized linear model with random effects for informative missing data . Biometrics1995; 51: 151-168 .
59.
Robins JM , Rotnitzky A.Semiparametric efficiency in multivariate regression models with missing data . Journal of the American Statistical Association1995; 90: 122-129 .
60.
Robins JM , Rotnitzky A.Semi-parametric estimation of models for the means and covariances in the presence of missing data . Scandinavian Journal of Statistics1995; 22: 323-333 .
61.
Robins JM , Rotnitzky A , Scharfstein DO.Semiparametric regression for repeated outcomes with non-ignorable non-response . Journal of the American Statistical Association1998; 93: 1321-1339 .
62.
Little RJA.Pattern-mixture models for multivariate incomplete data . Journal of the American Statistical Association1993; 88: 125-134 .
63.
Little RJA , Wang Y.Pattern-mixture models for multivariate incomplete data with covariates . Biometrics1996; 52: 98-111 .
64.
Michiels B , Molenberghs G , Lipsitz S.Selection models and pattern-mixture models for incomplete categorical data with covariates . Biometrics1999; 55. In press.
65.
Wu MC , Bailey KR.estimation and comparison of changes in the presence of informative right censoring: conditional linear model . Biometrics1989; 45: 939-955 .
66.
Mon M , Woolson RF , Woodsworth GG.Application of empirical Bayes inference to estimation of rate of change in the presence of informative right censoring . Statistics in Medicine1992; 11: 621-631 .
67.
Cook RD.Assessment of local influence . Journal of the Royal Statistical Society, Series B1986; 48: 133-169 .