Sage Journals: Discover world-class research

Abstract

Consider the conventional multilevel model $Y = C γ + Zu + e$ where $γ$ represents fixed effects and $(u, e)$ are multivariate normal random effects. The continuous outcomes $Y$ and covariates $C$ are fully observed with a subset $Z$ of $C$ . The parameters are $θ = (γ, var (u), var (e))$ . Dempster, Rubin and Tsutakawa framed the estimation as a missing data problem, where $(Y, u)$ are the complete data and the random effects $u$ are conceived as missing data. Viewed in this way, the Expectation-Maximization (EM) algorithm has proven to be a natural and popular approach to estimation. However, when $C$ is partially observed or subject to measurement error, it is natural to formulate a multilevel model for $C$ that includes random effects, $ν$ . In this article, we extend this thinking to allow estimation of the joint distribution of data $Y = (Y, C) = (Y_{o}, Y_{m})$ and random effects $b = (u, v)$ from observed data $Y_{o} = (Y_{o}, C_{o})$ and to generate multiple imputations of missing data $(Y_{m}, b)$ based on the estimated distribution under the assumption that the data $Y$ are missing at random. This approach contributes to the literature on multiple imputation in three ways: (a) it allows random effects $ν$ to be conceived as latent covariates, thus addressing measurement errors of $C$ ; (b) it allows non-linearities, including random coefficients, interaction effects, and other polynomial effects involving partially observed covariates; (c) it imputes $(Y_{m}, b)$ using two-step importance sampling. In these cases, the joint distribution of $Y$ is not analytically tractable even if the analytic multilevel model of interest to the analyst follows a multivariate normal distribution. We prove that our method of maximizing the likelihood and imputing missing data ensures compatibility of the nonnormal joint distribution with the analytic normal theory multilevel model via provisionally known random effects. We present and evaluate a sufficient condition under which the produced imputations are compatible with the analytic model.

Keywords

multilevel model compatibility provisionally known random effects maximum likelihood two-step importance sampling measurement error

Get full access to this article

View all access options for this article.

References

Allison

P. D.

(2003). Missing data techniques for structural equation modeling, Journal of Abnormal Psychology, 112, 545–557. https://doi.org/10.1037/0021-843X.112.4.545

Arnold

B. C.

Press

S. J.

(1989). Compatible conditional distributions. Journal of the American Statistical Association, 84, 152–156.

Bartlett

J. W.

Seaman

S. R.

White

I. R.

Carpenter

J. R.

(2015). Multiple imputation of covariates by fully conditional specification: Accommodating the substantive model. Statistical Methods in Medical Research, 24, 462–487.

Bates

Maechler

Bolker

Walker

(2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48. https://doi.org/10.18637/jss.v067.i01

Bodner

T. E.

(2008). What improves with increased missing data imputations? Structural Equation Modeling, 15, 651–175. https://doi.org/10.1080/10705510802339072

Carlin

B. P.

Louis

T. A.

(2009). Bayesian methods for data analysis (3rd ed.). CRC Press.

Carpenter

Kenward

(2013). Multiple imputation and its application. John Wiley & Sons.

Casella

Berger

R. L.

(2002). Statistical inference. Duxbury.

Collins

L. M.

Schafer

J. L.

Kam

(2001). A comparison of inclusive and restrictive strategies in modern missing data procedures. Psychological Methods, 6, 330–351.

10.

Dempster

A. P.

Laird

N. M.

Rubin

D. B.

(1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 76, 1–38.

11.

Dempster

A. P.

Rubin

D. B.

Tsutakawa

R. K.

(1981). Estimation in covariance components models. Journal of the American Statistical Association, 76, 341–353.

12.

Enders

C. K.

(2023). Missing data: An update on the state of the art. Psychological Methods, 30, 322–339. https://doi.org/10.1037/met0000563

13.

Enders

C. K.

Keller

B. T.

(2020). A model-based imputation procedure for multilevel regression models with random coefficients, interaction effects, and nonlinear terms. Psychological Methods, 25(1), 88–112.

14.

Enders

C. K.

Mistler

S. A.

Keller

B. T.

(2016). Multilevel Multiple Imputation: A Review and Evaluation of Joint Modeling and Chained Equations Imputation. Psychological Methods, 21, 222–240.

15.

Gelman

Rubin

D. B.

(1992). Inference from iterative simulation using multiple sequences, Statistical Science, 7, 457–472.

16.

Goldstein

Carpenter

J. R.

Browne

W. J.

(2014). Fitting multilevel multivariate models with missing data in responses and covariates that may Include interactions and non-linear terms. Journal of the Royal Statistical Society Series A-Statistics in Society, 177, 553–564.

17.

Goldstein

Carpenter

Kenward

Levin

(2009). Multilevel models with multivariate mixed response types, Statistical Modeling, 9, 173–197.

18.

Gueorguieva

R. V.

Agresti

(2001). A correlated probit model for joint modeling of clustered binary and continuous responses. Journal of the American Statistical Association, 96(455), 1102–1112.

19.

Hastings

W. K.

(1970). Monte Carlo sampling methods using Markov chains and their applications. Bioametrika, 57, 97–109.

20.

Hedeker

Gibbons

R. D.

(1994). A random-effects ordinal regression model for multilevel analysis. Biometrics, 50, 933–944.

21.

Horton

N. J.

Kleinman

K. P.

(2007). Much ado about nothing: A comparison of missing data methods and software to fit incomplete data regression models. The American Statistician, 61(1), 79–90.

22.

Ibrahim

J. G.

Chen

M. H.

Lipsitz

S. R.

(1999), Monte Carlo EM for missing covariates in parametric regression models. Biometrics, 55, 591–596.

23.

Ibrahim

J. G.

Chen

M. H.

Lipsitz

S. R.

(2002). Bayesian methods for generalized linear models with covariates missing at random. Canadian Journal of Statistics/Revue Canadienne De Statistique, 30, 55–78.

24.

Keller

B. T.

Enders

C. K.

(2021). Blimp user’s guide (Version 3). www.appliedmissingdata.com/multilevel-imputation.html

25.

Kim

Belin

T. R.

Sugar

C. A.

(2018). Multiple imputation with non-additively related variables: Joint-modeling and approximations. Statistical Methods in Medical Research, 27, 1683–1694.

26.

Kim

Sugar

C. A.

Belin

T. R.

(2015). Evaluating model-based imputation methods for missing covariates in regression models with interactions. Statistics in Medicine, 34(11), 1876–1888.

27.

Lee

V. E.

Bryk

A. S.

(1989). A multilevel model of the social distribution of high school achievement, Sociology of Education, 62, 172–192.

28.

Levy

Enders

C. K.

(2023). Full conditional distributions for Bayesian multilevel models with additive or interactive effects and missing data on covariates. Communications in Statistics-Simulation and Computation, 52(7), 28992923.

29.

Little

R. J. A.

Rubin

D. B.

(2002). Statistical analysis with missing data. Wiley.

30.

Liu

Gelman

Hill

Kropko

(2014). On the stationary distribution of iterative imputations. Biometrika, 101, 155–173.

31.

Liu

Taylor

J. M. G.

Belin

T. R.

(2000). Multiple imputation and posterior simulation for multivariate missing data in longitudinal studies. Biometrics, 56, 1157–1163.

32.

Lüdtke

Marsh

H. W.

Robitzsch

Trautwein

Asparouhov

Muthén

(2008). The multilevel latent covariate model: A new, more reliable approach to group-level effects in contextual studies. Psychological Methods, 13(3), 203–229.

33.

Meng

Rubin

(1992). Performing likelihood ratio tests with multiply-imputed data sets. Biometrika, 79, 103–111.

34.

Metropolis

Rosenbluth

A. W.

Rosenbluth

M. N.

Teller

A. H.

Teller

(1953). Equation of state calculations by fast computing machines. The Journal of Chemical Physics, 21(6), 1087–1092.

35.

Miyazaki

Frank

K. A.

(2006). A hierarchical linear model with factor analysis structure at level 2. Journal of Educational and Behavioral Statistics, 31(2), 125–156.

36.

Murray

J. S.

Reiter

J. P.

(2016). Multiple imputation of missing categorical and continuous values via Bayesian mixture models with local dependence. Journal of the American Statistical Association, 111(516), 1466–1479.

37.

Naylor

J. C.

Smith

A. F. M.

(1982). Applications of a method for the efficient computation of posterior distributions, Applied Statistics, 31(3), 214–225.

38.

Olsen

M. K.

Schafer

J. L.

(2001). A two-part random-effects model for semicontinuous longitudinal data, Journal of the American Statistical Association, 96, 730–745.

39.

Owen

Zhou

(2000). Safe and effective importance sampling, Journal of the American Statistical Association, 95, 135–143.

40.

Pinheiro

J. C.

Bates

D. M.

(1995). Approximations to the log-likelihood function in the nonlinear mixed-effects model, Journal of Computational and Graphical Statistics, 4, 12–35.

41.

R Core Team (2021). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/

42.

Rabe-Hesketh

Skrondal

Pickles

(2002). Reliable estimation of generalized linear mixed models using adaptive quadrature. The Stata Journal, 2(1), 1–21.

43.

Raghunathan

T. E

Berglund

Solenberger

P. W.

(2017). Multiple imputation in practice: With examples using IVEware. CRC Press.

44.

Raghunathan

Lepkowski

Van Hoewyk

Solenberger

(2001). A multivariate technique for multiply imputing missing values using a sequence of regression models. Survey Methodology, 27, 85–95.

45.

Raudenbush

S. W.

Bryk

A. S.

(1986). Hierarchical model for studying school effects. Sociology of Education, 59, 1–17.

46.

Raudenbush

S. W.

Bryk

A. S.

(2002). Hierarchical linear models. Sage.

47.

Raudenbush

S. W.

Yang

Yosef

(2000). Maximum likelihood for generalized linear models with nested random effects via high-order, multivariate Laplace approximation. Journal of Computational and Graphical Statistics, 9(1), 141–157.

48.

Rubin

D. B.

(1976). Inference and missing data. Biometrika, 63, 581–592.

49.

Rubin

D. B.

(1987). Multiple imputation for nonresponse in surveys. John Wiley & Sons.

50.

Schafer

J. L.

(1997). Analysis of incomplete multivariate data. Chapman & Hall.

51.

Schafer

J. L.

Graham

J. W.

(2002). Missing data: Our view of the state of the art. Psychological Methods, 7(2), 147–177. https://doi.org/10.1037/1082-989X.7.2.147

52.

Schafer

J. L.

Yucel

R. M.

(2002). Computational strategies for multivariate linear mixed-effects models with missing values. Journal of Computational and Graphical Statistics, 11, 437–457.

53.

Shin

Hagiwara

(2025). Bayesian estimation of hierarchical linear models from incomplete data: Cluster-level interaction effects and small sample sizes, Statistics in Medicine, 44(10–12), e70051.

54.

Shin

Raudenbush

S. W.

(2007). Just-identified versus over-identified two-level hierarchical linear models with missing data. Biometrics, 63, 1262–1268.

55.

Shin

Raudenbush

S. W.

(2010). A latent cluster mean approach to the contextual effects model with missing data. Journal of Educational and Behavioral Statistics, 35, 26–53.

56.

Shin

Raudenbush

S. W.

(2013) Efficient analysis of Q-level nested hierarchical general linear models given ignorable missing data. The International Journal of Biostatistics, 9(1), 109–133.

57.

Shin

Raudenbush

S. W.

(2024). Maximum likelihood estimation of hierarchical linear models from incomplete data: Random coefficients, statistical interactions, and measurement error. Journal of Computational and Graphical Statistics, 33(1), 112–125. https://doi.org/10.1080/10618600.2023.2234414

58.

Reiter

J. P.

(2013). Nonparametric Bayesian multiple imputation for incomplete categorical variables in large-scale assessment surveys. Journal of Educational and Behavioral Statistics, 38(5), 499–521.

59.

Palta

Smith

(2020). Bayesian profiling multiple imputation for missing hemoglobin values in electronic health records. Annals of Applied Statistics, 14(4), 1903–1924. https://doi.org/10.1214/20-AOAS1378

60.

Sun

Shin

Lafata

J. E.

Raudenbush

S. W.

(2024). Variability in causal effects and noncompliance in a multisite trial: A bivariate hierarchical generalized random coefficients model for a binary outcome, Statistics in Medicine, 43(28), 5353–5365. https://doi.org/10.1002/sim.10229

61.

Tourangeau

Nord

Sorongon

AG.

Najarian

(2009). Early Childhood Longitudinal Study, kindergarten class of 1998–99 (ECLS-K), combined user’s manual for the ECLS-K eighth-grade and K–8 full sample data files and electronic codebooks (NCES 2009–004). National Center for Education Statistics, Institute of Education Sciences, US Department of Education.

62.

van Buuren

. (2018). Flexible imputation of missing data. CRC Press.

63.

van Buuren

Groothuis-Oudshoorn

(2011). MICE: Multivariate imputation by chained equations in R. Journal of Statistical Software, 45, 1–67.

64.

van Buuren

Brand

Groothuis-Oudshoorn

Rubin

(2006). Fully conditional specification in multivariate imputation. Journal of Statistical Computation and Simulation, 76, 1049–1064.

65.

von Hippel

P. T.

(2020). How many imputations do you need? A two-stage calculation using a quadratic rule. Sociological Methods & Research, 49(3), 699–718.

66.

Willms

J. D.

(1986). Social class segregation and its relationship to pupils’ examination results in Scotland. American Sociological Review, 51, 224–241.

Multiple Imputation to Estimate Hierarchical Models From Data Missing at Random: Latent Covariates,Random Coefficients,and Statistical Interactions

Abstract

Keywords

Get full access to this article

References