The Bias and Efficiency of Incomplete-Data Estimators in Small Univariate Normal Samples

Abstract

Widely used methods for analyzing missing data can be biased in small samples. To understand these biases, we evaluate in detail the situation where a small univariate normal sample, with values missing at random, is analyzed using either observed-data maximum likelihood (ML) or multiple imputation (MI). We evaluate two types of MI: the usual Bayesian approach, which we call posterior draw (PD) imputation, and a little used alternative, which we call ML imputation, in which values are imputed conditionally on an ML estimate. We find that observed-data ML is more efficient and has lower mean squared error than either type of MI. Between the two types of MI, ML imputation is more efficient than PD imputation, and ML imputation also has less potential for bias in small samples. The bias and efficiency of PD imputation can be improved by a change of prior.

Keywords

missing data missing values incomplete data multiple imputation imputation M estimation Bayesian estimation ML imputation PD imputation maximum likelihood full information maximum likelihood

Get full access to this article

View all access options for this article.

References

Allison

Paul D.

2002. Missing Data. Thousand Oaks, CA: Sage.

Demirtas

Hakan

Freels

Sally A.

Yucel

Recai M.

. 2008. “Plausibility of Multivariate Normality Assumption When Multiply Imputing Non-Gaussian Continuous Outcomes: A Simulation Assessment.” Journal of Statistical Computation & Simulation 78:69–84.

Yulei

Raghunathan

Trivellore E

. 2006. “Tukey’s gh Distribution for Multiple Imputation.” The American Statistician 60:251–56.

Heitjan

Daniel F.

Basu

Srabashi

. 1996. “Distinguishing ‘Missing at Random’ and ‘Missing Completely at Random’.” The American Statistician 50:207–13.

Hoogendoorn

Adriaan

Allison

Paul D.

. 2009. “Multiple Imputation in a Very Simple Situation: Just Two Variables.” Retrieved May 31, 2011 (http://www.mail-archive.com/impute@listserv.it.northwestern.edu/msg00464.html).

Horton

Nicholas J.

Lipsitz

Stuart R.

Parzen

Michael

. 2003. “A Potential for Bias When Rounding in Multiple Imputation.” The American Statistician 57:229–32.

Kim

Jae Kwang

. 2004. “Finite Sample Properties of Multiple Imputation Estimators.” The Annals of Statistics 32:766–83.

Little

Roderick J. A.

Rubin

Donald B.

. 2002. Statistical Analysis With Missing Data. Hoboken, NJ: John Wiley.

Rubin

Donald B.

1987. Multiple Imputation for Nonresponse in Surveys. New York: John Wiley.

10.

Rubin

Donald B.

1976. “Inference and Missing Data.” Biometrika 63:581–92.

11.

Rubin

Donald B.

Schenker

Nathaniel

. 1986. “Multiple Imputation for Interval Estimation from Simple Random Samples With Ignorable Nonresponse.” Journal of the American Statistical Association 81:366–74.

12.

Schafer

Joseph L.

1997. Analysis of Incomplete Multivariate Data. London, England: Chapman & Hall.

13.

StataCorp. 2009. Stata Multiple-Imputation Reference Manual. College Station, TX: Stata Press.

14.

Theil

Schweitzer

. 1961. “The Best Quadratic Estimator of the Residual Variance in Regression Analysis.” Statistica Neerlandica 15:19–23.

15.

Wang

Naisyn

Robins

James M.

. 1998. “Large-sample Theory for Parametric Multiple Imputation Procedures.” Biometrika 85:935–48.

16.

Yuan

Ke-Hai

Wallentin

Fan

Bentler

Peter M.

. 2012. “ML versus MI for Missing Data with Violation of Distribution Conditions.” Sociological Methods and Research 41:598–629.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.34 MB