The Dance of the Mechanisms: How Observed Information Influences the Validity of Missingness Assumptions

Abstract

Missing data in scientific research go hand in hand with assumptions about the nature of the missingness. When dealing with missing values, a set of beliefs has to be formulated about the extent to which the observed data may also hold for the missing parts of the data. It is vital that the validity of these missingness assumptions is verified, tested, and that assumptions are adjusted when necessary. In this article, we demonstrate how observed data structures could a priori indicate whether it is likely that our beliefs about the missingness can be trusted. To this end, we simulate complete data and generate missing values according several types of MCAR, MAR, and MNAR mechanisms. We demonstrate that in scenarios where the data correlations are either low or very substantial, strictly different mechanisms yield equivalent statistical inferences. In addition, we show that the choice of quantity of scientific interest together with the distribution of the nonresponse govern the validity of the missingness assumptions.

Keywords

missing data methodology missingness assumptions multivariate amputation

Get full access to this article

View all access options for this article.

References

Collins

Linda M.

Schafer

Joseph L.

Kam

Chi-Ming

. 2001. “A Comparison of Inclusive and Restrictive Strategies in Modern Missing Data Procedures.” Psychological Methods 6:330–51.

Little

Roderick J. A.

Rubin

Donald B.

. 2002. Statistical Analysis with Missing Data. New York: John Wiley.

Molenberghs

Geert

Beunckens

Caroline

Sotto

Cristina

Kenward

Michael G.

. 2008. “Every Missingness Not at Random Model Has a Missingness at Random Counterpart with Equal Fit.” Journal of the Royal Statistical Society: Series B 70:371–88.

Molenberghs

Geert

Fitzmaurice

Garrett

Kenward

Michael G.

Tsiatis

Anastasios

Verbeke

Geert

. 2015. Handbook of Missing Data Methodology. Boca Raton, FL: Chapman & Hall/CRC Press.

Neyman

Jerzy

. 1934. “On the Two Different Aspects of the Representative Method: The Method of Stratified Sampling and the Method of Purposive Selection.” Journal of the Royal Statistical Society 97:557–625.

R Development Core Team. 2008. R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria. Retrieved from http://www.R-project.org.

Ripley

Brian

Venables

Bill

Bates

Douglas M.

Hornik

Kurt

Gebhardt

Albrecht

Firth

David

. 2017. R-Package MASS. Retrieved September 25, 2018 (https://Cran.r-project.org/web/packages/MASS/index.html).

Rubin

Donald B.

1976. “Inference and Missing Data.” Biometrika 63:581–90.

Rubin

Donald B.

1987. Multiple Imputation for Nonresponse in Surveys. New York: John Wiley.

10.

Rubin

Donald B.

Stern

Hal S.

Vehovar

Vasja

. 1995. “Handling “Don’t Know” Survey Responses: The Case of the Slovenian Plebiscite.” Journal of the American Statistical Association 90:822–28.

11.

Schafer

Joseph L.

1997. Analysis of Incomplete Multivariate Data. London, England: Chapman & Hall.

12.

Schafer

Joseph L.

Graham

John W.

. 2002. “Missing Data: Our View of the State of the Art.” Psychological Methods 7:147–77.

13.

Schouten

Rianne Margaretha

Lugtig

Peter J.

Vink

Gerko

. 2018. “Generating Missing Values for Simulation Purposes: A Multivariate Amputation Procedure.” Journal of Statistical Computation and Simulation 88:2909–30.

14.

Van Buuren

Stef

. 2012. Flexible Imputation of Missing Data. Boca Raton, FL: Chapman & Hall/CRC Press.

15.

Van Buuren

Stef

Groothuis-Oudshoorn

Karin

. 2011. “Mice: Multivariate Imputation by Chained Equations in R.” Journal of Statistical Software 45.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.01 MB

0.10 MB