On Ignoring the Random Effects Assumption in Multilevel Models: Review,Critique,and Recommendations

Abstract

Entities such as individuals, teams, or organizations can vary systematically from one another. Researchers typically model such data using multilevel models, assuming that the random effects are uncorrelated with the regressors. Violating this testable assumption, which is often ignored, creates an endogeneity problem thus preventing causal interpretations. Focusing on two-level models, we explain how researchers can avoid this problem by including cluster means of the Level 1 explanatory variables as controls; we explain this point conceptually and with a large-scale simulation. We further show why the common practice of centering the predictor variables is mostly unnecessary. Moreover, to examine the state of the science, we reviewed 204 randomly drawn articles from macro and micro organizational science and applied psychology journals, finding that only 106 articles—with a slightly higher proportion from macro-oriented fields—properly deal with the random effects assumption. Alarmingly, most models also failed on the usual exogeneity requirement of the regressors, leaving only 25 mostly macro-level articles that potentially reported trustworthy multilevel estimates. We offer a set of practical recommendations for researchers to model multilevel data appropriately.

Keywords

random effects fixed effects multilevel HLM endogeneity centering

Get full access to this article

View all access options for this article.

References

Aiken

L. S.

West

S. G.

Millsap

R. E.

(2008). Doctoral training in statistics, measurement, and methodology in psychology—Replication and extension of Aiken, West, Sechrest, and Reno’s (1990) survey of PhD programs in North America. American Psychologist, 63(1), 32–50.

Allison

P. D.

(2009). Fixed effects regression models. Thousand Oaks, CA: Sage.

Angrist

J. D.

Pischke

J.-S.

(2008). Mostly harmless econometrics: An empiricist’s companion. Princeton, NJ: Princeton University Press.

Angrist

J. D.

Pischke

J.-S.

(2010). The credibility revolution in empirical economics: How better research design is taking the con out of econometrics. Journal of Economic Perspectives, 24(2), 3–30.

Angrist

J. D.

Pischke

J.-S.

(2014). Mastering metrics: The path from cause to effect. Princeton, NJ: Princeton University Press.

Antonakis

(2011). Predictors of leadership: The usual suspects and the suspect traits. In Bryman

Collinson

Grint

Jackson

Uhl-Bien

(Eds.), Sage handbook of leadership (pp. 269–285). Thousand Oaks, CA: Sage.

Antonakis

Bastardoz

Liu

Schriesheim

C. A.

(2014). What makes articles highly cited? The Leadership Quarterly, 25(1), 152–179.

Antonakis

Bendahan

Jacquart

Lalive

(2010). On making causal claims: A review and recommendations. The Leadership Quarterly, 21, 1086–1120.

Antonakis

Bendahan

Jacquart

Lalive

(2014). Causality and endogeneity: Problems and solutions. In Day

D. V.

(Ed.), The Oxford handbook of leadership and organizations (pp. 93–117). New York, NY: Oxford University Press.

10.

Arellano

(1993). On the testing of correlated effects with panel data. Journal of Econometrics, 59(1-2), 87–97.

11.

Ballinger

G. A.

(2004). Using generalized estimating equations for longitudinal data analysis. Organizational Research Methods, 7(2), 127–150.

12.

Bascle

(2008). Controlling for endogeneity with instrumental variables in strategic management research. Strategic Organization, 6(3), 285–327.

13.

Bates

Mächler

Bolker

Walker

(2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48.

14.

Bell

Jones

(2015). Explaining fixed effects: Random effects modeling of time-series cross-sectional and panel data. Political Science Research and Methods, 3(1), 133–153.

15.

Bliese

P. D.

(1998). Group size, ICC values, and group-level correlations: A simulation. Organizational Research Methods, 1(4), 355–373.

16.

Bliese

P. D.

(2000). Within-group agreement, non-independence, and reliability: Implications for aggregation and analysis. In Kozlowski

S. W. J.

Klein

K. J.

(Eds.), Multilevel theory, research, and methods in organizations (pp. 349–381). San Francisco, CA: Jossey-Bass.

17.

Bliese

P. D.

Ployhart

R. E.

(2002). Growth modeling using random coefficient models: Model building, testing, and illustrations. Organizational Research Methods, 5(4), 362–387.

18.

Bollen

K. A.

(2012). Instrumental variables in sociology and the social sciences. Annual Review of Sociology, 38(1), 37–72.

19.

Cameron

A. C.

Gelbach

J. B.

Miller

D. L.

(2011). Robust inference with multiway clustering. Journal of Business & Economic Statistics, 29(2), 238–249.

20.

Cameron

A. C.

Miller

D. L.

(2015). A practitioner’s guide to cluster-robust inference. Journal of Human Resources, 50(2), 317–372.

21.

Certo

S. T.

Semadeni

(2006). Strategy research and panel data: Evidence and implications. Journal of Management, 32(3), 449–471.

22.

Certo

S. T.

Withers

M. C.

Semadeni

(2017). A tale of two effects: Using longitudinal data to compare within- and between-firm effects. Strategic Management Journal, 38(7), 1536–1556.

23.

Clark

T. S.

Linzer

D. A.

(2015). Should I use fixed or random effects? Political Science Research and Methods, 3(2), 399–408.

24.

Cohen

(1992). A power primer. Psychological Bulletin, 112(1), 155–159.

25.

Croissant

Millo

(2008). Panel data econometrics in R: The plm package. Journal of Statistical Software, 27(2), 1–43.

26.

Curran

P. J.

Bauer

D. J.

(2011). The disaggregation of within-person and between-person effects in longitudinal models of change. Annual Review of Psychology, 62, 583–619.

27.

Dalal

D. K.

Zickar

M. J.

(2012). Some common myths about centering predictor variables in moderated multiple regression and polynomial regression. Organizational Research Methods, 15(3), 339–362.

28.

Dawson

J. F.

(2014). Moderation in management research: What, why, when, and how. Journal of Business and Psychology, 29(1), 1–19.

29.

Enders

C. K.

Tofighi

(2007). Centering predictor variables in cross-sectional multilevel models: A new look at an old issue. Psychological Methods, 12(2), 121–138.

30.

Gennetian

L. A.

Magnuson

Morris

P. A.

(2008). From statistical associations to causation: What developmentalists can learn from instrumental variables techniques coupled with experimental data. Developmental Psychology, 44(2), 381–394.

31.

Greene

W. H.

(2012). Econometric analysis. Boston, MA: Prentice Hall.

32.

Guo

(2017). Demystifying variance in performance: A longitudinal multilevel perspective. Strategic Management Journal, 38(6), 1327–1342.

33.

Halaby

C. N.

(2004). Panel models in sociological research: Theory into practice. Annual Review of Sociology, 30, 507–544.

34.

Hausman

J. A.

(1978). Specification tests in econometrics. Econometrica, 46(6), 1251–1271.

35.

Helson

Jones

Kwan

V. S. Y.

(2002). Personality change over 40 years of adulthood: Hierarchical linear modeling analyses of two longitudinal samples. Journal of Personality and Social Psychology, 83(3), 752–766.

36.

Hofmann

D. A.

(1997). An overview of the logic and rationale of hierarchical linear models. Journal of Management, 23(6), 723–744.

37.

Hofmann

D. A.

Gavin

M. B.

(1998). Centering decisions in hierarchical linear models: Implications for research in organizations. Journal of Management, 24(5), 623–641.

38.

Holcomb

T. R.

Combs

J. G.

Sirmon

D. G.

Sexton

(2010). Modeling levels and time in entrepreneurship research: An illustration with growth strategies and post-IPO performance. Organizational Research Methods, 13(2), 348–389.

39.

Kreft

I. G.

De Leeuw

Aiken

L. S.

(1995). The effect of different forms of centering in hierarchical linear models. Multivariate Behavioral Research, 30(1), 1–21.

40.

Kromrey

J. D.

Foster-Johnson

(1998). Mean centering in moderated multiple regression: Much ado about nothing. Educational and Psychological Measurement, 58(1), 42–67.

41.

Landis

J. R.

Koch

G. G.

(1977). The measurement of observer agreement for categorical data. Biometrics, 33(1), 159–174.

42.

La Porta

Lopez-De-Silanes

Shleifer

(2008). The economic consequences of legal origins. Journal of Economic Literature, 46(2), 285–332.

43.

Larcker

D. F.

Rusticus

T. O.

(2010). On the use of instrumental variables in accounting research. Journal of Accounting and Economics, 49(3), 186–205.

44.

Lee

V. E.

(2000). Using hierarchical linear modeling to study social contexts: The case of school effects. Educational Psychologist, 35(2), 125–141.

45.

Luke

D. A.

(2004). Multilevel modeling. Thousand Oaks, CA: Sage Publications.

46.

McNeish

Kelley

(2018). Fixed effects models versus mixed effects models for clustered data: Reviewing the approaches, disentangling the differences, and making recommendations. Psychological Methods, 24(1), 20–35.

47.

McNeish

Stapleton

L. M.

Silverman

R. D.

(2017). On the unnecessary ubiquity of hierarchical linear modeling. Psychological Methods, 22(1), 114–140.

48.

McNeish

Wentzel

K. R.

(2017). Accommodating small sample sizes in three-level models when the third level is incidental. Multivariate Behavioral Research, 52(2), 200–215.

49.

Mundlak

(1978). Pooling of time-series and cross-section data. Econometrica, 46(1), 69–85.

50.

Neuhaus

J. M.

Kalbfleisch

J. D.

(1998). Between- and within-cluster covariate effects in the analysis of clustered data. Biometrics, 54(2), 638–645.

51.

Petersen

M. A.

(2009). Estimating standard errors in finance panel data sets: Comparing approaches. Review of Financial Studies, 22(1), 435–480.

52.

Pustejovsky

J. E.

Tipton

(2018). Small-sample methods for cluster-robust variance estimation and hypothesis testing in fixed effects models. Journal of Business & Economic Statistics, 36(4), 672–683.

53.

Rabe-Hesketh

Skrondal

(2008). Multilevel and longitudinal modeling using Stata. College Station, TX: Stata Press.

54.

Rabe-Hesketh

Skrondal

(2012). Multilevel and Longitudinal Modeling Using Stata (3rd ed). College Station, TX: Stata Press Publication.

55.

Raudenbush

S. W.

Bryk

A. S.

(2002). Hierarchical linear models: Applications and data analysis methods. Thousand Oaks, CA: Sage.

56.

Sachs

J. D.

(2003, February). Institutions don’t rule—Direct effects of geography on per capita income (Working Paper 9490). Cambridge, MA: National Bureau of Economic Research.

57.

Schaffer

M. E.

Stillman

(2006). Xtoverid: Stata module to calculate tests of overidentifying restrictions after xtreg, xtivreg, xtivreg2 and xthtaylor. Retrieved from http://ideas.repec.org/c/boc/bocode/s456779.html

58.

Schunck

(2013). Within and between estimates in random-effects models: Advantages and drawbacks of correlated random effects and hybrid models. Stata Journal, 13(1), 65–76.

59.

StataCorp. (2017). Stata statistical software: Release 15. College Station, TX: Author.

60.

Wooldridge

J. M.

(2002). Econometric analysis of cross section and panel data. Cambridge, MA: MIT Press.

61.

Wooldridge

J. M.

(2013). Introductory econometrics: A modern approach (5th ed.). Mason, OH: South-Western Cengage Learning.

62.

Yammarino

F. J.

Dansereau

(2011). Multi-level issues in evolutionary theory, organization science, and leadership. The Leadership Quarterly, 22, 1042–1057.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.02 MB

0.11 MB

0.06 MB