Editor in Chief’s Comment: External Validity in Systematic Reviews

Abstract

Get full access to this article

View all access options for this article.

References

Avellar

S. A.

Thomas

Kleinman

Sama-Miller

Woodruff

S. E.

Coughlin

Westbrook

T. P. R

. (2017). External validity: The next step for systematic reviews? Evaluation Review, 41, 283–325.

Banerjee

A. V.

Chassang

Snowberg

(2017). Decision theoretic approaches to experiment design and external validity. Handbook of Economic Field Experiments, 1, 141–174.

Casella

. (1985). An introduction to empirical Bayes data analysis. The American Statistician, 39, 83–87.

Chen

H. T.

(2010). The bottom-up approach to integrative validity: A new perspective for program evaluation. Evaluation and Program Planning, 33, 205–214.

Collins

A. M.

Briefel

Klerman

J. A.

Wolf

Rowe

Logan

… Lyskawa

. (2016). Summer Electronic Benefit Transfer for Children (SEBTC) Demonstration: Summary Report 2011-2014 (Summary; No. 15b5c7b9b0d7491fa0b5bcea08d978f9). Mathematica Policy Research. Cambridge MA, USA.

Cook

T. D.

(2014). Generalizing causal knowledge in the policy sciences: External validity as a task of both multiattribute representation and multiattribute extrapolation. Journal of Policy Analysis and Management, 33, 527–536.

Hedges

L. V.

Olkin

(2014). Statistical methods for meta-analysis. Orlando, FL: Academic Press.

Jaciw

A. P.

Lin

(2016). An empirical study of design parameters for assessing differential impacts for students in group randomized trials. Evaluation Review, 40, 410–443.

Kern

H. L.

Stuart

E. A.

Hill

Green

D. P.

(2016). Assessing methods for generalizing experimental impact estimates to target populations. Journal of Research on Educational Effectiveness, 9, 103–127.

10.

Klerman

J. A

. (2017). Special issue editor’s overview essay: Systematic reviews. Evaluation Review, 41, 175–182.

11.

Leviton

Trujillo

. (2017). Interaction of theory and practice. Evaluation Review, 41, 436–471.

12.

Michalopoulos

Schwartz

Adams-Ciardullo

(2000). National evaluation of welfare-to-work strategies: What works best for whom: Impacts of 20 welfare-to-work programs by subgroup: Executive summary. Washington, DC: U.S. Department of Education, Office of Educational Research and Improvement, Education Resources Information Center.

13.

Morris

C. N.

(1983). Parametric empirical Bayes inference: Theory and applications. Journal of the American Statistical Association, 78, 47–55.

14.

Olsen

R. B.

Bein

Judkins

. (2017). Sample size requirements for education multi-site RCTs that select sites randomly. Unpublished manuscript.

15.

Olsen

R. B.

Orr

L. L.

(2016). On the “where” of social experiments: Selecting more representative samples to inform policy. New Directions for Evaluation, 152, 61–71.

16.

Olsen

R. B.

Orr

L. L.

Bell

S. H.

Stuart

E. A.

(2013). External validity in policy evaluations that choose sites purposively. Journal of Policy Analysis and Management, 32, 107–121.

17.

Paulsell

Thomas

Monahan

Seftor

N. S

. (2017). A trusted source of information: How systematic reviews can support user decisions about adopting evidence-based programs. Evaluation Review, 41, 50–77.

18.

Rothwell

P. M.

(2005). Subgroup analysis in randomised controlled trials: Importance, indications, and interpretation. The Lancet, 365, 176–186.

19.

Stuart

E. A.

Bradshaw

C. P.

Leaf

P. J.

(2014). Assessing the generalizability of randomized trial results to target populations. Prevention Science, 16, 1–11.

20.

Stuart

E. A.

Cole

S. R.

Bradshaw

C. P.

Leaf

P. J.

(2011). The use of propensity scores to assess the generalizability of results from randomized trials. Journal of the Royal Statistical Society, Series A, Part 2, 174, 369–386.

21.

Stuart

Rhodes

. (2017). Generalizing treatment effect estimates from sample to population: A case study in the difficulties of finding sufficient data. Evaluation Review, 41, 357–388.

22.

Tipton

(2013). Improving generalizations from experiments using propensity score subclassification: Assumptions, properties, and contexts. Journal of Educational and Behavioral Statistics, 38, 239–266.

23.

Tipton

(2014). How generalizable is your experiment? Comparing a sample and population through a generalizability index. Journal of Educational and Behavioral Statistics, 39, 478–501.

24.

Tipton

Hallberg

Hedges

Chan

. (2017). Implications of small samples for generalization: Adjustments and rules of thumb. Evaluation Review.

25.

Tipton

Peck

. (2017). A design-based approach to improve external validity in welfare policy evaluations. Evaluation Review, 41, 326–356.

26.

Valentine

Wilson

Rindskopf

Lau

Tanner-Smith

Yeide

… Foster

. (2017). Synthesizing evidence in public policy contexts: The challenge of synthesis when there are only a few studies. Evaluation Review, 41, 3–26.

27.

Yusuf

Wittes

Probstfield

Tyroler

H. A.

(1991). Analysis and interpretation of treatment effects in subgroups of patients in randomized clinical trials. Journal of the American Medical Association, 266, 93–98.