AvellarS. A.ThomasJ.KleinmanR.Sama-MillerE.WoodruffS. E.CoughlinR.WestbrookT. P. R. (2017). External validity: The next step for systematic reviews?Evaluation Review, 41, 283–325.
2.
BanerjeeA. V.ChassangS.SnowbergE. (2017). Decision theoretic approaches to experiment design and external validity. Handbook of Economic Field Experiments, 1, 141–174.
3.
CasellaG. (1985). An introduction to empirical Bayes data analysis. The American Statistician, 39, 83–87.
4.
ChenH. T. (2010). The bottom-up approach to integrative validity: A new perspective for program evaluation. Evaluation and Program Planning, 33, 205–214.
5.
CollinsA. M.BriefelR.KlermanJ. A.WolfA.RoweG.LoganC…LyskawaJ. (2016). Summer Electronic Benefit Transfer for Children (SEBTC) Demonstration: Summary Report 2011-2014 (Summary; No. 15b5c7b9b0d7491fa0b5bcea08d978f9). Mathematica Policy Research. Cambridge MA, USA.
6.
CookT. D. (2014). Generalizing causal knowledge in the policy sciences: External validity as a task of both multiattribute representation and multiattribute extrapolation. Journal of Policy Analysis and Management, 33, 527–536.
JaciwA. P.LinL.MaB. (2016). An empirical study of design parameters for assessing differential impacts for students in group randomized trials. Evaluation Review, 40, 410–443.
9.
KernH. L.StuartE. A.HillJ.GreenD. P. (2016). Assessing methods for generalizing experimental impact estimates to target populations. Journal of Research on Educational Effectiveness, 9, 103–127.
10.
KlermanJ. A. (2017). Special issue editor’s overview essay: Systematic reviews. Evaluation Review, 41, 175–182.
11.
LevitonL.TrujilloM. (2017). Interaction of theory and practice. Evaluation Review, 41, 436–471.
12.
MichalopoulosC.SchwartzC.Adams-CiardulloD. (2000). National evaluation of welfare-to-work strategies: What works best for whom: Impacts of 20 welfare-to-work programs by subgroup: Executive summary. Washington, DC: U.S. Department of Education, Office of Educational Research and Improvement, Education Resources Information Center.
13.
MorrisC. N. (1983). Parametric empirical Bayes inference: Theory and applications. Journal of the American Statistical Association, 78, 47–55.
14.
OlsenR. B.BeinE.JudkinsD. (2017). Sample size requirements for education multi-site RCTs that select sites randomly. Unpublished manuscript.
15.
OlsenR. B.OrrL. L. (2016). On the “where” of social experiments: Selecting more representative samples to inform policy. New Directions for Evaluation, 152, 61–71.
16.
OlsenR. B.OrrL. L.BellS. H.StuartE. A. (2013). External validity in policy evaluations that choose sites purposively. Journal of Policy Analysis and Management, 32, 107–121.
17.
PaulsellD.ThomasJ.MonahanS.SeftorN. S. (2017). A trusted source of information: How systematic reviews can support user decisions about adopting evidence-based programs. Evaluation Review, 41, 50–77.
18.
RothwellP. M. (2005). Subgroup analysis in randomised controlled trials: Importance, indications, and interpretation. The Lancet, 365, 176–186.
19.
StuartE. A.BradshawC. P.LeafP. J. (2014). Assessing the generalizability of randomized trial results to target populations. Prevention Science, 16, 1–11.
20.
StuartE. A.ColeS. R.BradshawC. P.LeafP. J. (2011). The use of propensity scores to assess the generalizability of results from randomized trials. Journal of the Royal Statistical Society, Series A, Part 2, 174, 369–386.
21.
StuartE.RhodesA. (2017). Generalizing treatment effect estimates from sample to population: A case study in the difficulties of finding sufficient data. Evaluation Review, 41, 357–388.
22.
TiptonE. (2013). Improving generalizations from experiments using propensity score subclassification: Assumptions, properties, and contexts. Journal of Educational and Behavioral Statistics, 38, 239–266.
23.
TiptonE. (2014). How generalizable is your experiment? Comparing a sample and population through a generalizability index. Journal of Educational and Behavioral Statistics, 39, 478–501.
24.
TiptonE.HallbergK.HedgesL.ChanW. (2017). Implications of small samples for generalization: Adjustments and rules of thumb. Evaluation Review.
25.
TiptonE.PeckL. (2017). A design-based approach to improve external validity in welfare policy evaluations. Evaluation Review, 41, 326–356.
26.
ValentineJ.WilsonS.RindskopfD.LauT.Tanner-SmithE.YeideM.…FosterL. (2017). Synthesizing evidence in public policy contexts: The challenge of synthesis when there are only a few studies. Evaluation Review, 41, 3–26.
27.
YusufS.WittesJ.ProbstfieldJ.TyrolerH. A. (1991). Analysis and interpretation of treatment effects in subgroups of patients in randomized clinical trials. Journal of the American Medical Association, 266, 93–98.