Sage Journals: Discover world-class research

Abstract

Evaluators report effects of education initiatives as standardized effect sizes, a scale that has merits but obscures interpretation of the effects’ practical importance. Consequently, educators and policymakers seek more readily interpretable translations of evaluation results. One popular metric is the number of years of learning necessary to induce the effect. We compare years of learning to three other translation options: benchmarking against other effect sizes, converting to percentile growth, and estimating the probability of scoring above a proficiency threshold. After enumerating the desirable properties of translations, we examine each option’s strengths and weaknesses. We conclude that years of learning performs worst, and percentile gains performs best, making it our recommended choice for more interpretable translations of standardized effects.

Keywords

education program effects education policy program evaluation research utilization standardized effect sizes translation of research results years of learning

Get full access to this article

View all access options for this article.

References

Bloom

H. S.

Hill

C. J.

Black

A. B.

Lipsey

M. W.

(2008). Performance trajectories and performance gaps as achievement effect-size benchmarks for educational interventions. Journal of Research on Educational Effectiveness, 1(4), 289–328.

Briggs

D. C.

(2013). Measuring growth with vertical scales. Journal of Educational Measurement, 50(2), 204–226.

Childress

Amrofell

(2017). Reimaging learning: A big bet on the future of American education. Accessed September 7, 2017 from http://www.newschools.org/bigbet/.

Dadey

Briggs

D. C.

(2012). A meta-analysis of growth trends from vertically scaled assessments. Practical Assessment, Research & Evaluation, 17(14).

Dorn

(2015, September 24). “Weeks/days of learning” is well-intended bad interpretative factoid [Blog post]. Sherman Dorn: Work to understand how schools have been social institutions. Retrieved from http://shermandorn.com/wordpress/?p=8079.

Hanushek

E. A.

Woessmann

Peterson

P. E.

(2012). Is the US catching up? Education Next, 12(4).

Hedges

L. V.

(1981). Distribution theory for Glass’s estimator of effect size and related estimators. Journal of Educational Statistics, 6(2), 107–128.

Hill

C. J.

Bloom

H. S.

Black

A. R.

Lipsey

M. W.

(2008). Empirical benchmarks for interpreting effect sizes in research. Child Development Perspectives, 2(3), 172–177.

Krueger

A. B.

(1999). Experimental estimates of education production functions. The Quarterly Journal of Economics, 114(2), 497–532.

10.

Lee

Finn

Liu

(2018). Time-indexed effect size for educational research and evaluation: Reinterpreting program effects and achievement gaps in K–12 reading and math. Journal of Experimental Education. doi:10.1080/00220973.2017.1409183

11.

Lipsey

M. W.

Puzio

Yun

Hebert

M. A.

Steinka-Fry

Cole

M. W.

. . . Busick

M. D.

(2012). Translating the statistical representation of the effects of education interventions into more readily interpretable forms. (NCSER 2013-3000). Washington, DC: National Center for Special Education Research, Institute of Education Sciences, U.S. Department of Education.

12.

Luyten

Merrell

Tymms

(2017). The contribution of schooling to learning gains of pupils in Years 1 to 6. School Effectiveness and School Improvement, 28(3), 374–405.

13.

Martineau

J. A.

(2006). Distorting value added: The use of longitudinal, vertically scaled student achievement data for growth-based, value-added accountability. Journal of Educational and Behavioral Statistics, 31(1), 35–62.

14.

Maul

McClelland

(2013). Review of National Charter School Study 2013. Retrieved from National Education Policy Center, University of Colorado Boulder: https://nepc.colorado.edu/thinktank/review-credo-2013

15.

National Academies of Sciences, Engineering, and Medicine. (2017). Evaluation of the achievement levels for mathematics and reading on the national assessment of educational progress. Washington, DC: The National Academies Press. doi:10.17226/23409

16.

Northwest Evaluation Association. (2015). Smarter balanced preliminary performance levels: Estimated map scores corresponding to the preliminary performance levels of the Smarter Balanced Assessment Consortium. Portland, OR. Retrieved from https://www.nwea.org/content/uploads/2015/01/SBAC-Preliminary-Cut-Scores-MAY15.pdf.

17.

Pane

J. F.

Steiner

E. D.

Baird

M. D.

Hamilton

L. S.

(2015). Continued progress: Promising evidence on personalized learning. Santa Monica, CA: RAND Corporation.

18.

Pane

J. F.

Steiner

E. D.

Baird

M. D.

Hamilton

L. S.

Pane

J. D.

(2017). Informing progress: insights on personalized learning implementation and effects. Santa Monica, CA: RAND Corporation.

19.

Quinn

D. M.

Polikoff

(2017). Summer learning loss: What is it, and what can we do about it? Retrieved from Brookings Institution: http://brook.gs/2wWJKIN.

20.

U.S. Department of Education. (2014). What Works Clearinghouse: Procedures and standards handbook (Version 3.0): Institute of Education Sciences.

21.

Woodworth

Raymond

Chirbas

Gonzalez

Negassi

Snow

Von Donge

(2015). Online charter school study 2015. Center for Research on Educational Outcomes. Accessed September 7, 2017 from https://credo.stanford.edu/pdfs/Online%20Charter%20Study%20Final.pdf.

22.

Yen

(1986). The choice of scale for educational measurement: An IRT perspective. Journal of Educational Measurement, 23(4), 299–325.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.12 MB

Translating Standardized Effects of Education Programs Into More Interpretable Metrics

Abstract

Keywords

Get full access to this article

References

Supplementary Material