Sage Journals: Discover world-class research

Abstract

Background/Context:

Public monitoring of educational progress and inequality often involves tracking changes in the percentage of “proficient” students across groups and over time. These trends are important signals of state and district provision of educational opportunity. I show how known flaws of this percentage metric, sometimes assumed to be negligible, interacted with COVID-19 pandemic conditions to create a “perfect storm,” masking real declines and growing inequality. I use this to motivate three metrics necessary for fuller documentation of educational progress and inequality when tested populations change.

Purpose/Objective/Research Question/Focus of Study:

I present three metrics for measuring and contextualizing changes in educational achievement over time: the “match rate,” the “fair trend,” and the “equity check.” Like doctors or pilots using multiple instruments to diagnose and navigate, I argue that these three metrics are necessary for holistic understanding of educational progress when tested populations change. I show how neglecting these metrics leads to misclassification of schools that do and do not need support. These metrics have their foundations in the statistical literature for missing data and causal inference. I adapt them to the context of public reporting of educational test scores for monitoring educational equity.

Research Design:

I use publicly available data from the California Department of Education from 2019 through 2022 to show how poor reporting metrics led state officials to conclude that test score gaps were closing when they were in fact widening. Drawing from statistical theory, I show how these issues generalize to other contexts. I use statistical models to define three metrics that avoid biases and provide necessary context when tested populations change. The first metric is a percentage I call the “match rate.” The second and third metrics, the “fair trend” and the “equity check,” are regression-adjusted trends for changing populations. I explain how education officials can use these metrics to improve diagnosis, like doctors supplementing a patient’s pulse with their temperature, blood pressure, and oxygen saturation.

Conclusions/Recommendations:

Public reporting using simple metrics like “percent proficient” only yields defensible trend interpretations under conditions that are increasingly narrow and rare: when leaders care only about progress for a single stable population of students. The predictable biases and distortions of percent-proficient metrics necessitate more sophisticated metrics, simply explained, as complements. States and testing agencies should report metrics like the three I propose for transparency in technical documentation and wield them for decision-making, particularly when monitoring equity for changing populations.

Keywords

Educational testing educational inequality achievement gaps COVID-19 learning loss assessment literacy

Get full access to this article

View all access options for this article.

References

Ackerman

Bandalos

Briggs

Everson

A. D.

Lottridge

Madison

Sinharay

Rodriguez

Russell

von Davier

Wind

(2024). Foundational competencies in educational measurement. Educational Measurement: Issues and Practice, 43(3), 7–17.

American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (2014). Standards for educational and psychological testing. https://www.testingstandards.net/uploads/7/6/6/4/76643089/standards_2014edition.pdf

L. S.

A. D.

Davis

L. L.

(2022). Disrupted data: Using longitudinal assessment systems to monitor test score quality. Educational Measurement: Issues and Practice, 41(1), 28–32. https://doi.org/10.1111/emip.12491

Betebenner

D. W.

(2011). A technical overview of the Student Growth Percentile methodology: Student growth percentiles and percentile growth projections/trajectories. https://files.eric.ed.gov/fulltext/ED583125.pdf

Betebenner

D. W.

Van Iwaarden

(2022). COVID-19 academic impact in Rhode Island. National Center for the Improvement of Educational Assessment. https://www.ride.ri.gov/Portals/0/Uploads/Documents/News/Rhode_Island_Academic_Impact_Report_FINAL.pdf

Bishop

J. P.

Noguera

P. A.

(2019). The ecology of educational equity: Implications for policy. Peabody Journal of Education, 94(2), 122–141.

California Department of Education. (2021, June 2). California assessment of student performance and progress. Smarter balanced 2019–2020 technical report. ETS Contract No. CN150012. https://www.cde.ca.gov/ta/tg/ca/documents/sbactechrpt1920.pdf

California Department of Education. (2022). Interpretation guide to the 2020–2021 statewide assessment results. https://www.cde.ca.gov/ta/tg/ca/documents/assessmentresultsguide21.docx

Castellano

K. E.

A. D.

(2013). A practitioner’s guide to growth models. Council of Chief State School Officers.

10.

Cayla J. et al. v. State of California et al., No. RG20084386, Superior Court of California, County of Alameda. (2020). Complaint. https://www.documentcloud.org/documents/23908009-caylaj-original-complaint-2020-publiccounsel/

11.

Center on Reinventing Public Education. (2022). CRPE 2021–22 state response database. https://docs.google.com/spreadsheets/d/1AFyMob7ATHlrDDvWXsGVNUSrj_vpJkFdqU49zhb4TJU/

12.

Cohodes

Goldhaber

Hill

A. D.

Kogan

Polikoff

Sampson

West

(2022). Student achievement gaps and the pandemic: A new review of evidence from 2021–2022. Center on Reinventing Public Education. https://files.eric.ed.gov/fulltext/ED622905.pdf

13.

Comprehensive Center Network. (2022). Spring 2021 performance on three locally determined assessments in California. Region 15 Comprehensive Center at WestEd. https://csaa.wested.org/wp-content/uploads/2022/01/CA_Spr2021_LocalAssessment_FINAL-ADA.pdf

14.

Curriculum Associates. (2025). State of student learning in 2025. https://cdn.bfldr.com/LS6J0F7/at/cvqpgvm9s9j72jrngj4r3n9/ca-sosl-tech-report-2025.pdf

15.

Darling

(2007). Ecological systems theory: The person in the center of the circles. Research in Human Development, 4(3–4), 203–217. https://doi.org/10.1080/15427600701663023

16.

Fahle

Kane

T. J.

Reardon

S. F.

Staiger

D. O.

(2024). The first year of pandemic recovery: A district-level analysis. https://m.weartv.com/resources/pdf/75e50529-b8f9-4b0f-9071-d8d7ef42f906-ERSReportFinal1.31.pdf

17.

Goldhaber

Kane

T. J.

McEachin

Morton

(2022). A comprehensive picture of achievement across the COVID-19 pandemic years: Examining variation in test levels and growth across districts, schools, grades, and students. National Center for Analysis of Longitudinal Data in Education Research. https://caldercenter.org/sites/default/files/2024-11/CALDER%20Working%20Paper%20267-0522_0.pdf

18.

A. D.

(2008). The problem with “proficiency”: Limitations of statistics and policy under No Child Left Behind. Educational Researcher, 37, 351–360. https://www-leland.stanford.edu/~hakuta/Courses/Ed205X%20Website/Resources/Ho%20The%20Problem%20with%20Proficiency%20ER%20v37%20n6.pdf

19.

A. D.

(2009). A nonparametric framework for comparing trends and gaps across tests. Journal of Educational and Behavioral Statistics, 34, 201–228.

20.

A. D.

(2023). Expert report of Andrew Ho, Ph.D. https://publiccounsel.org/wp-content/uploads/2024/01/2023-08-15-Andrew-Ho-Expert-Report-Combined-w-Errata.pdf

21.

A. D.

Polikoff

M. S.

(2025). Test-based accountability in K–12 education. In Pitoniak

Cook

(Eds.), Educational measurement (5th ed.). Routledge.

22.

Holland

P. W.

(2002). Two measures of change in the gaps between the CDFs of test-score distributions. Journal of Educational and Behavioral Statistics, 27, 3–17.

23.

Irwin

Wang

Jung

Kessler

Tezil

Alhassani

Filbey

Dilig

Bullock Mann

(2024). Report on the condition of education 2024 (NCES 2024144). U.S. Department of Education, National Center for Education Statistics. https://nces.ed.gov/pubsearch/pubsinfo.asp?pubid=2024144

24.

Ladson-Billings

(2006). From the achievement gap to the education debt: Understanding achievement in U.S. schools. Educational Researcher, 35(7), 3–12. https://doi.org/10.3102/0013189X035007003

25.

Lewis

Kuhfeld

(2024). Recovery still elusive: 2023–24 student achievement highlights persistent achievement gaps and a long road ahead. NWEA Research. https://files.eric.ed.gov/fulltext/ED657294.pdf

26.

Little

R. J. A.

Rubin

D. B.

(2019). Statistical analysis with missing data (3rd ed.). Wiley.

27.

Modan

(2021, October 29). 2021 testing participation varies widely—what will data mean for districts? K–12 Dive. https://www.k12dive.com/news/2021-testing-participation-varies-widely-what-will-data-mean-for-district/608939/

28.

Motion for Summary Judgment. (2023, August 4). Defendants’ memorandum of points and authorities in support of motion for summary judgment or, in the alternative, summary adjudication. Reservation No. 284458662516. Case No. RG20084386.

29.

Polikoff

M. S.

(2016). A letter to the U.S. Department of Education (final signatory list). https://morganpolikoff.com/2016/07/12/a-letter-to-the-u-s-department-of-education/

30.

Public Counsel. (2024). Settlement agreement. https://publiccounsel.org/wp-content/uploads/2023/08/2023-12-18.Cayla-J-Settlement.FINAL_.pdf

31.

Quinn

D. M.

(2025). Experimental effects of “opportunity gap” and “achievement gap” frames. Sociology of Education, 98(3), 203–222. https://doi.org/10.1177/00380407251321372

32.

Utah State Board of Education. (2021). Exploring the effects of the COVID-19 pandemic on student achievement in Utah. https://www.schools.utah.gov/coronavirus/_corona_virus_/_effectsonlearning/ExploringEffectsCovidPandemic.pdf

Three Metrics for Monitoring Educational Progress when Tested Populations Change

Abstract

Background/Context:

Purpose/Objective/Research Question/Focus of Study:

Research Design:

Conclusions/Recommendations:

Keywords

Get full access to this article

References