Technical Adequacy and Cost Benefit of Four Measures of Early Literacy

Abstract

Technical adequacy and information/cost return were examined for four early reading measures: the Dynamic Indicators of Basic Early Literacy Skills (DIBELS), STAR Early Literacy (SEL), Group Reading Assessment and Diagnostic Evaluation (GRADE), and the Texas Primary Reading Inventory (TPRI). All four assessments were administered to the same students in each of Grades K through 2 over a 5-week period; the samples included 200 students per grade from 7 states. Both SEL and DIBELS were administered twice to establish their retest reliability in each grade. We focused on the convergent validity of each assessment for measuring five critical components of reading development identified by the U.S. National Research Panel: Phonemic awareness, phonics, vocabulary, comprehension, and fluency. DIBELS and TPRI both are asserted to assess all five of these components; GRADE and STAR Early Literacy explicitly measure all except fluency. For all components, correlations among relevant subtests were high and comparable. The pattern of intercorrelations of nonfluency measures with fluency suggests the tests of fluency, vocabulary, comprehension, and word reading are measuring the same underlying construct. A separate cost—benefit study was conducted and showed that STAR Early Literacy was the most cost-effective measure among those studied. In terms of amount of time per unit of test administration or teachers’ time, CAT (computerized adaptive testing) in general, and STAR Early Literacy in particular, is an attractive option for early reading assessment.

L’adéquation technique et les données reliées aux coûts par rapport aux dépenses ont été examinées pour les outils d’évaluations suivants: Dynamic Indicators of Basic Early Literacy Skills (DIBELS), STAR Early Literacy (SEL), Group Reading Assessment et Diagnostic Evaluation (GRADE), et le Texas Primary Reading Inventory (TPRI). Les quatre outils ont été administrés aux mêmes étudiants de chaque niveau de maternelle à deuxième année pendant une période de 5 semaines. Les données ont inclus 200 étudiants par niveau de 7 états. Les outils SEL et DIBELS ont été administrés deux fois, pour établir leur capacité de produire des résultats fiables à chaque niveau. Le but principal de cette recherche était de démontrer la validité convergente de chacun des cinq éléments du développement de la lecture qui ont été identifiés par le National Reading Panel des États-Unis: la reconnaissance phonémique, la correspondance symboles-sons, le vocabulaire, la compréhension, et l’aisance. DIBELS et TPRI ont été affirmés comme étant capables de mesurer tout les cinq éléments. Cependant, GRADE et STAR sont des mesures de tous les cinq sauf l’aisance. Les résultats indiquent que toutes les corrélations entre les sous-tests pour chaque élément de toutes mesures étaient élevées and comparables. De plus, les inter-corrélations entre les mesures d’aisance et les autres mesures suggèrent que les tests d’aisance, de vocabulaire, de compréhension, et de lecture de mot individuel mesurent, en effet, le même concept général. Une étude séparée a été entreprise pour établir les ratios coûts-bénéfices pour chaque instrument. Parmi les cinq programmes, c’était STAR Early Literacy qui a été identifié comme le programme produisant le plus de bénéfice relatif au coût. En tenant compte du temps requis pour l’administration des tests et par les enseignants, ce sont, en général, les tests adaptifs par ordinateur et, en particulier, STAR Early Literacy, qui semblent être les programmes les plus attrayants pour l’évaluation du développement de la lecture.

Keywords

early literacy assessment computerized adaptive testing

Get full access to this article

View all access options for this article.

References

Bracken, B. , & Nagle, R.J. ( 2006). Psychoeducational assessment of preschool children . Mahwah, NJ: Lawrence Erlbaum .

CTB/McGraw-Hill. ( 1992). California Achievement Tests, 5th ed (CAT/5). Monterey, CA.

Center for Academic and Reading Skills. (n.d.). Technical report: Texas Primary Reading Inventory (1999 ed.). Houston: Center for Academic and Reading Skills, University of Texas-Houston Health Science Center; Texas Institute for Measurement, Evaluation, and Statistics, University of Houston.

Christensen Associates. (2005). A cost analysis of K-2 early literacy assessments: STAR Early Literacy, DIBELS and TPRI. Madison, WI: Author.

Foorman, B.R. , Fletcher, J.M. , Francis, D.J. , Carlson, C.D. , Chen, D. , Mouzaki, A. , et al. (1998). Technical report: Texas Primary Reading Inventory (1998 ed.). Prepared for the Texas Education Agency by the Center for Academic and Reading Skills, University of Texas-Houston Health Science Center and University of Houston , Houston.

Good, R. , & Kaminski, R. (n.d.). Dynamic Indicators of Basic Early Literacy Skills. Eugene : University of Oregon.

Kame’enui, E.J. ( 2002). An analysis of reading assessment instruments for K-3 (final report). Eugene, OR: Institute for the Development of Educational Achievement, University of Michigan .

Kaminski, R. , Cummings, K. , Powell-Smith, K.R. , & Good, R.H., III. (2008). Best practices in using Dynamic Indicators of Basic Early Literacy Skills for formative assessment and evaluation . In A. Thomas & J. Grimes (Eds.), Best practices in school psychology V (Vol. 4, pp. 1181-1203). Bethesda, MD: National Association of School Psychologists .

Lehr, C.S. , Ysseldyke, J.E. , & Thurlow, M.L. ( 1987). Assessment practices in model early childhood special education programs. Psychology in the Schools, 24, 390-399.

10.

Meisels, S.J. , & Piker, R.A. ( 2001). An analysis of early literacy assessments used for instruction . Ann Arbor: Center for the Improvement of Early Reading Assessment, University of Michigan.

11.

National Association for the Education of Young Children. ( 1987). Standardized testing of young children 3 through 8 years of age. (Position statement). Washington, DC: NAEYC.

12.

National Institute of Child Health and Human Development. ( 2000). Report of the National Reading Panel. Teaching children to read: An evidence-based assessment of the scientific research literature on reading and its implications for reading instruction (NIH Publication No. 00-4769). Washington, DC: U.S. Government Printing Office.

13.

Renaissance Learning. (2009). STAR Early Literacy. Wisconsin Rapids, WI: Author.

14.

Salvia, J. , Ysseldyke, J. , & Bolt, S. ( 2010). Assessment: In special and inclusive education (11th edition). Boston: Houghton-Mifflin .

15.

Snow, C. E., & Van Hemel, S. B. (Eds.). (2008). Early childhood assessment: Why, what, and how. Washington, DC: National Academy Press.

16.

Stallman, A.C. , & Pearson, D.P. ( 1990). Formal measures of early literacy. In L. M. Morrow & J. K. Smith (Eds.), Assessment for instruction in early literacy (pp. 7-44). Englewood Cliffs, NJ: Prentice Hall.

17.

Williams, C. ( 2001). Group Reading Assessment Diagnostic Evaluation . San Antonio, TX: Pearson Assessments .