The measurement and analysis of individual change is one of the most enduring statistical problems in research on the efficacy of early intervention programs. This article describes conceptual and statistical issues in three approaches to the measurement of change: change scores, indexes of change, and residual change scores. Advantages and limitations of each approach are reviewed. Problems with the reliability of change scores are highlighted. Criteria are presented that can be used in selecting among analysis strategies.
Bagnato, S.J., & Neisworth, J.T. (1980). The intervention efficiency index: An approach to preschool program accountability. Exceptional Children, 46, 264–269.
3.
Barnard, K. (1978). Nursing child assessment teaching scales. Seattle: University of Washington School of Nursing.
4.
Bayley, N. (1969). The scales of infant development. San Antonio, TX: Psychological Corp.
5.
Bereiter, C. (1963). Some persisting dilemmas in the measurement of change. In C.W. Harris (Ed.), Problems in measuring change (pp. 3–20). Madison: University of Wisconsin Press.
6.
Bloom, B.S. (1964). Stability and change in human characteristics. New York: Wiley.
7.
Casto, G., & Mastropieri, M.A. (1986). The efficacy of early intervention programs: A meta analysis. Exceptional Children, 52, 417–424.
8.
Cronbach, L.J., & Furby, L. (1970). How should we measure “change”-or should we?Psychological Bulletin, 74, 68–80.
9.
Dunst, C J. (1986). Overview of the efficacy of early intervention programs. In L. Bickman & D.L. Weatherford (Eds.), Evaluating eariy intervention programs for severely handicapped children and their families (pp. 79–147). Austin, TX: Pro-Ed.
10.
Dunst, C.J., & Rheingrover, R. (1981). An analysis of the efficacy of early intervention programs with organically handicapped children. Evaluation and Program Planning, 4, 287–323.
11.
Gulliksen, H. (1950). Theory of mental tests. New York: Wiley.
12.
Helmstadter, G.C. (1964). Principles of psychological measurement. New York: Appleton-Century-Crofts.
13.
Linn, R.L, & Slinde, J.A. (1977). The determination of the significance of change between pre-and posttesting periods. Review of Educational Research, 47, 121–150.
14.
Lord, F.M. (1956). The measurement of growth. Educational and Psychological Measurement, 47, 421–437.
15.
Marfo, K., & Kysela, G.M. (1985). Early intervention with mentally handicapped children: A critical appraisal of applied research. Journal of Pediatric Psychology, 10, 305–324.
16.
O'Connor, EF. (1972). Extending classical test theory to the measurement of change. Review of Educational Research, 42, 73–97.
17.
Rogosa, D., Brandt, D., & Zimowski, M. (1982). A growth curve approach to the measurement of change. Psychological Bulletin, 92, 726–748.
18.
Rogosa, D.R., & Willett, J.B. (1983). Demonstrating the reliability of the difference score in the measurement of change. Journal of Educational Measurement, 20, 335–343.
19.
Rosenberg, S.A., Robinson, C.C., Finkler, D., & Rose, J.S. (1987). An empirical comparison of formulas evaluating early intervention program impact on development. Exceptional Children, 54, 213–219.
20.
Sheehan, R., & Gallagher, R.J. (1983). Methodological concerns in evaluating early intervention. Diagnoslique, 8, 75–87.
21.
Shonkoff, J.P., & Hauser-Cram, P. (1987). Early intervention for disabled infants and their families: A quantitative analysis. Pediatrics, 80, 650–658.
22.
Shonkoff, J.P., Hauser-Cram, P., Krauss, M.W, & Upshur, C.C. (1988). Early intervention efficacy research: What have we learned and where do we go from here?Topics in Eariy Childhood Special Education, 8, 81–93.
23.
Shonkoff, J.R, Hauser-Cram, P., Krauss, M.W., & Upshur, C.C. (1990). Early intervention collaborative study: Final report of phase one. Worcester: University of Massachusetts Medical Center.
24.
Simeonsson, R.J., Cooper, D.H., & Scheiner, A.P. (1982). A review and analysis of the effectiveness of early intervention programs. Pediatrics, 69, 635–641.
25.
Sparrow, S.S., Balla, D.A., & Cicchetti, D.V. (1984). Vineland adaptive behavior scales: Interview edition. Circle Pines, MN: American Guidance Service.
26.
Thorndike, E.L. (1924). The influence of chance imperfections of measures upon relationship of initial score to gain or loss. Journal of Experimental Psychology, 7, 225–232.
27.
Thorndike, R.L. (1966). Intellectual status and intellectual growth. Journal of Educational Psychology, 57, 121–127.
28.
Willett, J.B. (1988). Questions and answers in the measurement of change. In E.Z. Rothkopf (Ed.), Review of research in education (Vol. 15; pp. 345–422). Washington, DC: American Educational Research Association.
29.
Wolery, M. (1983). Proportional change index: An alternative for comparing child change data. Exceptional Children, 50, 167–170.
30.
Zimmerman, D.W., & Williams, R.H. (1982). Gain scores in research can be highly reliable. Journal of Educational Measurement, 19, 149–154.
31.
Bock, R.D. (1976). Basic issues in the measurement of change. In D.N.M. de Gruijter & L.J. Th. van der Kamp (Eds.), Advances in psychological and educational measurement (pp. 75–96). New York: Wiley.
32.
Gardner, R.C., & Neufeld, R.W.J. (1987). Use of the simple change score in correlational analyses. Educational and Psychological Measurement, 47, 849–864.
33.
Glass, G.V. (1968). Response to Traub's “Note on the reliability of residual change scores”. Journal of Educational Measurement, 5, 265–267.
34.
Irwin, J.V., & Wong, S.P. (1974). Compensation for maturity in long-range intervention studies. Acta Symbolica, 5, 34–45.
35.
Lord, F.M. (1958). Further problems in the measurement of growth. Educational and Psychological Measurement, 3, 437–451.
36.
Ottenbacher, K.J., Johnson, M.B., & Hojem, M. (1988). The significance of clinical change and clinical change of significance: Issues and methods. American Journal of Occupational Therapy, 42, 156–163.
37.
Stanley, J.C. (1967). General and specific formulas for reliability of differences. Journal of Educational Measurement, 42, 249–252.
38.
Traub, R.E (1967). A note on the reliability of residual change scores. Journal of Educational Measurement, 4, 253–256.
39.
Webster, H, & Bereiter, C. (1963). The reliability of changes measured by mental test scores. In C.W. Harris (Ed.), Problems in measuring change (pp. 39–59). Madison: University of Wisconsin Press.
40.
Zimmerman, D.W., & Brotohusodo, T.L. (1981). The reliability of sums and differences of test scores: Some new results and anomalies. Journal of Experimental Education, 49, 177–186.