Test-retest correlations can lead to biased reliability estimates when there is instability of the true scores in the interval between tests and/or when the measurement errors are correlated. Using three occasion data on the Test of Standard Written English and essay ratings, an analysis is demonstrated which separates true score instability and correlated errors.
Get full access to this article
View all access options for this article.
References
1.
Breland, H. M.A study of college English placement and the test of Standard Written English. (College Entrance Examination Board Research and Development Reports RDR-76-77, No. 4). Princeton, N. J.: Educational Testing Service, January 1977.
2.
Godshalk, F. I., Swineford, F., and Coffman, W. E.The measurement of writing ability. New York, N. J.: College Entrance Examination Board, 1966.
3.
Jöreskog, K. G. and Sörbon, D.Statistical models and methods for analysis of longitudinal data. (Research Report 75-1. Uppsala, Sweden: Uppsala University, Statistics Department, 1975.
4.
4. Wiley, D. E. and Hornik, R.Measurement error and the analysis of panel data. (Studies of Educative Processes, Report no. 5. Chicago, Illinois : University of Chicago, 1973 .
5.
Costner, H. L. and Schonberg, R.Diagnosing indicator ills in multiple indicator models . In A. S. Goldberger & O. D. Duncan (Eds.), Journal of Mathematical and Statistical Psychology, 1973, 26, 90-97.
6.
Coffman, W. E.Essay examinations. In R. L. Thorndike (Ed.), Educational measurement (2nd ed.). Washington, D. C.: American Council on Education, 1971, 271-302.
7.
Heise, D. R.Separating reliability and stability in test-retest correlation. American Sociological Review; 1969, 34, 93-101.
8.
Humphreys, L. G.Investigations of the simplex . Psychometrika, 196025,313-323.
9.
Humphreys, L. G.The fleeting nature of college academic success. Journal of Educational Psychology, 1968, 59, 375-380.
10.
Jöreskog, K. G.Estimation and testing of simplex models. The British Journal of Mathematical and Statistical Psychology, 1970, 59, 375-380.
11.
Kenny, D. A.Cross-lagged and synchronous common factors in panel data. In A. S. Goldberger and O. D. Duncan, (Eds.), Structural Equation Models In the Social Sciences, New York: Seminar Press, pp. 167-199.
12.
Klein, S. P. and Hart, F. M.Chance and systematic factors affecting essay grades . Journal of Educational Measurement, 1968 , 5, 197-206.
13.
Linn, R. L., Klein, S. P. , and Hart, F. M.The nature and correlates of
14.
law school essay grades. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1972, 32, 267-279.
15.
Lunneborg, C. E. and Lunneborg, P. W.Relations between aptitude changes and academic success during college. Journal of Educational Psychology, 1970, 61, 169-173.
16.
Markham, L. R.Influences of handwriting quality on teacher evaluation of written work. American Educational Research Journal, 1976, 13, 277-283.
17.
Sörbon, D.Detection of correlated errors in longitudinal data. British Journal of Mathematical and Statistical Psychology, 1975, 28, 138-151.
18.
Werts, C. E., Jöreskog, K. G., and Linn, R. L. Comment on "the estimation of measurement error in panel data." American Sociological Review, 1971, 36, 110-113.
19.
Werts, C. E., Jöreskog, K. G., and Linn, R. L.Analyzing ratings with correlated intrajudge measurement error. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1976, 36, 319-328.
20.
Werts, C. E., Linn, R. L. , and Jöreskog, K. G.A simplex model for analyzing academic growth. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1977, 37, 745-756.
21.
Werts, C. E., Linn, R. L. , and Jöreskog, K. G.The reliability of college grades from longitudinal data. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1978 , 38, 89-95.
22.
Wiley, D. E. and Wiley, J. A.The estimation of measurement error in panel data. American Sociological Review, 1970, 35, 112-117.