Brennan, R.L. (2001). An essay on the history and future of reliability from the perspective of replications. Journal of Educational Measurement, 38, 295-317.
2.
Dorans, N. J. (Ed.). (2004). Assessing the population sensitivity of equating functions [Special issue] . Journal of Educational Measurement, 41(1).
3.
Dorans, N.J., & Feigenbaum, M.D. (1994). Equating issues engendered by changes to the SAT and PSAT/NMSQT. In I. M. Lawrence, N. J. Dorans, M. D. Feigenbaum, N. J. Feryok, A. P. Schmitt, & N. K. Wright (Eds.), Technical issues related to the introduction of the new SAT®and PSAT/NMSQT® (ETS Research Memorandum No. RM-94-10). Princeton, NJ: Educational Testing Service.
4.
Dorans, N.J., & Holland, P.W. (2000). Population invariance and equatability of tests: Basic theory and the linear case. Journal of Educational Measurement , 37, 281-306.
5.
Dorans, N.J., Liu, J., & Hammond, S. (2008). Anchor test type and population invariance: An exploration across subpopulations and test administrations. Applied Psychological Measurement, 32, 81-97.
6.
Feldt, L.S., & Brennan, R.L. (1989). Reliability. In R. L. Linn (Ed.), Educational measurement (3rd ed., pp. 105-146). New York: Macmillan .
7.
Feuer, M.J., Holland, P.W., Green, B.F., Bertenthal, M.W., & Hemphill, F.C. (1999). Uncommon measures: Equivalence and linkage among educational tests. Washington, DC: National Academy Press.
8.
Flanagan, J.C. (1951). Units, scores, and norms. In E. F. Lindquist (Ed.), Educational measurement (pp. 695-763). Washington, DC: American Council on Education.
9.
Kolen, M.J. (2004). Population invariance in equating: Concept and history. Journal of Educational Measurement, 41, 3-14.
10.
Kolen, M.J., & Brennan, R.L. (2004). Test equating, scaling, and linking: Methods and practices (2nd ed.). New York : Springer-Verlag.
11.
Koretz, D. M., Bertenthal, M. W., & Green, B. F. (Eds.). (1999). Embedding questions: The pursuit of a common measure in uncommon tests. Washington, DC: National Research Council.
12.
Linn, R.L. (1993). Linking results of distinct assessments. Applied Measurement in Education, 6(1), 83-102.
13.
Liu, M. & Holland, P.W. (2008). Exploring Population Sensitivity of Linking Functions Across Three Law School Admission Test Administrations. Applied Psychological Measurement, 32, 27-44.
14.
Lord, F.M., & Wingersky, M.S. (1984). Comparison of IRT true-score and equipercentile observed-score ``equatings.'' Applied Psychological Measurement, 8, 452-461.
15.
Mislevy, R.L. (1992). Linking educational assessments: Concepts, issues, methods, and prospects (Policy Information Report). Princeton, NJ: Educational Testing Service.
16.
Pommerich, M., & Dorans, N. J. (Eds.). (2004). Concordance [Special issue]. Applied Psychological Measurement, 28(4).
17.
von Davier, A.A., Holland, P.W., & Thayer, D.T. (2003). Population invariance and chain versus post-stratification equating methods. In N. J. Dorans (Ed.), Population invariance of score linking: Theory and applications to Advanced Placement Program®examinations (ETS Research Rep. No. RR-03-27, pp. 19-36). Princeton, NJ: Educational Testing Service.
18.
von Davier, A.A., Holland, P.W., & Thayer, D.T. (2004). The chain and post-stratification methods for observed-score equating and their relationship to population invariance. Journal of Educational Measurement, 41, 15-32.
19.
von Davier, A.A., & Wilson, C. (2005). A didactic approach to the use of IRT true score equating (ETS Research Rep. No. RR-05-26). Princeton, NJ : Educational Testing Service.
20.
von Davier, A.A., & Wilson, C. (2008). Investigating the Population Sensitivity Assumption of Item Response Theory True-Score Equating Across Two Subgroups of Examinees and Two Test Formats. Applied Psychological Measurement, 32, 11-26.
21.
Yang, W.-L., & Gao, R. (2008). Invariance of Score Linkings Across Gender Groups for Forms of a Testlet-Based College-Level Examination Program Examination . Applied Psychological Measurement, 32, 45-61.
22.
Yi, Q., Harris, D.J., & Gao, X. (2008). Invariance of Equating Functions Across Different Subgroups of Examinees Taking a Science Achievement Test. Applied Psychological Measurement, 32, 62-80.
23.
Yin, P., Brennan, R.L., & Kolen, M.J. (2004). Concordance between ACT and ITED scores from different populations. Applied Psychological Measurement, 28, 274-289.