In many examinations, essays of different exami nees are rated by different rater pairs. This paper dis cusses the estimation of rater effects for rating designs in which rater pairs overlap in a special way. Two models for rater effects are considered: the additive model and a nonlinear model. An illustration with em pirical data is provided.
Get full access to this article
View all access options for this article.
References
1.
Choppin, B.H. (1982). The use of latent trait models in the measurement of cognitive abilities and skills. In D. Spearritt (Ed.), The Improvement of Measurement in Education and Psychology. Melbourne: Australian Council for Educational Research.
2.
De Gruijter, D.N.M. (1984). The estimation of examiner effects in designs with overlapping examiner teams. Kwantitatieve Methoden, 13, 148-155.
3.
Efron, B. (1979). Bootstrap methods: Another look at the jackknife . The Annals of Statistics, 7, 1-26.
4.
Engelhard, G., & Osberg, D.W. (1983). Constructing a test network with a Rasch measurement model. Applied Psychological Measurement, 7, 283-294.
5.
Mosteller, F., & Tukey, J.W. (1968). Data analysis, including statistics. In G. Lindzey & E. Aronson (Eds.), Handbook of Social Psychology, (Vol. 2, 2nd Ed.). Reading MA: Addison-Wesley.
6.
Paul, S.R. (1981). Bayesian methods for calibration of examiners . British Journal of Mathematical and Statistical Psychology , 34, 213-223.