Two Simple Models for Rater Effects

Abstract

In many examinations, essays of different exami nees are rated by different rater pairs. This paper dis cusses the estimation of rater effects for rating designs in which rater pairs overlap in a special way. Two models for rater effects are considered: the additive model and a nonlinear model. An illustration with em pirical data is provided.

Get full access to this article

View all access options for this article.

References

Choppin, B.H. (1982). The use of latent trait models in the measurement of cognitive abilities and skills. In D. Spearritt (Ed.), The Improvement of Measurement in Education and Psychology. Melbourne: Australian Council for Educational Research.

De Gruijter, D.N.M. (1984). The estimation of examiner effects in designs with overlapping examiner teams. Kwantitatieve Methoden, 13, 148-155.

Efron, B. (1979). Bootstrap methods: Another look at the jackknife . The Annals of Statistics, 7, 1-26.

Engelhard, G. , & Osberg, D.W. (1983). Constructing a test network with a Rasch measurement model. Applied Psychological Measurement, 7, 283-294.

Mosteller, F. , & Tukey, J.W. (1968). Data analysis, including statistics. In G. Lindzey & E. Aronson (Eds.), Handbook of Social Psychology, (Vol. 2, 2nd Ed.). Reading MA : Addison-Wesley.

Paul, S.R. (1981). Bayesian methods for calibration of examiners . British Journal of Mathematical and Statistical Psychology , 34, 213-223.