An item response theory model of response stability is developed, based on the local independence principle. The model predicts response changes under repeated administrations of the same instrument using item and examinee parameter estimates as predictors. Real data were used to assess how the model functioned. Results indicated that the model predictions were approximately fulfilled. Limitations of the model and the empirical study are discussed.
Get full access to this article
View all access options for this article.
References
1.
Angleitner, A., J., O. P. , & Lö, F. J. (1986). It’s what you ask and how you ask it: An itemmetric analysis of personality questionnaires. In A. Angleitner and J. S. Wiggins (Eds.), Personality assessment via questionnaires(pp. 61–107). Berlin, Germany: Springer-Verlag.
2.
Birenbaum, M. (1986). Effects of dissimulation motivation and anxiety on response pattern appropriateness measures. Applied Psychological Measurement, 10, 167–174.
3.
Cattell, R. B. (1986). The psychometric properties of tests: Consistency, validity, and efficiency. In R. B. Cattell and R. C. Johnson (Eds.), Functional psychological testing(pp. 54–78). New York: Brunner/Mazel.
4.
Costa, P. T. , & McCrae, R. R. (1985). Concurrent validation after 20 years: The implications of personality stability for its assessment. In J. N. Butcher and C. D. Spielberger (Eds.), Advances in personality assessment(Vol. 4., pp. 31–54). Hillsdale NJ: Erlbaum.
5.
Dickman, S. J. (1990). Functional and dysfunctional impulsivity: Personality and cognitive correlates. Journal of Personality and Social Psychology, 58,95–102.
6.
Drasgow, F. , Levine, M. V., & Williams, E. A. (1985). Appropriateness measurement with polychotomous item response models and standardized indices. British Journal of Mathematical and Statistical Psychology, 38,67–86.
7.
Eysenck, H. J. (1952). The scientific study of personality.London: Routledge.
8.
Eysenck, H. J. , & Eysenck, S. B. G. (1969). Personality structure and measurement.London: Routledge.
9.
Eysenck, H. J. , & Eysenck, S. B. G. (1976). Psychoticism as a dimension of personality.New York: Crane.
10.
Finch, J. F. ,& West, S. G. (1997). The investigation of personality structure: Statistical models. Journal of Research in Personality, 31,439–485.
11.
Fiske, D. W. (1966). Some hypotheses concerning test adequacy. Educational and Psychological Measurement, 26,69–88.
12.
Fiske, D. W. , & Butler, J. M. (1963). The experimental conditions for measuring individual differences. Educational and Psychological Measurement, 23,249–266.
13.
Fiske, D. W. , & Rice, L. (1955). Intra-individual response variability. Psychological Bulletin, 52,217–250.
14.
Fraser, C. , & McDonald, R. P. (1988). NOHARM: Least squares item factor analysis. Multivariate Behavioral Research, 23,267–269.
15.
Fricke, B. G. (1957). A response bias (B) scale for the MMPI. Journal of Counseling Psychology, 4,149–153.
16.
Goldberg, L. R. (1963). A model of item ambiguity in personality assessment. Educational and Psychological Measurement, 23,467–492.
17.
Goldberg, L. R. (1978). The reliability of reliability: The generality and correlates of intra-individual consistency in responses to structured personality inventories. Applied Psychological Measurement, 2,269–291.
18.
Gulliksen, H. (1950). Theory of mental tests.New York: Wiley.
19.
Hanley, C. (1962). The “difficulty” of a personality inventory item. Educational and Psychological Measurement, 22,577–584.
20.
Jackson, D. N. (1986). The process of responding in personality assessment. In A. Angleitner and J. S. Wiggins (Eds.), Personality assessment via questionnaires(pp. 123–142). Berlin, Germany: Springer-Verlag.
21.
Jones, R. R. , & Goldberg, L. R. (1967). Interrelationships among personality scale parameters: Item response stability and scale reliability. Educational and Psychological Measurement, 27,323–333.
22.
Kendall, M. G. , & Stuart, A. (1977). The advanced theory of statistics(Vol. 1). London: Griffin.
23.
Knowles, E. S. (1988). Item context effects on personality scales: Measuring changes the measure. Journal of Personality and Social Psychology, 55,312–320.
24.
Kuncel, R. B. , & Fiske, D. W. (1974). Stability of response process and response. Educational and Psychological Measurement, 34,743–755.
25.
Lord, F. M. (1980). Applications of item response theory to practical testing problems.Hillsdale NJ: Erlbaum.
26.
Lumsden, J. (1977). Person reliability. Applied Psychological Measurement, 1,477–482.
27.
Mac Eaton, A. , & Fiske, D. W. (1971). Item stability as related to implicit set and subject-item distance. Journal of Consulting and Clinical Psychology, 37,259–266.
28.
MATLAB . (1999). MATLAB 5.3 Release 11.1.Natick MA: The Math Works Inc.
29.
McDonald, R. P. , & Mok, M. C. (1995). Goodness of fit in item response models. Multivariate Behavioral Research, 30,23–40.
30.
Meijer, R. R. (1997). Person fit and criterion-related validity: An extension of the Schmitt, Cortina, and Whitney study. Applied Psychological Measurement, 21,99–113.
31.
Mislevy, R. J. (1984). Estimating latent distributions. Psychometrika, 49,359–381.
32.
Mislevy, R. J. , & Bock, R. D. (1990). BILOG 3 item analysis and test scoring with binary logistic models.Mooresville IN: Scientific Software.
33.
Mitra, S. K. , & Fiske, D. W. (1956). Intra-individual variability as related to test score and item. Educational and Psychological Measurement, 16,3–12.
34.
Nandakumar, R. (1994). Assessing dimensionality of a set of item responses—comparison of different approaches. Journal of Educational Measurement, 31,17–35.
35.
Nering, M. L. , & Meijer, R. R. (1998). A comparison of the person response function and the lzperson-fit statistic. Applied Psychological Measurement, 22,53–69.
36.
Nowakowska, M. (1983). Quantitative psychology: Some chosen problems and new ideas.Amsterdam: North-Holland.
37.
Nunnally, J. C. (1970). Introduction to psychological measurement.New York: McGraw-Hill.
38.
Reckase, M. D. (1997). A linear logistic multidimensional item response model for dichotomous item response data. In W. J. van der Linden and R. K. Hambleton (Eds.), Handbook of modern item response theory(pp. 271–286). Berlin, Germany: Springer-Verlag.
39.
Reise, S. P. , & Waller, N. G. (1990). Fitting the two parameter model to personality data. Applied Psychological Measurement, 14,45–58.
40.
Reise, S. P. , & Waller, N. G. (1993). Traitedness and the assessment of response pattern scalability. Journal of Personality and Social Psychology, 65,143–151.
41.
Rogers, T. B. (1973). Toward a definition of the difficulty of a personality item. Psychological Reports, 33,159–166.
42.
Schmidt, F. L. , & Hunter, J. E. (1996). Measurement error in psychological research: Lessons from 26 research scenarios. Psychological Methods, 1,199–223.
43.
Schuerger, J. M. , Zarella, K. L., & Hotz, A. S. (1989). Factors that influence the temporal stability of personality by questionnaire. Journal of Personality and Social Psychology, 56,777–783.
44.
Smith, D. D. (1992). Longitudinal stability of personality. Psychological Reports, 70,483–498.
45.
Stout, W. (1987). A nonparametric approach for assessing latent trait unidimensionality. Psychometrika, 52,589–617.
46.
Stout, W., Douglas, J., Junker, B., & Roussos, L. (1993). DIMTEST manual.Urbana Il: Department of Statistics, University of Illinois.
47.
Stroud, A. H. , & Secrest, D. (1966). Gaussian quadrature formulas.Englewood Cliffs NJ: Prentice-Hall.
48.
Tanaka, J. S. , & Huba, G. J. (1985). A fit index for covariance structure models under arbitrary GLS estimation. British Journal of Mathematical and Statistical Psychology, 38,197–201.
49.
Thurstone, L. L. (1927). A law of comparative judgment. Psychological Review, 34,273–286.
50.
Turner, C. B. ,& Fiske, D. W. (1968). Item quality and appropriateness of response processes. Educational and Psychological Measurement, 28,297–315.
51.
Waller, N. G. , Tellegen, A., McDonald, R. P., & Lykken, D. T. (1996). Exploring nonlinear models in personality assessment: Development and validation of a negative emotionality scale. Journal of Personality, 64,545–576.
52.
Way, W. D. , Ansley, T. N., & Forsyth, R. A. (1988). The comparative effects of compensatory and noncompensatory two-dimensional data on unidimensional IRT estimates. Applied Psychological Measurement, 12,239–252.
53.
Wiggins, J. S. , & Goldberg, L. R. (1965). Interrelationships among MMPI item characteristics. Educational and Psychological Measurement, 25,381–397.