We propose an item response theory model to analyse psychiatric questionnaires that contain embarrassing items. We use Bayesian methods to estimate its parameters and consider a simulation study to evaluate the performance of the proposed estimators. The results are illustrated with the analysis of data collected to evaluate teenager depression, highlighting the gender difference in the probabilities of ‘crying crisis’, a trait known to embarrass some male populations.
Get full access to this article
View all access options for this article.
References
1.
Lord FMA theory of test scores. Psychometrika Monograph. 1952; 7.
2.
Lord FMApplications of item response theory to practical testing problems . Lawrence Erlbaum, New Jersey ; 1980 .
3.
Birnbaum A.Some latent trait models and their use in infering an examinee’s ability . In Lord FM, Novick MR, eds. Statistical theories of mental test scores. Addison-Wesley, Reading, MA; 1968 : 395-479.
4.
Rasch G.Probabilistic models for some intelligence and attainment tests. Danish Institute for Educational Research, Copenhagen ; 1960.
5.
Fischer GHEinfuhrung in die theorie psychologischer tests (Introduction to mental test theory). Huber, Berne; 1974.
6.
Andrich D.Application of a psychometric model to ordered categories which are scored with successive integers. Journal of Clinical Psychology1978; 57: 355-65.
Masters GNA Rasch model for partial credit scoring. Psychometrika1982; 47: 149-74.
9.
Samejima F.Estimation of latent ability using a response pattern of graded scores. Psychometrika Monograph Supplement1969; 34(17):1-100.
10.
Bock RD, Zimowski MF.Multiple group IRT. In van der Linden WJ, Hambleton RK, eds. Handbook of modern item response theory. Springer-Verlag , New York; 1997: 433-48.
11.
McKinley RL, Reckase MDThe use of the general rasch model with multidimensional item response data . American College Testing, Iowa City; 1983.
12.
Bock RD, Gibbons R., Muraki EJFull information item factor analysis. Applied Psychological Measurement1988; 12: 261-80.
13.
Reckase MDThe past and the future of multidimensional item response theory. Applied Psychological Measurement1997; 21: 25-36.
14.
Patz RJ, Junker BWA straightforward approach to Markov chain Monte Carlo methods for item response theory models. Journal of Educational and Behavioral Statistics1999; 24: 146-78.
15.
Patz RJ, Junker BWApplications and extensions of MCMC in IRT: multiple item types, missing data and rated responses. Journal of Educational and Behavioral Statistics1999; 24: 342-66.
16.
Baker FB, Kim S.Item response theory - parameter estimation techniques (2nd edn). Marcel Dekker, Inc., New York; 2004.
17.
Béguin AA, Glas CAW.MCMC estimation and some model-fit analysis of multidimensional IRT models. Psychometrika2001; 66: 541-62.
18.
Noortgate WV , Boeck P.Assessing and explaining differential item functioning using logistic mixed models. Journal of Educational and Behavioral Statistics2005; 30: 443-64.
19.
Glas CAW.Detection of differential item functioning using Lagrange multiplier test . Statistica Sinica1998; 8: 647-67.
20.
Fischer GH, Molenaar IW, (eds). Rasch models: their foundations, recent developments and applications . Springer-Verlag, New York; 1995.
21.
Wainer, (ed). Computerized adaptive testing: a primer. Lawrence Erlbaum Associates, Hilsdale, NJ; 1990.
22.
Weiss DJ, (ed). New horizonts in testing: latent trait theory and computerized adaptive testing. Academic Press, New York; 1983.
23.
Drasgow F., Olson-Buchanan JBInnovations in computerized assessment. Lawrence Erlbaum Associates, Mahwah, NJ; 1999.
24.
van der Linden WJ, Glas CAW, (eds). Computerized adaptive testing: theory and practice . Kluwer Academic Publishers, Dordrecht ; 2000.
von Davier M.WINMIRA: Software for analyses with the Rasch model, with the mixed Rasch model and with the latent class analysis . Kiel, Germany; 1994.
34.
Corporation ASUser’s manual for the XCALIBRE marginal maximum-likelihood estimation program. St. Paul, MN; 1996.
35.
Mislevy RJ, Bock RDBILOG-3: item analysis and test scoring with binary logistic models;1990.
36.
Zimowski MF , Muraki E., Mislevy RJ, Bock RDBILOG-MG: multiple-group IRT analysis and test maintenance for binary items;1996.
37.
Thissen D.MULTILOG user’s guide: multiple categorical item analysis and test scoring using item response theory. Chicago; 1991.
38.
Muraki E., Bock RDPARSCALE: IRT based test scoring and item analysis for graded open-ended exercises and performance tasks. Chicago;1993.
39.
Wilson DT, Wood R., Gibbons R.TESTFACT: Test scoring, item statistics, and item factor analysis . Mooresville, IN; 1991.
40.
Spiegelhalter D., Thomas A., Best N., Lunn D.WinBUGS User Manual Version 1.4. Cambridge; 2003.
41.
Kessler RC, Andrews G., Colpe LJ, et al. Short screening scales to minitor population prevalences and trends in non-specific psychological. Psychological Medicine2002; 32: 959-76.
42.
Langenbucher JW, Labouvie E., Sanjuan PM, Bavly L., Martin CS, Kirisci L.An application of item response theory analysis to alcohol, cannabis and cocaine criteria in DSM-IV. Journal of Abnormal Psychology2004; 113: 72-80.
43.
Coleman MJ, Matthysse S., Levy DL, et al. Spatial and object working memory impairments in schizophrenia patients: a bayesian item-response theory analysis. Journal of Abnormal Psychology2002; 111: 425-35.
44.
Embretson SE , Reise SPItem response theory for psychologists. Lawrence Erlbaum Associates, Inc., New Jersey; 2000 .
45.
Hays R., Morales LS, Reise SPItem response theory and health outcomes measurement in the 21st century . Medical Care2000; 38: 28-42.
46.
Azocar F., Arean P., Miranda J., Munoz RFDifferential item functioning in a Spanish translation of the Beck Depression Inventory. Journal of Clinical Psychology2001; 57: 355-65.
47.
Williams MT , Turkheimer E., Schmidt K., Oltmanns TFEthnic identification biases responses to the Padua Inventory for obsessive-compulsive disorder . Assessment2005; 12: 174-85.
48.
Thissen D., Steinberg L., Wainer H.Detection of differential item functioning using the parameters of item response models. Lawrence Erlbaum, New Jersey ; 1993.
49.
Crane PK, van Belle G., Larson EBTest bias in a cognitive test: differential item functioning in the CASI. Statistics in Medicine2004; 23: 241-56.
50.
Nuevo R., Dunn G., Dowrick C., et al. Cross-cultural equivalence of the Beck Depression Inventory: A five-country analysis from the ODIN study. Journal of Affective Disorders2009; 114: 156-62.
51.
Kirisci L., Hsu TC, Tarter R.Fitting a two-parameter logistic item response model to clarify the psychometric properties of the Drug Use Screening Inventory for adolescent alcohol and drug abusers. Alcoholism: Clinical and Experimental Research1994; 18: 1335-41.
52.
Cappaliez P.Social desirability response set and self-report depression inventories in the elderly. Clinical Gerontologist1989; 9: 45-52.
53.
Gelman A., Roberts GO, Gilks WREfficient Metropolis jumping rules. In Bernardo JM, Berger JO, Dawid AP, Smith AFM , eds. Bayesian Statistics 5: Proceedings of the Fifth Valencia International Meeting. Oxford, New York; 1996: 599-608.
54.
Gelman A., Rubin DBInference for iterative simulation using multiple sequences. Statistical Science1992; 7: 457-511.
55.
Geyer CJPractical Markov Chain Monte Carlo. Statistical Science1992; 7: 473-83.
56.
Silveira DX , Jorge MRPropriedades psicométricas da escala de rastreamento populacional para depressão CES-D em populações clínica e não-clínica de adolescentes e adultos jovens . Revista de Psiquiatria Clínica1998 ; 25(5): 251-61.
57.
Gelman A., Carlin JB, Stern HS, Rubin DBBayesian data analysis. Chapman and Hall, London; 1995.
58.
Andrade DF, Tavares HRItem response theory for longitudinal data: population parameter estimation . Journal of Multivariate Analysis2005; 95: 1-22.
59.
Tavares HR, Andrade DFItem response theory for longitudinal data: item and population ability parameters estimation. Test2006; 15: 97-123.