Abstract
The asymptotic posterior normality (APN) of the latent vector in an item response theory model is an important argument in modeling and inference. For a single latent trait, Chang and Stout proved its APN for binary items under general conditions, which generalized Chang for polytomous data and Kornely and Kateri for multivariate latent traits (MLT) and binary items. As MLT and polytomous items are nowadays common in psychometry, an APN-theory covering both simultaneously remains an open and ongoing problem. We generalize the APN-theory accordingly, providing thus a broader foundation for developments relying on APN. We also prove consistency of common estimators for the MLT, extending the according results of Chang and Kornely and Kateri.
Keywords
1. Introduction
In the context of item response theory (IRT) methodology, statistical inference for the examinee’s ability relies often on the assumption that its posterior distribution given the test response is a normal distribution or exhibits certain properties, resulting from posterior normality. Fortunately, in several cases the abilities distribution can be proven to be asymptotic posterior normal, allowing the approximation of the posterior by a normal distribution, as shown by Chang and Stout (1993) for univariate latent traits (LTs) and binary items under general assumptions. Chang (1996) extended their results to polytomous items, remaining in the framework of a univariate LT. In the meantime, various IRT and relevant statistical modeling approaches have been developed, improving the fit and interpretability by considering multivariate LTs, assuming normality or aymptotic posterior normality (APN) (cf. e.g., Anderson and Vermunt, 2000; Hessen, 2012; Li, 2010; Pelle et al., 2016; Rabe-Hesketh et al., 2002; Schilling and Bock, 2005).
Kornely and Kateri (2022) proved APN for the case of multivariate LTs and binary items, under general assumptions, analog to those of Chang and Stout (1993). The recent work of Wang et al. (2022) regarding adaptive testing with polytomous items and multivariate latent traits (MLT) argues under the conjecture of posterior normality, indicating thus the timeliness need for developing APN theory for the case of polytomous items and multivariate LTs. About the same period, Sinharay (2022) revisits and extends approaches for the practical problem of estimating the passing probability of unfinished dichotomous and mixed format tests, some relying on the APN of the LT’s distribution. However, extensions of the analyzed modified Lord–Wingersky approach to multidimensional IRT models would require the APN for multivariate LTs in mixed format tests, which has not been proved yet. This need is further underlined in a recent revisit of log-multiplicative association models for IRT setups of mixed format items for multivariate LTs and their connection to IRT models under posterior normality by Anderson et al. (2023). In this note, we show that, similarly to the work of Chang (1996), the results of Kornely and Kateri (2022) can be extended naturally to polytomous items with possibly different numbers of response categories per item. It is important to note that the results for binary items cannot be directly extended to models for polytomous items, due to the fact that, though polytomous data can be reformulated to respective sets of binary items, these binary items contain local dependencies that prevent the application of the theorems of Kornely and Kateri (2022). Hence, it is required to formulate a new proof. The general approach along with some advanced technical steps based on proved properties of the log-likelihood function can be transferred. However, the preliminary results establishing these properties cannot directly be carried over and have to be shown. For example, this involves proving certain bounds of the log-likelihood-ratio outside some neighborhood and quadratic approximability inside of it. With this work the results of Chang (1996) are extended for cases (a) with LTs of higher dimension of LTs and (b) with possibly different numbers of response categories. Furthermore, we provide proofs for the consistency of penalized MLE (maximum likelihood estimator)/MAP (maximum a-posteriori estimator) and EAP (expected a-posteriori estimator), which were not considered. The work of Kornely and Kateri (2022), on the other side, is extended (a) with regard to the response variables, from binary to polytomous ones, and (b) by weakening the requirements on the penalization function for the penalized MLE.
The article is organized as follows. After setting the assumed IRT framework and the notation in Section 2, the required regularity conditions are formulated and briefly explained in Section 3. The main result regarding the APN for multivariate LTs, along with results on the existence and consistency of the MLE, penalized MLE/MAP, the posterior and the EAP as well as the APN with convergence in manifest probabilities, is given and commented in Section 4. The way the proof for multivariate APN extends from the case of binary items to that of polytomous items is discussed in Appendix 1. Finally, the results are summarized in Section 5.
2. Preliminaries
This work extends the results of Kornely and Kateri (2022) to polytomous items, adapting their setup and notation accordingly. The set of positive integer numbers is denoted by
where
where
Assuming Equation 2, the marginal probability mass function of
where
Given a realization
The test (or Fisher) information matrix for a test on
where ∇ denotes the gradient of a function, that is,
Kornely and Kateri (2022) have studied the APN of
for a sequence
Next, we define the functions
Note that
is the negative Kullback–Leibler divergence between the conditional distributions of
3. Regularity Conditions for Asymptotic Properties of Latent Vectors
Throughout, we assume that
(C1) [i]The set
[ii] The prior density
(C2)
(C3) For each
and if
where
(C4) If restricted to any compact set
is uniformly bounded.
(C5) For all
where
The regularity conditions are essentially the same as that of Chang and Stout (1993), Chang (1996), and Kornely and Kateri (2022), adjusted for MLT and polytomous items. Thus, we just briefly commend them. Comparing these conditions to respective versions for univariate LTs, notice that under the current multivariate setup, we are required to enforce properties on more abstract subsets of the latent space, whereas intervals were appropriate in the univariate case. When considering binary items, all important information is contained in each item’s log-odds as a function of the LTs (or a single function for the probability of responding one, respectively). For polytomous items, in contrast, this is not possible and a function for each item response category is assumed (plus the sum to one condition for each
4. Main Results
Our main result is the direct extension of the APN for multivariate LTs given binary items of Kornely and Kateri (2022, Theorem 5(c)) to polytomous items. At the same time, we extend Theorem 1 of Chang (1996) to MLT. We formulate this result in the following theorem:
The first key challenge for proving Theorem 4 is to prove that
The second key challenge is to prove that
The subsequent steps can then follow from proofs for binary items and MLTs. Our general approach is adopted from Chang (1996) and Kornely and Kateri (2022) and adjusted for the combined setup. The proof of Theorem 1 is further discussed in Appendix 1. The APN in Theorem 1 is the semiproper centering of the MLE, that is, the set-wise convergence of the probabilities of the normalized posterior, centered at the MLE (cf. Definition 2 of Ghosal et al., 1995). If
Several preliminary results and by-products of the proof of Theorem 1 are of own interest due to their usage independently of the APN of LTs. We formulate them in the following theorem, noting the interesting relation of these results to APN:
(i) There is a sequence
and
(ii) Statement (i) remains valid if
for some continuously differentiable and positive function
(iii) If (C1[ii]), (C4) and (C5) are additionally satisfied and if there is a mapping
The proof of Theorem 2 is discussed in the Supplemental Appendix (available in the online version of this article). This second theorem has a few implications. Part (i) ensures the asymptotic existence and consistency of the MLE. Part (ii) is a criterion for consistency of the MAP as we can set
5. Conclusion
In this work, we proved that, for MIRT models for polytomous items with possibly different numbers of response categories per item, under certain assumptions, latent vectors are asymptotic posterior normal distributed. The assumptions are not restrictive and thus the result applies to a large class of MIRT models.
Furthermore, we also extended the results of Kornely and Kateri (2022) from binary to polytomous items regarding (a) the existence and consistency of the MLE and penalized MLE/MAP (with a further weakening of the requirements), (b) the consistency of the posterior and the EAP, and (c) APN with convergence in
The consistency of the MLE is often considered as commonly known. However, we are only aware of asymptotic theoretic results for LTs that either restrict to stricter conditions on the model or to special cases, like binary items or univariate LTs (e.g., Sinharay [2015]; Kornely and Kateri [2022]). Thus, this work contributes to the certainty of the maximum likelihood approach for ability estimation in more general setups.
Supplemental Material
sj-pdf-1-jeb-10.3102_10769986241283687 – Supplemental material for A Note on the Asymptotic Posterior Normality of Multivariate Latent Traits in an IRT Model for Polytomous Items of Mixed Format
Supplemental material, sj-pdf-1-jeb-10.3102_10769986241283687 for A Note on the Asymptotic Posterior Normality of Multivariate Latent Traits in an IRT Model for Polytomous Items of Mixed Format by Mia Johanna Katharina Kornely and Maria Kateri in Journal of Educational and Behavioral Statistics
Footnotes
Appendix 1
Acknowledgements
The authors sincerely thank the reviewers for their constructive and useful comments on an earlier version of the manuscript.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Authors
MIA JOHANNA KATHARINA KORNELY was a research associate at the Manufacturing Technology Institute, RWTH Aachen University, Templergraben 55, 52062 Aachen, Germany; e-mail:
MARIA KATERI is a professor (Chair of Statistics and Data Science) at the Institute of Statistics, RWTH Aachen University, Templergraben 55, 52062 Aachen, Germany; e-mail:
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
