Sage Journals: Discover world-class research

Abstract

In cognitive diagnosis assessments, examinees frequently skip some items due to various reasons, such as time constraints, lack of confidence, or the perception of item difficulty. These skipped items are usually due to lacking specific cognitive attributes or knowledge structures. However, many research studies conventionally rely on complete-case analysis, potentially compromising the validity of results by ignoring skipped items. This omission can lead to invalid inferences about the attributes profiles of examinees and introduce bias into item parameters. This study aims to develop statistical models for effectively handling missing data resulting from skipped items. Specifically, we employ an item response theory model for missing indicators and utilize the deterministic inputs, noisy “and” gate (DINA) model to describe cognitive item responses. Furthermore, we introduce a higher-order structure to describe the correlation between the higher-order ability parameters and skipping propensity parameters and the correlation of the item parameters. The proposed new model fills the gaps in missing data handling, providing a more precise evaluation for the examinees who skip items and yielding more accurate research results. This enhances our understanding of cognitive processes and strengthens support for educational policies, aligning them better with student needs and optimizing decision-making.

Plain language summary

We propose a missing data model for skipped items in cognitive diagnostic assessments. The significance of our findings is illustrated with an application to a real data set.

Keywords

cognitive diagnosis assessments missing data mechanism skipped items Bayesian analysis

Introduction

In cognitive diagnostic assessments, researchers typically utilize responses from multiple items to gain a comprehensive understanding of individuals’ cognitive attributes or knowledge structures. However, a common challenge emerges when examinees choose to respond to some items while skipping others, resulting in missing data at item level. This behavior can be attributed to various reasons, including perceived difficulty of specific items, allocating excessive time to one item leading to the omission of others, or simply skipping some items. Ignoring such missing data can lead to information loss and potentially bias the interpretations. It is important to note that many research investigations commonly employ an analytical approach that relies solely on available data, evaluating examinees based exclusively on their observed response data. This approach may potentially provide incomplete diagnostic feedback to individuals with missingness in their data, and importantly, may lead to biased conclusions when comparing them to examinees with complete datasets.

The process of making a cognitive diagnosis has received extensive attention in psychology and educational assessment recently. A great deal of cognitive diagnosis models (CDMs) has been developed, for instance, the deterministic inputs, noisy “and” gate (DINA) model (Junker & Sijtsma, 2001), the log-linear cognitive diagnostic (LCDM) model (Henson et al., 2009), and the general DINA model (de la Torre, 2009; de la Torre & Douglas, 2004; Junker & Sijtsma, 2001). De la Torre and Douglas (2004) proposed a higher-order DINA (HO-DINA) model to connect latent attributes through higher-order ability.

In practice, cognitive diagnosis assessments could investigate examinees’ skills and knowledge, which provide fine-grained diagnosis and feedback information, such as attribute mastery information and cognitive processes. For example, even if the scores of two examinees are the same, their knowledge structures and learning paths may be different. Thus, teachers’ instructional approaches should also be distinct. Consequently, examinees with the same score tend to have distinct cognitive structures and cognitive processes. In this case, if examinees’ strengths and weaknesses can be identified, much more meaningful and much richer information about the individuals’ attribute profiles can be provided. Meanwhile, such information can also be used to adapt instruction to the examinees and determine their skills and knowledge that need to be developed further.

Cognitive diagnosis assessments (de la Torre, 2011) are effective tools for evaluating examinees’ mastery of skills or attributes. However, in practical test scenarios, some examinees often leave a few items blank for various reasons. For example, they may perceive some items as too difficult, they may have insufficient time to answer some questions, or their test-taking motivation may decline over the course of the assessments. Usually, the skipped/omitted items occur within a block of responses. According to the Programme for the International Assessment of Adult Competencies (PIAAC) in 2012, skipped item rates varied significantly; in Korea, only 2% of items in the numeracy domain were skipped, while in Chile, 25.9% of items in the literacy domain were left unanswered (OECD, 2013). This highlights the widespread issue of missing data in educational assessment and psychological measurement (Rose et al., 2017; Yuan et al., 2018). Consequently, if these missing responses are not handled properly, they can pose a significant threat to the validity of the assessments (Pohl et al., 2014; Rose et al., 2015).

To address the missing data issue more effectively, Rubin (1976), Little and Rubin (2002) proposed three mechanisms: not missing at random (NMAR), missing at random (MAR), and missing completely at random (MCAR). Let Y be a response matrix with the elements of observed response data $Y_{observe}$ and missing data $Y_{missing}$ , and let $D$ be the corresponding missing indicator matrix. Consequently, MCAR is described as $P (D | Y_{observe}, Y_{missing}) = P (D)$ , MAR is expressed as $P (D | Y_{observe}, Y_{missing}) = P (D | Y_{observe})$ , and NMAR denotes that the missing data are correlated with both $Y_{observe}$ and $Y_{missing}$ .

In the framework of IRT, the issue of missing data has garnered substantial attention, leading to the development of various methodologies to address item nonresponse based on different missing mechanisms (Dai, 2021; Du et al., 2022; Pohl & Becker, 2020; Sperrin & Martin, 2020; Xiao & Bulut, 2020). The contributions include the multidimensional IRT (MIRT) models explored by scholars (Glas & Pimentel, 2008; Grund et al., 2021;Holman & Glas, 2005; Moustaki & Knott, 2000; O’Muircheartaigh & Moustaki, 1999), as well as latent regression models (LRM) proposed by Rose et al. (2010). The literature has also witnessed sophisticated integrations, such as the combination of MIRT with LRM as presented by Rose et al. (2017), and innovative models like the IRTree proposed by Debeer et al. (2017). A recent development, as emphasized by Pohl et al. (2019) and Ulitzsch et al. (2020a, 2020b), focuses on incorporating response time to model item nonresponses (Guo et al., 2022; Lu et al., 2023; Lu & Wang, 2020).

Moving into the realm of cognitive diagnosis assessments (Ma, 2022; Templin & Henson, 2006), which intricately analyze examinees’ skills and knowledge, the need to address missing data becomes increasingly urgent. While several methods have emerged within the cognitive diagnosis framework, most remain relatively straightforward. Ömür Sünbül (2018) investigated four methods to address missing data in the DINA model under the MCAR and MAR mechanisms. These methods include treating missing data as incorrect, person mean imputation, two-way imputation, and expectation-maximization imputation. While these methods are conceptually clear, their effectiveness tends to diminish as the proportion of missing items increases. Specifically, when dealing with non-ignorable missing data, these methods can introduce biases and errors into statistical inference. Following this, Dai et al. (2018) examined the influence of missing responses on two Q matrix validation methods. However, they did not delve into the impact of missingness on specific CDMs. Shan and Wang (2020) proposed a joint CDM model that addresses item-level missing data by introducing the categorical missingness propensity of examinees, but they did not elaborate on this potential categorization. Dai and Svetina Valdivia (2022) assessed the performance of different strategies for managing missing data, including the full information maximum likelihood estimation in CDMs. However, their focus remained on methods to treat missing responses in the CDMs, without constructing a dedicated model for these missing responses. More recently, Liang et al. (2022) introduced a joint cognitive model for not-reached missingness. Yet, when it comes to skipped items, to our knowledge, no model has been developed within the cognitive diagnoses framework to particularly address them.

Recognizing the absence of such models, this paper introduces a novel missing data model for skipped items within the CDM framework. Our aim extends beyond merely addressing this gap; we aim to present a model that offers a comprehensive understanding, departing from the more simplistic approaches employed previously. In essence, our work not only assesses skipped items but also delivers a nuanced analysis, representing a significant advancement. To capture the underlying processes of the cognitive diagnosis, we bring together research on modeling cognitive item responses and research on missing data models in a joint modeling framework. This framework is characterized by a bivariate normal distribution between examinees’ higher-order ability parameters and skipping propensity parameters, and a multivariate normal distribution of the item intercept parameters, interaction parameters, and item skipping difficult parameters. The parameters of the model were estimated by the Bayesian Markov chain Monte Carlo (MCMC) method. The results of simulations showed that the proposed model improved attribute profile correct classification rates and produced precise estimation of item parameters when the missing mechanism is nonignorable.

The remainder of this paper is structured as follows. First, a missing indicator model is presented to model the skipped item responses. Second, the higher-order DINA model is used to model the response data. The missing model and response model are linked by the correlation of higher-level ability parameters and the skipping propensity parameters, and the correlation of item parameters. Third, the MCMC algorithm was used to estimate the model parameters and two model selection criteria were given to evaluate the model fit. Fourth, simulation studies are conducted to evaluate the performance of the proposed model under different conditions. Finally, concluding remarks and some suggestions for future research are provided.

Model Construct

Let $Y$ be a two-dimensional data matrix, if we observe the response data for the $i$ th examinee to the $j$ th item, $Y_{ij}$ corresponds to the observation $y_{ij}$ . Otherwise, it is missing data, and we use the sign “s” to express the missing data and the relevant parameters.

Modeling Skipped Items

The missing data mechanism of skipped items is modeled using an additional manifest or latent variable, which represents the examinees’ tendency to skip items (Holman & Glas, 2005; Rose et al., 2017). The skipping indicator $D_{ij}$ contains information on whether examinee $i$ generated a response to item $j$ , with 1 indicating a skipped response and 0 representing an observed response. The missing indicator matrix D is defined as:

D_{ij} = {\begin{matrix} 0, if y_{ij} was observed, \\ 1, if y_{ij} was not observed . \end{matrix}

(1)

The Rasch model (Rasch, 1966) is employed to express the probability of skipping items. That is, the probability of item skipping is modeled as a function of an examinee’s skipping propensity $θ_{i}^{s}$ and the $j$ th item skipping difficulty $β_{j}^{s}$ . The probability of item skipping is modeled as:

p (s_{ij} = 1 | θ_{i}^{s}, β_{j}^{s}) = \frac{\exp (θ_{i}^{s} - β_{j}^{s})}{1 + \exp (θ_{i}^{s} - β_{j}^{s})}

(2)

HO-DINA Model

The DINA model describes the probability of item response as a function of latent attributes, and the probability of the $i$ th examinee responding to item $j$ correctly is expressed as follows:

p (Y_{ij} = 1) = g_{j} + (1 - s_{j} - g_{j}) Π_{k = 1}^{K} {α_{ik}}^{q_{jk}},

(3)

where $g_{j}$ and $s_{j}$ are the guessing and slipping probabilities of the $j$ th item, respectively; $1 - s_{j} - g_{j} = ID I_{j}$ is the $j$ th item discrimination index (De La Torre, 2008); and $α_{ik}$ describes the attribute mastery statue for examinee $i$ ,that is, the value of $α_{ik}$ is 1 when examinee $i$ masters attribute $k$ , otherwise, the value of $α_{ik}$ is 0. The Q matrix (Tatsuoka, 1983) is a $J \times K$ matrix with element $q_{jk}$ ; when the attribute $k$ is required for answering the $j$ th item correctly, the value of $q_{jk}$ is 1, otherwise it is 0.

Equation 3 can be re-parameterized by the logit scale as the re-parameterized DINA model (DeCarlo, 2011):

β_{j} = logit (g_{j}),

(4)

δ_{j} = logit (1 - s_{j}) - logit (g_{j}) .

(5)

In addition, $logit (x) = \log (\frac{x}{1 - x})$ , thus, Equation 3 can be expressed as:

logit (P (y_{ij} = 1)) = β_{j} + δ_{j} Π_{k = 1}^{K} α_{ik}^{q_{jk}},

(6)

where $β_{j}$ and $δ_{j}$ are the item intercept and interaction parameters, respectively. The higher-order structure is flexible: on one hand, it can produce more accurate higher-order ability and attribute structure estimates; in addition, it also reduces the number of model parameters. On the other hand, the attributes are usually correlated, and it would be more beneficial to formulate a higher-order structure to link attributes (de la Torre & Douglas (2004); Zhan et al., 2018), which is expressed as:

logit (P (α_{ik} = 1)) = {θ_{i}}^{h} γ_{k} - λ_{k},

(7)

where $P (α_{ik} = 1)$ represents the probability that the $i$ th examinee possesses the $k$ th attribute, $θ_{i}^{h}$ is the higher-order ability for examinee $i,$ and $γ_{k}$ and $λ_{k}$ are the slope and intercept parameters for attribute $k$ , respectively. The slope parameter $γ_{k}$ is positive, indicating that higher values of an ability $θ_{i}^{h}$ correspond to better knowledge mastery of the attribute by the examinee.

The Hierarchical Model

Following the hierarchical modeling framework, item parameters $β_{j}$ and $δ_{j}$ of the DINA model and item skipping difficult parameter $β_{j}^{s}$ are assumed to follow a multivariate normal distribution with the mean vector and covariance matrix, which is expressed as follows:

(\begin{matrix} β_{j} \\ δ_{j} \\ β_{j}^{s} \end{matrix}) ~ N ((\begin{matrix} μ_{β} \\ μ_{δ} \\ μ_{β^{s}} \end{matrix}, Σ_{I})), Σ_{I} = (\begin{matrix} σ_{β}^{2} & σ_{β δ} & σ_{β β^{s}} \\ σ_{β δ} & σ_{δ}^{2} & σ_{δ β^{s}} \\ σ_{β β^{s}} & σ_{δ β^{s}} & σ_{β^{s}}^{2} \end{matrix})

(8)

In addition, the joint distribution of the person parameters is assumed to follow a bivariate normal distribution with mean vector $μ_{P}$ and covariance matrix $Σ_{P}$ :

μ_{P} = (μ_{θ^{h}}, μ_{θ^{s}})', Σ_{P} = (\begin{matrix} σ_{θ^{h}}^{2} & σ_{θ^{h} θ^{s}} \\ σ_{θ^{h} θ^{s}} & σ_{θ^{s}}^{2} \end{matrix}) .

(9)

Model Identifications

To solve the identifications of the proposed model, we have made the following restrictions. On the one hand, to eliminate the trade-off between ability $θ_{i}^{s}$ and skipping threshold parameter $β_{j}^{s}$ , we set $μ_{θ^{s}} = 0$ . Similarly, to eliminate the trade-off between the higher-order ability person parameter $θ^{h}$ and the attribute intercept $λ_{k}$ , we set $μ_{θ^{h}} = 0$ . On the other hand, $σ_{θ^{h}}$ =1 was fixed to eliminate the scale trade-off between $θ_{i}^{h}$ and $γ_{k}$ . These restrictions were based on the method to fix the mean of ability to 0 and the variance of ability to 1 (Fox, 2010; Lord & Novick, 1968).

Model Selection

We used two common Bayesian model evaluation criteria: the deviance information criteria (DIC; Spiegelhalter et al., 2002) and the logarithm of the pseudo marginal likelihood (LPML, Geisser & Eddy, 1979; Ibrahim et al., 2001); They can be employed for comparing the missing mechanism models through MCMC sampling. The $DIC$ is given by:

De v_{(Y, D | Ω)} = - 2 \log L (Y, D | Ω)

= - 2 \sum_{i = 1}^{N} \sum_{j = 1}^{J} \sum_{k = 1}^{K} [(Y_{ij} = s) \log (P (Y_{ij} = s))

+ (Y_{ij} = 1) \log ((1 - P (Y_{ij} = s)) P (Y_{ij} = 1))

+ (Y_{ij} = 0) \log ((1 - P (Y_{ij} = s)) P (Y_{ij} = 0))] .

(10)

Based on the posterior distribution of $Dev (Y, D | Ω)$ , the $DIC$ is defined as:

DIC = \bar{Dev} + p_{D} = \bar{Dev} + (\bar{Dev} - \hat{Dev}),

(11)

where $\bar{Dev} = E (Dev (Y, D, Ω) | Y, D) ≅ \frac{1}{R} \sum_{r = 1}^{R} Dev (Y, D, Ω^{r})$ , which is the posterior mean deviance; a Bayesian measure of fit, $r = 1, . . ., R$ denotes the $r$ th iteration of the algorithm; and $\hat{Dev} = Dev (Y, D, \bar{Ω})$ , which is the effective number of parameters, is a Bayesian measure of complexity, with $\bar{Ω} = E (Ω | Y, D) ≅ \frac{1}{R} \sum_{r = 1}^{R} Ω^{r}$ . A smaller $DIC$ indicates a better model fit.

Another way to compare the two model fits is LPML. The conditional predictive ordinate index (Chen et al., 2000) of the two models is computed. Let $Q_{ij, \max} = ma x_{1 \leq r \leq R} {- logf (Y_{ij}, D_{ij} | Ω^{r})}$ . Then,

\begin{matrix} \log (\hat{CP O_{ij}}) = - Q_{ij, \max} \\ - \log [\frac{1}{R} \sum_{r = 1}^{r} \exp {- logf (Y_{ij}, D_{ij} | Ω^{r}) - Q_{ij, \max}}] . \end{matrix}

(12)

Note that the maximum value adjustment used in $\log (\hat{CP O_{ij}})$ plays an important role in numerical stabilization in computing $\exp {- logf (Y_{ij}, D_{ij} | Ω^{r}) - Q_{ij, \max}}$ in Equation 12. The summary statistic for log $(\hat{CP O_{ij}})$ is the sum of their logarithms, and the LPML is:

LPML = \sum_{i = 1}^{N} \sum_{j = 1}^{J} \log ({\hat{CPO}}_{ij})

(13)

where the model with a larger LPML signifies a better fit to the data.

Simulation Studies

We conducted four simulation studies to evaluate different aspects of the proposed model. Simulation study I was conducted to assess whether the Bayesian MCMC algorithm could successfully recover parameters for the proposed model under different numbers of examinees with fixed numbers of items and attributes. Simulation study II was conducted to investigate the parameter recovery of different numbers of attributes for the same numbers of examinees and items. Simulation study III was intended to show the differences in model parameter estimates between the NMAR and MAR models for different skipping proportions and correlations between person parameters. Simulation study IV aimed to show the performance of the model selection using DIC and LPML for the NMAR and MAR models with different skipping proportions and correlations between person parameters.

We employed the Bayesian MCMC method to estimate the model parameters. The convergence of the Bayesian algorithm was checked by monitoring the trace plots of the parameters for 10,000 iterations. The first 5,000 iterations were set as the burn-in period. There were R = 30 replications for each simulation condition. In terms of convergence, we found that the potential scale reduction factor (PSRF; Brooks & Gelman, 1998) was less than 1.1 for all parameters, which means that the chains converge well. The bias and root mean square error (RMSE) were computed to evaluate the parameter recoveries. To evaluate item parameter recoveries, the bias was averaged across all items. Regarding the classification of individual attributes and profiles, the attribute correct classification rate (ACCR) and the pattern correct classification rate (PCCR) were computed.

Data Generation

In the four simulation studies, the item parameters were sampled from the following distributions:

\begin{matrix} (\begin{matrix} β_{j} \\ δ_{j} \\ β_{j}^{s} \end{matrix}) ~ MVN ((\begin{matrix} μ_{β} \\ μ_{δ} \\ μ_{β^{s}} \end{matrix}), Σ_{I}), μ_{β} = - 2.197, μ_{δ} = 4.394, \\ Σ_{I} = (\begin{matrix} 1 & - 0.8 & 0.25 \\ - 0.8 & 1 & - 0.15 \\ 0.25 & - 0.15 & 0.25 \end{matrix}), \end{matrix}

which were consistent with Liang et al. (2022). The low and high skipping proportions were varied by setting the value of $μ_{β^{s}}$ . That is, the skipping proportion was 12% (low) when $μ_{β^{s}} = 2.1$ ; it was 25% (high) when $μ_{β^{s}} = 1.3 .$

The intercept parameters of the attributes were set as $λ = (- 1, - 0.5, 0, 0.5, 1),$ and the slope parameters were $γ_{k} = 1.5$ for all attributes, consistent with Liang et al. (2022). We considered three Q matrices (Shan & Wang, 2020; Xu & Shang, 2018), as expressed in Figure 1.

Figure 1.

$Q_{1}$ presents a matrix with 30 items and five attributes, $Q_{2}$ represents a matrix with 20 items and five attributes, and $Q_{3}$ represents a matrix with 20 items and three attributes.

The person parameters $θ_{i}^{h}$ and $θ_{i}^{s}$ were simulated from the bivariate normal distribution $(\begin{matrix} θ^{h} \\ θ^{s} \end{matrix}) ~ MVN ((\begin{matrix} 0 \\ 0 \end{matrix}), (\begin{matrix} 1 & σ_{θ^{h} θ^{s}} \\ σ_{θ^{h} θ^{s}} & σ_{θ^{s}}^{2} \end{matrix})), where σ_{θ^{s}}^{2}$ = 0.25. Three levels of the correlation between $θ_{i}^{h}$ and $θ_{i}^{s}$ (i.e., $ρ_{θ_{i}^{h} θ_{i}^{s}}$ ) were considered:0 (uncorrelated), −.5 (medium), and −.8 (high). Two levels of skipping proportions were 12% (low) and 25% (high).

Q_{1} = (\begin{matrix} 1 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 1 \\ 1 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 1 \\ 1 & 1 & 0 & 0 & 0 \\ 1 & 0 & 1 & 0 & 0 \\ 1 & 0 & 0 & 1 & 0 \\ 1 & 0 & 0 & 0 & 1 \\ 0 & 1 & 1 & 0 & 0 \\ 0 & 1 & 0 & 1 & 0 \\ 0 & 1 & 0 & 0 & 1 \\ 0 & 0 & 1 & 1 & 0 \\ 0 & 0 & 1 & 0 & 1 \\ 0 & 0 & 0 & 1 & 1 \\ 1 & 1 & 1 & 0 & 0 \\ 1 & 1 & 0 & 1 & 0 \\ 1 & 1 & 0 & 0 & 1 \\ 1 & 0 & 1 & 1 & 0 \\ 1 & 0 & 1 & 0 & 1 \\ 1 & 0 & 0 & 1 & 1 \\ 0 & 1 & 1 & 1 & 0 \\ 0 & 1 & 1 & 0 & 1 \\ 0 & 1 & 0 & 1 & 1 \\ 0 & 0 & 1 & 1 & 1 \end{matrix}) Q_{2} = (\begin{matrix} 1 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 1 \\ 1 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 1 \\ 1 & 1 & 0 & 0 & 0 \\ 0 & 1 & 1 & 0 & 0 \\ 0 & 0 & 1 & 1 & 0 \\ 0 & 0 & 0 & 1 & 1 \\ 1 & 0 & 0 & 0 & 1 \\ 1 & 1 & 1 & 0 & 0 \\ 0 & 1 & 1 & 1 & 0 \\ 0 & 0 & 1 & 1 & 1 \\ 1 & 0 & 0 & 1 & 1 \\ 1 & 1 & 0 & 0 & 1 \end{matrix}) Q_{3} = (\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \\ 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \\ 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \\ 1 & 1 & 0 \\ 1 & 0 & 1 \\ 0 & 1 & 1 \\ 1 & 1 & 0 \\ 1 & 0 & 1 \\ 0 & 1 & 1 \\ 1 & 1 & 0 \\ 1 & 0 & 1 \\ 1 & 1 & 1 \\ 1 & 1 & 1 \\ 1 & 1 & 1 \end{matrix})

The Prior and Posterior Distributions

Model parameters were estimated using the Bayesian MCMC algorithm. The joint posterior distributions for the model parameters are obtained through observed data likelihood and prior distributions of model parameters.

Let $Ω = {θ_{i}^{h}, θ_{i}^{s}, β_{j}^{s}, α_{ik}, β_{j}, δ_{j}, γ_{k}, λ_{k}, μ_{β}, μ_{δ}, μ_{β^{s}}, Σ_{I}, σ_{θ^{h} θ^{s}}, σ_{θ^{s}}^{2}}$ be the parameters of the proposed model. The priors and hyperpriors of the model parameters are presented in Table 1.

Table 1.

The Priors and Hyperpriors of the Model Parameters and Their Values in Simulation Studies.

Parameters	Priors/hyperpriors	Values
$μ_{β}$	$N (μ_{β}, σ_{β}^{2})$	$μ_{β} = - 2.197, σ_{β}^{2} = 2$
$μ_{δ}$	$N (μ_{δ}, σ_{δ}^{2}) I (μ_{δ} > 0)$	$μ_{δ} = 4.394, σ_{δ}^{2} = 2$
$μ_{β}^{s}$	$N (μ_{β^{s}}, σ_{β^{s}}^{2})$	$μ_{β^{s}} = 2.1, σ_{β^{s}}^{2} = 2$
$λ_{K}$	$N (μ_{λ}, σ_{λ}^{2})$	$μ_{λ} = 0, σ_{λ}^{2} = 4$
$γ_{K}$	$N (μ_{γ}, σ_{γ}^{2}) I (γ_{K} > 0)$	$μ_{γ} = 0, σ_{γ}^{2} = 4$
$Σ_{I}$	$Inv - Wishart (Σ_{I 0}^{- 1}, v_{I 0})$	$Σ_{I 0} = (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}), v_{I 0} = 2, k_{I 0} = 1$
$σ_{θ^{h} θ^{s}}$	$σ_{θ^{h} θ^{s}} ~ U (- a, a)$	$a = 1$
$σ_{θ^{s}}^{2}$	Inv $- γ (v_{0}, c_{0})$	$v_{0} = 2, c_{0} = 2$

The posterior distribution of the proposed model can be obtained as follows:

\begin{matrix} p (Ω | Y, D) = Π_{i = 1}^{N} Π_{j = 1}^{J} Π_{k = 1}^{K} P (Y_{ij} = s)^{Y_{ij} = s} \\ \times {[1 - P (Y_{ij} = s)] P (Y_{ij} = 1)}^{Y_{ij} = 1} \\ \times {[1 - P (Y_{ij} = s)] P (Y_{ij} = 0)}^{Y_{ij} = 0} \times p (θ^{h}, θ^{s}; μ_{P}, Σ_{P}) \end{matrix}

p (β_{j}, δ_{j}, β_{j}^{s}; μ_{β}, μ_{δ}, μ_{β^{s}}, Σ_{I}) p (Σ_{P}) p (μ_{β}) p (μ_{δ}) p (μ_{β^{s}}) p (Σ_{I})

p (α_{ik}; θ_{i}^{h}, γ_{k}, λ_{k}) p (λ_{k}) p (γ_{k}) .

(14)

The details of the MCMC algorithm are given in the Appendix.

Simulation Study I

In simulation study I, the aim was to investigate the impact of the sample size on the performance of the parameters and attribute profiles. The conditions were set as follows: 1,000 examinees responded to 30 items with five attributes, and this was compared to a scenario where 500 examinees answered 30 items with five attributes. Moreover, the correlation between $θ_{i}^{h}$ and $θ_{i}^{s}$ was −.3, and the skipping proportion was low (12%).

Table 2 presents recoveries of the ability parameters and item parameters as well as the attribute parameter estimates. Given the test length and the number of attributes, when the number of examinees increased from 500 to 1,000, the bias and RMSE of the item parameters $β, δ$ , and $β^{s}$ decreased. Specifically, 30 items and five attributes were fixed, the number of examinees increased from 500 to 1,000, and the average RMSE of all the item intercept parameters ranged from 0.233 to 0.191. For the interaction parameters, the average bias of all item interaction parameters ranged from 0.014 to 0.001, and the average RMSE ranged from 0.366 to 0.291. For the skipping item difficulty parameters, the average bias ranged from 0.021 to 0.018, and the average RMSE ranged from 0.133 to 0.099. Moreover, the RMSE of the item mean parameters and item covariance matrix elements decreased noticeably. In addition, the first two columns of Table 4 show the ACCRs and PCCRs for simulation study I. With the test length and number of attributes held constant, an increase in the number of examinees from 500 to 1,000 led to higher ACCRs and PCCRs, owing to the larger sample size. Notably, the PCCRs exhibited a distinct variation, ranging from 0.848 to 0.852. Figure 2 shows the PSRF values of several items with 500 examinees, 30 items, and five attributes. We found that the item intercept parameters $β$ , the item interaction parameters $δ,$ and the skipping difficulty parameters $β^{s}$ converged at 5,000 iterations.

Table 2.

The Parameter Estimates in Simulation Study I.

	N = 1,000		N = 500
	J = 30		J = 30
	K = 5		K = 5
Parameter	Bias	RMSE	Bias	RMSE
$β$	0.011	0.191	0.007	0.233
$δ$	0.001	0.291	0.014	0.366
$β^{s}$	0.018	0.099	0.021	0.133
$μ_{β}$	0.036	0.179	0.032	0.186
$μ_{δ}$	−0.129	0.228	−0.118	0.236
$μ_{β^{s}}$	0.016	0.091	0.021	0.102
$λ_{1}$	0.012	0.148	−0.014	0.213
$λ_{2}$	−0.024	0.135	−0.079	0.232
$λ_{3}$	−0.043	0.128	−0.003	0.195
$λ_{4}$	−0.023	0.156	−0.077	0.223
$λ_{5}$	−0.014	0.177	−0.019	0.248
$γ_{1}$	0.041	0.298	0.036	0.381
$γ_{2}$	0.014	0.265	0.037	0.378
$γ_{3}$	−0.001	0.234	0.166	0.516
$γ_{4}$	0.037	0.297	−0.046	0.416
$γ_{5}$	0.043	0.279	−0.014	0.399
$σ_{β}^{2}$	−0.129	0.298	−0.148	0.319
$σ_{δ}^{2}$	−0.043	0.325	−0.069	0.352
$σ_{β^{s}}^{2}$	−0.043	0.073	−0.033	0.073
$σ_{β δ}$	0.165	0.298	0.207	0.337
$σ_{β β^{s}}$	−0.077	0.120	−0.077	0.123
$σ_{δ β^{s}}$	0.063	0.112	0.074	0.124
$σ_{θ^{h} θ^{s}}$	−0.005	0.044	0.001	0.061
$σ_{θ^{s}}^{2}$	0.022	0.042	0.027	0.053
$θ^{h}$	−0.005	0.628	−0.014	0.635
$θ^{s}$	0.002	0.376	0.003	0.372

The bold elements in the table highlight significant differences in the results.

Figure 2.

The trace plots of PSRF values in simulation study I.

In simulation study I, our findings indicate: First, a larger sample size provides a more robust dataset that captures individual responses comprehensively, thereby enhancing the precision of item parameter estimates. For instance, the bias and RMSE for item parameters were significantly reduced when the sample size was increased to 1,000. Second, when focusing on attributes, ACCR reflects the accuracy of classifying each attribute, while PCCR measures the overall accuracy of pattern classification. Our analysis revealed that as the sample size increased, both ACCR and PCCR values improved. For detailed results, please refer to Table 4. These findings emphasize the significance of sample size in improving the precision of item parameter estimation and enhancing the accuracy of attribute classification.

Simulation Study II

The aim of simulation study II was to investigate the parameter recoveries of different attributes under two conditions: (a) 500 examinees, 20 items, five attributes; and (b) 500 examinees, 20 items, three attributes. In both cases, the correlation between $θ_{i}^{h}$ and $θ_{i}^{s}$ was −.3, and the skipping proportion was low (12%).

Table 3 shows the bias and RMSE values for the parameters. The ability parameter $θ^{h}$ improved when the number of attributes ranged from K = 3 to K = 5. Furthermore, the ACCRs and PCCRs are displayed in the last two columns of Table 4. Under the condition of 500 examinees and 20 items, a transition from five attributes to three attributes resulted in changes in ACCRs and PCCRs. ACCRs were higher with three attributes, averaging at 0.975, compared to an average of 0.950 with five attributes. Moreover, the PCCRs significantly increased when the number of attributes decreased. Specifically, the PCCR under five attributes was 0.791 compared with the PCCR under three attributes was 0.931. Consequently, both ACCRs and PCCRs were notably higher under three attributes compared to five attributes when considering the fixed condition of 500 examinees and 20 items.

Table 3.

The Parameter Estimates in Simulation Study II.

	N = 500		N = 500
	J = 20		J = 20
	K = 5		K = 3
Parameter	Bias	RMSE	Bias	RMSE
$β$	0.011	0.291	−0.007	0.259
$δ$	0.015	0.428	0.046	0.354
$β^{s}$	0.022	0.145	0.022	0.134
$μ_{β}$	−0.142	0.275	−0.160	0.276
$μ_{δ}$	0.109	0.289	0.138	0.291
$μ_{β^{s}}$	−0.111	0.191	−0.112	0.191
$λ_{1}$	0.047	0.215	−0.025	0.216
$λ_{2}$	−0.021	0.193	−0.025	0.198
$λ_{3}$	−0.015	0.198	−0.022	0.202
$λ_{4}$	−0.011	0.202	—	—
$λ_{5}$	−0.067	0.257	—	—
$γ_{1}$	0.032	0.473	0.074	0.664
$γ_{2}$	0.034	0.398	0.211	0.782
$γ_{3}$	0.209	0.531	0.116	0.551
$γ_{4}$	0.035	0.398	—	—
$γ_{5}$	−0.023	0.386	—	—
$σ_{β}^{2}$	−0.028	0.424	−0.065	0.389
$σ_{δ}^{2}$	0.170	0.549	0.162	0.501
$σ_{β^{s}}^{2}$	0.180	0.238	0.177	0.235
$σ_{β δ}$	0.018	0.410	0.038	0.373
$σ_{β β^{s}}$	0.032	0.194	0.027	0.178
$σ_{δ β^{s}}$	−0.008	0.199	−0.006	0.191
$σ_{θ^{h} θ^{s}}$	0.001	0.065	−0.006	0.069
$σ_{θ^{s}}^{2}$	0.046	0.071	0.045	0.068
$θ^{h}$	−0.008	0.640	−0.021	0.705
$θ^{s}$	0.010	0.396	0.010	0.395

The bold elements in the table highlight significant differences in the results.

Table 4.

ACCRs and PCCRs in Simulation Studies I and II.

N = 1,000	N = 500	N = 500	N = 500
J = 30	J = 30	J = 20	J = 20
K = 5	K = 5	K = 5	K = 3
0.961	0.960	0.926	0.964
0.956	0.956	0.946	0.982
0.964	0.964	0.968	0.980
0.961	0.962	0.948	—
0.980	0.979	0.964	—
0.852	0.848	0.791	0.931

The bold elements in the table highlight significant differences in the results.

In summary, a larger sample size and longer item length resulted in improved ACCR and PCCR recoveries. Conversely, when the number of examinees or the item length was reduced, both ACCRs and PCCRs decreased. Note that these changes became more apparent when the test length was decreased.

Simulation Study III

Simulation study III aims to investigate the parameter recoveries under different skipping proportions and correlation coefficients. We considered three models: the NMAR, MAR, and HO-DINA models. Two skipping proportions were considered: 12% (low) and 25% (high). In addition, three correlations of $θ_{i}^{h}$ and $θ_{i}^{s}$ were 0 (uncorrelated), −0.5 (medium), and −0.8 (high). As a result, there were 2 × 3 simulation conditions. The data were generated from the proposed model under the NMAR mechanism. There were 30 items answered by 500 examinees, and each item associated with five attributes.

For the conditions with a low skipping proportion, Tables 5 and 6 show the bias and RMSE values for the parameters, including person, item, and attribute parameters of the three models, based on the different correlations between $θ_{i}^{s}$ and $θ_{i}^{h}$ . When the correlation between $θ_{i}^{s}$ and $θ_{i}^{h}$ was 0, the parameter estimates for the three models were quite similar. However, the NMAR model exhibited an advantage as the correlation between $θ_{i}^{h}$ and $θ_{i}^{s}$ increased. For instance, the bias and RMSEs of parameters $β, δ, μ_{β}, μ_{δ}, λ, γ, θ^{h}, and θ^{s}$ for the NMAR model were consistently the smallest in the NMAR model compared to the MAR and HO-DINA models. As the correlation between $θ_{i}^{h}$ and $θ_{i}^{s}$ increased, the performance of the attribute parameters became particularly evident. For example, the RMSE of the attribute parameters $λ and γ$ in the NMAR model were smaller compared with those of the MAR and HO-DINA models. As expected, the NMAR model demonstrated higher accuracy in terms of parameter estimation. Furthermore, because $θ_{i}^{h}$ and $θ_{i}^{s}$ in both the MAR and HO-DINA models were uncorrelated, we observed that the parameter estimates in these models were relatively consistent in simulations that did not account for skipped items.

Table 5.

The Bias of Parameter Estimates in the Models With Low Skipping Proportions and Different $ρ_{θ_{i}^{h} θ_{i}^{s}}$ in Simulation Study III.

Parameter	$ρ =$ 0			$ρ = -$ 0.5			$ρ = -$ 0.8
Parameter	NMAR	MAR	HO-DINA	NMAR	MAR	HO-DINA	NMAR	MAR	HO-DINA
$β$	0.014	0.011	0.010	0.002	0.167	0.167	0.017	0.178	0.177
$δ$	0.001	0.005	0.025	0.001	−0.063	−0.046	0.001	−0.061	−0.041
$β^{s}$	0.019	0.017	—	0.019	0.018	—	0.015	0.013	—
$μ_{β}$	0.041	0.037	0.037	0.027	0.191	0.192	0.045	0.201	0.203
$μ_{δ}$	−0.133	−0.127	−0.107	−0.132	−0.195	−0.178	−0.134	−0.191	−0.173
$μ_{β^{s}}$	0.018	0.017	—	0.018	0.017	—	0.015	0.011	—
$σ_{β}^{2}$	−0.113	−0.106	−0.100	−0.131	−0.212	−0.213	−0.142	−0.223	−0.218
$σ_{δ}^{2}$	−0.056	−0.052	0.145	−0.155	−0.201	−0.019	−0.097	−0.150	0.029
$σ_{β^{s}}^{2}$	−0.031	−0.031	—	−0.036	−0.036	—	−0.030	−0.030	—
$σ_{β δ}$	0.170	0.164	0.144	0.216	0.294	0.288	0.199	0.287	0.279
$σ_{β β^{s}}$	−0.075	−0.075	—	−0.079	−0.078	—	−0.078	−0.075	—
$σ_{δ β^{s}}$	0.062	0.062	—	0.078	0.085	—	0.067	0.071	—
$λ_{1}$	0.042	0.042	0.035	0.051	0.282	0.280	−0.002	0.193	0.196
$λ_{2}$	−0.031	−0.036	−0.028	−0.022	0.218	0.221	−0.058	0.187	0.197
$λ_{3}$	0.055	0.055	0.051	−0.021	0.189	0.187	−0.033	0.163	0.164
$λ_{4}$	0.021	0.017	0.016	−0.013	0.268	0.268	0.012	0.274	0.280
$λ_{5}$	0.001	0.004	0.008	0.023	0.178	0.174	−0.041	0.092	0.093
$γ_{1}$	0.070	0.066	0.067	0.092	0.707	0.696	0.154	0.698	0.690
$γ_{2}$	0.012	0.014	0.027	−0.004	0.488	0.486	0.012	0.562	0.563
$γ_{3}$	0.110	0.110	0.098	0.119	0.510	0.501	0.075	0.429	0.425
$γ_{4}$	0.040	0.036	0.028	−0.102	0.344	0.349	−0.012	0.400	0.400
$γ_{5}$	−0.029	−0.024	−0.012	0.062	0.255	0.248	−0.093	0.070	0.068
$σ_{θ^{h} θ^{s}}$	−0.006	—	—	−0.021	—	—	−0.015	—	—
$σ_{θ^{s}}^{2}$	0.035	0.031	—	0.043	0.040	—	0.024	0.014	—
$θ^{h}$	0.004	0.003	0.003	−0.008	−0.011	−0.012	−0.008	−0.011	−0.010
$θ^{s}$	0.001	0.001	—	−0.003	−0.001	—	0.001	0.001	—

Note. NMAR = not missing at random model; MAR = missing at random model; HO-DINA = higher-order DINA model.The bold elements in the table highlight significant differences in the results.

Table 6.

The RMSE of Parameter Estimates in the Models With Low Skipping Proportions and Different $ρ_{θ_{i}^{h} θ_{i}^{s}}$ in Simulation Study III.

Parameter	$ρ =$ 0			$ρ = -$ 0.5			$ρ = -$ 0.8
Parameter	NMAR	MAR	HO-DINA	NMAR	MAR	HO-DINA	NMAR	MAR	HO-DINA
$β$	0.230	0.232	0.234	0.252	0.385	0.386	0.240	0.372	0.375
$δ$	0.384	0.385	0.391	0.398	0.461	0.454	0.394	0.446	0.448
$β^{s}$	0.132	0.132	—	0.144	0.144	—	0.131	0.132	—
$μ_{β}$	0.188	0.187	0.188	0.182	0.257	0.257	0.184	0.263	0.265
$μ_{δ}$	0.243	0.240	0.245	0.231	0.268	0.267	0.235	0.267	0.269
$μ_{β^{s}}$	0.099	0.100	—	0.101	0.101	—	0.101	0.102	—
$σ_{β}^{2}$	0.313	0.312	0.316	0.325	0.328	0.328	0.315	0.330	0.331
$σ_{δ}^{2}$	0.360	0.358	0.422	0.359	0.364	0.343	0.360	0.360	0.364
$σ_{β^{s}}^{2}$	0.072	0.072	—	0.073	0.073	—	0.072	0.072	—
$σ_{β δ}$	0.324	0.321	0.328	0.341	0.371	0.373	0.333	0.370	0.373
$σ_{β β^{s}}$	0.124	0.123	—	0.125	0.121	—	0.124	0.120	—
$σ_{δ β^{s}}$	0.118	0.118	—	0.123	0.125	—	0.120	0.121	—
$λ_{1}$	0.241	0.237	0.239	0.226	0.361	0.358	0.228	0.325	0.325
$λ_{2}$	0.198	0.199	0.199	0.183	0.293	0.294	0.193	0.293	0.294
$λ_{3}$	0.191	0.190	0.188	0.202	0.284	0.283	0.187	0.266	0.263
$λ_{4}$	0.206	0.205	0.206	0.196	0.343	0.343	0.225	0.355	0.358
$λ_{5}$	0.235	0.236	0.239	0.232	0.285	0.285	0.229	0.247	0.251
$γ_{1}$	0.429	0.434	0.438	0.396	0.839	0.824	0.445	0.847	0.836
$γ_{2}$	0.421	0.426	0.434	0.386	0.638	0.639	0.368	0.722	0.716
$γ_{3}$	0.418	0.417	0.408	0.404	0.668	0.648	0.397	0.603	0.590
$γ_{4}$	0.404	0.401	0.394	0.412	0.539	0.546	0.391	0.578	0.580
$γ_{5}$	0.343	0.347	0.351	0.394	0.444	0.441	0.365	0.364	0.369
$σ_{θ^{h} θ^{s}}$	0.069	—	—	0.063	—	—	0.054	—	—
$σ_{θ^{s}}^{2}$	0.057	0.055	—	0.062	0.061	—	0.051	0.045	—
$θ^{h}$	0.630	0.629	0.629	0.621	0.638	0.638	0.589	0.634	0.633
$θ^{s}$	0.379	0.379	—	0.364	0.381	—	0.329	0.380	—

Note. NMAR = not missing at random model; MAR = missing at random model; HO-DINA = higher-order DINA model.The bold elements in the table highlight significant differences in the results.

For the conditions with a high skipping proportion, as presented in Tables 7 and 8, we observed similar results to those observed in conditions with low skipping proportions. Across both scenarios, the NMAR model consistently outperformed the other two models, particularly in the accuracy of person and item parameter estimates. Attribute parameter estimation also displayed notable differences among the three mechanism models, highlighting the NMAR model’s superior performance.

Table 7.

The Bias of Parameter Estimates in the Models With High Skipping Proportions and Different $ρ_{θ_{i}^{h} θ_{i}^{s}}$ in Simulation Study III.

Parameter	$ρ =$ 0			$ρ = -$ 0.5			$ρ = -$ 0.8
Parameter	NMAR	MAR	HO-DINA	NMAR	MAR	HO-DINA	NMAR	MAR	HO-DINA
$β$	0.021	0.020	0.019	0.018	0.282	0.290	0.023	0.291	0.296
$δ$	0.007	0.008	0.028	−0.017	−0.140	−0.129	−0.019	−0.148	−0.135
$β^{s}$	0.001	0.0001	—	0.004	0.003	—	0.004	0.004	—
$μ_{β}$	0.048	0.044	0.047	0.043	0.303	0.313	0.051	0.313	0.318
$μ_{δ}$	−0.126	−0.123	−0.106	−0.150	−0.269	−0.260	−0.155	−0.278	−0.262
$μ_{β^{s}}$	0.001	−0.002	—	0.003	0.001	—	0.003	0.002	—
$σ_{β}^{2}$	−0.156	−0.152	−0.148	−0.147	−0.208	−0.213	−0.153	−0.217	−0.222
$σ_{δ}^{2}$	−0.084	−0.081	0.113	−0.092	−0.116	0.066	−0.089	−0.109	0.071
$σ_{β^{s}}^{2}$	−0.084	−0.084	—	−0.084	−0.084	—	−0.083	−0.084	—
$σ_{β δ}$	0.210	0.207	0.192	0.203	0.272	0.270	0.205	0.272	0.270
$σ_{β β^{s}}$	−0.074	−0.073	—	−0.071	−0.076	—	−0.074	−0.083	—
$σ_{δ β^{s}}$	0.025	0.025	—	0.017	0.025	—	0.016	0.029	—
$λ_{1}$	0.038	0.040	0.035	0.017	0.487	0.510	0.023	0.566	0.602
$λ_{2}$	−0.030	−0.029	−0.024	−0.110	0.451	0.478	−0.055	0.455	0.467
$λ_{3}$	0.041	0.041	0.037	0.004	0.361	0.362	0.031	0.397	0.389
$λ_{4}$	0.009	0.006	0.017	0.038	0.485	0.499	0.016	0.440	0.458
$λ_{5}$	0.011	0.009	0.003	0.004	0.234	0.230	0.003	0.196	0.185
$γ_{1}$	0.119	0.119	0.104	0.084	1.191	1.230	0.179	1.628	1.717
$γ_{2}$	−0.016	−0.002	0.0003	0.070	1.285	1.367	0.016	1.159	1.158
$γ_{3}$	0.101	0.096	0.086	0.080	0.668	0.672	0.0001	0.633	0.627
$γ_{4}$	0.086	0.081	0.106	0.046	0.733	0.753	0.039	0.689	0.723
$γ_{5}$	0.014	0.012	0.002	0.010	0.301	0.283	0.018	0.220	0.202
$σ_{θ^{h} θ^{s}}$	−0.006	—	—	−0.002	—	—	−0.006	—	—
$σ_{θ^{s}}^{2}$	0.024	0.022	—	0.018	0.016	—	0.023	0.022	—
$θ^{h}$	−0.006	−0.005	−0.006	−0.014	−0.023	−0.022	0.002	−0.006	−0.006
$θ^{s}$	−0.002	−0.003	—	−0.001	−0.002	—	0.003	0.003	—

Table 8.

The RMSE of Parameter Estimates in the Models With High Skipping Proportions and Different $ρ_{θ_{i}^{h} θ_{i}^{s}}$ in Simulation Study III.

Parameter	$ρ =$ 0			$ρ = -$ 0.5			$ρ = -$ 0.8
Parameter	NMAR	MAR	HO-DINA	NMAR	MAR	HO-DINA	NMAR	MAR	HO-DINA
$β$	0.272	0.276	0.282	0.247	0.511	0.530	0.267	0.544	0.559
$δ$	0.428	0.431	0.438	0.390	0.522	0.534	0.394	0.539	0.552
$β^{s}$	0.108	0.107	—	0.103	0.103	—	0.102	0.102	—
$μ_{β}$	0.188	0.187	0.190	0.187	0.348	0.357	0.190	0.356	0.362
$μ_{δ}$	0.247	0.246	0.254	0.249	0.332	0.334	0.253	0.339	0.339
$μ_{β^{s}}$	0.086	0.086	—	0.084	0.084	—	0.085	0.083	—
$σ_{β}^{2}$	0.331	0.335	0.337	0.328	0.326	0.328	0.334	0.337	0.339
$σ_{δ}^{2}$	0.395	0.395	0.432	0.357	0.358	0.383	0.377	0.367	0.395
$σ_{β^{s}}^{2}$	0.097	0.097	—	0.097	0.097	—	0.097	0.097	—
$σ_{β δ}$	0.358	0.359	0.364	0.340	0.362	0.368	0.349	0.368	0.375
$σ_{β β^{s}}$	0.114	0.114	—	0.113	0.113	—	0.114	0.117	—
$σ_{δ β^{s}}$	0.097	0.097	—	0.093	0.093	—	0.093	0.094	—
$λ_{1}$	0.237	0.234	0.236	0.218	0.551	0.576	0.233	0.641	0.675
$λ_{2}$	0.196	0.198	0.199	0.218	0.527	0.552	0.195	0.528	0.541
$λ_{3}$	0.195	0.193	0.193	0.193	0.415	0.419	0.220	0.456	0.448
$λ_{4}$	0.241	0.238	0.247	0.245	0.551	0.566	0.229	0.505	0.525
$λ_{5}$	0.260	0.252	0.254	0.239	0.332	0.330	0.256	0.310	0.301
$γ_{1}$	0.460	0.448	0.445	0.436	1.346	1.404	0.454	1.862	1.922
$γ_{2}$	0.431	0.445	0.442	0.396	1.457	1.559	0.408	1.376	1.333
$γ_{3}$	0.462	0.444	0.453	0.431	0.787	0.792	0.354	0.754	0.749
$γ_{4}$	0.507	0.496	0.521	0.458	0.864	0.882	0.421	0.818	0.856
$γ_{5}$	0.470	0.462	0.464	0.381	0.448	0.436	0.391	0.430	0.412
$σ_{θ^{h} θ^{s}}$	0.052	—	—	0.055	—	—	0.048	—	—
$σ_{θ^{s}}^{2}$	0.042	0.041	—	0.040	0.039	—	0.042	0.043	—
$θ^{h}$	0.646	0.646	0.645	0.623	0.654	0.654	0.579	0.650	0.651
$θ^{s}$	0.336	0.336	—	0.321	0.332	—	0.296	0.333	—

The bold elements in the table highlight significant differences in the results.

Figures 3 and 4 show the bias of attribute parameters $λ$ and $γ$ in the NMAR and MAR models across different skipping proportions and correlations between $θ_{i}^{h}$ and $θ_{i}^{s}$ . The results indicate that the parameter estimates between the NMAR and MAR models were similar when the correlation between $θ_{i}^{h}$ and $θ_{i}^{s}$ was 0. Regardless of correlation level or skipping proportion, the bias of attribute parameters $λ$ and $γ$ remained close to 0 in the NMAR model, indicating accurate and effective estimation of these attributes. In contrast, when the skipping proportion was high, the bias of the attribute parameters $λ$ and $γ$ of the MAR model were significantly larger than those of the NMAR model. In addition, the RMSE of the attribute parameters $λ$ and $γ$ are shown in Figures 5 and 6. Comparing the NMAR and MAR models under these conditions, we observe that the RMSEs of attribute parameters in the NMAR model exhibited significant improvement. The estimates of attribute parameters became notably more precise with increasing correlation level or higher skipping proportions. In contrast, the RMSEs of attribute parameters in the MAR model were exceptionally large.

Figure 3.

Bias of parameter estimates about $λ$ with different skipping proportions and $ρ_{θ_{i}^{h} θ_{i}^{s}}$ .

Figure 4.

Bias of parameter estimates about $γ$ with different skipping proportions and $ρ_{θ_{i}^{h} θ_{i}^{s}}$ . .

Figure 5.

RMSE of parameter estimates about $λ$ with different skipping proportions and $ρ_{θ_{i}^{h} θ_{i}^{s}}$ . .

Figure 6.

RMSE of parameter estimates about $γ$ with different skipping proportions and $ρ_{θ_{i}^{h} θ_{i}^{s}}$ . .

Figure 7 presents the ACCRs and PCCRs in the NMAR and MAR models under six simulation conditions: low and high skipping proportions, as well as low, medium, high correlations between $θ_{i}^{h}$ and $θ_{i}^{s}$ . Detailed results of the ACCRs and PCCRs are provided in Table S1 in the Supplemental Material. The ACCRs and PCCRs in the NMAR model consistently exhibit a noticeable advantage, especially in scenarios with high skipping proportions and high correlations. This indicates that the NMAR model is more effective in recovering attribute patterns when the missing data mechanism is non-ignorable. Conversely, the MAR model struggles to effectively recover attribute patterns in such scenarios.

Figure 7.

The ACCRs and PCCRs for MAR and NMAR models in simulation study III.

In brief, our results consistently indicate that the NMAR model outperforms both the MAR model and the HO-DINA model, regardless of whether the skipping proportion is high or low, or whether the person correlations are strong or weak. Furthermore, across all scenarios, the NMAR model consistently achieves higher ACCRs and PCCRs compared to the other models. This highlights that the NMAR model excels in delivering more precise parameter estimation and attribute profiles.

Simulation Study IV

In simulation study IV, our aim was to conduct the model selection using Bayesian model assessment criteria, namely, DIC and LPML, based on the six conditions from simulation III. We used 500 examinees who answered 30 items with five attributes. The model selection results are shown in Table 9. There were no significant differences in DIC and LPML among the models when $ρ_{θ_{i}^{h} θ_{i}^{s}}$ was 0. However, when $ρ_{θ_{i}^{h} θ_{i}^{s}}$ deviated from 0, the DIC of the NMAR model was consistently smaller, and the LPML was relatively higher. This suggests that the DIC and LPML criteria effectively facilitated the accurate selection of the true model.

Table 9.

Model Selections Under Different $ρ_{θ_{i}^{h} θ_{i}^{s}}$ and Skipping Proportions.

		Low skipping proportion		High skipping proportion
Correlation		NMAR	MAR	NMAR	MAR
$ρ = 0$	DIC	20,278.23	20,277.16	24,192.70	24,194.22
	LPML	−10,625.71	−10,622.35	−12,639.62	−12,640.24
$ρ = - 0$ .5	DIC	20,077.83	20,373.61	24,100.94	24,563.02
	LPML	−10,481.75	−10,582.50	−12,551.96	−12,680.42
$ρ = - 0$ .8	DIC	20,245.78	20,577.46	24,137.64	24,578.39
	LPML	−10,563.83	−10,677.20	−12,566.58	−12,722.20

In conclusion, the results demonstrate that under various conditions, the NMAR model provides a better fit to the data compared to the MAR model, as evidenced by DIC and LPML values.

Empirical Example

We analyzed the data from the computer-based PISA 2018 mathematics cognitive test in Albania. According to the 2018 PISA mathematics assessment framework, four attributes belonged to mathematical content knowledge: change and relationship $(α_{1})$ , quantity $(α_{2})$ , space and shape $(α_{3})$ , and uncertainty and data $(α_{4})$ . The item IDs and Q matrices are shown in Table 10; these items were dichotomous. Item responses were coded 0 (no credit), 1 (full credit), 6 (not reached), 7 (not applicable), 8 (invalid), or 9 (nonresponse). Because the nonresponses were discrete, we treated code 9 as skipped/omitted responses. Furthermore, with some examinees’ responses removed due to being coded as not applicable or invalid, 798 examinees remained in the data set. Furthermore, another 40 examinees coded 6 (not-reached) were removed. The final sample size is 758. The overall skipping proportion was 4.9%, and the skipping proportions of each item ranged from 0% to 14.8%.

Table 10.

The Q Matrix in the Real Data.

Attribute	CM033Q01	CM474Q01	CM155Q01	CM155Q04	CM411Q01	CM411Q02	CM803Q01	CM442Q02	CM034Q01
$α_{1}$	0	0	1	1	0	0	0	0	0
$α_{2}$	1	0	0	0	0	0	0	0	1
$α_{3}$	0	1	0	0	1	0	0	1	0
$α_{4}$	0	0	0	0	0	1	1	0	0

Tables 11 and 12 show the parameter estimates and standard deviations of the person, item, and attribute parameters. The results indicate that the estimated correlation coefficient between the higher-order ability and skipping ability parameter $σ_{θ^{h} θ^{s}}$ was negative. This suggests that as the higher-order ability decreased, the latent person parameter associated with skipping increased, indicating that examinees with higher higher-order abilities found it more challenging to skip items. The positive estimates for the attribute slope parameters imply that as the higher-order ability parameter $θ_{i}^{h}$ increases, examinees demonstrate a better mastery of the knowledge attribute, which aligns with real-world expectations. Furthermore, the estimated item mean parameter $μ_{β}$ was −2.067, which implies that the mean guessing probability was approximately .11. The correlation coefficient between the item intercept parameter $β_{j}$ and interaction parameter $δ_{j}$ was negative ( $σ_{β δ} < 0$ ). In addition, all item intercept parameters $β_{j}$ were negative, which suggests that the guessing probability of all the items was lower than .5. All item interaction parameters $δ_{j}$ were positive, which satisfied $g_{j} < 1 - s_{j}$ , as expected. The smaller the skipping difficulty parameter $β_{j}^{s}$ , the higher the skipping proportion for this item; for example, for CM155Q01, the estimation of $β_{j}^{s}$ was 2.621 and the skipping proportion was 14.8%. For CM442Q02, the estimation of $β_{j}^{s}$ was 2.998 and the skipping proportion was 11.6%. Although the overall skipping proportion in the real data was 4.9%, certain items exhibited notably high skipping proportions, indicating the need to consider them as missing data.

Table 11.

Estimates and Standard Deviations of the Parameters for the Real Data.

Statistics	$μ_{β}$	$μ_{δ}$	$μ_{β}^{s}$	$σ_{β}^{2}$	$σ_{δ}^{2}$	$σ_{β^{s}}^{2}$	$σ_{β δ}$	$σ_{β β^{s}}$	$σ_{δ β^{s}}$	$σ_{θ^{h} θ^{s}}$
Est.	−2.067	2.471	4.787	2.543	1.537	3.302	−0.870	0.475	−0.634	−1.122
SD	0.475	0.405	0.560	1.688	1.208	2.093	1.141	1.206	0.997	0.142
Statistics	$σ_{θ^{s}}^{2}$	$λ_{1}$	$λ_{2}$	$λ_{3}$	$λ_{4}$	$γ_{1}$	$γ_{2}$	$γ_{3}$	$γ_{4}$
Est.	3.076	−0.218	0.450	0.779	1.414	2.416	2.396	2.171	1.702
SD	0.568	0.397	0.525	0.398	0.653	0.572	0.609	0.416	0.357

Note. Est. = estimated value; SD = standard deviation of the posterior distribution.

Table 12.

Estimates and Standard Errors of the Item Parameters for the Real Data.

Parameter	Statistics	033Q01	474Q01	155Q01	155Q04	411Q01	411Q02	803Q01	442Q02	034Q01
$β_{j}$	Est.	−0.157	−0.532	−1.546	−1.452	−1.837	−1.354	−3.913	−4.430	−3.386
	SD	0.246	0.145	0.537	0.198	0.270	0.182	0.550	0.535	0.734
$δ_{j}$	Est.	2.440	1.618	4.219	1.344	2.573	0.825	3.203	2.990	2.801
	$SD$	0.499	0.250	0.829	0.257	0.375	0.369	0.632	0.588	0.797
$β_{j}^{s}$	Est.	5.907	6.807	2.621	4.637	3.665	5.026	4.887	2.998	7.808
	SD	0.402	0.575	0.185	0.289	0.233	0.325	0.311	0.199	0.904

Note. Est = estimated value; SD = standard deviation of the posterior distribution.

Discussion

This study addresses the prevalent issue of item-level missing data in cognitive diagnostic assessments, where examinees skip certain items. Such missing data poses a risk of providing inaccurate diagnostic feedback and may lead to biased outcomes when compared to those with complete data. Therefore, this study aims to develop a missing data model for skipped items, ultimately providing more accurate cognitive feedback for examinees. This not only facilitates a deeper comprehension of individual cognitive processes but also furnishes more robust data support for educational policies and practices.

Our proposed model combines the DINA model, which captures responses, and the 1PLM, which represents the missing indicator model. A bivariate normal distribution for person parameters and a multivariate normal distribution for item parameters were used to capture the relationships between these two models in a higher-level framework. Our model not only strives for more precise knowledge structures derived from diagnostic feedback but also sheds light on examinees’ response behaviors, offering valuable insights into their cognitive processes.

We employed the Bayesian MCMC algorithm to estimate the model parameters. Through comprehensive simulation studies, we found our model consistently yielded more accurate parameter estimates compared to other methods. These simulation studies also facilitated an in-depth evaluation of the model’s performance, considering various missing mechanisms, proportions of skipped items, and correlations between higher-order ability and potential skipping tendency. Under these diverse simulation scenarios, our model consistently demonstrated reliable estimation accuracy. Furthermore, the HO-DINA model, which characterizes examinees’ cognitive attributes, further enhanced the accuracy of our parameter estimates. Notably, we observed substantial improvements in ACCRs and PCCRs, specifically, ACCRs and PCCRs were significantly higher in the NMAR model, indicating that our model better fits non-ignorable missing data mechanisms. To demonstrate the practical applicability of our model, we successfully applied it to the PISA 2018 computer-based test.

While this study offers valuable insights, it comes with certain limitations, suggesting potential directions for future research. Firstly, it’s essential to consider time constraints as a factor contributing to item skipping. Secondly, we have not explored other potential reasons for skipped items in cognitive assessments. Future research should delve into these additional factors. Lastly, cognitive assessments frequently exhibit various types of item nonresponses, including not-reached and skipped items. Future research could refine our model to address these diverse missing data scenarios.

Conclusion

In this study, we introduced a novel model to address skipped items in cognitive diagnostic assessments, combining the missing data model with the item response model under a cohesive hierarchical structural framework. By leveraging the Bayesian MCMC algorithm, robust parameter estimation results are obtained.

Extensive simulations demonstrated the superior performance of our model across a range of scenarios, consistently producing highly accurate parameter estimates. Notably, our model exhibited a significant advantage in ACCRs and PCCRs when compared to the NMAR and MAR models, particularly in the presence of non-ignorable missing data.

The validation of our model using the 2018 PISA mathematics dataset further highlights its practical effectiveness. Our research has profound implications for cognitive diagnostic assessments, offering a transformative approach that addresses skipped items comprehensively. It lays the foundation for potential advancements in the field, promising more accurate and insightful assessments in the future.

Supplemental Material

sj-docx-1-sgo-10.1177_21582440241288005 – Supplemental material for Modeling Skipped Items in Cognitive Diagnostic Assessments

Supplemental material, sj-docx-1-sgo-10.1177_21582440241288005 for Modeling Skipped Items in Cognitive Diagnostic Assessments by Lidan Liang, Jing Lu, Jiwei Zhang and Guanghui Cheng in SAGE Open

Footnotes

Appendix: Details of the MCMC Algorithm for the Proposed Model

Let $Ω = {θ_{i}^{h}, θ_{i}^{s}, β_{j}^{s}, α_{ik}, β_{j}, δ_{j}, γ_{k}, λ_{k}, μ_{β}, μ_{δ}, μ_{β^{s}}, Σ_{I}, σ_{θ^{h} θ^{s}}, σ_{θ^{s}}^{2}}$ . The posterior distribution of the proposed model is expressed as follows:

p (Ω | Y, D) = Π_{i = 1}^{N} Π_{j = 1}^{J} Π_{k = 1}^{K} P (Y_{ij} = s)^{Y_{ij} = s} \times {[1 - P (Y_{ij} = s)] P (Y_{ij} = 1)}^{Y_{ij} = 1}

\times {[1 - P (Y_{ij} = s)] P (Y_{ij} = 0)}^{Y_{ij} = 0} \times p (θ^{h}, θ^{s}; μ_{P}, Σ_{P})

p (β_{j}, δ_{j}, β_{j}^{s}; μ_{β}, μ_{δ}, μ_{β^{s}}, Σ_{I}) p (Σ_{P}) p (μ_{β}) p (μ_{δ}) p (μ_{β^{s}}) p (Σ_{I})

p (α_{ik}; θ_{i}^{h}, γ_{k}, λ_{k}) p (λ_{k}) p (γ_{k}) .

Following Patz and Junker (1999), a M-H algorithm is employed, and the acceptance rate $\min (1, R_{β, δ, β^{s}})$ , where

R_{β, δ, β^{s}} = \frac{π_{β, δ, β^{s}} (β_{j}^{*}, δ_{j}^{*}, β_{j}^{s *}) Π_{i = 1}^{N} p (Y_{ij} | β_{j}^{*}, δ_{j}^{*}, α_{ik}^{(r - 1)}) p (s_{ij} | β_{j}^{s *}, θ_{i}^{s (r - 1)})}{π_{β, δ, β^{s}} (β_{j}^{(r - 1)}, δ_{j}^{(r - 1)}, β_{j}^{s (r - 1)}) Π_{i = 1}^{N} p (Y_{ij} | β_{j}^{(r - 1)}, δ_{j}^{(r - 1)}, α_{ik}^{(r - 1)}) p (s_{ij} | β_{j}^{s (r - 1)}, θ_{i}^{s (r - 1)})},

and $π_{β, δ, β^{s}}$ is the normal density of the prior for parameter $β, δ, β^{s}$ .

R_{μ_{β}} = \frac{π_{μ_{β}} (μ_{β}^{*}) Π_{j = 1}^{J} P (β_{j}^{(r)}, δ_{j}^{(r)}, β_{j}^{s (r)} | μ_{β}^{*}, μ_{δ}^{(r - 1)}, μ_{β}^{s (r - 1)}, Σ_{I}^{(r - 1)})}{π_{μ_{β}} (μ_{β}^{(r - 1)}) Π_{j = 1}^{J} P (β_{j}^{(r)}, δ_{j}^{(r)}, β_{j}^{s (r)} | μ_{β}^{(r - 1)}, μ_{δ}^{(r - 1)}, μ_{β}^{s (r - 1)}, Σ_{I}^{(r - 1)})},

and $π_{μ_{β}}$ is the normal prior for parameter $μ_{β}$ .

R_{μ_{δ}} = \frac{π_{μ_{δ}} (μ_{δ}^{*}) Π_{j = 1}^{J} P (β_{j}^{(r)}, δ_{j}^{(r)}, β_{j}^{s (r)} | μ_{β}^{(r)}, μ_{δ}^{*}, μ_{β}^{s (r - 1)}, Σ_{I}^{(r - 1)})}{π_{μ_{δ}} (μ_{δ}^{(r - 1)}) Π_{j = 1}^{J} P (β_{j}^{(r)}, δ_{j}^{(r)}, β_{j}^{s (r)} | μ_{β}^{(r)}, μ_{δ}^{(r - 1)}, μ_{β}^{s (r - 1)}, Σ_{I}^{(r - 1)})},

and $π_{μ_{δ}}$ is the normal prior for parameter $μ_{δ}$ .

R_{μ_{δ}} = \frac{π_{μ_{β^{s}}} (μ_{β^{s}}^{*}) Π_{j = 1}^{J} P (β_{j}^{(r)}, δ_{j}^{(r)}, β_{j}^{s (r)} | μ_{β}^{(r)}, μ_{δ}^{(r)}, μ_{β}^{s *}, Σ_{I}^{(r - 1)})}{π_{μ_{β^{s}}} (μ_{β^{s}}^{(r - 1)}) Π_{j = 1}^{J} P (β_{j}^{(r)}, δ_{j}^{(r)}, β_{j}^{s (r)} | μ_{β}^{(r)}, μ_{δ}^{(r)}, μ_{β}^{s (r - 1)}, Σ_{I}^{(r - 1)})},

and $π_{μ_{β^{s}}}$ is the normal prior for parameter $μ_{β^{s}}$ .

$Σ_{I} ~$ Inverse-Wishart $(Σ_{I 0}^{- 1}, v_{I 0})$ ,

the posterior distribution is

$Σ_{I} | Ψ ~$ Inverse-Wishart $(Σ_{I *}^{- 1}, v_{I *})$ ,

where

Σ_{I *} = Σ_{I 0} + S_{Ψ} + \frac{k_{I 0} J}{k_{I 0} + J} (\bar{Ψ} - μ_{I 0}) {(\bar{Ψ} - μ_{I 0})}^{'}

v_{I *} = v_{I 0} + J,

and $S_{Ψ}$ is defined as

S_{Ψ} = \sum_{j = 1}^{J} (Ψ_{j} - \bar{Ψ}) {(Ψ_{j} - \bar{Ψ})}^{'} .

R_{λ} = \frac{π_{λ} (λ_{k}^{*}) Π_{i = 1}^{N} p (α_{ik} | λ_{k}^{*}, γ_{k}^{(r - 1)}, θ_{i}^{h (r - 1)})}{π_{λ} (λ_{k}^{(r - 1)}) Π_{i = 1}^{N} p (α_{ik} | λ_{k}^{(r - 1)}, γ_{k}^{(r - 1)}, θ_{i}^{h (r - 1)})},

and $π_{λ}$ is the normal density of the prior for parameter $λ$ .

R_{γ} = \frac{π_{γ} (γ_{k}^{*}) Π_{i = 1}^{N} p (α_{ik} | λ_{k}^{(r)}, γ_{k}^{*}, θ_{i}^{h (r - 1)})}{π_{γ} (γ_{k}^{(r - 1)}) Π_{i = 1}^{N} p (α_{ik} | λ_{k}^{(r)}, γ_{k}^{(r - 1)}, θ_{i}^{h (r - 1)})},

and $π_{γ}$ is the normal density of the prior for parameter $γ$ .

R_{θ} = \frac{π_{θ} (θ_{i}^{*}) Π_{k = 1}^{K} Π_{j = 1}^{J} p (α_{ik} | λ_{k}^{(r)}, γ_{k}^{(r)}, θ_{i}^{h *}) p (s_{ij} | β_{j}^{s (r)}, θ_{i}^{s *})}{π_{θ} (θ_{i}^{(r - 1)}) Π_{k = 1}^{K} Π_{j = 1}^{J} p (α_{ik} | λ_{k}^{(r)}, γ_{k}^{(r)}, θ_{i}^{h (r - 1)}) p (s_{ij} | β_{j}^{s (r)}, θ_{i}^{s (r - 1)})},

and $π_{θ}$ is the bivariate normal prior of parameter $θ$ , that is, $π_{θ} (θ_{i}^{*}) = p (θ_{i}^{h *}, θ_{i}^{s *} | μ_{P}, Σ_{P}^{(r - 1)})$ .

R_{α} = \frac{π_{α} (α_{i}^{*}) Π_{j = 1}^{J} p (Y_{ij} | β_{j}^{(r)}, δ_{j}^{(r)}, α_{i}^{*})}{π_{α} (α_{i}^{(r - 1)}) Π_{j = 1}^{J} p (Y_{ij} | β_{j}^{(r)}, δ_{j}^{(r)}, α_{i}^{(r - 1)})},

and $π_{α}$ is the prior for parameter $α$ , $π_{α} (α_{i}^{*}) = Π_{k = 1}^{K} p (α_{i}^{*}; λ_{k}^{(r)}, γ_{k}^{(r)}, θ_{i}^{h (r)})$ .

R_{σ_{θ^{h} θ^{s}}} = \frac{Π_{i = 1}^{N} p (θ_{i}^{s (r)} | θ_{i}^{h (r)}, σ_{θ^{s}}^{2 (r - 1)}, σ_{θ^{h} θ^{s}}^{*}) p (σ_{θ^{h} θ^{s}}^{*}) (Φ (\frac{p_{01} - σ_{θ^{h} θ^{s}}^{(r - 1)}}{c_{01}}) - Φ (\frac{- p_{01} - σ_{θ^{h} θ^{s}}^{(r - 1)}}{c_{01}}))}{Π_{i = 1}^{N} p (θ_{i}^{s (r)} | θ_{i}^{h (r)}, σ_{θ^{s}}^{2 (r - 1)}, σ_{θ^{h} θ^{s}}^{(r - 1)}) p (σ_{θ^{h} θ^{s}}^{(r - 1)}) (Φ (\frac{p_{01} - σ_{θ^{h} θ^{s}}^{*}}{c_{01}}) - Φ (\frac{- p_{01} - σ_{θ^{h} θ^{s}}^{*}}{c_{01}}))},

and $p (θ_{i}^{s} | θ_{i}^{h})$ is the conditional density function, and $p (σ_{θ^{h} θ^{s}})$ is a uniform prior density function.

R_{σ_{θ^{s}}^{2}} = \frac{Π_{i = 1}^{N} p (θ_{i}^{s (r)} | θ_{i}^{h (r)}, σ_{θ^{s}}^{2 *}, σ_{θ^{h} θ^{s}}^{(r)}) p (σ_{θ^{s}}^{2 *} | v_{0}, c_{0}) (1 - Φ (\frac{p_{0} - σ_{θ^{s}}^{2 (r - 1)}}{c_{02}}))}{Π_{i = 1}^{N} p (θ_{i}^{s (r)} | θ_{i}^{h (r)}, σ_{θ^{s}}^{2 (r - 1)}, σ_{θ^{h} θ^{s}}^{(r)}) p (σ_{θ^{s}}^{2 (r - 1)} | v_{0}, c_{0}) (1 - Φ (\frac{p_{0} - σ_{θ^{s}}^{2 *}}{c_{02}}))},

and $p (σ_{θ^{s}}^{2} | v_{0}, c_{0})$ is the density function of the inverse Gamma distribution with shape parameter $v_{0}$ and the scale parameter $c_{0}$ .

Acknowledgements

The authors are greatly indebted to the Editor, Associate Editor and three reviewers for their valuable comments and suggestions.

Correction (October 2024):

Article updated to correct the affiliation of Jiwei Zhang from 1 to 2 in the affiliation section.

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by the general projects of National Social Science Fund of China on Statistics (Grant No. 23BTJ067).

ORCID iD

Jing Lu

Data Availability Statement

The real data set was derived from the following resources available in the public domain: .

Supplemental Material

Supplemental material for this article is available online.

References

Brooks

S. P.

Gelman

(1998). General methods for monitoring convergence of iterative simulations. Journal of Computational and Graphical Statistics, 7(4), 434–455.

Chen

M. H.

Shao

Q. M.

Ibrahim

J. G.

(2000). Monte Carlo methods in Bayesian computation. Springer.

Dai

(2021). Handling missing responses in psychometrics: Methods and software. Psyche, 3(4), 673–693.

Dai

Svetina

Chen

(2018). Investigation of missing responses in Q-matrix validation. Applied Psychological Measurement, 42(8), 660–676.

Dai

Svetina Valdivia

(2022). Dealing with missing responses in cognitive diagnostic modeling. Psyche, 4(2), 318–342.

Debeer

Janssen

De Boeck

(2017). Modeling skipped and not-reached items using IRTrees. Journal of Educational Measurement, 54(3), 333–363.

DeCarlo

L. T.

(2011). On the analysis of fraction subtraction data: The DINA model, classification, latent class sizes, and the Q-matrix. Applied Psychological Measurement, 35(1), 8–26.

De La Torre

(2008). An empirically based method of Qmatrix validation for the DINA model: Development and applications. Journal of Educational Measurement, 45(4), 343–362.

de la Torre

(2009). DINA model and parameter estimation: A didactic. Journal of Educational and Behavioral Statistics, 34(1), 115–130.

10.

de la Torre

(2011). The generalized DINA model framework. Psychometrika, 76(2), 179–199.

11.

de la Torre

Douglas

J. A.

(2004). Higher-order latent trait models for cognitive diagnosis. Psychometrika, 69(3), 333–353.

12.

Enders

Keller

B. T.

Bradbury

T. N.

Karney

B. R.

(2022). A Bayesian latent variable selection model for nonignorable missingness. Multivariate Behavioral Research, 57(2-3), 478–512.

13.

Fox

J. P.

(2010). Bayesian item response modeling: Theory and applications. Springer.

14.

Geisser

Eddy

W. F.

(1979). A predictive approach to model selection. Journal of the American Statistical Association, 74(365), 153–160.

15.

Glas

C. A. W.

Pimentel

J. L.

(2008). Modeling nonignorable missing data in speeded tests. Educational and Psychological Measurement, 68(6), 907–922.

16.

Grund

Lüdtke

Robitzsch

(2021). On the treatment of missing data in background questionnaires in educational large-scale assessments: An evaluation of different procedures. Journal of Educational and Behavioral Statistics, 46(4), 4309465.

17.

Guo

Ying

Zhang

(2022). Modeling not-reached items in timed tests: A response time censoring approach. Psychometrika, 87(3), 835–867.

18.

Henson

R. A.

Templin

J. L.

Willse

J. T.

(2009). Defining a family of cognitive diagnosis models using log-linear models with latent variables. Psychometrika, 74(2), 191–210.

19.

Holman

Glas

C. A.

(2005). Modelling non-ignorable missing-data mechanisms with item response theory models. British Journal of Mathematical and Statistical Psychology, 58(Pt 1), 1–17.

20.

Ibrahim

J. G.

Chen

M. H.

Sinha

(2001). Bayesian survival analysis. Springer.

21.

Junker

B. W.

Sijtsma

(2001). Cognitive assessment models with few assumptions, and connections with nonparametric item response theory. Applied Psychological Measurement, 25(3), 258–272.

22.

Liang

Zhang

Shi

(2022). Modeling not-reached items in cognitive diagnostic assessments. Frontiers in Psychology, 13, 889673.

23.

Little

R. J. A.

Rubin

D. B.

(Eds.). (2002). Statistical analysis with missing data (2nd ed.). Wiley.

24.

Lord

F. M.

Novick

M. R.

(1968). Statistical theories of mental test scores. Addison-Wesley.

25.

Wang

(2020). A response time process model for not-reached and omitted items. Journal of Educational Measurement, 57(4), 584–620.

26.

Wang

Shi

(2023). A mixture response time process model for aberrant behaviors and item nonresponses. Multivariate Behavioral Research, 58(1), 71–89.

27.

(2022). A higher-order cognitive diagnosis model with ordinal attributes for dichotomous response data. Multivariate Behavioral Research, 57(2-3), 408–421.

28.

Moustaki

Knott

(2000). Weighting for item non-response in attitude scales by using latent variable models with covariates. Journal of the Royal Statistical Society. Series A, (Statistics in Society), 163(3), 445–459.

29.

OECD. (2013). PISA 2013 technical report. OECD Publishing.

30.

Ömür Sünbül

(2018). The impact of different missing data handling methods on DINA model. International Journal of Evaluation and Research in Education (IJERE), 7(1), 77–86.

31.

O’Muircheartaigh

Moustaki

(1999). Symmetric pattern models: A latent variable approach to item non-response in attitude scales. Journal of the Royal Statistical Society. Series A, (Statistics in Society), 162(2), 177–194.

32.

Patz

R. J.

Junker

B. W.

(1999). A straightforward approach to Markov chain Monte Carlo methods for item response models. Journal of Educational and Behavioral Statistics, 24(2), 146–178.

33.

Pohl

Becker

(2020). Performance of missing data approaches under nonignorable missing data conditions. Methodology, 16(2), 147–165.

34.

Pohl

Gräfe

Rose

(2014). Dealing with omitted and not-reached items in competence tests: Evaluating approaches accounting for missing responses in item response theory models. Educational and Psychological Measurement, 74(3), 423–452.

35.

Pohl

Ulitzsch

von Davier

(2019). Using response times to model not-reached items due to time limits. Psychometrika, 84(3), 892–920.

36.

Rasch

(1966). An item analysis which takes individual differences into account. British Journal of Mathematical and Statistical Psychology, 19(1), 49–57.

37.

Rose

von Davier

(2010). Modeling nonignorable missing data with IRT. Research Report No. RR-10-11. Educational Testing Service.

38.

Rose

von Davier

Nagengast

(2015). Commonalities and differences in IRT-based methods for nonignorable item nonresponses. Psychological Test and Assessment Modeling, 57(4), 472–498.

39.

Rose

von Davier

Nagengast

(2017). Modeling omitted and not-reached items in IRT models. Psychometrika, 82(3), 795–819.

40.

Rubin

D. B.

(1976). Inference and missing data. Biometrika, 63(3), 581–592.

41.

Shan

Wang

(2020). Cognitive diagnosis modeling incorporating item-level missing data mechanism. Frontiers in Psychology, 11, 564707.

42.

Sperrin

Martin

G. P.

(2020). Multiple imputation with missing indicators as proxies for unmeasured variables: Simulation study. BMC Medical Research Methodology, 20(1), 185.

43.

Spiegelhalter

D. J.

Best

N. G.

Carlin

B. P.

Van Der Linde

(2002). Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society Series B (Statistical Methodology), 64(4), 583–639.

44.

Tatsuoka

K. K.

(1983). Rule space: An approach for dealing with misconceptions based on item response theory. Journal of Educational Measurement, 20(4), 345–354.

45.

Templin

J. L.

Henson

R. A.

(2006). Measurement of psychological disorders using cognitive diagnosis models. Psychological Methods, 11(3), 287–305.

46.

Ulitzsch

von Davier

Pohl

(2020a). Using response times for joint modeling of response and omission behavior. Multivariate Behavioral Research, 55(3), 425–453.

47.

Ulitzsch

von Davier

Pohl

(2020b). A multiprocess item response model for not-reached items due to time limits and quitting. Educational and Psychological Measurement, 80(3), 522–547.

48.

Xiao

Bulut

(2020). Evaluating the performances of missing data handling methods in ability estimation from sparse data. Educational and Psychological Measurement, 80(5), 932–954.

49.

Shang

(2018). Identifying latent structures in restricted latent class models. Journal of the American Statistical Association, 113(523), 1284–1295.

50.

Yuan

K.-H.

Jamshidian

Kano

(2018). Missing data mechanisms and homogeneity of means and variances-covariances. Psychometrika, 83(2), 425–442.

51.

Zhan

Jiao

Liao

(2018). Cognitive diagnosis modelling incorporating item response times. British Journal of Mathematical and Statistical Psychology, 71(2), 262–286.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.01 MB