Abstract
Objective
The Montreal Cognitive Assessment (MoCA) Memory Index Score (MIS) is a supplemental assessment of memory composed of word list delayed free-recall followed by step-down category cued- and multiple-choice cued-recall. This paper reviews the MIS literature within Alzheimer’s and other neurodegenerative dementias to synthesize evidence regarding its clinical utility, identify gaps, and inform future research directions.
Method
We searched electronic databases of OVID Medline, Embase, PsycINFO, and PubMed from 2014, when the MIS was first described, to July 2025. Peer-reviewed studies that reported data on the diagnostic or prognostic utility of the MIS in assessing neurodegenerative dementia populations were included.
Results
We screened 278 articles, and 14 were included in the review. The current literature includes limited reporting on the diagnostic or prognostic utility of the MIS and is characterized by minimal diversity of samples and non-rigorous validation methods. Initial findings are promising and suggestive of incremental validity over the MoCA total score for identifying episodic memory impairment and therefore aiding in differentiation of suspected dementia etiology. However, evidence is insubstantial for the MIS as a tool for predicting progression and additional research is needed to evaluate the incremental validity of the MIS over the conventional MoCA five-word recall score.
Conclusions
Large literature gaps exist regarding the clinical utility of the MIS within neurodegenerative dementias. Additional research exploring the psychometric properties of the MIS using diverse samples with rigorous validation methods is needed to better inform its application.
Keywords
The Montreal Cognitive Assessment (MoCA) is a cognitive screening instrument developed to detect mild cognitive impairment (MCI), which was supported by initial findings demonstrating that the MoCA total score (TS) had excellent sensitivity (90%) and good specificity (87%) in the detection of MCI. 1 The MoCA-TS ranges from 0 to 30 points and assesses various cognitive domains, including visuospatial and executive function (5 points), naming (3 points), attention (6 points), language (3 points), abstraction (2 points), orientation (6 points), and delayed free-recall (5 points). The MoCA has steadily grown in popularity as a cognitive screener across clinical and research settings due to its cost-free availability, time-efficient administration, and effectiveness in detecting MCI, outperforming other cognitive screening instruments such as the Mini-Mental State Examination (MMSE).2,3 Julayanont and Nasreddine (2017) claimed the MoCA’s effectiveness in detecting amnestic MCI (aMCI) may be attributable to the MoCA’s delayed-recall component, which uses more words (5 vs 3), fewer opportunities for learning (2 vs up to 6), and a longer delay (5 minutes vs 2 minutes) than the MMSE. 4
Domain-specific MoCA Indices as Described by Julayanont et al (2014). 5
Julayanont and colleagues (2014) suggested that the addition of cued memory performance within the MIS may help assess underlying neurodegenerative progression. 5 They posited that in the early stages of Alzheimer’s disease (AD), executive functions compensate for hippocampal dysfunction, resulting in benefit from cues to aid with memory recall. As the disease progresses, executive regions are increasingly affected, and memory encoding as well as retrieval performance decline. Therefore, encoding deficits are indicative of more advanced disease progression and an increased likelihood of conversion from MCI to AD. Supporting this theory, research suggests that encoding deficits are the primary characteristic of aMCI and thus AD risk. 6 Further, impaired cued-recall performance is more closely associated with neuropathological AD changes and more accurately classifies aMCI to AD converters than free-recall performance.7,8 Therefore, the incorporation of cued-recall by the MIS may provide valuable prognostic information with those at the highest risk of AD conversion performing poorly.
In addition to this prognostic use, the MIS may have additional clinical value such as classifying individuals into stages of cognitive decline and informing inference about probable dementia etiology. Delayed word list recall tasks, such as the California Verbal Learning Test, Rey Auditory Verbal Learning Test, and Hopkins Verbal Learning Test, are instrumental to the neuropsychological assessment of neurodegenerative dementias, and tasks that assess the multidimensional nature of memory deficits (ie, encoding, consolidation, and retrieval) are particularly useful in the assessment of amnestic disorders.3,9-12 In AD, word list delayed free and cued-recall tasks have shown strong diagnostic accuracy, such as differentiating those with AD from healthy controls, suggesting that it may serve as a potential cognitive marker of AD. 13 Further, performance on free and cued-recall tasks can aid in differentiating between various forms of neurodegenerative disease.14,15 For example, Van Liew et al. (2016) found that individuals with Huntington’s disease (HD) and those with AD had poorer free recall than normal controls but those with HD showed greater improvement with cues (characteristic of a retrieval deficit) than those with AD (characteristic of an encoding deficit). 16 Given the well-established use of the MoCA and the theoretical advantages of the MIS, the potential incremental clinical utility of the MIS warrants further evaluation as it could improve the efficacy of MoCA assessment without the need for additional time or resources.
While the step-down retrieval cues of the MIS are not required to derive the MoCA TS, the MIS has been added to the standard MoCA 8.x record form as a supplemental measure, leaving researchers and clinicians with the decision to administer the cued-items and interpret this score. 5 However, its psychometric properties and clinical utility remain unclear. Clinicians and researchers would benefit from knowing the scope of existing evidence evaluating the MIS. Yet, to our knowledge, no review exists. Therefore, the current paper aims to survey the empirical literature on the utility of the MIS within neurodegenerative dementia populations. The research reviewed is summarized with respect to the following 4 broad questions that clinicians and researchers are likely to have: (1) whether the available evidence suggests that the MIS is useful above and beyond the MoCA-TS and MoCA free-recall score (ie, has incremental validity), (2) what applications of the MIS are empirically supported (ie, predicting dementia progression, differentiating dementia syndromes, etc.), (3) what methodological themes, limitations, and gaps exist, and (4) what future research is needed to augment understanding of the clinical utility of this supplemental score.
Methods
The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) scoping review guidelines were followed. 17 Review protocols were not pre-registered. A medical librarian was consulted to ensure a comprehensive database search strategy. Following pilot testing in the databases, an a priori search protocol was developed.
Search Strategy
OVID Medline, Embase, PsycINFO and PubMed were searched for publications between January 2014 to July of 2025. Results were limited to articles in the English language published since 2014, as the MIS was first described at this time. 5 Boolean operators (AND, OR, NOT) with a combination of Medical Subject Headings (MeSH) and search terms were used (see Supplemental Table 1 for the full list of search criteria). A manual search of reference sections of discovered articles was also performed to identify additional articles possibly omitted during the database inquiry process.
Selection Criteria and Screening
Articles were included if they were: (1) peer-reviewed, (2) utilized the MIS as defined by Julayanont et al (2014), and (3) reported upon its diagnostic or prognostic utility within neurodegenerative dementia populations. Articles were excluded if they were not: (1) available in full text and (2) written in English.
We managed our systematic screening process using Covidence. 18 After duplicate articles were automatically removed by this software, 2 authors (OK and AK) independently reviewed titles and abstracts to determine eligibility based on the above-outlined criteria. Any discrepancies were collaboratively resolved. Next, full-text articles were independently reviewed by 2 authors (OK and AK), and a third author (JS) resolved any differences. If provided, the following information was extracted from the included articles: authors, title, sample size, population, race, research aim(s), mean and variance of MoCA-TS, mean and variance of MIS, and results related to validity, reliability, or diagnostic accuracy. Data extraction was completed by one author (OK).
Synthesis of Evidence
Studies Examining the Clinical Utility of MoCA-MIS in Neurodegenerative Dementia Populations.
aA non-English version of the MoCA was utilized.
bOther etiologies of neurocognitive dementias (n = 832) were not included in the analysis and were rather used as a reference group.
Participant Demographics and MoCA Performance Across Studies by Diagnostic/Etiologic Group.
aThe study grouped those with MCI and dementia.
bA non-English version of the MoCA was utilized.
Results
Selection of Sources of Evidence
Figure 1 illustrates the screening process and final studies identified from each source. Of the 278 studies initially identified, only 10 met all selection criteria. During the study selection process, 4 additional articles were identified that met most inclusion criteria, but did not evaluate the MIS as computed according to Julayanont et al (2014) guidelines.16,19-21 Rather, these 4 articles examined MoCA free- and cued-recall items independently rather than aggregated as the MIS. These articles were relevant to this review and are therefore included in the narrative synthesis below. PRISMA flow diagram of database search.
Characteristics of Sources of Evidence
The majority of studies utilized English language versions of the MoCA (
Predicting Progression of Cognitive Decline
Classification of MCI to Alzheimer’s Dementia Converters Adapted From Julayanont et al (2014). 5
The authors proposed that sensitivity and specificity of the MIS for identifying MCI to AD converters could not be provided, as individuals within the normal range on MoCA scores were not followed up or included in the analysis. However, it is unclear why the sensitivity and specificity of the MIS and the combined algorithm was not provided within the specific context of an MCI memory clinic sample, a circumstance relevant to many clinicians and researchers assessing the likelihood that individuals with MCI will progress to develop AD. The low sensitivity is concerning given the use of the MoCA as a screening tool, as sensitivity is typically prioritized during screening to alert the need for comprehensive assessment.
26
Further, the study had several noteworthy limitations, including potential bias, given that the primary developer of the MoCA was an author. Additionally, a relatively short (
Differentiating Between Neurodegenerative Diseases and Dementia Syndromes
Two studies using NACC samples have reported upon the differential utility of the MIS in assessing neurodegenerative diagnoses.27,28 However, it is important to emphasize that a limitation of this research is its lack of in vivo or postmortem biomarker consideration, which are becoming increasingly central to making neurodegenerative etiologic diagnoses.29,30 Therefore, this research fails to make a direct comparison between neurodegenerative-specific pathologies and the MIS. Rather, it relies on clinician diagnosis of clinical syndromes, again resulting in circularity.
Ang and colleagues (2023) explored the ability of MoCA domain-specific index scores, including the MIS, to discriminate between various etiologies in the early stages of neurocognitive disorders including individuals with MCI or dementia (n = 6584).
27
Using multinomial logistic regression, findings supported the discriminative validity of the MoCA domain-specific index scores, as they were reflective of hallmark patterns observed in the most common forms of neurodegenerative dementias. Regarding the MIS, a relative risk ratio score of 1.15 for AD, 1.00 for Lewy body dementia, 1.01 for frontotemporal dementia, and 1.00 for vascular disease were observed, with AD being the only statistically significant result (
Another study examined whether MoCA-TS, domain-specific scores, and the MIS could effectively differentiate between individuals classified as having normal cognition, AD, or a language-based dementia, Primary Progressive Aphasia (PPA). 28 In the NACC sample, 153 individuals (n = 33 with mild AD, n = 36 with PPA, and n = 83 normal controls) completed the MoCA. Using a logistic regression model, they found that although MoCA-TS did not differ between AD and PPA patients, domain-specific index scores were different between the groups and helped differentiate by diagnosis. Specifically, findings suggested that higher MIS and orientation index scores predicted a lower likelihood of having an AD vs PPA diagnosis, with an MIS odds ratio of 0.53. These findings provide some support for the utility of domain-specific measures within the MoCA in differentiating diagnoses. Furthermore, they demonstrate that the MIS is sensitive to memory impairment, with individuals with diagnoses that are typified by memory impairment performing worse than those without central memory impairment.
Differentiating Between Stages of Cognitive Decline
Four studies have reported on the MIS’s ability to differentiate between stages of cognitive decline. The first study by Huang et al. (2018) compared the MoCA-Basic (a modified MoCA incorporating items that are thought to be less influenced by education effects
31
), MIS, and an index of non-memory items in differentiating Chinese participants into normal cognition (n = 520), MCI (n = 663), mild AD (n = 345), and moderate AD (n = 520).
32
Across low (≤6 years), mid (7-12 years), and high (>12 years) levels of education, the MIS differentiated MCI vs normal cognition with AUCs being 0.82-0.86 and an optimal cutoff of <8, outperforming a MoCA score comprised of non-memory items (AUCs 0.71-0.77,
A study by Goldstein et al. (2018) included participants with normal cognition (n = 295), MCI (n = 471), and AD (n = 150) from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database.
33
Analysis of covariance was used to compare MoCA-TS, MIS, and domain-specific index scores in their ability to differentiate between cognitive stages. Effect sizes were interpreted as the differential ability between groups. The MoCA-TS demonstrated the largest effect sizes (
The third study by Dodge et al. (2020) evaluated how MoCA scores differentiated between CDR scores in the NACC cohort, specifically among White and Black participants. 23 They also reported on the classification accuracy of various other neuropsychological assessment instruments and a Memory Composite Score (including Immediate Craft Story Recall with paraphrase scoring, Delayed Craft Story Recall with paraphrase scoring, and total score for delayed recall of Benson figure), allowing for comparison between the MIS and other variables. They utilized global CDR scores of 0, 0.5, and 1, corresponding to normal, very mild dementia, and mild dementia, respectively. 34
Results from Dodge et al (2020) showed that the MIS had AUC values of 0.78 in differentiating CDR 0 vs 0.5, 0.95 for CDR 0 vs 1, and 0.76 for CDR 0.5 vs 1. In comparison with the other MoCA domain-specific-index scores in the total sample, the MIS displayed the best classification accuracy at each stage, except for the MoCA orientation index more accurately classifying CDR 0.5 vs 1 (AUC = 0.79). In contrast, the MoCA-TS demonstrated superior classification accuracy when compared to the MIS (CDR 0 vs 0.5 AUC = 0.79, CDR 0 vs 1 AUC = 0.97, and CDR 0.5 vs 1 AUC = 0.81), suggesting that the MoCA-TS was a more useful tool overall. Similarly, the Memory Composite Score had better classification accuracy than the MIS across the groups (CDR 0 vs 0.5 AUC = 0.81, CDR 0 vs 1 AUC = 0.96, and CDR 0.5 vs 1 AUC = 0.80). However, when comparing the Memory Composite Score to the MIS, only the CDR 0 vs 0.5 had a significant difference, suggesting that the MIS performed similarly to the more comprehensive memory assessment in classifying CDR 0 vs 1 and CDR 0.5 vs 1. Notably, the MoCA-TS, MIS, and Memory Composite Score all displayed higher classification accuracy than a Global Composite Score, which included more comprehensive neuropsychological instruments assessing memory, language, attention, executive function, and visuospatial functioning (CDR 0 vs 0.5 AUC = 0.75, CDR 0 vs 1 AUC = 0.91, and CDR 0.5 vs 1 AUC = 0.72).
Importantly, findings from Dodge et al (2020) also suggested that racial differences may exist when differentiating individuals by CDR scores with the MIS. Within Black participants specifically, the findings were largely consistent, but some remarkable differences emerged. Namely, there was a significant difference between the CDR 0 vs 0.5 AUC values, with the MIS having lower accuracy in Black participants (AUC = 0.70) than in White participants (AUC = 0.80). This difference was not significant at other CDR score levels. Furthermore, the MoCA executive index showed higher accuracy in differentiating CDR 0 vs 1 (AUC = 0.95) and CDR 0.5 vs 1 (AUC = 0.85). Similarly, the MoCA orientation index more accurately differentiated CDR 0 vs 1 (AUC = 0.94) and CDR 0.5 vs 1 (AUC = 0.83). These differences indicate that the MIS may be less accurate in classifying early stages of cognitive decline in Black older adults. These discrepancies may be largely attributable to differences in educational attainment and quality. Therefore, normative educational adjustments to the MIS may help attenuate this inconsistency.
The fourth paper by Kaur and colleagues (2018) compared the MIS to Craft Story 21 delayed story recall, a measure which involves learning and remembering a short story after a 20-minute delay,35,36 for differentiating normal controls (n = 2205) from individuals with aMCI (n
Association with Other Neuropsychological Memory Measures
Regarding correlation with other neuropsychological memory variables, Ang et al. (2023; described above), reported on the construct validity of the MIS as it relates to other neurocognitive measures in the NACC dataset within a larger sample comprised of 14,571 individuals with normal cognition (n = 7987), MCI (n = 3534), and dementia (n = 3050).
27
Using z scores of common neuropsychological assessment tools, the correlation between the MIS and visuospatial (Benson Complex Figure Copy), immediate memory (Craft Score 21 Immediate Recall), delayed memory (Craft Story 21 Delayed Recall and Benson Complex Figure Recall), language (Multilingual Naming Test and Verbal Fluency – L-Words), attention (Number Span Test Forward and Number Span Backward), processing speed (Trail Making Test Part A) and executive function (Trail Making Test Part B) were computed. Findings supported the notion that the MIS is more highly associated with memory than non-memory measures, but the correlations with memory measures were only weak to moderate. Specifically, correlations of
Similarly, Kim et al (2021) evaluated the correlation between the MIS and the Seoul Neuropsychological Screening Battery, 2nd Edition (SNSB-II) in a Korean speaking population of participants with aMCI (n = 104), VaMCI (n = 74), AD (n = 73), and VaD (n = 41). They found that the MoCA MIS displayed moderate correlations with the memory measure of the SNSB-II within the aMCI (
Association with Anatomical Correlates of Memory
Some evidence exists supporting an association between the MIS and neuroanatomical structures implicated in memory. One study investigated the association between the MIS scores and hippocampal volume in a mixed clinical sample of 138 individuals presenting to a clinic with memory complaints.
38
Ritter and colleagues also investigated the performance on delayed-recall items alone, enabling evaluation of whether cued-recall was additive in this association. Findings supported the hypothesis that lower scores on the MIS and MoCA delayed-recall items were weakly to moderately associated with lower hippocampal volume in both the left and right hemispheres; however, there was not a statistically significant difference between the 2 MoCA memory scores. The association of MIS scores was similar for left hippocampus (
Association with Neuropathological Markers of Neurodegenerative Disease
Cayir et al (2025) analyzed the association of the MoCA TS and MIS with cerebrospinal fluid-derived tau markers in 61 patients with AD (n = 33) and FTD (n = 28). In the FTD group, total-tau (t-tau) was significantly inversely moderately associated with the MoCA TS (
Studies Investigating Free-, Category Cued-, and Multiple Choice Cued-Recall Performance Independently
During the search process, 4 studies were identified that investigated the memory components underlying the MIS, but these studies did not report on the MIS as scored using the Julayanont et al. (2014) MIS paradigm.
5
They are included here as they report information relevant to the utility of the MIS in neurodegenerative populations and are therefore relevant to the aims of this review. The first of the studies reported upon the association of MoCA memory scores and hippocampal volume. Hari and colleagues (2024) investigated the association between MoCA memory subdomains and hippocampal volume as measured by T7 MRI in various regions of interest within the hippocampus in individuals who were diagnosed with probable Alzheimer’s disease (n = 24) recruited from local memory clinics.
20
They examined associations between MoCA memory scores and the cornu ammonis (CA) areas (CA1, CA2, and CA3), the dentate gyrus, entorhinal cortex, the hippocampal tail, and the subiculum using Spearman’s correlations. After correcting for multiple comparisons, their findings demonstrated a significant association between the dentate gyrus and category-cued memory performance (
Li et al. (2018) investigated the Beijing version of the MoCA and its memory subdomain scores in differentiating single-domain aMCI (n = 56), multi-domain aMCI (n = 32), and non-amnestic MCI (n = 33) from those with normal cognition (n = 53), while also reporting on the association with Rey Auditory Verbal Learning Test (RAVLT) and the Rey-Osterrieth complex figure (ROCF) delayed recall scores.
21
Importantly, the authors reported the category and multiple-choice cued-recall scores but did not clarify how these scores were derived (ie, simply stating 0-5 points without specifying how it was scored if the word was recalled accurately at the prior recall level). In identifying single- or multi-domain aMCI, the free-recall (AUC = 0.78) and category cued-recall (AUC = 0.79) were similar to the MoCA TS (AUC = 0.79) but were inferior to the RAVLT (AUC = 0.91) and ROCF (AUC = 0.87). Although, it is important to acknowledge that the RAVLT and/or ROCF scores were used to classify participants as having aMCI. The multiple-choice cued-recall had the lowest discrimination (AUC = 0.69) of aMCI from normal cognition. As may be expected, a notable strength of the MoCA memory subdomain scores over the TS was in identifying single-domain aMCI (AUC values for TS = 0.63 vs free-recall = 0.72 and category cued-recall = 0.69), suggesting they may be more effective in identifying focal memory deficits. AUC values were lower for differentiating non-aMCI from normal cognition for all the MoCA memory subdomain scores (AUCs = 0.56-0.69) and the TS (AUC = 0.69). Notably, the MoCA memory subdomain scores showed moderate positive association with the RAVLT (Pearson’s
De Wit and colleagues (2022) reported upon the added utility of cued-memory items of the MoCA in detecting memory impairment.
19
Using data from the Alzheimer’s Disease Neuroimaging Initiative, they examined how the MoCA memory scores predicted impaired memory (classified by a RAVLT score 2 standard deviations below the mean) in a sample of 719 normal controls and 601 individuals with MCI. They used the free-recall score (ranging from 0-5 points) and a combined MoCA cued-recall score (also ranging from 0-5 points, with 1 point for
Another study evaluated the MoCA memory subdomains independently using maximum likelihood models. 16 They sought to assess performance on free-recall, category cued-recall, and multiple-choice cued-recall items in a sample of 327 individuals with Huntington’s Disease (HD; n = 70), AD (n = 64), and normal controls (n = 183) from California dementia research center samples. They observed that patterns of memory performance differed between the HD, AD, and normal control groups. While controlling for age and education, they found that participants with AD and HD both performed worse on the free-recall items of the MoCA than the normal controls, but the AD and HD participants displayed no difference from one another. In contrast, with category cueing, the normal controls and HD participants both benefited from cues, while the AD participants did not. Finally, on the multiple-choice cued-recall items participants with HD and AD performed worse than the normal controls, while individuals with HD were more likely to correctly respond than those with AD. This study supports the notion that the multidimensional memory items of the MoCA have utility for differentiating dementia etiologic diagnoses, indirectly offering support for the MIS for this purpose. However, the authors explained that the detail provided by examining performance at each level independently may be lost with an average across all levels, as is the case with the MIS.
Discussion
This comprehensive review suggests the MIS can add incremental validity to the MoCA as there is some empirical support for select use cases. However, the available evidence on the clinical utility of the MIS is sparse. Various gaps in the literature were identified, including limitations with methodological rigor (eg, lack of blinding to MoCA scores when evaluating diagnostic utility and lack of reporting on MoCA version utilized) and non-diverse research samples with heavy reliance on the NACC cohort. Therefore, future research using diverse community samples is needed to better evaluate the psychometric properties of the MIS and inform its utility in non-White individuals. While some studies provide normative data on the MIS for specific populations, such as Dutch, Brazilian, and Italian samples, additional research from other representative community standardization samples would enhance the inferences that could be derived from the MIS and lay the groundwork for wider application.40-42
Despite early claims that the MIS may be a useful predictor of progression from MCI to AD, 4 only one study has investigated this particular use. 5 The findings of this sole investigation were limited by a short follow-up period and a lack of reporting on sensitivity and specificity with the studies suggested algorithm displaying very low sensitivity for predicting MCI to AD progression as was highlighted in this review. Thus, evidence supporting this claim is currently unsubstantiated.
Regarding the broader utility of the MIS, published research provides initial support for its use in neurodegenerative dementia populations, as the MIS shows weak to moderate correlations with neuropsychological memory measures and patterns of hippocampal atrophy typically seen in this clinical group.21,27,38 Furthermore, the MIS may aid in differentiating dementia syndromes and stages of cognitive decline.21,23,27,28,33 However, it is noteworthy that the MoCA-TS outperformed the MIS as a stand-alone measure for classifying stages of dementia progression in the examined studies, while the MIS can theoretically help identify specific cognitive deficits in memory to aid in assessing different clinical syndromes with poor recall. A major limitation of the MIS is its narrow range, resulting in potential ceiling effects in healthy individuals and floor effects in populations with dementia, although scores differed substantially across studies (eg, 0.96 ± 2.04 in the “moderate AD sample” in Huang et al [2018] vs 5.88 ± 3.8 in the AD sample from Cayir et al [2025]).32,43 These findings suggest that the MIS may be a valuable supplement to TS but is clearly not a substitute for comprehensive memory assessment as part of a neuropsychological evaluation. Therefore, the MIS should be interpreted within the context of additional assessment data and clinical information when available.
This review is unable to sufficiently answer several important questions that arise regarding the MIS. Firstly, the added value of cued-recall items within the MoCA is unclear (ie, how does the MIS differ from delayed free-recall score, and does it warrant the extra time required). Only 2 of the included studies directly compared the MIS to the conventional five-word delayed-recall score.37,38 In addition, 4 studies were identified that investigated free-, category cued-, and multiple-choice cued-recall on the MoCA; however, the studies did not follow the MIS scoring recommendations and thus did not allow direct comparisons between independent subdomains scores and the MIS aggregate.16,19,20 Although limited, some evidence does suggest that MoCA cued-recall items improve identification of memory impairment above and beyond delayed free-recall performance. 19 Moreover, on average, performance on the various memory subdomains of the MoCA does differ between clinical diagnoses, suggesting that differential assessment of specific memory deficits may be possible. 16 These findings lend support to the notion that the MIS adds incremental validity to the MoCA beyond the conventional five-word recall score, but additional research is needed to understand the clinical value of this difference.
Another important question that cannot be conclusively answered currently is how future iterations of the MoCA might improve its memory components to increase its utility in neurodegenerative dementias. The MIS step-down cues provide aid to retrieval while simultaneously assessing memory subdomains, ultimately helping reduce floor effects which are common on the MoCA in amnestic populations.32,43,44 Nevertheless, this step-down procedure and scoring paradigm comes with limitations. Performance at a prior level subsequently affects performance at the following level. For example, if an individual fails to freely recall any of the 5 words, they will have more opportunities to obtain points during the cued-recall. This limits one’s ability to assess cued-recall independently of free-recall performance. In addition, the point value assigned to each level of the MIS appears to be selected based on difficulty (with 3 points for free-recall being the most difficult, and one point for multiple-choice cued-recall being easiest). However, it is not clear whether the proportional difference between these values reflects meaningful differences clinically. Moreover, as expressed by Ritter et al. (2017), free-recall performance is heavily weighted in the MIS. 38 Ultimately, assessing memory performance collectively with the MIS, rather than examining performance at each stage individually, limits the nuance of information that can be extracted. 16 Thus, modification of the scoring model of the MIS, the general structure of the MoCA memory measures, or future iterations of neurodegenerative dementia screening measures may be warranted. To this end, a clearly defined goal of the MIS is necessary. If the goal is indeed to predict progression to AD in MCI subjects as stated by Julayanont and Nasreddine (2017), then alterations may be warranted given the questionable results of the single investigation.4,5
While our review focuses on the clinical utility of the MIS, it is important to recognize that this score is not typically interpreted in isolation in clinical settings. Rather, the clinical determination of diagnosis or prognosis requires a comprehensive evaluation, factoring in a wide variety of confirmed diagnostic features and/or risk factors. Moreover, the MIS was developed as a brief screening tool and not through rigorous psychometric development methods (eg, item response theory, item internal consistency, or identifying ideal items through larger item pools). According to Julayanont and Nasreddine (2017), the MoCA was developed using “clinical intuition” to screen for MCI and later the MIS was added “to predict AD conversion among patients with MCI.” 4 However, this prognostic MIS application has not been validated through the existing literature. In addition to altering the task procedures, the MoCA may further improve its diagnostic and prognostic utility through robust empirical, rather than intuition-based, psychometric item development.
Furthermore, additional evidence demonstrating the test-retest reliability, reliable change, and sensitivity to change in clinical populations would strengthen evidence for the utility of this brief tool. Only one study investigated the test-retest reliability of the MIS which did not include a clinical sample. In a Dutch sample, the intra-class correlations across MoCA versions 7.1, 7.2, and 7.3 were poor between 7.1 to 7.2 (0.32) and fair for 7.1 to 7.3 (0.48), indicating a reliable change score of ±3.4 and ±3.0, respectively, after a mean interval of 22.1 days. 45 This suggests that there is substantial variability across versions, even within healthy participants over a short follow-up period. Therefore, a MIS change of greater than 3 or 4 points is needed to affirm statistical significance in this sample of healthy individuals which is a large portion of the overall range, and replication in cognitively impaired groups across MoCA versions and translations is needed.
This review, along with the broader literature on this topic, has several noteworthy limitations. First, as we have outlined, much of the investigation has utilized NACC data, meaning that several of these studies may have examined MoCA scores in overlapping subjects limiting the diversity represented in the MIS literature. Second, no formal risk-of bias or critical appraisal protocols were followed to assess the quality of included studies, and we are therefore unable to assure the quality of the research examined in this review. Third, it is possible that this review overlooked studies reporting on the clinical utility of the MoCA memory subdomains, as we did not aim to comprehensively capture all available literature on the MoCA free, category-cued, or multiple-choice cued-recall memory domains independently but rather as aggregated in the MIS. Fourth, much of the existing research on the MIS has limited interpretability, as studies report statistically significant differences between group or within regression models in large cohort studies rather than providing diagnostic accuracy statistics. 46 Finally, this review is limited by the available evidence on this topic. Despite broad inclusion criteria and search strategy, only 14 studies were ultimately retained and included in this review, some of which included the MIS as only a peripheral variable. These factors limit these findings and highlight the need for additional research on the MIS.
Future studies should address the limitations of the literature highlighted in this review. Firstly, research within diverse racial, ethnic, and diagnostic populations is needed. In addition, future research on the diagnostic or prognostic utility of the MIS should report results using clinically interpretable statistical methods, such as sensitivity, specificity, and positive/negative predictive values. Furthermore, research investigating the diagnostic utility of the MIS should ideally blind clinicians to MoCA scores during diagnostic consideration, as this would remove potential circularity involved in comparing MoCA scores to diagnosis. Relatedly, the association between the MIS and other measures, such as functional decline, collateral reports, in vivo and post-mortem neurodegenerative biomarkers, and functional and anatomical neuroimaging must be explored to extend understanding of how the MIS relates to features of neurodegenerative disease. Critically, if the goal of the MIS is to predict conversion from MCI to AD as suggested by Julayanont and Nasreddine (2017), longitudinal research over longer-term follow-up is needed to substantiate claims that it is an effective tool for identifying MCI to AD converters. 4 In doing so, it may be necessary to explore the scoring and task procedures of the MIS, tailoring them to the desired goal.
In conclusion, the MIS shows some promise as a supplemental tool within MoCA screening. However, research to date is limited, and well-substantiated conclusions cannot be made regarding its overall clinical utility within neurodegenerative dementias. The MIS provides novel information about cued-recall performance, with some findings suggesting value above and beyond the TS for detection of memory impairment and aiding in differentiation of dementia syndromes. Evidence supports the notion that the MIS correlates with expected neurocognitive variables and has potential to contribute to addressing various clinically relevant questions. Nevertheless, as a stand-alone measure, existing literature does not validate the MIS for differentiating between stages of cognitive decline, differentiating dementia etiologies, nor predicting MCI to AD progression at present. Therefore, the MIS should be interpreted within the context of the TS and more comprehensive neuropsychological assessment.
Supplemental Material
Supplemental Material - A Scoping Review of Clinical Utility From the Montreal Cognitive Assessment Memory Index Score
Supplemental Material for A Scoping Review of Clinical Utility From the Montreal Cognitive Assessment Memory Index Score by Oscar R. Kronenberger, Alyssa N. Kaser, Jeffrey Schaffert
Footnotes
Author Contributions
Oscar R. Kronenberger: Conceptualization; Methodology; Writing - Original Draft; Writing - Review & Editing. Alyssa N. Kaser: Conceptualization; Methodology; Writing - Review & Editing. Jeff Schaffert: Conceptualization; Methodology; Writing - Review & Editing; Supervision. Vishal J. Thakkar: Writing - Review & Editing. William Goette: Writing - Review & Editing. Christian LoBue: Writing - Review & Editing. Laura H. Lacritz: Writing - Review & Editing; Supervision.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
