Abstract
Background
Via the Vienna Visuoconstructional Test 3.0 (VVT 3.0) delayed recall we combined the assessment of visuoconstructive abilities and memory and investigated the test’s potential to support diagnostic processes, including staging and the elaboration of a cognitive profile.
Methods
We retrospectively analysed the data of 368 patients of the Department of Neurology, Medical University of Vienna, between 04/2014 and 10/2020 that had performed the VVT 3.0. Our sample involved 70 healthy controls (HC), 29 patients with subjective cognitive decline (SCD), 154 patients with mild cognitive impairment (MCI) and 115 patients with Alzheimer’s disease (AD). We investigated the differences in the VVT 3.0 scores, as well as the VVT’s ability to differentiate between AD and nonAD by calculating receiver-operating-characteristic (ROC) curves, ideal cut-offs and a logistic regression model.
Results
Results stated that the VVT 3.0 delayed recall scores were able to differentiate between all diagnostic groups, respectively, except HC-SCD and SCD-MCI. The ROC analyses determined an AUC of 0.890, 95% CI [0.855; 0.925], P < .001, and the ideal cut-off at 29.5 points that maximised sensitivity at 0.896 and specificity at 0.81. The logistic regression model classified 83.4% of AD patients correctly and delivered a significant Cohen’s Kappa of 0.619 (P < .001).
Conclusion
As the VVT 3.0 revealed satisfactory values of diagnostic accuracy in our sample, it could enrich clinical diagnosing. However, for more clarity about its informative value in other populations, there remains a need for future studies with other samples.
Keywords
Introduction
The International Working Group (IWG) 1 and the US National Institute on Aging-Alzheimer’s Association (NIA-AA) 2 each describe a concept to classify the stages within the progressive cognitive decline of AD before the final stage of major cognitive decline due to AD. The “asymptomatic at-risk stage of AD” (no symptoms but biomarker evidence for AD pathology) named by the IWG matches with the “preclinical stage of AD” (no impairment in cognition on standard assessments but biomarker evidence) of the NIA-AA’s concept. At the late stage of preclinical AD subtle cognitive decline can be reported subjectively (SCD) or by an informant, is, however, not detectable by typical neurocognitive tests.1,2 In case of evidenced AD pathology, this stage is called “SCD due to AD”, if there is no proof it is defined as “SCD” only.
As test scores of the earliest Alzheimer’s stages are at the border between normal and diseased, categorical definition by tests leads to wrong classifications. High-performing patients are deemed normal and cause low sensitivity, whereas poor performance in a single time point measurement results in misclassification of truly cognitively healthy subjects as declined. 3
When this “subjective” cognitive decline can also be sensed by objective cognitive tests the second stage, “prodromal AD” by the IWG, respectively “MCI due to AD” by the NIA-AA is reached.1,2 Despite MCI, general cognitive function is maintained, as well as functional activities. 4
At last, both concepts end with “dementia due to AD” in stage 3, standing for full dependence in instrumental activities of daily living, e.g., making calls, handling money.1,2,5 In other words, formulated in the DSM-5, major neurocognitive disorder interferes with independence in everyday activities. Moreover, all above-mentioned stages of cognitive deficit may not better be related to another psychiatric, neurological or systemic condition. 6
Various studies analysed the prevalence of each stage and the annual conversion rates (ACR) to more severe stages (for SCD MCI and dementia, for MCI only dementia).7-11
In his review Petersen et al. reported several prevalence of MCI, altogether steeply rising with age:” 6.7% for ages 60-64, 8.4% for 65-69, 10.1% for 70-74, 14.8% for 75-79, and 25.2% for 80-84” 12 : 1). They further report a 2-year conversion rate to dementia of 14.9% for MCI patients older than 65 years. This constitutes a three times higher risk to develop dementia due to AD than for HC. 13
Mitchell et al. found that 17.4% of elderly without objective cognitive impairment report subjective memory difficulties (not yet used standardized definition of “SCD”). 14 Later he conducted a meta-analysis that determined an ACR from SCD to MCI of 6,67% and 2,33% that converted from SCD to dementia, which presents a twofold higher risk to develop dementia of patients with SCD than of controls without SCD. 9
So although the majority does not develop dementia, 15 this higher risk, which was supported by multiple studies,7,9,16,17 makes SCD a potential indicator for first manifestations. 18
Cognitive decline in the amnestic domain is considered as typical in MCI due to AD and in AD.2,6,8,19,20 Moreover, also most studies on cognitive decline scrutinized memory and learning,15,16 which is ordinarily evaluated by tests that include an immediate and a delayed recall. However, for the differentiation between normal and different stages of cognitive decline due to AD, the combined assessment of various cognitive domains has been more and more proposed.2,15
According to the American Psychiatric Association cognition includes six distinct domains (complex attention, executive function, learning and memory, language, perceptual motor, or social cognition). Next to memory and learning the association states that dysfunction of executive abilities may co-occur in MCI. Perceptual motor ability and language is usually affected in later stages already belonging to major neurocognitive disorder. However, atypical cases resembling non-amnestic-MCI have been found that present with disabilities in these non-amnestic domains in earlier stages. Visuoconstruction is one ability that belongs to the non-amnestic domain perceptual-motor and one that our study and test involves. It can be assessed by copying, drawing and assembling that all demand hand-eye coordination. 6
To address this independent cognitive domain Lehrner et al. established the Vienna Visuoconstructional Test (VVT 1.0). 21 It combines three different copy tasks; firstly, the clock drawing test, 22 secondly, the pentagons task of the Mini-Mental State Examination (MMSE) 23 and thirdly, the cube task from the Alzheimer’s Disease Assessment Scale (ADAS-Cog). 24 In addition to this, the initial test version was amended by appending a delayed recall. The VVT 3.0 comprises the VVT 3.0 copy and the additional VVT 3.0 delayed recall. The add-on was conducted approximately 30 minutes after the first three tasks and encompasses again the same three patterns. A scoring system including 98 different items for both the immediate and the delayed task leads to the corresponding scores. Besides this, a quotient, the “DI-quotient” (delayed recall score divided by the immediate copy score = DI-quotient), was investigated as an additional VVT 3.0 score.
Since our add-on, the VVT 3.0 delayed recall, obviously is no immediate copy task, the underlying neurologic processes happen in additional, other domains next to visuoconstruction.25,26 On the one hand there are no patterns presented, which makes it a draw-to-command task. As such it affects higher order executive operations such as planning and active organisation, besides visuospatial abilities, attention, comprehension, and motor execution that are equally addressed as in copy tasks.27,28 On the other hand the instruction does not name the specific patterns or support recognition by any given cues, which claims the patient to recall them. By that, it requires autonomous commencement and for remembering the patterns approximately 30 minutes later, episodic recent memory is indispensable, more specifically the memory process of recollection or mnemonic strategies. 29
The focus of the VVT 3.0 delayed recall therefore rests on not only on visuoconstruction andexecutive functioning, but also memory and learning. To a certain degree the results might also depend on language, attention, visual perception and numeric knowledge,6,25,27 but this is not what the test should provide information about. Alike the delayed recall score, the DI-quotient represents several cognitive domains since it combines both VVT 3.0 scores.
The purpose of the current study is to make use of a combination of three already-known visuoconstructive tests applied in form of a delayed recall denoted as the VVT 3.0 delayed recall and DI-quotient scores and examine the differences between the different, currently distinguished stages of AD (SCD, MCI, AD) and healthy controls. We evaluated the potential of the VVT 3.0 delayed recall score and the DI-quotient to discriminate between nonAD (HC, SCD, MCI) and AD patients and its concurrent validity. Furthermore, we were able to analyse associations between the VVT 3.0, age, gender, premorbid intelligence quotient (IQ), global cognitive status (Neuropsychological Test Battery Vienna, NTBV) and depression.
Methods
Study Population and Clinical Assessments
This study analysed pseudoanonymized data from a retrospective view. Data collection consisted of patients who were under treatment at the Department of Neurology at the Medical University of Vienna between 04/2014 and 10/2020. This study followed the ethical principles of the Declaration of Helsinki and was again assented by the ethics committee of the Medical University of Vienna. The whole study population received information about legal backgrounds of the participation and signed a declaration of consent.
Our sample consisted of 368 patients in total and comprised 70 healthy participants, 29 patients with SCD, 154 patients with MCI and 115 with AD. Patients with cognitive dysfunction (SCD, MCI, AD) approached directly to the tertiary care setting seeking for help or got referred for examination of their cognitive status. The inclusion in the study proceeded consecutively and incorporated all patients who met eligibility criteria and approved.
Eligibility criteria for diseased patients encompassed possible classification into one of the three groups and not meeting any of the criteria. They received a standard neurological and neuropsychological evaluation and conducted the NTBV. 30 The following diagnostic criteria were applied for group-assignment: the research criteria of the Subjective Cognitive Decline Initiative 7 for SCD, the criteria of Petersen et al. 4 for MCI, the DSM-V criteria of the American Psychiatric Association 6 and the criteria of the NINCDS-ADRDA Work Group for AD. 31 Furthermore, the neuropsychological status included the MMSE 23 before conducting the more comprehensive NTBV. 30 The NTBV combines various measuring tools, each requiring primarily one specific cognitive domain Altogether they comprise memory, language, executive functions, psychomotor speed and attention. The concrete tasks are based on separate pre-existing tests listed in Table 3. Additionally, the Beck Depression Inventory II (BDI-II) 32 served to identify depressive symptoms, and the Wortschatztest (WST) 33 to evaluate premorbid intelligence.
Cognitively healthy subjects were tested on their suitability in a standardized clinical interview and cognitive screening. Normal cognition was determined with the aid of the Mayo Clinic criteria. 34
Exclusion criteria have been maintained from previous studies on the VVT 3.0 and encompassed:
(1). Neurological disorders like cortical stroke or traumatic brain injuries in the past, which were determined by neuroradiologic and clinical examination. (2). Medical conditions possibly interfering with normal cognitive abilities including renal, respiratory, cardiac and hepatic disease. (3). Current major psychiatric disorder according to ICD-10 20 other than depressive symptoms. (4). Significant auditory, visual, language or motor deficits. (5). “Less than 8 years of schooling.” (6). Other forms of dementia that are not related to AD. (7). Less than 50 years of age.
Vienna Visuoconstructional Test 3.0 - VVT 3.0
The VVT 3.0 comprised a copy and delayed task. In the immediate copy task participants were asked to copy three objects (analogue clock set at 11:10, two intercepting pentagons and a three-dimensional cube) as accurately as possible. After a delay of approximately 30 minutes the participant was asked to draw the three items from memory (delayed recall task) without giving any cue about the patterns. The retention interval (30 minutes delay) was chosen according to established neuropsychological visual memory tests such as e.g., Rey Complex Figure Test and Recognition Trial, where a time delay of 30 minutes is also used. Both the copy and the delayed recall task were scored broadly 35 (maximum 98 points) by study staff that had clinical information about the patients. A transparent foil functioning as a template was used to facilitate and standardize scoring. The VVT 3.0 can be obtained from www.psimistri.com.
Statistical Analyses
Statistical analyses were conducted using the IBM SPSS Statistics for Windows, Version 27.0, and Microsoft Excel for additional calculations. The alpha level was set to 0.05. In cases of pairwise comparisons the Holm-Bonferroni correction was applied (adjusted alpha = aa). In Kruskal-Wallis analyses the IBM SPSS programme revealed only one P-value for each analysis including 6 pairwise comparisons. Only in these and in one other analysis, a correlation analysis with 96 pairwise comparisons, the Bonferroni correction was chosen, as expenditure for ranking all coefficients would have exceeded benefits. For these calculations the adjusted alpha based on the classical Bonferroni correction (aac) was quoted. Analyses were performed on the whole study population and all available values. If any value was missing, it was excluded from analyses on this specific variable.
To compare the ranks of the VVT 3.0 scores (delayed recall and DI-quotient) of the diagnostic groups Kruskal-Wallis analyses coupled with a Dunn’s post hoc comparison test were performed. Ensuing, receiver-operating-characteristic (ROC) curves for differentiation between AD and nonAD patients were computed. Ideal cut- were set exploratorily by using the Youden Index. Test values that undercut the threshold were counted as positive results, indicating the presence of AD, scores that exceeded it were counted as the absence of AD (HC, SCD, MCI). Subsequently, sensitivity, specificity, together with positive and negative predictive values (PPV, NPV) and positive and negative likelihood ratios were evaluated for the threshold of the delayed recall score. Diagnosis according to the threshold versus diagnosis previously given by clinicians were depicted in cross tabulations and Cohen’s Kappa was determined.
This finding brought along novelty in so far as most other visuoconstructional assessment tests failed to discriminate between HC and MCI and only showed significant differences in test values of subjects with moderate-to-severe dementia and HC.36-39
Following this, logistic regression analyses were performed. For this purpose, the VVT 3.0 delayed recall scores were defined as independent variable and the diagnoses AD versus non-AD as binary dependent variable.
Afterwards, again cross tabulations were generated and Cohen’s Kappa was calculated to quantify the agreement between the predicted and the actual diagnoses according to the diagnostic criteria.
In addition to that, Spearman’s correlation analyses were conducted to investigate associations between the VVT 3.0 scores, age, the MMSE, the WST, the BDI-II and the different tested domains by the NTBV. A t-test served to explore correlations between the VVT 3.0 scores and gender. Moreover, we calculated the Spearman’s rho to determine the correlation between all three VVT 3.0 scores (copy, delayed recall, DI-quotient) among each other and to quantify and compare the overlap of the measured constructs.
Results
Sample Characteristics
Sample Characteristics of the VVT 3.0 delayed recall
Abbreviations: AD, Alzheimer’s disease; BDI-II, Beck Depression Inventory-II; HC, healthy controls; IQR, interquartile range; MCI, mild cognitive impairment; MMSE, Mini-Mental State Examination; n, absolute number; SCD, subjective cognitive decline; VVT, Vienna Visuoconstructional Test; WST, Wortschatztest.
Group Comparisons
As data was not normally distributed, non-parametric measures were chosen. The medians and IQRs of all three VVT 3.0 scores are listed in Table 1.
Kruskal-Wallis analyses depicted a significant difference between groups in the VVT3.0 delayed recall score with a H(3) = 167.734 (P < .001, aa = .009) and mean ranks of HC 274.08, SCD 253.03, MCI 204.57 and AD 85.82. Dunn’s post hoc comparison test specified this difference as significant between AD and all other diagnostic groups (HC and AD: P < .001, aa = .009, SCD and AD: P < .001, aa .0125, MCI and AD: P < .001, aa = .01) as well as between MCI and healthy controls (P < .001, aa = .0167). In comparisons of HC and SCD, such as SCD and MCI, the p-value, however, did not reach the adjusted alpha levels to be significant (HC and SCD: P = .24, aa = .025, SCD and MCI: P = .368, aa = .05).
Results of Kruskal-Wallis analyses and Dunn’s post hoc analyses on the VVT 3.0 DI-quotient revealed the same significant group differences as the delayed recall did. Exact values for the DI-quotient were: H(3) = 152.085 (P < .001, aa = .009); mean ranks: HC = 273.27, SCD = 238.57, MCI = 203.95, AD = 90.79; Dunn’s post hoc comparison - significant results; HC-AD: P < .001, aa = .009; SCD-AD: P < .001, aa 0.0125; MCI-AD: P < .001, aa = .01; HC-MCI: P < .001, aa = .0167; Dunn’s post hoc comparison - not significant results; HC-SCD: P = .138, aa = .05; SCD-MCI: p = .106, aa = .025.
The ensuing ROC curves for the delayed recall with AD as the positive actual state revealed an area under the curve (AUC) of .890, 95% CI [.855; .925], P < .001. By means of the Youden Index an ideal cut-off of 29.5 points was determined, corresponding with a sensitivity of .896 and a specificity of .81. Figure 1 shows the ROC curve that contains the ideal cut-off at the point of intersection of the two dashed lines. At this point the sum of sensitivity plus specificity is the greatest possible. ROC Curve of the VVT 3.0 delayed recall (positive actual state = AD)
The same procedure was applied for the DI-quotient, presenting an AUC of 0.870, 95% CI [.830; .911], P < .001, an ideal cut-off of 0.485, as it entails the maximal sum of sensitivity (.887) and specificity (.763) (see Figure 2). ROC Curve of the VVT 3.0 DI-quotient (positive actual state = AD)
Crosstabulation: Diagnosis According to the VVT 3.0 Delayed Recall - Cut-Off <29,5 Versus Diagnosis Made by Clinicians.
Abbreviations: AD, Alzheimer’s disease; n, absolute number; VVT, Vienna Visuoconstructional Test.
The binomial logistic regression comprised the VVT 3.0 delayed recall score as independent/predictor variable and the diagnoses HC, SCD, MCI, collectively referred to as nonAD, versus AD as dependent variable. It ascertained significance of P < .001 for a X2 = 182.406 and a Nagelkerke R2 = .550. Furthermore, the model classified 83.4% of the cases correctly with a sensitivity of .757 and a specificity of .87. The PPV resulted in .725 and the NPV yielded .887.
The B of -0.75 was significant (standard error = .006, P < .001). When inserting this value into the formula of the natural logarithm with the Euler’s number e as a base, it results in .928 with a 95% CI of [.914; .942]. The measure of agreement between the diagnoses predicted by the logistic regression model and the diagnoses previously made by clinicians according to the diagnostic criteria revealed a significant Kappa of .619 (P < .001).
Correlation Analyses
Further on, Spearman’s rank correlation analyses on diseased groups found significant negative associations between the VVT 3.0 scores and age: for the delayed recall -0.389 (P < .001, aa = .016), for the DI-quotient -0.368 (P < .001, aa = .025). In the control group correlations were quantified significantly with a Spearman’s rho of -.339 (p = .004, aa = .0125) for the delayed recall. Correlation between age and the DI-quotient did not reach significance in the HC group (DI-quotient: -.202, p = .094, aa = .016).
Spearman’s Rank Correlations of the Vvt 3.0 Scores With Age, the MMSE, the WST, the BDI-II, Years of Education and the Subtests of the NTBV
**Correlation is significant at the Bonferroni-Holm corrected level (2-tailed). For the subtests of the NTBV the adjusted alpha level is .0005 (2-tailed).
Abbreviations: AKT, Alters-Konzentrations-Test; BDI-II, Beck Depression Inventory-II; C.I., cerebral Insufficiency test; ID, identification number; mBNT, modified Boston Naming Test; MMSE, Mini-Mental State Examination, N, absolute number; NAI, Nürnberger Altersinventar; NTBV, Neuropsychological Test Battery Vienna; PWT, phonematic verbal fluency; r, Spearman’s rho; SWT, semantic verbal fluency; TMT A, Trail Making Test version A; TMT B, Trail Making Test version B; VSRT, Verbal Selective Reminding Test; VVT, Vienna Visuoconstructional Test; WST, Wortschatztest.
Analysed population includes only diseased patients.
Analysed population includes only HC.
Spearman’s rank correlations between the three VVT 3.0 scores
** Correlation is significant at the adjusted alpha level of 0.01 (2-tailed).
Abbreviations: N, absolute number, r, Spearman’s rho; VVT, Vienna Visuoconstructional Test.
According to the t-test gender showed no significant associations with any of the VVT 3.0 scores: delayed recall: t = -1.126 (P = .261); DI-quotient: t = -1.234 (P = .218).
Discussion
Our results provide insight into the potential of the VVT 3.0 delayed recall; on the one hand to assess the decline of visuoconstruction combined with memory and executive control in different stages of cognitive decline, on the other hand to determine the diagnostic group membership. Moreover, we studied correlations between various clinical variables and the VVT 3.0 scores.
The distribution of the four diagnostic groups in our study population was dominated by MCI patients and AD patients. This was foreseeable as the Department of Neurology at the University Hospital of Vienna is sought out mainly by patients that report cognitive decline, such as MCI and AD, with medical referral for further assessment or directly by self-initiative. For SCD patients, however, the demarcation between physiological cognitive deterioration as a concomitant of ageing and abnormal cognitive deficits is difficult. Before cognitive decline is not objectively recognisable, the presentation of SCD due to normal ageing or because of AD equal one another. 15 In view of this background it might had contributed to a more balanced distribution and higher sample of SCD patients, if there was more active recruitment of SCD patients. However, Jessen et al. explicitly recommended not to consciously search for affected persons, who do not seek out medical institutions by themselves. 15
When regarding the medians of all three VVT 3.0 scores, we found that the VVT 3.0 delayed recall showed stronger decline than the VVT 3.0 copy. The same applied for the DI-quotient which seemed to follow primarily the distribution of the delayed recall. This circumstance stresses the underlying difference of cognitive domains that are demanded for each subtest. The poorer performance in the delayed recall therefore could be traced back to the most commonly impaired domain in cognitive decline due to AD, memory and learning. 2 In addition to that, it constitutes a domain that is involved early in the process of cognitive decline due to AD. 6
Delayed recall task showed more distinct values of the MCI group compared to HC and SCD patients than the copy task. One reason for that could be the that the delayed recall teask taps memory and perhaps other functions, so that test scores not only assess and depend on the visuoconstructive abilities. Indeed, delayed recall was shown to be influenced by the functioning of executive abilities and memory.26,29
The post hoc comparison analyses provided results that the delayed recall and DI-quotient significantly differed in group comparisons including AD and the comparison between HC and MCI. No significant difference could be found between SCD and MCI, what we had not expected. HC-SCD did not reveal significance either and thus, did fulfil our expectation as SCD by definition does not show declined performance on an objective test. 7
The demarcation between SCD due to AD and MCI due to AD diagnostically consists only of an objectively measured decline. Practically, however, they are both part of a continuum in the disease progression of AD. 15
Moreover, although HC and SCD per definition both perform above cut-offs of objective tests, there is evidence that SCD patients score slightly lower than HC, but still within the range of normal reference standard. 40 Forcing the setting of a cut-off would likely lead to misclassifications of both diagnostic groups, producing both low sensitivity and specificity. 3 This might theoretically explain, why our results detected significant differences between HC and MCI, but not between SCD and MCI. Practically, there was no indication that the SCD group performed worse than HC. Enlarging sample size might enable detection of that subtle differences.
Reason for missing objective difference in scores between HC and SCD is an immanent part of the definition of SCD per se. 15 Results advocate appropriate diagnosing criteria, as SCD is diagnosed by subjective report only and cannot yet objectively be sensed by a neuropsychological testing method. 41
Comparing our results on the delayed recall with those of Valencia et al. 42 on the copy task revealed that the post hoc comparison test was significant in the same pairwise group comparisons as the other two VVT 3.0 scores, except for the comparison between HC-MCI. We thus can infer that the delayed recall achieved the aim to allow more gradations in the distinction of different disease stages. This enhancement may again be related to tapping additional cognitive domains, such as memory and executive functioning. While visuoconstructive impairment was not different in HC, SCD or MCI, VVT 3.0 delayed recall differed between MCI and HC as well as MCI and SCD. This supports earlier findings that memory functions are already impaired in MCI.1,4,6,43 Furthermore, there are studies that provide evidence regarding results of visuoconstructive tests to differentiate between MCI and AD patients.21,43-45 The inconsistent findings might be based on the different subtypes of MCI.4,42 As shortly outlined above, “MCI” might present in either the amnestic domain, in one or multiple nonamnestic domains or in both. 4 If a study sample includes various subtypes or many MCI patients without nonamnestic decline, this could explain missing differentiation by copy tasks. If it comprises a lot of nonamnestic declined patients, it might overestimate copy test scores.
According to the calculated values of sensitivity and specificity for the ideal cut-off of 29.5 points for the delayed recall score, 89.6% of patients with AD are classified as such, and 81% of all nonAD patients are identified correctly. According to the proposed standard of Hosmer and Lemeshow the AUC of the delayed recall and of the DI-quotient can be classified as “excellent”, 46 as cited in. 47
The Cohen’s Kappa quantified the strength of interrater reliability between the diagnosis according to the cut-off of 29.5 points and the diagnosis nonAD versus AD according to clinical criteria. This value incorporated incidental consensus 48 and can be graded as “substantial agreement” following the interpretation of Landis et al. 49
Despite these satisfactory results, they should be interpreted with caution. As the underlying cut-off was chosen post hoc and by maximizing the sensitivity and specificity through the Youden Index, the test performance might be overestimated.50,51 The cut-off of 29.5 is the ideal in our sample, but firstly may not provide the same accuracy when applied to another population, secondly may not be the best threshold in another population. Therefore, depending on its final intended use, different cut-offs could and should be evaluated. For instance, high sensitivity and a high PPV might be targeted, while a consequently lower specificity and lower NPV might be tolerated. 52 This could be obtained by raising the threshold.
Next to ROC analyses and the ideal cut-offs, also the binomial logistic regression model delivered satisfactory results. The Nagelkerke R2 indicated that the model explained 55% of the variance of the diagnosis nonAD versus AD by the VVT 3.0 delayed recall score. Altogether the model predicted 83.4% of the group membership correctly, 75.5% of the AD patients and 87% of the nonAD patients. The B of -0.75 (P < .001) predicted that increasing the VVT 3.0 delayed recall score was associated with decreasing odds of suffering from AD. Specifically, increasing the VVT 3.0 delayed recall score by 1 decreased the odds to have AD by a factor of .928, 95% CI [.914; .942]. Cohen’s Kappa revealed “substantial agreement” of 0.619 (P < .001) between the diagnosis met by the logistic regression model and the actual diagnosis observed by clinicians. 49
To differentiate within predementia stages, various authors recommend the combination of various testing methods.4,15,22,36 This should include the above-mentioned consideration of non-amnestic domains.2,7
The VVT 3.0 delayed recall could play a role in the process of diagnosing, specifically in staging and in the elaboration of the cognitive profile. A probable indication could be any type of cognitive impairment, no matter how far progressed. It may greatly serve as an add-on test within the first steps of staging before more costly, more complicated neuropsychological test batteries are applied. 52 Further steps should always follow, as only the combination of various assessment tools allows obtaining full clarity about the present status, more accurate staging and probable distinction between SCD and MCI.1,4,15,36
Although the VVT 3.0 delayed recall task requires a time gap of about 30 minutes, the task itself is still easily and quickly administered and scored. Hence participants usually accept it well.
Although the correlation coefficients between age and the VVT 3.0 scores in diseased groups turned out to be moderate, analyses on associations between age and the delayed recall in HC also determined a rho of -.339 (P = .004, aa = .016). The remaining VVT 3.0 scores did not provide significant correlations with age in the control group, which was most likely based on the small sample size. Especially the results in the HC group stressed that the moderate associations between the VVT 3.0 scores and age in diseased groups were not solely due to pathologic impairment. They corroborated findings that delayed recall, visuoconstructional and executive abilities decline as a concomitant of normal ageing. 53
If the HC group would not show almost the same size of correlation between age and the delayed recall score as did the diseased group, we could not differentiate if age correlated with the VVT 3.0 scores because of normal or because of pathologic changes. Although it was likely that associations between age and the VVT 3.0 scores in the diseased group were related to both types of changes (cognitive decline due to normal ageing 53 and cognitive decline due to AD, whose prevalence and incidence strongly depend on age) 54 , the difference in correlations with age between the HC group and the diseased group was small. Thus, the difference did not show a strongly confounding effect of pathologic changes that increase with age, such as cognitive impairment due to AD.
Even if the effect size was small to moderate for age, or not significant in case of gender, these variables might have confounded our results up to a moderate degree. Calculations were based on the observed scores and not adjusted for any variable, such as age or gender.
As expected, the MMSE revealed large associations with the VVT 3.0 scores. This was not surprising, as the MMSE constitutes a measure for global cognition and itself comprises visuoconstructional assessment by the pentagon task. 23 The nonsignificant results for the WST and the VVT 3.0 scores supported the assumption that the VVT 3.0 does not depend on premorbid intelligence. Further nonsignificant results on the BDI-II reassessed the VVT 3.0 as independent of depression, which corresponds with prior findings.42,55
The NTBV subtests allowed correlation analyses of different domains with the VVT 3.0. In the domain memory differences between the correlations of the two VVT 3.0 tasks were the strongest. While the delayed recall notably largely correlated with the subtests on memory, the copy task only yielded low to moderate coefficients. This supports our concept of the delayed free recall task addressing episodic memory. 29 Other than expected and other than reported before22,25 the domain executive function did not stand out as a domain that was considerably more required by the draw-to-command task than by the copy task, at least according to the subtests of the NTBV on executive control. The coefficient for the DI-quotient for almost all subtests of the NTBV was located between the coefficients of the delayed recall and of the copy task.
In sum, correlation analyses with the NTBV depicted that the VVT 3.0 delayed recall task did depend on the domains attention, psychomotor speed, language and executive functions up to a moderate extent. The inclusion of these domains had been reported before and was outlined above.27,28
The difference between the measured domains by the VVT 3.0 copy and the delayed recall could also be depicted by the Spearman’s rho between these two. If they affected completely the same domains, we would have expected a rho of almost 1. The coefficient of 0.490 (P < .001, aa = .025) showed that the domains only correlated moderately and provided evidence that the measured constructs were quite disparate.
The correlation analyses on the VVT 3.0 DI-quotient regarding the delayed recall yielded a three times higher Spearman’s rho than when regarding the copy task. These correlations were also present in our primary and secondary results. In most of our analyses results of the DI-quotient resembled the results of the delayed recall considerably stronger than the copy task. This is most likely based on the mathematical background of calculating the DI-quotient by dividing the delayed recall score by the copy task score.
In general, however, the results on the DI-quotient were in most cases located between the results on the copy task and those on the delayed recall and rarely more extreme. It follows that the quotient does not entail more informative value, than the results on the initial immediate copy and delayed recall scores. Consequently, we suggest dispensing with the additional calculation of the DI-quotient in future studies on the VVT 3.0.
Further Limitations
Further limitations of our design should be discussed. Our results are influenced by the severity of dementia, especially when diagnosis was analysed dichotomously, i.e. nonAD versus AD. For instance, sensitivity would rise and the rate of false positive classifications would simultaneously fall, if more patients with severe dementia were analysed. 56
Comorbidities or conditions that resemble symptoms of AD could show the same influence on test properties. If they affect subjects of the HC group, they would decrease diagnostic accuracy by increased false positive assignments. 56 Therefore, the majority of medical conditions that could produce biases in HC were averted by defining comprehensive exclusion criteria. In case of diseased participants, however, the incipient multi-morbidity of AD per se and of predementia stages of AD aggravates diagnosing and obtaining undisturbed results.6,57-59
Additional confounders such as apathy expressing as a lack of motivation or diminished initiative during the test administration, which are typical of cognitive decline due to AD, could not be controlled for either. 19 Hypothetically, they would lead to lower scores in AD patients and further to overestimation of the VVT 3.0 scores.
Another disturbance might have been produced by impairment in processing speed, difficulties with complex attention or fine motor skills. Their decline, however, must not be pathological, but is known to accompany normal ageing and further can impact various other cognitive abilities.52,60 Depending on whether these normal age-effects concern the HC or the diseased group, they could cause over- as well as underestimation of our results.
The fact that all our patients either autonomously decided to see a doctor or had social contacts that took action to seek medical advice, might have preselected healthier or socially better integrated subjects for our sample. All these possible biases constitute limitations of our results that should sound a note of caution about generalising our findings too generously.
Summary and Conclusion
To conclude, our newly added delayed recall enhanced the discriminating power and the diagnostic accuracy levels of the VVT 3.0. On the one hand the VVT 3.0 delayed recall scores were able to differentiate between all diagnostic groups, respectively, except HC-SCD and SCD-MCI. On the other hand, they revealed great diagnostic accuracy values in our analyses, including an AUC of .890, 95% CI [.855; .925], P < .001, a sensitivity of .896, a specificity of .81 for the cut-off of 29.5 points and a correct classification by the regression model of 83.4%, which all outperformed the immediate copy task. This improvement is most likely based on the additionally demanded domain memory by the free delayed recall.
Although results on accuracy are highly satisfactory in our study population, they should be interpreted with caution when regarding other population groups and with the background of possible biases. Therefore, there remains a need for further investigations on other study samples in order to assess its potential use in clinical diagnosing, staging and cognitive profiling. The VVT 3.0 might also serve as a convenient tool for monitoring and for prognosis. However, to analyse this use, longitudinal studies with a reasonable follow-up sample and ideally various follow-ups are needed.
Footnotes
Acknowledgments
This manuscript has been accomplished as part of the diploma thesis of Paula Knechtl. We would like to thank Mrs. Mag. Grete Sonnek for proofreading this manuscript.
Declaration of conflicting interests
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Open access was funded by the Medical University of Vienna.
Ethical Approval
All participants signed an informed consent. The study was approved by the Ethics Committee of the Medical University of Vienna (EC-number: 2227/2020) and was conducted in accordance with the 1964 Helsinki Declaration.
Informed Consent
Written informed consent was obtained from all participants.
Data Availability
The datasets analysed during the study are available from the corresponding author on reasonable request.
