Abstract
Background
Theory of mind (ToM) is crucial for socioemotional interaction. ToM deficits may explain behavioral changes in dementia, especially Alzheimer's disease (AD) and frontotemporal dementia (FTD).
Objective
This study examined the psychometrics of a new ToM test in healthy adults, identified ToM differences in dementia syndromes, and assessed if ToM scores predict neuropsychiatric function and real-life behavior.
Methods
The UCSF Cognitive and Emotional Perspective Taking Test (CEPT) was evaluated in 195 healthy adults (age: 42.69 ± 16.20) and in a clinic cohort of 304 participants (age: 64.07 ± 9.2). Participants included healthy controls, AD, behavioral-variant frontotemporal dementia (bvFTD), semantic variant primary progressive aphasia (svPPA), non-fluent PPA (nfvPPA), and progressive supranuclear palsy (PSP) patients. CEPT's psychometrics were assessed, and ToM differences and predictions of neuropsychiatric symptoms were analyzed using regression models.
Results
In controls, CEPT showed good validity and reliability. In patients, CEPT scores correlated with executive and emotional measures, but not language measures, showing good construct validity. Cognitive ToM was most impaired in AD and bvFTD, with less impairment in svPPA and PSP, and all patient groups showed impaired emotional ToM. ToM performance predicted real-life neuropsychiatric behavior, including anxiety, apathy, disinhibition, and aberrant motor behaviors.
Conclusions
ToM deficits appear early in dementia syndromes and predict neuropsychiatric behavior. Assessing ToM and social cognition with ecologically valid tasks may help identify altered social cognition in early neurodegeneration.
Keywords
Introduction
Theory of mind (ToM) is a complex cognitive mechanism that involves predicting the content of other people's minds (including knowledge, emotions, beliefs, and intentions), often in circumstances where that content may differ from that in one's own mind.1,2 ToM ability develops early during infancy and seems to stabilize during early childhood, though the accuracy of ToM continues to evolve into adulthood with increasing socioemotional experience.3,4 By allowing accurate representation of others’ minds, this core cognitive process supports conventional socioemotional interaction in everyday life by facilitating the correct interpretation of others’ behavior and adaptation to changing social contexts. 5
As a construct, ToM can be divided concerning whether the mental content being processed is cognitive or affective. Cognitive ToM can evaluate others’ knowledge, beliefs, or intentions, while emotional ToM involves understanding others’ moods and emotions. 6 Both cognitive and emotional ToM abilities are predicted by attention and affective intelligence during adolescence, and there are sex differences in how cognitive processes such as working memory and language comprehension influence ToM. 7 Additionally, both processes have unique but overlapping neural representations.6,8
Neuroimaging studies have demonstrated a ToM-related brain network that includes the medial prefrontal cortex, temporal poles, anterior and posterior midline regions, dorsal and ventral temporal-parietal junction (temporoparietal and supramarginal gyrus), and superior temporal gyrus.9–11 Multiple studies have found that attentional processes, mediated by the salience network, act as a gating mechanism controlling the activation of other cognitive networks that are needed to perform ToM effectively, such as set-shifting and memory.12,13
Research consistently reports ToM deficits in different clinical disorders, including schizophrenia, bipolar and unipolar depression, borderline personality disorder, anorexia, and neurodegenerative syndromes.14–18 In the latter case, understanding the effect of normal aging on ToM is crucial to elucidate how to assess it efficiently during early prodromal phases of neurodegeneration, during the normal-to-pathological aging transition. Existing literature reflects mixed findings in this area, with some showing no age-related difference in ToM abilities in older adults,19,20 but others indicating a decline in ToM abilities with aging, regardless of the task used.21–23
Many studies suggest ToM alterations commonly occur in Alzheimer's disease (AD), behavioral-variant frontotemporal dementia (bvFTD), dementia with Lewy bodies, and primary progressive aphasia syndromes (PPA).12,18,24–30 However, different imaging studies of ToM in patients with neurodegeneration tend to reveal inconsistent neural relationships, perhaps in part because there is no standardized test to assess ToM in this population, 31 and tests often do not control for the deficits in non-ToM aspects of cognition that are common in these patients, such as memory, executive functioning, or emotion reading.
Furthermore, studies of ToM encounter problems because there is not a clear consensus regarding the theoretical framework or scope of the construct of ToM. 12 Hence, research tasks used to study ToM differ across a number of dimensions, including (1) which mental states are considered to involve ToM (e.g., knowledge of visual or social perspective versus prediction of intention), (2) testing context and modality of stimuli (e.g., text-based, cartoon scenarios, and others), and (3) the goal or target for which ToM ability is implemented (e.g., understanding perspective versus selecting the most empathic behavioral response), which is particularly relevant to the overlap between ToM and other cognitive processes like emotional empathy and executive functioning.11,32 All of this testing diversity has further contributed to incongruence across MRI studies attempting to delineate the specific anatomic substrate of ToM in neurodegenerative disease patients and, more generally. 33
Ideally, ToM tests designed for use with patients with cognitive deficits would disambiguate the elements of ToM being tested and would reduce or explicitly account for overlap between canonical ToM elements and contributions from other cognitive domains. Of course, to fractionate the construct of ToM will limit its ecological validity, i.e., how well the test reflects the capacity to employ holistic ToM reasoning in a realistically challenging setting, but this trade-off may be necessary to enhance psychometric rigor for face-to-face testing. For example, ToM tests for these patients should have stratified elements clearly delineating the executive contribution to patients’ ToM deficits (e.g., set-shifting, inhibitory control, and working memory for social material). This might include prioritizing the “perspective taking” elements of ToM in the task, would improve the psychometric precision for this one specific cognitive function that is not directly measured by other executive tasks. A task might also include trials that intentionally range decision-making difficulty from simpler to more complex executive demands. Similarly, patients with language deficits might have difficulty processing long blocks of text, or those with visuospatial impairments might be less able to decode line drawings. Thus, contextualization and multimodal presentation of each item, perhaps through video stimuli or with simultaneous audio narration of written text, will aid patients in focusing on and comprehending key elements of test items so that variance on the test corresponds more directly to their ability to perform perspective taking. Also, because perspective taking is conceptually distinct from emotion reading but may still depend on it, 12 particularly when interpreting real-life situations, a task designed for patients with neurodegenerative disease might best dissociate emotional perspective taking (i.e., recognizing when not all individuals share the same information about other individuals’ emotional state) from pure emotion reading. In this way, a patient with focal deficits in face reading or emotion labeling who could still accurately dissociate perspectives could pass the test, and the clinical interpretation could be that their perspective taking is intact while their emotion reading is impaired. Thus, for patient evaluation, a test may be more precise if it incorporates a simplified, multimodal, but well-controlled approach that evaluates specific aspects of ToM processing, and uses realistic stimuli that distinguish cognitive from emotional elements where possible.
In accordance with these principles, our group at the Memory and Aging Center (MAC) at the University of California (UCSF) developed the Cognitive-Emotional Perspective Taking (CEPT) Test to identify focal impairments in cognitive (cToM) and emotional (eToM) perspective-taking in patients with neurocognitive disorders who show a wide range of non-ToM cognitive impairments in domains that could impact their test performance. Here, we describe the development and validation of the test, including whether the CEPT could detect early changes in social cognition in neurodegenerative conditions, and whether these changes corresponded to real-life social behavior. To achieve this, we first (1) identified the psychometric properties of the cToM and eToM components of the CEPT in a sample of healthy control adults, then (2) identified differences in CEPT performance across dementia syndromes, and finally (3) determined if CEPT performance predicted neuropsychiatric function and real-life social behavior as rated by an informant.
Methods
Sample
Participants were recruited from longitudinal observational studies at the UCSF MAC (San Francisco, CA, USA). We evaluated two different sub-samples: (1) a control cohort comprised of neurologically healthy individuals across the adult age span (n = 195; mean age 42.69 ± 16.2, range 19–78), selected to show the general psychometric properties of the task including normative data, and (2) a clinic cohort (n = 304) of individuals over 40 years old, (mean age 64.07 ± 9.2, range 46–87), selected to demonstrate the properties of the UCSF ToM task in aging adults with neurodegenerative disease, its intended population (41 neurologically healthy controls (NC); 34 with biomarker confirmed probable AD according to the McKhann criteria: 34 96 with bvFTD according to the ISFTD criteria; 35 38 with semantic variant primary progressive aphasia (svPPA) and 47 non-fluent PPA (nfvPPA) according to the ICFTD criteria: 36 and 48 with progressive supranuclear palsy (PSP) according to the Höeglinger criteria. 37
All patients also underwent a diagnostic evaluation, including complete clinical history, neurologic examination, neuropsychological evaluation, and functional/neuropsychiatric evaluation. AD patients were included only if they had confirmed amyloid pathology based on amyloid imaging or cerebrospinal fluid markers, as described in previous studies.38,39 The final diagnoses were made in a multi-disciplinary clinical consensus conference. All healthy controls were recruited from the community and underwent comprehensive neurologic evaluation, neuroimaging, and neuropsychological testing and were included only if they showed typical cognitive performance, did not have any neurologic, primary psychiatric, or significant medical illnesses, and had Clinical Dementia Rating (CDR®) score of zero.
Procedures: Participant evaluation
Participants underwent direct assessment in a quiet room administered by neuropsychologically trained study personnel. Participants were monitored for signs of distress or fatigue, which would prompt discontinuation of testing. Each participant also identified a study partner who was able to report on their daily functioning via semi-structured interviews and questionnaires. Patients in the clinical cohort were evaluated with the whole test battery once; while all healthy controls were assessed at least once, a subset (n = 60) underwent repeat testing with the cToM condition of the CEPT at two weeks and 1-year post-baseline to evaluate test-retest reliability and test stability.
Ethical statement
Written informed consent was obtained from all participants in consultation with their caregivers, after full evaluation of their capacity to provide consent. Human procedures followed Helsinki guidelines and were approved by the University of California San Francisco Committee on Human Research.
Direct socioemotional testing
Cognitive Emotional Perspective Taking (CEPT) test
The CEPT was developed to measure an examinee's ability to conduct the perspective-taking processes required to correctly infer others’ knowledge and beliefs based on watching narrated videos of characters interacting and answering structured questions. It contains two conditions measuring cognitive (cToM) and emotional (eToM) perspective-taking, with eight scenarios per condition (four “cheat” and four “no cheat”), and three questions per scenario. There is no sound directly from the video, but a voice-over narration is provided that describes the interaction on the screen. In the cToM condition, characters interact around an object, one character departs, and then that object is moved by the second character, who may or may not be unwittingly observed by the first character. The eToM condition follows the same structure, but instead of an object moving, the emotion of one character changes with or without the knowledge of the second character. To ensure any perspective-taking deficits are not conflated with deficits in emotion reading, the emotions of the characters are always explicitly named by the narrator; thus, the examinee does not need to be able to read emotions while performing the task, only to understand the perspective of one character on the other's named emotional state.
After each scenario, the examinee is shown a set of still photographs of key moments in the video to refresh their memory and focus their attention, below which they are asked three questions to assess (1) whether they understood and recall a fact about the video (control question: e.g., “Where is the bag?”/“What emotion is Ellen feeling now?”), (2) what one character thinks the second character knows or believes about the object (cToM) or emotion (eToM) (first order ToM question: e.g., “Where does Jack think the bag is now?”/“What does Jack think Ellen is feeling now?”), and (3) what one character thinks the second character thinks the first character believes about the object or emotion (second-order ToM question: e.g., “Where does Jack think Ellen thinks the bag is now?”/ “What does Ellen think Jack thinks Ellen is feeling now?”). Half of the scenarios involve a “cheat” condition, in which one of the characters is unwittingly observed by the other, making ToM deductions more executively complex. The maximum score for each condition is 8 points for the control question, and 16 total points for answering the first- and second-order ToM questions correctly.
This structure allows assessment of examinees’ ability to conduct perspective-taking in order to correctly identify characters’ knowledge and beliefs by making first- and second-order ToM inferences of graded executive complexity about non-emotional concrete and visually presented stimuli (cToM) and explicitly labeled emotional material, distinct from any concurrent emotion reading deficits (eToM). Performance on the control scale is designed to indicate whether the individual's other non-social cognitive deficits may have interfered with their ability to accurately perform the ToM elements of the task. Each condition takes approximately 10 min each to complete. The test videos, photos, and questions can be integrated into any standard questionnaire administration platform; for this study, the UCSF instance of the Qualtrics survey tool was used (http://www.qualtrics.com). While the test can be self-administered, for the purposes of this study a psychometrician was present to assist in case participants with cognitive disorders became confused or distracted while taking the task.
Emotion measures
We also administered face-to-face tests to evaluate participants’ capacity for emotional reading and mood. Patients also underwent testing with either of two video-based emotion reading tests over the multi-year time course of the study. The Awareness of Social Inference Test Emotion Evaluation Task (TASIT-EET) is designed for the clinical assessment of emotion reading based on videos of dynamic social interactions where basic emotions are expressed via semantically neutral scripts. 40 For this study, we used the first 14 of the 28 TASIT-EET videos for brevity. After watching the ∼20-s videos, participants were asked to select one of the six basic emotions (happy, surprised, angry, frightened, disgusted, or sad) plus neutral. This test is commonly used for evaluation of persons with dementia, showing distinct performance profiles across syndromes and strong correspondence with underlying brain function.41,42 The Dynamic Affect Recognition Test (DART) is a similarly structured video-based test for which participants watch twelve short (∼20-s) videos of an actor portraying an emotion via a semantically neutral script, then, they are given the same six basic emotions (without neutral) as options to describe the emotion the character displayed in the video, for a maximum total score of 12. Participants over the long timespan of the study had either one or the other task, but because they had similar structure and psychometrics, we combined them to create an Emotion Reading summary score. We used the EET score when available or used raw DART*1.17 if EET was not available, thus yielding a total score of 14 for either test. With the assistance of the examiner, participants also completed the Geriatric Depression Scale (GDS) to self-identify symptoms of depression. 43
Direct neuropsychological testing
We use the Mini-Mental State Examination (MMSE) as a general measure of cognitive impairment. The neuropsychological evaluation used cognitive measures of each primary cognitive domain (memory, executive functioning, language, and visuospatial functioning). Details on the neuropsychological protocol can be found elsewhere;15,44,45 briefly, tests included the California Verbal Learning Test (Short form) for verbal memory, the Benson Figure for visuoconstruction and visual memory; Stroop color-naming speed and color-word interference speed, Trailmaking test speed, digit span forwards and backwards, and the Design Fluency Condition 1 task from the Delis-Kaplan Executive Function Scales to measure executive functioning; and the 15-item abbreviated version of the Boston Naming task, Animal fluency, D-word fluency, and the Wide Range Achievement Test reading condition to evaluate the language domain.
Informant-based measures of real-life functioning
Behavior
To obtain an observer-based measure of participants’ real-life behavioral symptoms, family member informants were asked to report on participants via semi-structured interviews and questionnaires. We used the 12 subscale scores (frequency by severity product) and the total score from the Neuropsychiatric Inventory (NPI). 46 We also used the Revised Self-Monitoring Scale (RSMS) 47 as an informant-based measure of participants’ socioemotional sensitivity and responsiveness to subtle emotional expressions during face-to-face interactions. 48
Disease severity
The CDR Sum of Boxes Score and Dementia Staging Instrument and the Global CDR score were recorded as the primary measures of disease severity. 49
Procedures: Statistical analyses
Statistical analysis was performed using IBM SPSS Statistics software (Armonk, NY, USA) and Graph Pad Prism 6 software (San Diego, CA, USA). Depending on the number of independent variables, degree of normality of data distribution (Kolmogorov–Smirnov normality test), and the experimental groups compared, we used Student's t-test or two-way ANOVA followed by Dunnett's multiple comparison test. For non-normally-distributed data, we used the Mann–Whitney U test to compare two groups or a non-parametric alternative to multi-factorial ANCOVA (Quade's test) 50 with a post hoc analysis, using Dunnett's multiple comparison test. For Spearman's correlations, the following interpretations were used: 0.0–0.25 “very weak,” 0.26–50 “weak,” 0.51–0.75 “moderate to strong,” 0.76–1.0 “very strong to perfect”. 51
Evaluation of CEPT psychometric properties
To evaluate the content validity of the CEPT, we used Cronbach alpha, intraclass correlation coefficients, and subscale-test correlation approaches to assess the homogeneity and internal consistency of the CEPT in the control cohort. The Bland-Altman was then used to plot the test-retest reliability of cToM total scores in the control cohort at two-time intervals (2 weeks and one year after the first evaluation), reflecting the test stability in a healthy aging cohort. This graphical method offers a means to assess the concordance between repeated measurements. It involves plotting the disparities between test and retest scores against each participant's average test and retest scores. Confidence intervals are computed for the mean difference to ascertain whether it significantly departs from zero. 52 Means and standard deviations of all CEPT subscale scores were derived in the control cohort to provide normative data for clinical evaluation.
To examine the convergent and discriminant validity of the CEPT, i.e., the degree to which scores reflect predicted associations with other domains of function outside of Theory of Mind, we used Spearman's rank correlation coefficient, performed in each group (control cohort, clinic, and total). We expected higher degrees of association between CEPT scores and (1) disease severity (as measured by CDR®), (2) level of executive cognitive control (represented by Stroop interference score), and (3) level of socioemotional functioning (represented by Emotion Reading score). We also examined the association with (4) speech and language functions (represented by phrase repetition and verbal fluency scores) to show the discriminant validity of the CEPT, predicting they should have lower associations. Because the CEPT data was not distributed normally, concurrent validity was assessed by determining Spearman's rank correlation coefficient (ρ) to measure the strength of a monotonic relationship between the CEPT and each test score.
For criterion-related validity, i.e., the capacity of the CEPT to predict diagnostic group membership, we compared all CEPT summary scores from both cToM and eToM conditions across all groups (NC, AD, bvFTD, svPPA, nfvPPA, and PSP) using non-parametric ANCOVA, with the expectation that poorer scores on the CEPT would predict patient group membership and that syndromes with known behavioral disorders such as bvFTD and svPPA would show worse performance than those with non-behavioral syndromes such as AD and nfvPPA.
Additional analyses
Multiple regression modeling examined the relationship between CEPT performance and other cognitive functions in control and combined samples, controlling for age and sex. Initially, both forward and backward stepwise analyses were conducted for predictor validation utilizing all the assessed cognitive scores. Subsequently, the variables that remained significant via both methods were selected for final modeling to ascertain the stability of the models. A conclusive model was developed using the “entry method,” with 95% bias-corrected bootstrapped confidence intervals involving 10,000 repetitions (to address non-normally distributed data).
Finally, we analyzed CEPT scores to determine if they predict real-life behaviors reported by informants (i.e., neuropsychiatric symptom burden according to the NPI and poor interpersonal sensitivity according to the RSMS). We performed linear regressions with bootstrapping. We also controlled for bvFTD group membership because bvFTD patients are known to have significantly worse neurobehavioral symptoms, and we wished to identify any relationship between CEPT and behavior that generalized beyond individuals with bvFTD.
Results
Sample characteristics
Control cohort. In the neurologically healthy cohort, aged 19 to 78, we did not find a significant influence of age or education on CEPT scores. In the cToM condition, females performed slightly better on the Control Task Total score (M = 3.97 versus 3.87 in males; U: 4342.5, p < 0.05) and some eToM tasks (Table 1), though the magnitude of the score differences was so small they would be unlikely to have clinical relevance on an individual level. Data for normative CEPT performance clustered by age (≤45, 46–70, 71≤) can be also found in Table 1.
Demographic characteristics and CEPT performance in the normal control sample grouped by age and sex.
a = χ2, b = Kruskal-Wallis's test; #p < 0.10; *p < 0.05; **p < 0.01; and ***p < 0.001.
Clinic cohort
The subset of 41 healthy controls that were age-matched to our neurodegeneration patients (age 40+) were compared to our patient groups by age, sex, and education and were found to be somewhat younger (F: 18.925, p < 0.01), though education did not statistically differ. Sex distribution significantly differed across NC and patient groups (F: 62.348, p < 0.001). Age and sex were therefore included as confounding covariates in subsequent comparative and regression analyses. Predictably, clinical test scores were significantly worse in the patient groups than the NCs, including the MMSE, Global CDR® score, and CDR® Sum of Boxes score (p < 0.001). However, this cohort was generally very early in their disease progression (CDR®: 0.80 ± 0.53), which is optimal for demonstrating focal cognitive deficits across individuals, since with at more advanced disease stages, patients with neurodegenerative diseases will fairly uniformly fail executively demanding tasks.
Psychometric properties of the CEPT
The primary psychometric characteristics of the CEPT were first evaluated in the control cohort. Additional reliability and construct validity analyses were then performed within the clinic cohort.
Reliability (control cohort)
Cronbach's alpha for cToM Total score (α = 0.847) and eToM Total Score (α = 0.813) reflected a good internal consistency 53 (see Supplemental Table 1). Additional information about internal consistency in the clinic cohort, with alphas ranging from 0.91–0.94, can be found in Supplemental Table 2, and correlations between each subscale and the overall test for the control cohort can be found in Supplemental Table 3.
The retest reliability of the cToM Total score was evaluated using the Bland-Altman test (Supplemental Figure 1). The mean difference between test and retest responses across control participants after two weeks (0.08 ± 0.69; 95% CI: −1.28 to 1.45) indicated no significant change in results, showing good test-retest reliability. Similarly, no significant difference was found between baseline and one year (0.07 ± 0.09; 95% CI: −1.24 to 1.38), suggesting good test stability in healthy older adults.
Concurrent validity
Control cohort: In this sample, cToM and eToM Total scores were not significantly correlated. We analyzed convergent and divergent validity for cToM Total and eToM Total scores against Stroop (executive) and Emotion Reading (socioemotional) scores. We found that the CEPT scores did not significantly correlate with the executive or socioemotional domains. The lack of correlation among these measures reflects some ceiling effects and a lack of variance in performance in the control cohort. Clinical cohort: The clinical group, in contrast, did show a significant correlation between the cToM and eToM Total scores (r = 0.646, p < 0.001) (Supplemental Table 2). A moderate negative correlation was found between CDR® and both cToM (r = −0.424, p < 0.001) and eToM (r = −0.447, p < 0.001), suggesting that poorer CEPT scores correspond with increasing clinical symptom severity. Significant, moderate correlations were found between the Stroop score and both cToM (r = 0.441, p < 0.001) and eToM Total scores (r = 0.304, p < 0.001), as well as between the Emotion Reading score and both cToM Total (r = 0.413, p < 0.001) and eToM Total scores (r = 0.440, p < 0.001). Thus, in a clinical sample, performance on the CEPT is distinct from, but related to, performance on executive and socio-emotional measures.
Clinical correlates of the CEPT in neurodegenerative disease patients
Differential performance across patient groups
The comparisons between NC and the clinical groups suggest good criterion validity for predicting neurodegenerative group membership using the CEPT (see Table 2). AD and bvFTD groups performed significantly worse than NCs for all the CEPT scores (p < 0.001), scored quantitatively worse than other patient groups, and were more likely to fail the Control condition, suggesting a significant influence of general cognitive dysfunction on their CEPT theory of mind scores. svPPA and PSP scored better than AD and bvFTD patients and did not show impairment on the Control condition but still performed significantly worse than controls on both cToM Total score (p < 0.01) and eToM Total score (p < 0.001). nfvPPA patients performed usually on the cToM condition but scored significantly worse than controls on eToM Total scores (p < 0.05). cToM and eToM distribution between clinic groups is in Figure 1.

CEPT performance, including cognitive and emotional theory of mind conditions, across neurodegenerative disease syndrome groups. (a) cToM Total ToM score (out of 16); (b) eToM Total ToM score (out of 16).
Demographic characteristics and CEPT performance in the clinical cohort by diagnostic group.
a = χ2, b = Kruskal-Wallis's test; #p < 0.10; *p < 0.05; **p < 0.01; and ***p < 0.001.
Influence of cognition on CEPT performance
We used regression modeling to study the relationship between domain-specific cognitive deficits and CEPT performance in our patients with neurocognitive disorders (see Supplemental Table 4). We found that Emotion Reading, Stroop Interference (i.e., cognitive control), Benson Figure Copy (visuospatial functioning), and Benson Figure Delay (visuospatial memory) all contributed independently to the prediction of cToM performance in the clinical cohort (p < 0.0001). eToM performance was predicted by Emotion Reading, Benson Figure Copy, California Verbal Learning Trials 1–4 Correct (auditory attention), and Design Fluency score (generation of nonverbal material) (p < 0.0001).
CEPT as a predictor of real-life behavior
We analyzed the relationship between cToM Total score and behavior using models controlling for age, sex, and bvFTD group membership (Table 3). In the case of NPI subscales, cToM predicted Apathy and Disinhibition scores with a medium-size effect, 54 with adjusted R2 of 26% and 20%, respectively, and showed a very weak prediction of anxiety (adjusted R2 = 3%). cToM Total score statistically significantly predicted RSMS socioemotional sensitivity score with an adjusted R2 = 28%, a medium-size effect according to Cohen. 54
Regression models analyzing real-life behaviors (NPI subscales and RSMS) predicted by cToM in the clinical cohort.
#p < 0.10; *p < 0.05; **p < 0.001; NPI: Neuropsychiatric Inventory, RSMS: Revised Self-Monitoring Scale Total Score; cToM: Cognitive Theory of Mind scale; eToM: Emotional Theory of Mind scale; bvFTD: behavioral variant frontotemporal dementia syndrome.
Performing the same analysis with eToM Total score and behavior results (Table 4) showed that eToM significantly predicted several NPI neurobehavior scores, including NPI Total score (adj R2 = 24.5%), apathy (adj R2 = 28.3%), disinhibition (adj R2 = 19.3%, Aberrant Motor Behavior (adj R2 = 19.3%), and Aberrant Eating Behavior (adj R2 = 24.9%). eToM predicted RSMS score (adj R2 = 27.6%) at a medium-size effect, according to Cohen, 54 even after controlling for bvFTD group membership.
Multiple regression model analyzing real-life behaviors (NPI subscales and RSMS) predicted by eToM in the clinical cohort.
#p < 0.10; *p < 0.05; **p < 0.001; NPI: Neuropsychiatric Inventory, RSMS: Revised Self-Monitoring Scale Total Score; cToM: Cognitive Theory of Mind scale; eToM: Emotional Theory of Mind scale; bvFTD: behavioral variant frontotemporal dementia syndrome.
Discussion
Our validation of the CEPT task for individuals with neurocognitive disorders showed that the CEPT has good internal consistency, 2-week test-retest reliability, and one-year test stability in a neurologically healthy older adult sample. The CEPT was particularly sensitive to ToM deficits in clinical populations for whom ToM is known to be significantly impaired, i.e., bvFTD and AD.55,56
The cognitive condition of the CEPT (cToM) showed a moderate correlation with tests of cognitive control, visuospatial processing, and socioemotional functioning and predicted real-life apathy, disinhibition, and socioemotional sensitivity. Performance on the emotional perspective-taking condition (eToM), intentionally designed to avoid any demand that the examinee infer or identify implicit emotions, also correlated with auditory attention and predicted real-life neuropsychiatric behaviors, including disinhibition and socioemotional sensitivity. Thus, the CEPT may be useful for measuring how the ability to predict others’ beliefs and intentions becomes altered in individuals with neurodegenerative conditions.
Theory of mind in healthy aging
When the CEPT was used to examine cognitive and emotional perspective-taking comprehensively in our neurologically healthy control sample, using subscales representing tasks of increasing executive complexity (i.e., first and second-order ToM deliberations, with and without a cheat condition), we saw no effect of age on performance, with young-, middle-, and older-adults performing equally well. 57 These results support the stability over the adult lifespan of the cognitive processes involved in understanding others’ beliefs and intentions and the robustness of socioemotional functioning even in late adulthood.19,20,55
Our study did not find differences in cognitive and emotional ToM capacity in healthy individuals, at least as measured by the CEPT test. Furthermore, we found no correlation between CEPT scores and other cognitive processes in our neurologically healthy sample, such as inhibitory control and emotion reading, though these correlations were found in our clinical sample. Potentially, the lack of results reflecting age-related nuances in ToM functioning in our study may have been because our healthy cohort showed some ceiling effects in their performance of the CEPT. This result contrasts with other reports, where investigators found ToM alterations associated with aging.21,23 This discrepancy may derive from multiple factors, such as differences in the measurement tasks across studies 58 (e.g., written stories, pictures, or videos as stimuli; asking participants to show comprehension of narrative histories versus asking them to perform a social events analysis; measuring first- or second-order false beliefs in contrast to higher-order ToM). The degree to which a task is considered to be the speed of response and accuracy may also have resulted in some studies showing age-related ToM alterations. 23 Aging and brain health are not uniform; thus, any other task differences that placed particular demands on the aspects of executive function that show greater individual differences and instability with age (e.g., working memory, information processing speed, inhibitory control) could result in age-related ToM differences, though it remains unclear the degree to which these findings truly reflect healthy participants’ ability to maintain ToM as they age.59,60
Our control participants were carefully selected and underwent a much more rigorous inclusion screening than is typically performed in studies of ToM in community-based samples, including comprehensive screening for neurologic disease, neuropsychological testing, neurologic examination, and structural MRI scanning. Arguably, this may have resulted in an unusually healthy control sample that does not genuinely reflect the full range of aging-related neurologic problems seen in community samples; however, it also suggests that ToM declines, and by extension declines in executive function and socioemotional processing, may reflect subclinical neurologic disease, and may not be a part of genuinely healthy aging.
Impact of neurodegeneration on the theory of mind
Testing the ToM in individuals with concurrent cognitive deficits has always been challenging practically, both in clinical and research settings. Decoding others’ thoughts and intentions is a higher-order cognitive process that depends on numerous other subordinate cognitive functions, including memory, executive processing, and the ability to read emotions and social cues. Performance on ToM tests will also rely on the examinee's capacity to perform perceptual processing of the stimuli, whether presented in verbal or pictorial format. Thus, studies in clinical samples have repeatedly found correlations between ToM performance and numerous other cognitive domains, and our study further confirms this relationship. We found that in the individuals with neurodegenerative disease in our study, CEPT scores were strongly associated with inhibitory control, visual-constructional ability, auditory attention, visual memory, and the ability to spontaneously generate non-verbal material, suggesting an involvement of a wide range of circuits throughout the brain. However, this also confirms that when patients with neurodegeneration and other neurologic diseases have deficits in non-social cognitive domains, particularly in executive functioning, this not only impacts test performance but likely directly alters their capacity for cognitive and emotional perspective taking in real life. 60
Many existing emotional ToM tasks require that emotion be inferred from the stimuli, either from facial or vocal expressions or contextual cues. The eToM condition of the CEPT was intentionally designed to dissociate the “perspective-taking” elements of ToM from pure emotion reading, instead explicitly telling examinees what emotions the actors were feeling but asking them to recognize when one character had never observed the emotion and to identify the limitations of their knowledge. This allows the eToM task to strongly parallel the cToM condition structurally but also makes it one of the first ToM tasks designed specifically for individuals with diverse patterns of cognitive deficits. Individuals with emotion reading impairments but no additional perspective-taking impairments should typically perform this task; conversely, an individual with an actual perspective-taking impairment will show deficits on the CEPT, even if their emotion reading remains intact.
Many brain regions focally disrupted by neurodegeneration seem to be required for ToM reasoning. There is a growing consensus among researchers that a widely distributed set of brain regions are involved in ToM, including crucial nodes in the medial prefrontal cortex, the temporal-parietal junction, the posterior cingulate cortex, the medial orbitofrontal cortex, and the precuneus.61,62 In addition to this core “mentalizing network,” other brain networks give support to ToM processing, including the salience network (involved in detecting salient stimuli and allocating attention to those stimuli, focally affected in bvFTD 63 and default mode network (involved in memory processes, imagination, and scenario prediction, focally affected in AD64,65 For example, there is evidence that the mentalizing network may rely on the salience network to detect social cues, and the default mode network may provide key personal memories for building inferences about others’ intentions.12,66,67 Developing a more nuanced understanding of the neural basis of ToM will deepen not only our knowledge of how this ability develops in childhood but also how it can become impaired in neurodegenerative and neuropsychiatric disorders.
Clinical implications
We found that both the cToM and eToM conditions of the CEPT could discriminate between neurologically healthy individuals and patients in the early phase of neurodegenerative disease. Persons with AD and bvFTD displayed more significant impairments in recognizing others’ beliefs on the cToM, which aligns with previous studies evaluating the theory of mind in neurodegeneration.16,27,68 Both persons with svPPA and PSP displayed somewhat less severely reduced capacity for cognitive perspective-taking, similar to previous reports, 15 and persons with nfvPPA performed at the same level as controls.
The eToM condition of the CEPT detected impairments in persons with AD, bvFTD, svPPA, and PSP relative to controls, whereas nfvPPA displayed minor changes. While deficits in what has been termed “emotional Theory of Mind” have previously been seen in various neurodegenerative syndromes,55,56,69–71 other studies have almost exclusively used stimuli that require spontaneous emotion identification as a part of the test. However, the eToM condition of the CEPT does not require explicit emotion identification. Thus, our result shows for the first time that the ToM deficits seen in these patient groups are not simply due to difficulties with emotion reading but stem from impairments in cognitive processes directly related to perspective-taking, regardless of the content of the target individual's knowledge (i.e., location of an object, or an emotional state).
Somewhat worse performances on the eToM than cToM condition across patient groups suggests that following others’ knowledge of an explicitly identified emotion is potentially more difficult than following knowledge of the location of an object, perhaps due to the comparative lack of concreteness of the emotion, or because more complex neural processes are required to maintain the cognitive representation of another's emotion during working memory manipulations. We also found that the capacity to accurately read emotions still strongly correlated with performance on both the cToM and eToM conditions. This may reflect the fact that many patients with perspective-taking deficits also have emotion reading deficits, despite our evidence in this study that they are clinically dissociable symptoms.
The fact that both conditions of the CEPT showed strong correlations with tests in several other cognitive domains also has important clinical implications. Rather than a purer test that detects focal changes in a single functioning domain, the perspective-taking capacities measured by the CEPT depend on executive functioning, memory, visuospatial skills, attention, and some aspects of language. Thus, performance will be perturbed by alterations in any of those cognitive domains. As a result, low scores on this task during neuropsychological evaluations will not necessarily yield clear data on brain-behavior relationships or even specific diagnostic syndrome classification; however, reduced scores will be more likely to correspond to realistic deficits in social perspective-taking.
We found that among patients with neurodegenerative disease, performance on the cToM and eToM conditions of the CEPT could predict informant-reported neuropsychiatric symptoms and socioemotional sensitivity in real life; while other ToM tests used in persons with dementia may also correspond to real-life social deficits, studies often do not include this explicit comparison to allow this to be evaluated.16,27 Face-to-face tests that are able to reflect socioemotional and behavioral deficits in a more ecologically realistic manner are more likely to be of practical benefit to practitioners in both clinical research and clinical settings, particularly those where informant reports are unavailable. Rather than being a purely academic measure of perspective-taking, performance on the CEPT seems to reflect real-life socioemotional changes, even considering the early stage of the patients in our study.
In patients like those with bvFTD for whom the earliest neurologic signs of disease are not cognitive, but fall in the domain of social cognition, tests that can detect and quantify such impairments may be a pivotal part of diagnostic evaluation and disease staging.42,72 While the patient may not have insight into any changes, and close informants may not have the language to describe the changes they are perceiving (e.g., “She is more distant; her personality has changed.”), neuropsychological testing using validated social cognition tests like the CEPT can yield quantitative corroboration for the clinician or researcher. 73 The role of these tests in early detection of neurodegenerative disease will become increasingly urgent as disease-modifying treatments for tau and TDP pathologies become publicly available.
Limitations and conclusions
One limitation of this study is that it was performed in a population of predominantly white, well-educated individuals from the US, and we do not yet have evidence of the utility or generalizability of these results to more ethnically, culturally, or socioeconomically diverse samples. Further validation is needed to expand on normative and clinical data in these broader samples and determine if specific test materials or scoring adaptations are required for valid testing outside our reference sample. Also, though we were able to show good test-retest reliability and test stability in healthy individuals with the cToM condition, these psychometric features have not yet been evaluated in the eToM condition, which was constructed after considerable data collection with the cToM had already been conducted. Further studies are needed to determine the longitudinal characteristics of CEPT performance among patients throughout disease progression and whether the test remains useful at later stages. Analysis of the collinearity between CEPT scores and tests of executive function and emotion reading showed that both the cToM and eToM conditions corresponded to cognitive and emotional deficits in the patients. This may have been an artifact of the manner in which cognitive deficits co-occurred in our patient sample (e.g., patients with perspective taking deficits almost always showed concurrent emotion reading deficits because the disease causes stereotypical patterns of concurrent neural damage to distinct circuits), though this finding suggests further studies disambiguating the cToM and eToM conditions are needed. In particular, examining the specific neuroanatomic correlates of performance on each condition would be useful, and further would provide valuable data supporting the clinical interpretation of results during neuropsychological evaluations, by providing evidence for how test performance reflects the function of specific brain circuits affected by early neurodegeneration. For any neuropsychological test, psychometric precision in a face-to-face testing scenario does not necessarily indicate that the test will reliably or accurately reflect an individual's capacities in diverse and complex real-life situations, and the CEPT is no exception. Despite our comparatively small sample size, however, this study showed that the CEPT differentiates between healthy controls and participants in the early stages of neurodegenerative disease, and that test performance had a meaningful relationship with patients’ real-life behavior, including alterations in socioemotional and neuropsychiatric functioning. Given these useful characteristics, the CEPT may be a helpful addition to neuropsychological batteries attempting better to characterize social cognition in patients with early neurodegenerative disease.
Supplemental Material
sj-docx-1-alz-10.1177_13872877251317683 - Supplemental material for Validation of the Cognitive-Emotional Perspective Taking test in patients with neurodegeneration
Supplemental material, sj-docx-1-alz-10.1177_13872877251317683 for Validation of the Cognitive-Emotional Perspective Taking test in patients with neurodegeneration by Jonathan Adrián Zegarra-Valdivia, Tal Shany-Ur, Myrthe Gwen Rijpma, Patrick Callahan, Pardis Poorzand, Scott Grossman, Bailey McEachen, Joel H Kramer, Bruce L Miller and Katherine P Rankin in Journal of Alzheimer's Disease
Footnotes
Acknowledgments
We gratefully acknowledge the participants and their families for contributing to this research.
ORCID iDs
Author contributions
Jonathan Adrián Zegarra-Valdivia (Formal analysis; Methodology; Writing – original draft; Writing – review & editing); Tal Shany-Ur (Data curation; Investigation; Methodology; Project administration); Myrthe Gwen Rijpma (Data curation; Formal analysis; Investigation; Methodology; Project administration); Patrick Callahan (Data curation; Investigation; Methodology; Project administration); Pardis Poorzand (Data curation; Investigation; Methodology; Project administration); Scott Grossman (Data curation; Investigation; Methodology; Project administration); Bailey McEachen (Data curation; Investigation; Project administration); Joel H Kramer (Funding acquisition; Investigation; Methodology; Project administration; Resources; Supervision; Validation); Bruce L Miller (Funding acquisition; Investigation; Resources; Validation; Visualization; Writing – review & editing); Katherine P Rankin (Conceptualization; Data curation; Funding acquisition; Investigation; Methodology; Project administration; Resources; Supervision; Validation; Writing – original draft; Writing – review & editing).
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The National Institutes of Health supported this work under Grant (numbers R01AG029577/RF1AG029577, K23-AG021606 (PI: Rankin), P01AG019724 (PI: Miller), P50AG023501 (PI: Miller), P30 AG062422 (PI: Rabinovici); Larry L. Hillblom foundation under Grant (2014-A-004-NET (PI: Kramer)).
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data availability
Data is available upon reasonable request.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
