Abstract
Background:
Individuals with amnestic mild cognitive impairment (aMCI), especially for those with multidomain cognitive deficits, should be clinically examined for determining risk of developing Alzheimer’s disease. English-speakers with aMCI exhibit language impairments mostly at the lexical–semantic level. Given that the language processing of Mandarin Chinese is different from that of alphabetic languages, whether previous findings for English-speakers with aMCI can be generalized to Mandarin Chinese speakers with aMCI remains unclear.
Objective:
This study examined the multifaceted language functions of Mandarin Chinese speakers with aMCI and compared them with those without cognitive impairment by using a newly developed language test battery.
Methods:
Twenty-three individuals with aMCI and 29 individuals without cognitive impairment were recruited. The new language test battery comprises five language domains (oral production, auditory and reading comprehension, reading aloud, repetition, and writing).
Results:
Compared with the controls, the individuals with aMCI exhibited poorer performance in the oral production and auditory and reading comprehension domains, especially on tests involving effortful lexical and semantic processing. Moreover, the aMCI group made more semantic naming errors compared with their counterparts and tended to experience difficulty in processing items belonging to the categories of living objects.
Conclusions:
The pattern identified in the present study is similar to that of English-speaking individuals with aMCI across multiple language domains. Incorporating language tests involving lexical and semantic processing into clinical practice is essential and can help identify early language dysfunction in Mandarin Chinese speakers with aMCI.
Keywords
INTRODUCTION
Mild cognitive impairment (MCI) represents a group of people with an increased risk of developing dementia [1, 2]. Because of heterogeneity in its clinical presentation, MCI is classified into four subtypes on the basis of two dimensions: primary episodic memory impairment (i.e., amnestic and nonamnestic) and impairment in single or multiple cognitive domains [3]. Amnestic MCI (aMCI) with impairment in single or multiple domains might be involved in the progression of Alzheimer’s disease (AD) [4]. Compared with individuals with only memory impairment, those with aMCI in multiple domains, including language, are more likely to develop the AD type of dementia [5–8]. Thus, in individuals with aMCI, impairment in cognitive domains apart from episodic memory impairment should be clinically examined to determine their risk of dementia [9, 10].
Patients with dementia due to AD commonly exhibit language impairments in the aspects of verbal fluency [8, 11], semantic knowledge [12, 13], and connected speech processing [14, 15]. Similar to the pattern observed during the early stage of AD, language function deficits in aMCI are characterized at the lexical and semantic levels by impairments in verbal fluency and confrontation naming [15–19]. Moreover, individuals with aMCI make more semantic coordinate errors (e.g., naming a tiger as a bear) than other types of errors in naming tasks [13, 20]. The lexical-semantic deficits observed in aMCI are likely attributable to difficulties in lexical access and semantic processing, encompassing semantic representation and control [13, 21]. These difficulties in semantic processing are likely linked to compromised brain networks involving temporal and frontal regions and their connections [22, 23], as well as volumetric reductions in the perirhinal cortex, which is essential for integrating cross-modal object representations, especially in patients with aMCI[24, 25].
Although lexical–semantic impairments appear to be a crucial characteristic of language impairments in individuals with aMCI, conflicting results concerning individuals with aMCI have been reported [13, 18]. The verbal fluency task is among the most commonly used to assess lexical–semantic function, and it typically involves semantic and phonemic fluency tasks. In semantic verbal fluency tasks, participants are instructed to provide as many exemplars as possible for a given semantic category within a specific time interval. In phonemic tasks, participants are asked to provide as many words as possible on the basis of a phonological feature (e.g., a specific letter) within a specific time interval. Compared with controls without cognitive impairment, individuals with aMCI exhibited poorer performance in both fluency tasks; however, the difference in performance was more prominent in the semantic fluency task than in the phonemic fluency task [19, 26–29]. Conversely, other studies have reported either no differential deficit between semantic and phonemic fluency in individuals with single-domain aMCI [30] or greater impairment in phonemic fluency than in semantic fluency [31] in individuals with aMCI compared with controls without cognitive impairment. These mixed results were likely due to the inclusion of heterogeneous MCI samples across studies (e.g., the inclusion of individuals with single-domain versus those with multidomain aMCI) and specific task methods (e.g., use of different semantic categories) in each study [18, 30].
Although many studies have examined the lexical-semantic aspect of language impairments in MCI, such as in naming and fluency tests, other language deficits have been observed. Comprehensive language test batteries, employed by Jokel et al. [32] and Tsantali et al. [33], have demonstrated that individuals with aMCI exhibit deficits in tasks involving language functions beyond lexical-semantic processing, such as sentence comprehension and auditory spelling. In addition, studies utilizing language tests that focus on aspects other than lexical and semantic processing, such as reading aloud, repetition, and writing abilities, have found more significant impairments in older adults with aMCI compared to healthy controls [34–38]. Poor performance in these tasks may be partially attributed to decreased executive control and working memory span, which can negatively impact processes involved in understanding various syntactic structures and temporally storing separate letters for different words [39]. These findings suggest that a multifaceted test battery can provide valuable information about the language function of individuals with aMCI.
The aforementioned studies have primarily enrolled English-speaking participants. Thus, whether their findings can be generalized to non-English-speaking participants, such as Mandarin Chinese speakers, remains unclear [40]. Mandarin Chinese, a logographic language, contains different phonological, semantic, and orthographical processing features from the English language, an alphabetical language. For example, the phonological processing unit in Mandarin Chinese is the syllable rather than phoneme, as in English, and such syllable-based processing aligns with the fact that each Chinese character is an orthographical processing unit that also maps to a syllable [41]. In semantic processing, each Chinese character is a morpheme that directly represents meaning, whereas letters in alphabetical languages primarily represent sounds [42]. Moreover, lexical tone is a specific suprasegmental feature in Mandarin Chinese and reflects the categorical perception of pitch contours for native Mandarin Chinese speakers [43]. Studies examining neural networks for language processing have revealed common and distinct patterns between Mandarin and English [19, 44]. Thus, given the differences in linguistic features between the two languages, the extent to which the findings based on English speakers with aMCI are applicable to Mandarin Chinese speakers with aMCI should be explored.
Some studies enrolling Mandarin Chinese speakers have demonstrated that compared with healthy control participants (HCs), individuals with aMCI had deficits in lexical semantics [19, 45], tonal discrimination [46], sentence repetition [47], or increased semantic and phonemic errors in writing samples [48]. However, comparing the results of the aforementioned studies is difficult because they used diverse inclusion criteria for individuals with aMCI. Moreover, the aforementioned studies examined only a single aspect of language function without evaluating various language functions in the same individuals. Thus, the detailed and comprehensive language profile of Mandarin Chinese speakers with aMCI should be determined using a multifaceted language test battery.
Currently, there is a limited availability of test batteries to evaluate the multifaceted language functions of Mandarin Chinese-speaking populations. Translations of test batteries in other languages may prove insufficient due to the neglect of specific language features unique to Mandarin Chinese, such as challenges in tonal discrimination and dyslexia [40, 49]. To the best of our knowledge, only the Concise Chinese Aphasia Test (CCAT) [50] has been developed and standardized to evaluate multifaceted language functions in Mandarin Chinese speakers. The CCAT consists of nine subtests that cover expressive and receptive language domains, including the tasks of answering simple questions regarding participants’ personal information and life experiences, auditory and reading comprehension, and writing. However, the CCAT was primarily developed for patients with focal brain lesions, and its test materials do not cover language aspects such as semantic knowledge and verbal fluency that are crucial for evaluating neurodegenerative disorders, such as dementia due to AD.
To address these research gaps, this study investigated and compared multifaceted language functions between older Mandarin Chinese speakers with aMCI and older adults without cognitive impairment by using a newly developed language test battery. This test battery covers five domains of language functions, namely oral production, auditory and reading comprehension, reading aloud, repetition, and writing, based on the input and output modalities during the language processing in each test [51]. Notably, incorporating writing tests into our battery development is imperative, especially when accounting for the distinctive features of Chinese in comparison to English. The Chinese writing system is distinguished by heightened character complexity, necessitating robust orthographic memory, a meticulous adherence to stroke order and structure, and heightened demands for lexical precision. This is particularly evident during a written dictation test, where participants are tasked with distinguishing between distinct characters that share the same phonetic representation (homophones) or exhibit similar phonetic variations (tonal variations). This challenge underscores the need for precise lexical discrimination in the context of the test.
Previous studies have indicated that individuals with aMCI primarily had lexical and semantic impairments. Accordingly, we hypothesized that compared with older adults without cognitive impairment, older adults with aMCI would exhibit impairments in the language function domains of oral production, and auditory and reading comprehension, thus requiring effortful lexical and semantic processing. In addition, we hypothesized that both the groups would demonstrate comparable performance in other language functions, including reading aloud, repetition, and writing, that require less effort in lexical and semantic processing. Further, we hypothesized that the aMCI group would primarily have language deficits in lexical and semantic processing and perform poorly in semantic verbal fluency and naming tests. Moreover, given the predominance of lexical–semantic deficits in aMCI, we hypothesized that compared with HCs, individuals with aMCI would make more semantic errors but comparable phonological errors in the naming task. In light of the evidence suggesting executive dysfunction may contributed to difficulties in certain language tasks in aMCI [19, 39], we sought to investigate the associations between performances on executive function-related tests and those in each language domain.
MATERIALS AND METHODS
Participants
In this study, we included 23 individuals with aMCI (age: range = 61–82 years,
According to standards proposed by the International Working Group [2], individuals were diagnosed as having MCI when they met all the following conditions: 1) being neither normal nor having dementia, 2) having preserved activities of daily living and either intact or minimally impaired complex instrumental functions, and 3) having evidence of objective cognitive impairment on neuropsychological tests. Objective cognitive impairment was determined if participants scored 1
Neuropsychological evaluation
To evaluate baseline functions, the participants were administered a comprehensive neuropsychological test battery covering four cognitive domains: attention and processing speed, learning and memory, language, and executive function. Attention and processing speed were measured using the Digit Span Forward length of the Wechsler Adult Intelligence Scale–Third Edition (WAIS-III) [57] and the word condition of the Color–Word Interference Test (CWIT) of the Delis–Kaplan Executive Function System (D-KEFS) [58]. Learning and memory were evaluated using the immediate and delayed recall conditions of the Logical Memory (LM) subtest of the Wechsler Memory Scale–Third Edition (WMS-III) [59] and the immediate and delayed recall conditions of the Rey Complex Figure Test (RCFT) [60]. Language function was evaluated using the Vocabulary subtest of the WAIS-III and the 30-item Boston Naming Test [61]. Executive function was measured using the Part 2-minus-Part 1 reaction time of the Color Trails Test (CTT) [62] and the inhibition and switching condition of the CWIT. Participants’ depressive status was assessed using the short form of the Geriatric Depression Scale (GDS) [63].
Mandarin Chinese language test: Construction, administration, and scoring
The Mandarin Chinese language test battery was constructed on the basis of the English language–processing framework from previous studies [64, 65]. Specific language features in Mandarin Chinese, such as lexical tones and Mandarin phonetic symbols (i.e., Zhuyin), were included to comprehensively investigate the processing of Mandarin Chinese. The test battery consisted of 20 tests belonging to five domains: 1) oral production, 2) auditory and reading comprehension, 3) reading aloud, 4) repetition, and 5) writing ability. Table 1 summarizes the 20 tests. Each domain score was computed as a composite score by averaging
Summary of 20 tests included in the Mandarin Chinese language test battery
See the Supplementary Material for detailed information on construction, administration, and scoring of each language test.
Picture stimuli were black-and-white line drawings developed on the basis of prior studies including sets of standardized pictures [51, 67]. Auditory stimuli were prerecorded using a male voice for tests relying on auditory input processing to standardize test administration. Because the characteristics of language stimuli may affect test performance in clinical populations [51, 66], an independent group of participants without cognitive impairment (
Statistical analysis
The independent
A 2 (group) ×5 (domain) mixed-design analysis of variance (ANOVA) was performed to analyze group differences among the five language domains. The group served as the between-subject factor, comprising the aMCI and HC groups, while the language domain acted as the within-subject factor, encompassing oral production, auditory and reading comprehension, reading aloud, repetition, and writing domains. Greenhouse–Geisser correction was employed when the assumption of sphericity was violated in the ANOVA analysis. We analyzed the planned simple main effects of the groups on each language domain and the effects of the domains on each group. Whenever a simple main effect reached significance,
When the main effect of the language domains or the interaction of group and language domains reached significance according to the ANOVA results, an independent
Total numbers of semantic and phonological errors on the noun and verb naming test were calculated. The semantic errors were further classified into four distinct types, namely coordinate (e.g., “dog” for cat), superordinate (e.g., “pet” for cat), subordinate (e.g., “British shorthair” for cat), and associate (e.g., “scratchers” for cat). A more comprehensive definition of these four semantic errors can be found in the supplementary materials. Independent
To evaluate the associations between language domains and executive function, we generated a composite variable for executive function by standardizing individual scores on the two executive function measures using the means and standard deviations of the entire cohort. The z-scores from both measures were then averaged. Subsequently, Pearson correlations were calculated between each language domain and the composite executive function score, using data from the full cohort. An
To ensure the reliability and validity of the language tasks utilized in this study, we conducted several psychometric evaluations, including test-retest reliability, inter-rater reliability, and convergent validity. Test-retest reliability was assessed by calculating the intraclass correlation coefficients (ICCs) and 95% confidence intervals (CIs) of each language domain’s performance during the first and second administrations, utilizing a single-rating, absolute agreement, and 2-way mixed-effects model [68]. To evaluate inter-rater reliability, the spontaneous speech section and the idiom comprehension test were examined given their potentially subjective scoring involvement. To do so, we randomly selected five cases from the sample and had another rater, who was blind to the clinical diagnosis, rate them in addition to the original scores rated by the first author. The inter-rater reliability was determined using ICCs and 95% CIs based on a single-rating, absolute agreement, and 2-way random-effects model. To validate the new language tasks used in this study, we employed Pearson correlations to assess the relationship between the newly developed language battery and the CCAT battery across the entire cohort. Specifically, scores within the oral production domain underwent correlation analysis with the combined scores of the CCAT Simple Response, Expository Speech, and Naming subtests. Additionally, correlational analyses were conducted between the domain scores of auditory and reading comprehension in our developed test and the aggregate scores derived from the CCAT Auditory Comprehension and Reading Comprehension subtests. We also performed a correlational analysis on the reading aloud domain scores and the scores from the CCAT Reading Comprehension subtest. Furthermore, repetition domain scores were correlated with the CCAT Repetition subtest scores. Lastly, scores within the writing domain underwent correlation analysis with the aggregated scores of the CCAT Copying and Spontaneous Writing subtests. The
Effect sizes (Cohen’s
RESULTS
Differences in demographic, clinical, and cognitive characteristics between the groups
Table 2 presents the demographic, clinical, and cognitive data of the aMCI and HC groups. Age, years of education, sex distribution, and Mini-Mental State Examination (MMSE) and GDS scores did not significantly differ between the groups (all
Demographic, clinical, and cognitive characteristics of individuals with amnestic mild cognitive impairment (aMCI) and healthy controls (HCs)
CDR-SB, Clinical Dementia Rating-Sum of Boxes; CTT, Color Trials Test; CWIT, Color–Word Interference Test; GDS, Geriatric Depression Scale; LM, Logical Memory subtest; MMSE, Mini-Mental State Examination; RCFT, Rey Complex Figure Test; WAIS-III, Wechsler Adult Intelligence Scale–Third Edition; WMS-III, Wechsler Memory Scale–Third Edition. †raw scores based on the spontaneous naming condition. *
Regarding missing data, one participant from the aMCI group did not complete the CTT due to color blindness. His data were excluded while comparing the CTT score between the groups. Another participant in the HC group did not complete the Zhuyin blending and deletion test due to a lack of experience with Zhuyin symbols. His data were excluded from the analyses concerning Zhuyin blending and deletion test scores.
Group differences in the Mandarin Chinese language test battery
The two-way ANOVA (group×domain) exhibited a significant main effect of group,

Language performance of the two groups in the five language domains. Error bars denote the standard error. *significant group difference within each domain at
Group differences at the test level
We analyzed group differences among the various tests for each language domain (Table 3). In the oral production domain, the aMCI group performed poorer than did the HC group on the noun and verb naming test,
Domain composite scores and z-socres of tests in the Mandarin Chinese language test battery
aMCI, amnestic mild cognitive impairment; HC, healthy control. aThe sample size of the HC group was 28 because one participant with missing data was excluded. *
In the auditory and reading comprehension domain, the aMCI group performed poorer than did the HC group on the attribute verification test,
Group comparisons of naming error types and item analyses of passing rates
For the naming errors on the noun and verb naming test, the aMCI group made significantly more semantic errors compared with the HC group,

Semantic and phonological naming errors between groups. Error bars denote the standard error. Panel A shows the total number of semantic and phonological errors by groups. *significant group difference in each type of naming errors at
In the item analysis, the passing rate cutoff was set at 90% for items on which the aMCI group performed poorly. For nouns in the noun and verb naming test, four items had low passing rates: 75% (three out of four items) of vegetable and fruit items (i.e., passing rates for the items “pepper,” “onion,” and “apple” were 30.4%, 78.3%, and 87.0%, respectively) and 33% (one out of three items) of tool items (i.e., the passing rate for the item “hammer” was 47.8%). No passing rate was below the cutoff for items belonging to the categories of animals and clothing. For verbs, 11 items had low passing rates: 67% (four out of six items) of intransitive verbs (i.e., passing rates for the items “pray,” “cry,” “jump,” and “crawl” were 82.6%, 73.9%, 56.5%, and 8.7%, respectively) and 70% (7 out of 10 items) of transitive verbs (i.e., passing rates for the items “zip,” “spilt,” “throw,” “write,” “stir,” “pull,” and “climb” were 69.6%, 30.4%, 34.8%, 87.0%, 39.1%, 82.6%, and 0%, respectively).
Five items in the attribute verification test had low passing rates in the aMCI group on the basis of the average percentage of auditory and written forms. These items included 100% (all three items) of vegetable items (i.e., passing rates for the items “corn,” “bitter gourd,” and “sweet potato” were 78.3%, 89.2%, and 84.8%, respectively), 50% (one out of two items) of fruit items (i.e., passing rates for the item “watermelon” was 89.2%), and 50% (one out of two items) of animal items (i.e., the passing rate for the item “sparrow” was 82.7%). None of the items belonging to the tool category had a passing rate below the cutoff. Moreover, item analysis of the attribute verification test in terms of statement veracity revealed that four items had a low passing rate: 30% (3 out of 10 items) of false statements and 10% (1 out of 10 items) of factual statements.
Correlations between language domains and executive function scores
The correlations results between executive function scores and of language domains are illustrated in Fig. 3. The composite executive function scores exhibited significant positive correlations with the following language domains: oral production (

Association between composite scores of executive function and of five language domains. aMCI, amnestic mild cognitive impairment; HC, healthy control.
Reliability analyses of the new language test battery
Reliability analyses were conducted on the new language test battery, and the results showed that the test-retest reliability ICCs were 0.917 (95% CI = 0.826–0.961) for the oral production domain, 0.894 (95% CI = 0.783–0.949) for the auditory and reading comprehension domain, 0.769 (95% CI = 0.557–0.886) for the reading domain, 0.793 (95% CI = 0.600–0.899) for the repetition domain, and 0.835 (95% CI = 0.673–0.920) for the writing domain. Additionally, inter-rater reliability ICCs were 0.778 (95% CI = –0.243–0.975) for the spontaneous speech section and 0.755 (95% CI = –0.042–0.985) for the idiom comprehension test.
Correlations between scores on the new language test battery and CCAT battery
Correlation analyses between scores on the new language test battery and those on the CCAT battery revealed significantly positive correlations between the following pairs of language test scores: the oral production composite scores and the combined scores of CCAT Simple Response, Expository Speech, and Naming subtests (
DISCUSSION
The current study characterized a multifaceted language profile of the Mandarin Chinese speakers with aMCI and compared it with that of the HCs. Four main findings were obtained using a newly developed language test battery. First, compared with the HC group, the aMCI group exhibited poorer performance in the oral production and auditory and reading comprehension domains; however, the performance of both the groups was comparable in the reading aloud, repetition, and writing domains. Within the aMCI group, the participants’ language function in the oral production domain was poorer than in the other domains. Second, the aMCI group exhibited poorer performance on tests requiring effortful lexical and semantic processing and made significantly more semantic errors compared with the HC group. Third, item analysis findings demonstrated that more items concerning living objects had low passing rates compared with those related to nonliving items in the aMCI group. Fourth, the performance of executive function exhibited positive correlations with nearly all language domains, with the exception of the repetition domain.
In line with our hypothesis, compared with the HCs, the older adults with aMCI exhibited poorer performance in the oral production and auditory and reading comprehension domains but comparable performance in the reading aloud, repetition, and writing domains. Furthermore, the individuals with aMCI experienced more difficulty in the verbal fluency test, noun and verb naming test, and attribute verification test compared with the other tests. This pattern of impairment suggests a common underlying difficulty in effortful lexical and semantic processing in individuals with aMCI [18, 21]. This finding is consistent with those of studies either employing a comprehensive language test battery [32, 33] or focusing on a single aspect of language function (e.g., [17, 71]).
In particular, in the oral production domain, both the verbal fluency test and noun and verb naming test were involved in the semantic processing (e.g., searching for the semantic associations of a superordinate target or a picture with attributes) and lexical processing of retrieving specific words from semantic networks [13, 19]. We observed that the individuals with aMCI made many sematic naming errors but no phonological errors on the noun and verb naming test; this finding supports the notion that the noun naming problem of individuals with aMCI may have a combined lexical and semantic origin [18, 73]. Moreover, we determined that compared with the HC group, the aMCI group made more semantic coordinate errors, which indicate responses belonging to the same category as the target word (e.g., naming dogs as cats). This finding is in line with evidence that among semantic errors, individuals with aMCI make more semantic coordinate errors compared with other semantic error subtypes, such as superordinate errors (i.e., replacing the target word with a general category), suggesting specific difficulty in accessing detailed information on the lexical form of an object [13, 20]. Similar to the error pattern observed on the noun naming test, the individuals with aMCI made more semantic errors on the verb naming test, indicating a similar lexical and semantic origin of these errors; this finding accords that of a study revealing that patients with AD exhibited a similar semantic error pattern on noun and verb naming tests [74].
In the oral production domain, the aMCI and HC groups exhibited comparable language functions in the Zhuyin blending and deletion test and sentence production and spontaneous speech test. Our findings suggest that individuals with aMCI have a relatively preserved ability to process orally produced Zhuyin symbols, syntactically simple sentences, and connected speech; this finding is consistent with those of studies revealing the involvement of intact brain regions in speech production and articulation even during the early stage of dementia due to AD [75, 76]. Recent studies have demonstrated differences in detectable linguistic changes between older adults with aMCI and their counterparts by performing detailed and automatic linguistic analysis of connected speech data (e.g., [47, 77]). Although future studies are warranted, the inconsistent findings indicate that novel behavioral analysis techniques may detect subtle changes that may not be visible in behavioral data analyzed at the coarse-grained level, which is typical of clinical behavioral evaluations.
In the auditory and reading comprehension domain, the individuals with aMCI performed poorly on the attribute verification test compared with the controls; such a finding has been reported in previous studies [39, 71]. The results may be biased because items in the four categories included in the attribute verification test differed in concreteness ratings, with the animal category having a lower concreteness rating than items in the other three categories. However, the group differences on this task remained after items in the animal category were excluded from the analysis,
The individuals with aMCI exhibited poorer performance, with marginal significance, compared with their counterparts on the sentence and syntactic comprehension test. Although some studies [79–81] have observed preserved syntactic comprehension performance in individuals with aMCI, other studies [38, 82] have indicated that when performing tasks requiring effortfully decoding syntactic structures and taxing cognitive resources, the aMCI group may demonstrate poorer performance than the control group. The aforementioned evidence is consistent with the finding of the item analysis of the sentence and syntactic comprehension test in our aMCI sample. We observed that the individuals with aMCI failed to comprehend more syntactically complex sentences (i.e., a sentence with an embedded clause) compared with syntactically simple sentences within reversible sentence items in the test. This pattern may be attributable to their decreased set-switching and inhibition abilities required for integrating various syntactic structures. For example, the individuals with aMCI appeared to experience more difficulty with sentences having an embedded clause for the subject in a sentence with a subject complement (e.g., “
<1>
<2>
<3>The cup <2>which is under the fork <1>is blue <3> ”). This sentence required the participants to switch to an unfamiliar structure and inhibit the false organization of phrases. Notably, the sentences that the individuals with aMCI had a problem with were not always the longest, suggesting that working memory capacity did not account for their difficulty with syntactically complex sentences. The involvement of inhibition and switching abilities was evident in our finding that compared with the HC group, the aMCI group had impairments on standardized neuropsychological tests that involved the use of switching and inhibition abilities, namely the CTT and CWIT.
The aMCI group exhibited comparable performance to the HC group on other tests involving a lower load of intentionally lexical and semantic processing in the auditory and reading comprehension domain, such as in the written lexical decision test, relatedness judgement test, and word–picture matching test. Our finding of similar performance on these tests aligns with the findings of prior studies. In these studies, individuals with MCI demonstrated comparable performance to controls in tasks involving the judgment of simple semantic associations for vocabularies or pictures, without demanding extensive cognitive resources [12, 39]. Notably, despite both the attribute verification test and the relatedness judgment test evaluating semantic memory, the greater difficulty observed in individuals with aMCI in the former test compared to the latter, when compared with controls, may be attributed to several potential reasons. First, this pattern aligns with the “bottom-up” or “attribute-first” theory, suggesting that attributes are affected earlier in the course of AD compared to higher-level information such as coordinate or superordinate connections [83, 84]. In support of this notion, Rogers and colleagues found that patients with AD exhibited significant superordinate and coordinate category member priming effects but showed no attribute priming effect [83]. Second, in contrast to the attribute verification test, the relatedness judgment test may place lower cognitive demands, as participants were tasked with evaluating semantic associations among three vocabularies without the need to process syntactic information at the sentence level. Additionally, the relatedness judgment test, comprising six Chinese characters in total for each trial, featured shorter mean character lengths compared to the attribute verification test (
It is worth noting that we did not observe group difference in our word–picture matching test. Conversely, a study reported significant impairment in the multidomain aMCI group on the Peabody Picture Vocabulary Test, which is also a word–picture matching task [32]. However, it is crucial to consider that the aforementioned result may be confounded by the inclusion of individuals with aMCI at a more advanced stage of the disorder (mean MMSE score = 25.50) than our aMCI sample (mean MMSE score = 27.35). This is because our operational criterion for objective cognitive impairment was –1 instead of –1.5
The current study demonstrated comparable language functions between the older adults with aMCI and their counterparts in the reading aloud, repetition, and writing domains. This result is consistent with those of other studies [8, 85]. Our finding of comparable performance may be attributable to the lower difficulty of the test items, such as the use of a shorter word length and higher word frequency for repetition tasks, and lower task demand for assessing the writing ability at a simple sentence level. Other studies demonstrating significant impairment in the aMCI group [37, 47] have included stimuli with expanded word lengths for repetition tasks or used narrative stories as writing stimuli, which presumably required greater cognitive resources to perform the tasks than did those required in the tasks used in our study. Alternatively, we did not identify group differences in writing and reading aloud domains, possibly because we measured only behavioral accuracy at the coarse-grained level, similar to the clinical assessment context, unlike other studies that have used finer-grained measurements of written and reading latencies [34, 35] and analyzed data by using machine learning techniques [47]. Although semantic processing may be involved in the reading aloud, repetition, and writing domains, low task demands may enable older adults with aMCI to complete language tasks in an automatic manner by leveraging frequently learned connections between input and output lexicons in different modalities [32, 49].
Nevertheless, it is crucial to underscore that our writing tests incorporate a diverse range of stimuli intentionally designed to elicit various error types. We systematically manipulated character regularity, distinguishing between regular and irregular words. This deliberate design enables a comprehensive analysis, both quantitative or qualitative analysis, of group disparities in error types or the accuracy of regular versus irregular words within our assessment across diverse clinical populations. While we did not observe significant group differences in accuracy within the writing domain, qualitative analysis reveals a higher incidence of errors in the aMCI group compared to the control group. Particularly noteworthy are phonologically plausible substitution errors, instances of transcribing only Zhuyin, and leaving answers blank—a phenomenon indicative of challenges in orthographic memory for specific words [86]. These preliminary observations align with existing literature [48, 87], suggesting that individuals with aMCI are more prone to homophone errors, phonetically similar errors, and radicals misplacements. These observed tendencies resonate with prior research and may be attributed to the heightened reliance on non-semantic pathways by individuals with aMCI, stemming from the progressive deterioration of their lexical-semantic pathways [88].
The present study demonstrated differences between domains within the aMCI group, with relatively poor language function in the oral production domain. This result suggests a primary deficit of lexical and sematic processing in individuals with aMCI; this finding is supported by the results of passing rate analyses. Overall, the aMCI group failed in more items in the categories of living objects (i.e., vegetables, fruits, and animals) than in the categories of nonliving objects (i.e., tools and clothing) in the noun and verb naming test as well as in the attribute verification test. Our finding of the individuals with aMCI experiencing relative difficulty processing living items is consistent with the findings of other studies [16, 39], suggesting that individuals with aMCI have difficulty processing items in semantic categories that contain more shared and fewer distinctive attributes, such as vegetables and animals [89]. A study has revealed a significant association between a semantic deficit in the MCI group and a volumetric reduction in the perirhinal areas [25]. These perirhinal areas have been identified as central regions that play a crucial role in integrating semantic representations [24]. Notably, in our study of individuals with MCI, we did not observe a syntactic deficit in the argument structures of the verb naming test. This was evident from their comparable performance between transitive and intransitive verbs in the noun and verb naming tests. These results align with a previous finding concerning patients with AD [90].
In addition to core language ability, our result suggest that executive function plays a role in various language tasks used in this study, consistent with previous studies [37–39, 47]. Notably, the verbal fluency test heavily relies on executive functions, such as self-initiation, inhibiting previously produced responses, and efficiently organizing verbal retrieval [11, 92]. Moreover, the fluency test used in our study includes a switching condition that is presumed to heighten the demand for executive function, particularly in the context of switching ability [92, 93]. Studies have linked impairments in switching ability during the semantic verbal fluency task to the volumes of the superior frontal gyrus and inferior frontal gyrus in individuals with aMCI [94]. Therefore, it is plausible that both executive function and language ability contribute to impairments observed in the verbal fluency test among older adults with aMCI. However, we did not find any significant correlation between the repetition domain and the composite executive function score. A qualitative examination of this finding suggests that it may be attributed to the constrained variation in performance, possibly owing to a floor effect observed in the word and nonword repetition test. Participants, regardless of their MCI status, might have encountered challenges while repeating items within the nonword repetition section. This difficulty can be attributed to the relatively unfamiliar nature of two-to-six-syllable nonword items derived from low-frequency characters, which Mandarin Chinese-speaking older adults find challenging to recognize and repeat.
The psychometric properties of the Mandarin Chinese language test battery employed in this study were preliminarily established. Notably, all indicators of test-retest reliability and inter-rater reliability exhibited high levels of reliability, ranging from good to excellent [68]. Moreover, our Mandarin Chinese language test battery demonstrated correlations with diverse subtest scores within the CCAT battery, offering considerable validation for the efficacy of our language assessments. While there may be room for discussion regarding the use of CCAT Reading Comprehension subtest scores as the index for correlating with our reading aloud composite scores, it emerges as the most suitable CCAT subtest for comparison. This choice is based on their shared involvement in the intricate process of reading Chinese characters. To mitigate potential concerns, a thoroughgoing validity analysis of the reading aloud composite scores is deemed necessary in future studies. In summary, the robust correlations observed across multiple tests between our test battery and the CCAT collectively affirm the validity of our assessment in evaluating the language function of the Mandarin Chinese speakers.
This study has some limitations that should be addressed. First, the sample size was relatively small. Despite the sample size, our findings provide preliminary evidence that Mandarin Chinese speakers with aMCI exhibit significantly poor performance on tests requiring effortful lexical and semantic processing. The effect sizes of Cohen’s
Fifth, although our primary focus did not center on investigating the relationship between education and language performance, previous studies have shed light on a notable association between language test performance and the level of education (e.g., [95, 96]). Moreover, high educational achievement has been linked to the postponement of cognitive decline associated with neurodegeneration (e.g., [97, 98]). In the present study, we identified a positive correlation between education levels and the five domains of language function: oral production (
Overall, the language function profile of the Mandarin Chinese speakers with aMCI exhibited a behavioral pattern similar to that determined for English-speaking participants by using a multifaceted language test battery [32, 33]. This study extended the results to demonstrate a primary deficit of effortful lexical and semantic processing in Mandarin-speaking individuals with aMCI. Clinicians can incorporate language tests examining functions in oral production and comprehension domains to evaluate individuals suspected of having MCI. Compared to the CCAT, our Mandarin Chinese language test battery has several strengths. It covers a more comprehensive range of language aspects and utilizes a simpler scoring system, making it a potentially valuable clinical tool for evaluating the multifaceted language abilities of Mandarin Chinese speakers. Future studies should validate the construct of the five language domains by conducting factor analyses to confirm the latent structure of the language test battery. Other psychometric properties, including sensitivity, specificity, and the cutoff for each test, of the Mandarin Chinese language test battery should also be explored with a larger sample size. The comprehensive language test battery developed in the present study can enhance our understanding of how and to what extent language function may change over time in different domains, such as reading, repetition, and writing, during disease progression from MCI to dementia in Mandarin speakers [35, 103]. This battery also provides opportunities to investigate multifaceted language functions in different clinical populations, such as patients with primary progressive aphasia [104, 105] or amyotrophic lateral sclerosis [106], and further elucidate the complex brain–behavioral relationship.
Footnotes
ACKNOWLEDGMENTS
The authors would like to thank Dr. Lu Lu and Dr. Chi-Ting Chang for comments, and Chia-Hsing Chi and Jing-Rong Wang for assisting in data collection.
FUNDING
This work was supported by the National Science and Technology Council, Taiwan (grant numbers 112-2410-H-002-201-MY3, 111-2740-H-002-003-RE3, and 109-2629-H-002-001-MY3 to YLC). This research was also supported by the Center for Artificial Intelligence & Advanced Robotics, National Taiwan University (grant numbers112-2223-E-002-019-, 111-2634-F-002-02 and 11-2223-E-002-008).
CONFLICT OF INTEREST
Yu-Ling Chang is an Editorial Board Member of this journal but was not involved in the peer-review process nor had access to any information regarding its peer-review. Other authors have no conflicts of interests to report.
DATA AVAILABILITY
The data used in this study is regulated due to privacy and ethical restrictions. The data are available upon request from the corresponding author.
