Abstract
Introduction
Although narrative speech data are freely generated as part of the Autism Diagnostic Observation Schedule (ADOS) resulting language samples have rarely been utilized as linguistic data in their own right. Narrative elicitation allows us to profile within the same task both formal aspects of grammar (such as tense and agreement) and how grammatical devices are used in referential strategies in the development of a story. In the literature on language in ASC, analysis of formal syntax often relates to the discussion of specific difficulties with language in ASC (Boucher et al., 2012; Modyanova et al., 2017; Terzi et al., 2019) while differences in language use are discussed in relation to cognitive differences among this population, such as in mentalizing or theory of mind (ToM), cognitive flexibility (executive functioning) and global information processing (central coherence) (Happé & Frith, 2006). A priori, all of these cognitive mechanisms may relate to the organization of language at a discourse level: ToM, because it relates to our capacity to represent mental states, which enters in how we present information in discourse for an interlocutor; executive functioning, because it relates to managing information to be integrated over narrative; and central coherence, since information needs to be held together and be organized coherently around a topic. However, a comprehensive linguistic analysis of narrative data in ASC provides the essential linguistic baseline datapoints, which form the basis for testing specific predictions of neurocognitive models of ASC incorporating the foregoing constructs, and for assessing their explanatory scope (Hinzen, 2022).
Previous literature on narrative in ASC has revealed both variability and commonalities of ASC groups across studies relative to age-matched controls. For example, a reduction in syntactic complexity was found in some narrative studies (Banney et al., 2015; Norbury & Bishop, 2003), but not others (Colle et al., 2008; Diehl et al., 2006; Losh & Capps, 2003; Rumpf et al., 2012). There are indications of an increase in formal syntactic errors, yet also variability with regards to the way or extent to which such errors were assessed. For example, Norbury and Bishop (2003) found increased rates of tense-marking errors among their ASC group to be similar to controls with specific language impairment (SLI). In studies by Banney et al. (2015) or Rumpf et al. (2012), on the other hand, syntactic errors were either not under exploration as in the case of the latter or rather coarse-grained as in the former, such as assessing whether an error occurred at the lexical or utterance level. Looking towards commonalities, a meta-analysis by Baixauli et al. (2016) of studies of narrative in ASC found that measures of story cohesion were the most consistent area of weakness in the narrative of autistic children. How cohesive devices are defined varies across studies, though cohesion often broadly relates to the use of referential devices while constructing a narrative.
Aside from presenting more tense errors, the participants in Norbury and Bishop (2003) also differed from their typically developing peers on a standardized language assessment, the Clinical Evaluation of Language Fundamentals (CELF, Semel et al., 2003). This pattern of generalized weakness in productive and receptive language and presentation of SLI-like errors (i.e., morphological) has been characterized as autism with language impairment (ALI) (Kjelgaard & Tager-Flusberg, 2001), though the comparability of the language profiles seen in ASC and in SLI is also questioned (Boucher et al., 2012; Sukenik & Friedmann, 2018; Taylor et al., 2014). The other autistic groups in the studies surveyed here were similarly matched on chronological age but were not reported to differ from control groups on standardized verbal skills, however, comparability is limited as standard measures varied across studies. ALI entails a strict interpretation of structural language impairments, often at the sentence-level and which does not consider aspects such as cohesion. Narrative assessments have been demonstrated to uncover language difficulties that might not be noticeable through sentence-level tasks (King & Palikara, 2018) and therefore may help shed light on a more comprehensive profiling of language skills in autism more generally.
Against this background, our aim in the present study was to bring a more basic linguistic perspective to bear on the profiling of narrative in autistic children. We view this as prior to clarifying the ultimate cognitive basis of linguistic differences for this population and whether commonly posited neurocognitive differences in ASC in such constructs as ToM, weak central coherence, or executive functioning (Happé & Frith, 2006), are conceptually or empirically suited to explain them. A comprehensive and systematic linguistic analysis, carried out in terms of linguistically specific and independently motivated variables, could also provide a baseline also for assessing whether SLI is a suitable model for understanding language profiles in ASC. We, therefore, aimed to identify variables both within and across the main domains of structural organization of language (noun phrases, verb phrases, and clauses), along with error patterns.
In particular, in the case of clausal complexity, in previous studies, different clausal construction types have often entered indiscriminately in a diverse set of composite variables used to measure the construct of syntactic complexity, each of which included a range of different syntactic construction types with different functions (e.g., Norbury & Bishop, 2003; Norbury et al., 2014; Rumpf et al., 2012). This may have contributed to the aforementioned variability of previous findings. Cataloguing syntactic construction types by their functionality may prove a better strategy for grouping construction types in that their functionality may inform the cognitive processes involved. For example, in the syntactic domain of clauses, virtually any clause embedded as an argument specifies the content of a mental state or speech act (e.g., The frogs thought
The importance of referentiality for studies of cohesion similarly motivates distinguishing between different syntactic construction types within the domain of noun phrases (NPs). An increase in referential anomalies relative to TD controls or other clinical groups is among the most consistent findings among studies of narrative production in ASC – whether through an overproduction of ambiguous pronouns (Banney et al., 2015), a reduction of pronouns in favor of full definite noun phrases (NPs) (Rumpf et al., 2012), or an increase of both ambiguous nouns and pronouns (Norbury & Bishop, 2003). In the study by Banney et al. (2015), however, reference to characters was assessed through a measure of cohesion that tracked the means of introduction and maintenance of novel characters, through indefinites NPs, definite NPs, nouns only, pronouns, or omission. This measure did not distinguish between the ASC and TD groups.
As in the case of syntactic complexity, our aim in the domain of referentiality was not to start from a prior conceptualization of referential errors under the label of cohesive adequacy, but a comprehensive analysis of referential function in narrative across different NP types from a linguistic point of view. NPs systematically differ structurally, in line with how they function referentially in discourse (Hinzen & Sheehan, 2015). Thus, generic NPs are maximally non-specific (e.g.,
A linguistic analysis of narratives targeting this referential system as a whole has not yet been carried out to our knowledge. In experimental studies, however, definite NP comprehension weakness have been repeatedly noted (Fine et al., 1994; Modyanova, 2009; Schaeffer et al., 2014; Schroeder, 2019), and the classical case of pronoun reversals in ASC (Kanner, 1943) shows that problems with deictic forms of definite NPs are by no means restricted to problems of referential cohesion in narratives (see also Bartolucci et al., 1980; Mizuno et al., 2011; Shield et al., 2015). In fact, these may form just the tip of an iceberg, as referentiality is a dimension of language well-known to be affected throughout the spectrum, from the complete absence of declarative reference in non- or minimally verbal autistic people (Hinzen et al., 2020), to associative rather than referential forms of word use (Preissler & Carey, 2005; Preissler, 2008), anomalous lexical development (Arunachalam & Luyster, 2016), and delayed declarative pointing (which is closely related to language development: Colonnesi et al., 2010).
Moreover, referentiality in ASC has barely been explored in the verbal domain. In verb phrases (VP), the same events as depicted on the page of a story book can always be described using either ‘static’ verbs (e.g., The frogs
In sum, there is significant evidence that elicitation of spontaneous narratives is a sensitive tool to assess language anomalies across all broad domains of structural language in ASC. However, from a linguistic perspective, previous approaches have neither been very systematic (comprising construction types across nominal, verbal, and clausal domains), nor very specific (e.g., using generic or else composite measures, which conflate what often are linguistically very different phenomena, such as passives and embedded clauses). The fact that ASC can end up indistinguishable at a group level from SLI on language impairment (Norbury & Bishop, 2003; Norbury et al., 2014) motivates a deeper exploration using a fine-grained and comprehensive set of linguistic variables, which can provide potential differentiation.
Present study
We used ADOS-based assessments of narrative competence with the Tuesday book (Wiesner, 1991), both to allow for comparability with several previous studies and exploit linguistic data freely generated as part of clinical assessments. A new annotation scheme using linguistically defined variables was created to address the following specific research questions:
Can groups be distinguished by the quantitative distribution of syntactic construction types in a-c?
NPs (pronouns/clitics, anaphoric and nonanaphoric definite NPs, and indefinite lexical NPs); VPs (static vs. episodic); Clauses (embedded as adjuncts to NPs (relative clauses) and as clausal adjuncts or arguments (complement clauses)). Can groups be distinguished by error patterns according to either a or b below? Grammatical-level, further subdivided further into formal-syntactic errors and errors in the referential use of grammatical devices such as NPs; Lexical-level errors, further subdivided into relational, e.g., but, vs. non-relational, e.g., carpet. A quantitative reduction of embedded clauses (cf. Banney et al., 2015), specifically in relative and complement clauses (Durrleman & Delage, 2016; Durrleman et al., 2018), as well as of anaphoric NPs, including both third-person clitic pronouns and definite-anaphoric lexical NPs. More grammatical-level errors, particularly referential ones relating to specificity as required in definite-anaphoric and definite-non-anaphoric NPs, pronouns, clitics, and verbal inflection, with a potentially spared lexicon (e.g., Boucher et al., 2012).
To explore the cognitive basis of the linguistic patterns in question, correlations with verbal IQ, as assessed by the Peabody Picture Vocabulary Test (PPVT-III, Dunn et al., 2010), were also run, as well as with IQ measures (IQ, working memory and processing speed) in the ASC group, as assessed by the WISC-IV (Wechsler, 2003). We predicted that despite individual matching of groups on a language criterion, linguistic vulnerability in the ASC group would show in:
Methods
Participants
Eighteen children (mean age: 9;11, range: 7;4–12;6 years old) diagnosed with ASC participated in this study. ASC participants were required to score positively for autism in either the ADOS-G or the autism diagnosis interview-revised (ADI-R), as well as have a non-verbal IQ above the threshold for intellectual disability, as measured by the Wechsler Intelligence Scale for Children (WISC-IV). These assessments were conducted at the time of the data collection by the research team. All children were native speakers of Spanish and Catalan. ASC participants were recruited from a network of juvenile mental health centers in the Baix-Llobregat county of Barcelona, a region that tends to be Spanish-dominant. ASC participants were individually matched with 18 typically developing controls based on receptive vocabulary verbal IQ, as measured by the PPVT-III (see Krasileva et al., 2017, for an assessment of the validity of using the IQ generated by the PPVT as a proxy for verbal IQ for autistic individuals). Controls were identified as being a match if they were within one chronological year of age and five verbal IQ points. Controls were recruited from an elementary school in the Berguedà region of Barcelona. As the Berguedà region is Catalan-dominant, we further required participants to speak Spanish with at least one caregiver. All ASC participants spoke Spanish as a home language. Male to female sex ratios were similar across groups: the ASC group was comprised of 17 males and one female and the control group was comprised of fifteen males and three females. Independent t-tests showed that there were no significant differences between groups on either verbal IQ or age (see
Participant information.
ADOS and WISC-IV means and SD for ASC participants (n = 18).
Narrative elicitation procedure
All participants were tested individually in a quiet room and video-recorded so that their narrative production could be later transcribed and annotated. The participants were presented with the picture book Tuesday (Wiesner, 1991). Following the ADOS-G (Lord et al., 2000) administration procedure, the examiner began narrating the first page and then allowed the child to continue the narration until the final pages when the experimenter would take over again. The experimenter provided neutral feedback. If a child didn’t begin or stopped in the middle of the narrative, they were asked to continue the story. The book depicts a fanciful story in which frogs fly through a town at night on lily pads.
Coding scheme
Syntactic complexity in the nominal, verbal and clausal domains
The narratives were transcribed and annotated using CLAN (MacWhinney, 2000). All narratives were divided into utterances, which were defined as self-standing (grammatically independent) units of discourse providing new information. In the interest of profiling how the ASC participants refer to entities in the story and maintain reference, we categorized each NP according to NP type. NPs were first classified as either indefinite (e.g., some frogs) or definite (e.g., the frogs; he). Definite NPs were further classified as anaphoric (e.g., they; the frogs she had seen) or not (e.g., this/a policeman). Clitic pronouns were tagged as + clitic in addition to being classified as definite-anaphoric, with a further judgment of whether they were ‘successful’ or ‘unsuccessful’, depending on whether the clitic was involved in a referential error or not. Clitic errors were specified in order to be able to contrast the ratio of referential errors that were pronominal in nature. Finite VPs were classified as to whether they were stative or not (episodic). Syntactic complexity was scrutinized based on tagging clausal embedding and distinguishing between relative clauses, complement clauses, and (non-relative) adjunct clauses (see
Linguistic measures of construction types across syntactic subdomains.
As the length of narratives varied across participants, frequencies of all variables were normalized by the total number of words, with the exception of embedded clauses, which were normalized by the total number of utterances. We also compared the proportional use of each grammatical device type relative to its domain, creating nominal, verbal, and clausal domain ratios. For example, the definite-anaphoric NP ratio was calculated by dividing the total number of definite-anaphoric NPs by the total number of NPs.
Errors
We coded errors within the nominal (
Nominal domain error variables.
Verb domain errors.
Mental state content
Mental state and emotion words were annotated as ‘congruent’, if used correctly or plausibly, or ‘incongruent’, if there was a clear mismatch between the emotion or mental state described by the participant and what was depicted in the illustration.
Reliability
Ten percent of the transcripts were randomly selected and transcribed again by a new transcriber; another ten percent of transcripts were independently annotated. Prior to reliability calculation, the first author met with the independent transcriber and annotator to establish a consensus of disagreements. The total number of point-by-point agreements were divided by the sum of the total agreements and disagreements, resulting in an inter-transcriber reliability of 90.3% and an inter-annotator reliability was 90.1%.
Analysis plan
Statistical analyses were carried out using SPSS version 24. Where assumptions of normal distribution were met, independent t-tests were run to assess group differences. Where there were significant outliers or data were skewed as assessed by the Shapiro–Wilk test, we carried out non-parametric Mann–Whitney U-tests and compared medians. To adjust for multiple comparisons and strike an appropriate balance between a risk of type I and type II errors, we adjusted the alpha-level of significance by dividing .05 by the number of variables involved in a domain where comparisons were made: the clausal, nominal and verbal domains, and errors, leading to significance thresholds of .005 for both the grammatical profiling and error variables and a threshold of .01 for the remainder of variables. Only results meeting these significance thresholds, as well as statistical tendencies, are reported below. This quantitative analysis is followed by a qualitative analysis of two specific observed error patterns, which were added post-hoc to this analysis plan in order to provide further insight into a specific pattern that transpired during the annotation process.
Results
Quantitative analyses
Syntactic complexity
ASC participants produced significantly fewer embedded clauses: 42.9% of utterances in the TD group had an embedded clause in comparison to 16.8% in the ASC group. In particular, the ASC group produced significantly fewer relative clauses, while lower frequency of (non-relative) adjunct clauses was at the level of a tendency (see

Rate of embedded clauses per utterance: means and medians (gray bars). *indicates p = .001, **p = .0001.
Group comparisons of nominal, verbal and clausal domain variables using Mann-Whitney U-tests.
Errors
ASC participants produced lexical-level and grammatical-level errors at a significantly higher rate than TD controls (see

Frequency of error types: means and medians (gray bars). *indicates p ≤ .005; *p ≤ .001; **p ≤ .0001. Gray bars indicate medians.
Error scores normalized by number of words.
*indicates p ≤ .005; *p ≤ .001; **p ≤ .0001.
Results summary
Several significant differences across groups emerged, including (1) significantly more utterances in the ASC group while having similar quantities of total words; (2) a reduction in the rate of embedded clauses in ASC in relation to the TD, particularly in relative clauses but not (non-relative) adjunct clauses or complement clauses; (3) a greater rate of referential errors in ASC, particularly that of failed specific reference; (4) a greater rate of lexical errors in ASC, particularly in non-relational lexical choices. In contrast, the ASC participants did not differ significantly with regard to either the frequency of different NP and VP-types. Nor did they differ in terms of the frequency of verbal errors, or the rate of use of internal state language across groups. Regarding correlations, no linear relationships were identified between verbal IQ and key linguistic variables significant in the group comparisons, specifically failed specific reference, wrong lexical items (WLI), and embedded clause frequency. In the ASC group, none of the cognitive measures available as part of the WISC, including working memory and processing speed related to the above key linguistic variables (see
Correlation analyses, Spearman's rho.
Qualitative analyses
We provide a detailed qualitative analysis of two specific patterns of failed specific reference based on four examples. Consider (1) and (2), where a star in the translation localizes the error occurring—in bold in the original utterance and gloss:
…se fueron saltando al lago y se tiraron de cabeza y están allí, como pensando, Went3p−pl jumping back to the lake and jumped3p−pl head-first and are3p−pl there, like thinking, ‘(They) went jumping back to the lake and they jumped head-first and they are there, like thinking, (*he) is angry.’
Aquí Here calmly ‘Here (he) calmly flies and runs into a dog here xxx with a dog and they go and he eats *them.’
In these examples, the children appear to be referring through inflections and clitics to a specific individual or group, yet get entangled in how person and number in inflection and clitics are to be used in conjunction over a single stretch of discourse, so as to convey a coherent content. The pictures aid the listener in forming hypotheses about what is intended, and the use of deictics such as aquí and allí (here and there) indicates reliance on the shared visual context for referencing. Reference appears to be ‘exophoric’ in this sense, i.e., reliant on an entity in the shared visual domain, rather than ‘endophoric’, i.e., picking up a referent introduced previously into the discourse—it is as if the exophoric scope is continually being refreshed as the utterance proceeds.
In another error pattern, the participant's utterance in its literal interpretation does not match a viable reality in consideration of the visual support provided. Consider (3) and (4):
Entrabani por la ventana y Theyi entered through the window and Theyi entered through the window and
Cogieron el mando sin que se Theyi took the remote control without noticingi-3p−pl and were watching TV. They took the remote control without noticing and were watching TV.
Here, there is a mismatch of what is being said with what is seen in the picture. In (3) we see the frogs entering through the window no longer wearing capes. The child is saying that they entered the house through the window and that they took the capes off them. But it is unclear who they would be. More likely, the intended meaning was to say se quitaron los pañuelos (they took off their capes), a clitic-V-NP possessive construction where the clitic refers to the possessor of the entity denoted by the NP, which then would be appropriately co-referential with the subject of the coordinate clause and, in turn, with that of the initial clause. In (4), the illustration presents an older woman sleeping in front of her TV while several frogs are watching TV and one is controlling the remote with his tongue. In the utterance, there is a number mismatch, however, as the verb to notice should be singular rather than plural, if it is to refer to the woman. Instead of reference to the frogs and the woman to be disambiguated, we find the plural number inflection uniformly across the utterance. There also is a tense/aspect error as the verb den cuenta is in the present tense, which is incongruent with the past tense just prior in the utterance. Errors of the type seen in (3) and (4) suggest difficulty in handling and coordinating linguistic devices used in order to refer to entities at a grammatical level.
To further explore this distinction between errors that may relate to maintaining an exophoric scope and those which suggest challenges in the grammatical build-up of the child's discourse, we further categorized and counted all instances of failed specific reference across both groups. Total instances and proportions are presented in
Failed specific reference sub-type proportional use per group.
Discussion
These results reveal specific linguistic differences in the narratives produced by autistic children in comparison to TD peers of a similar age and matched on vocabulary-based verbal IQ. In this section we will discuss the syntactic complexity results first, followed by lexical-level and grammatical-level errors.
Syntactic complexity
In line with predictions, the ASC group produced fewer embedded clauses, and particularly fewer relative clauses. At the level of a tendency, there was a reduction in (non-relative) adjunct clauses as well. Complement clauses were rarely used in either group, so strength of comparisons between groups may have been limited. This result highlights the significance of clausal embedding as such for distinguishing these groups. The pronounced reduction in relative clauses in turn adds to a pattern of differences in the nominal domain as identified in the previous literature (Banney et al., 2015; Norbury & Bishop, 2003). Relatives embed under nouns and function so as to provide further descriptions of the referent of the NP. A reduction in relative and (at the level of a tendency) clausal adjuncts may thus form part of a pattern of reduced referential specificity (see more below), at the level of the object- and event-reference, respectively.
On the other hand, and contrary to our predictions, there were no group differences in the quantitative distribution of either NP or VP types as distinguished here. We speculate that this may relate to the use of a picture-book task, where the referents are visually present as and when they were being referenced. This allows definite NPs (whether full lexical NPs, clitics or null/inflectional) to be used exophorically, i.e., to refer to an entity in the shared visual domain, rather than endophorically, i.e., picking up a referent introduced previously into the discourse. While we distinguished definite-anaphoric NPs from definite-non anaphoric NPs (where a novel referent is introduced with a definite article), our annotation scheme therefore cannot fully exclude that some of the former (anaphoric) cases were in fact exophoric uses disguised as anaphoric NPs. This hypothesis is supported by previous findings from experimental studies that it is precisely (truly anaphoric) definite NPs that cause problems in ASC populations (Modyanova, 2009; Schaeffer et al., 2014; Schroeder, 2019). This predicts that they might be under- or mis-used in narrative production as well. Future results from studies of narratives both with and without visual support can address this crucial open question.
Error patterns
In line with predictions, error patterns distinguished groups, yet contrary to predictions, the lexical level (word choice) was not spared. A higher rate of lexical errors in the ASC group specifically appeared in what we called ‘non-relational’ lexical errors, which were virtually non-existent in the TD group. This was particularly surprising, as we matched participant groups on receptive vocabulary level. The finding also contrasts to claims in the literature that vocabulary is relatively preserved compared to grammar among autistic children without intellectual disability (e.g., Boucher et al., 2012; Eigsti et al., 2007; though see Arunachalam & Luyster, 2016). Wrong lexical choices (WLI) were often related to the target words in terms of appearance or function yet not quite matching the visual stimuli or the previous representation of the object (e.g., naming capes or sheets worn by the frogs as handkerchiefs or napkins, as validated by the immediate visual context, but not the discourse and story (this type of error further adds to our above suggestion that the ASC participants were referring exophorically, not relating the current scene with what had passed in their own discourse previously). Other common lexical errors included mismatches between the lexical choice and the visual stimuli, such as referring to the hang-drying sheets as blankets. Given that both groups were highly comparable on receptive vocabulary as measured by single-word comprehension (i.e., the PPVT-III), we speculate that anomalies of lexical choice likely relate to the constraints of lexical retrieval within grammatical structures as built online over narrative time. While cognitive capacities in ToM are often drawn upon to provide a potential explanation for referential errors in ASC narratives (e.g., Colle et al., 2008; Suh et al., 2014; though see Arnold et al., 2009 for a contrary view), it is unclear how mentalizing difficulties would account for lexical errors as found here. In particular, identifying accurately a picture in the story such as a sheet or a blanket would not seem to depend on the mental state of the listener of the story.
The role of failed specific reference
Among grammatical-level errors, it is noteworthy that morphosyntactic errors did not distinguish groups and were indeed rare. On the other hand, failed specific reference occurred significantly more frequently in the ASC group. Unlike the lexical errors above, these errors do not relate to the kind of object being referred to (i.e., how it is conceptualized or described lexically), but rather to mishandling the means through which it is being referenced, resulting in a referential breakdown. As described in the introduction, NP types in human language form a system, which provides means to create and maintain a shared system of reference over narrative time. Grammatically distinct NP types support distinct forms of reference, and system-internal constraints guide their co-occurrence and relative ordering. For example, pronouns lack lexical descriptive content and depend for their appropriate use on either a prior introduction of the referent through a lexical (definite or indefinite NP), or else a perceptually salient referent in the narrative content (the exophoric case). Previously documented difficulties in handling such devices in ASC (such as under-use of pronouns, Arnold et al., 2009) could indicate a breakdown of this coordinated system, but it could also be due to system-external deficits such as deficits in ToM (e.g., Colle et al., 2008).
Against the latter option, Arnold et al. (2009) already argued that coordinating the referential system serves to structure both the narrator's own representation of the discourse and to guide that of the interlocutor. Investigating this issue further in Spanish is useful, as strong (i.e., independent, non-clitic) pronouns in this language are actually mainly used for emphasis and are not the default option, which instead is the use of verbal morphology (inflection) and clitics for subjects and objects (direct and indirect), respectively. Both inflection and clitics encode person and number features (and structural case features in clitics), which enable reference in pronouns and, in this regard, they are as pronominal as strong pronouns. In this respect, we found that both groups had a similar frequency of clitics, both quantitatively and in terms of errors in them, along with similar frequencies in lexical NP types as discussed above. Overall, this may suggest that failure of specific reference in our study is best interpreted as reflecting a weakness in coordinating the referential system of grammar as a whole over narrative time. Insofar as this system is subject to intricate and system-internal constraints on how various grammatically different referential devices are to be handled together, this conclusion illustrates a challenge for current neurocognitive models of ASC and for understanding language impairment in ASC. The absence of morphosyntactic errors together with problems of referential specificity do not seem typical of an SLI-type language impairment. In turn, while difficulties in the domain of ToM might be thought to specifically explain difficulties with pronouns (Colle et al., 2008), this is less clear for the pattern above.
Our qualitative analysis of error patterns in failed specific reference further contributes to the foregoing interpretation, as it suggests an increased reliance on an exophoric narrative style where reference is implemented via visually provided referents, and processing difficulties specifically appear in the coordination of referential devices in complex sentences. Errors such as those described in examples (3) and (4) (Section Qualitative analyses) have the effect of confusing the listener, hampering the build-up of a coherent discourse representation. The cause of the problem appears to be a difficulty in the speaker's handling of a system of referential devices and the constraints on using inflections and clitics in the context of other NPs over the course of a complex utterance. The overall rate and proportion of these errors (see
Cognitive capacities such as verbal working memory, too, may be relevant to explore in relation to referential errors, as they have been argued to relate to the ability to maintain discourse representations and anaphoric links in ASC (Schaeffer, 2018). Interestingly, Stanford et al. (2019) show that training of working memory capacities can lead to gains in syntax and specifically in clitic placement, in language-impaired groups. Note, on the other hand, that a deficit in working memory would not naturally incorporate the lexical errors that we found. Furthermore, for the ASC group, cognitive measures were available as part of the WISC, including working memory and processing speed. None of these cognitive variables related to these key linguistic variables, suggesting some autonomy of the latter, at least in this part of the spectrum (without intellectual disability).
A number of non-significant group differences in our study contribute to a comprehensive linguistic profile of this group as well as highlighting specific strengths in their narrative development. We explored the verbal domain, where neither the distribution between episodic and stative VPs nor error patterns distinguished groups. Here we found that ASC participants presented episodic descriptions of the scenes with well-developed morphosyntactic skills. This highlights the importance of the nominal domain when building a linguistic profile potentially specific to ASC, since both relative clauses and referential errors relate to this domain. Comparing the strengths and weaknesses found here in narratives by autistic children to children with developmental language disorders may further refine our understanding of ASC linguistic profiles.
Overall, this analysis of narratives in diagnostically relevant ASC assessments has shed light on a distinctive linguistic profile comprising differences in the quantitative distribution and qualitative use of specific syntactic construction types, and errors in both lexical choices and in managing a system of referential devices involved in generating referential specificity over narrative time. This profile imposes a linguistic constraint on current and future cognitive models of ASC. In particular, it is difficult to see it naturally explained by mentalizing difficulties. Indeed, future neurocognitive models of ASC need to be sensitive to the possibility that language impairment in ASC may not be secondary to a set of other cognitive impairments viewed as primary, nor be ‘specific’ in the sense of reflecting SLI as traditionally conceived. Independent evidence of divergences between the ASC and SLI linguistic profiles, consistent with evidence from the present study, further supports the latter possibility. Instead, it may be that, in the context of ASC, language itself as an essential social-communicative tool mediates certain social-cognitive abilities, and is of relevance in neurocognitive models of ASC for this reason (Hinzen, 2022; Hinzen et al., 2020). Recent evidence that training in the domain of embedded clauses can improve ToM capacities (Durrleman et al., 2019) is a further pointer in this direction: language and cognitive domains traditionally viewed as non-linguistic may be closely linked, and potentially share a cognitive substrate (Schroeder et al., 2021). The same may apply to referential functions. Dependent on both lexical choices and handling these choices within a complex interdependent system of grammatical devices, reference exploits virtually the entire complexity of the linguistic system. Reference is vulnerable across mental health conditions (Hinzen, 2017), and whether impairment in non-linguistic cognitive domains can explain this breakdown, or else language itself is the crucial explanatory factor, is an important open question.
Summary
Linguistic analysis of spontaneous narratives regularly elicited as part of standardized assessments for ASC shows that bilingual autistic Spanish-Catalan children without intellectual disability diverge from their neurotypical verbal-IQ-matched controls on specific linguistic variables: fewer adjunct clauses and more frequent errors in referential specificity and non-relational content word choice, jointly with an absence of morphosyntactic errors. These findings invite new inquiries on how to situate language changes within neurocognitive models of ASC, as they appear to define a distinctive linguistic profile, which resists explanation through familiar nonverbal cognitive impairments in ASC as much as assimilation to a comorbid SLI.
Limitations and future work
Inclusion of a comprehensive language test would have allowed us to assess how our linguistic measures relate to linguistic measures used to identify a typical SLI profile as well as determine whether linguistic differences identified would have transpired against an even tighter linguistic group matching. In order to further explore the effects of online language processing differences, further studies may benefit from profiling groups on both expressive language skills as well as receptive vocabulary. Another limitation is that only a part of the autism spectrum was targeted here, given our IQ inclusion threshold, which shields an extensive range of language impairment in ASC from view (Hinzen et al., 2020; Jack & Pelphrey, 2017). Third, participants of this study were all bilingual, which complicates comparability with previous studies of monolinguals. While new evidence supports that bilingualism can enhance cognition in ASC without intellectual disability (Durrleman et al., 2022), it is an important question whether it also impacts on linguistic variables extracted here. Future work should also consider the practical implications of the present study. Many of the linguistic differences uncovered here are subtle (such as rate of embedded clauses), and not of a kind that necessarily meet a clinician's eye. Given their potential significance for training studies and diagnosis, objective assessments of language based on automated language processing options are therefore highly desirable. The measures used here do not yet lend themselves to such analyses.
Finally, future work should explore narratives or stories told by autistic children without visual support, as these may present greater interpretative challenges for the listener to overcome. As mentioned in the introduction, one of the aspects for which children are given narrative generation tasks is to assess sense-making skills. In order to build a narrative and draw inferences of how the story is unfolding, it is favorable to build endophoric representations which are continuous and involve anaphoric linking. If autistic children tend to favor exophora, as we suspect, we would expect non-visually supported narratives to show the characteristics identified here to a greater extent.
Footnotes
Credit author contribution statement
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was funded through the Spanish Ministry of Economy and Competitiveness [FFI2013-40526P] and the grant PID2019-105241GB-I00/AEI/10.13039/501100011033 provided by the Ministry of Science, Innovation and Universities (MCIU) and the Agencia Estatal de Investigación (AEI) of Spain.
