Abstract
Children with developmental language disorder (DLD) present with unexplained difficulties with both expressive and receptive language. A key conundrum is why their prognoses are often poorer than children with evidenced brain lesions. How can children with early focal lesions acquire language so well compared with children with DLD who do not present with gross neurological abnormalities? A possibility raised by Dell and Chang in 2013 is that learning in DLD is fundamentally derailed by an exemplar-based learning style. Such a process has been robustly demonstrated in studies of artificial neural networks. This theoretical article discusses exemplar theory, the role exemplars may play in language development, and surveys evidence suggesting an exemplar-based learning style in DLD. It concludes that this provides a plausible new approach to both explaining the condition and designing new therapies.
Introduction
Children with developmental language disorder (DLD) have severe difficulties acquiring, comprehending and producing language. Difficulties are evidenced across a wide range of domains, including word-learning, vocabulary, morphology (in particular verb morphology), use of complex sentences, pragmatics and phonology. A broad overview of linguistic difficulties in DLD is provided by Leonard (2014), while bearing in mind the shift in terminology from Specific Language Impairment (SLI) to DLD (see below).
In this review paper, I propose a new account of DLD, arguing that this condition is due to an exemplar-based style of learning and processing. The primary motivation for this argument is the ‘tough-but-fragile’ paradox. This is my term to describe an observation by Dell and Chang (2013). According to this paradox, the language system, in most individuals, is robust, being able to overcome severe obstacles such as early neurological damage. However, in individuals with DLD, it is extremely fragile, with severe long-term implications for language development. I then propose, based on Dell and Chang (2013), that exemplar-based learning enables us to explain this paradox. A novel aspect of this account is the fact that it is based on the principles of machine learning/artificial intelligence (AI). I will argue that, given substantial improvements in the power of generative AI systems, these may act as good models of human learning and cognition and may also be used to identify ways in which learning may break down.
A Note on Developmental Language Disorder
DLD is a recent label which revises the criteria used to diagnose SLI. A key change is the lowering of IQ cut-offs, with many studies of DLD using a cut-off of −2 standard deviations, as opposed to −1 standard deviations in the SLI literature. Consequently, the populations of children with SLI and children with DLD do not wholly overlap. This raises the question of whether research into SLI may inform our understanding of DLD. An argument in favour of comparing across research literatures is that linguistic profiles in children with language difficulties are not substantially influenced by IQ. For example, the linguistic phenotype in children with SLI closely resembles that of children who have a combination of language difficulties and moderate learning difficulties (e.g. Bishop, 1994; Rice et al., 2004). In fact, the finding that language profiles are largely unaffected by IQ constitutes an important motivation for lowering IQ cut-offs in the CATALISE consensus definition of DLD (Bishop et al., 2016, Criteria and Terminology Applied to Language Impairments: Synthesising the Evidence, statement 23, pg.15). In the light of these findings, Bishop (2020) argues that the research literature on SLI remains relevant to a discussion of DLD.
Given the above arguments, I will use the term ‘DLD’ when discussing research conducted using the recruitment criteria for SLI. The reader should bear in mind that most studies published prior to 2018 will use the ‘SLI’ label and should be aware that recruitment criteria (particularly with respect to IQ) differ slightly.
The Tough-But-Fragile Paradox
To my knowledge, Dell and Chang (2013) were the first authors to explicitly formulate the tough-but-fragile hypothesis. Regarding the ‘fragility’ of language in DLD, there is, indeed, evidence that children with DLD experience severe and protracted language difficulties. Botting (2020) found that 40% of a group of 84 24-year-old adults with DLD scored below <2 standard deviations on the Clinical Evaluation of Language Fundamentals (CELF) core language index (Semel et al., 2006). Given that the CELF is standardised up to 21;11, this study most likely overestimated their language ability. Unfortunately, due to the practicalities of running longitudinal studies, there are few studies of DLD in adulthood, and Botting’s (2020) study is unique in combining a large sample with standardised language assessments. Nonetheless, poor prognosis is widely recognised by clinicians and is emphasised by the CATALISE consensus definition of DLD, which specifies ‘language problems enduring into middle childhood and beyond’ (Statement 2: Bishop et al., 2017).
This poor prognosis contrasts with the better outcome of children with early focal lesions whose language systems are surprisingly ‘tough’. In a study of 38 children, aged 5 to 8 who experienced early focal lesions, Bates et al. (2001) observed normal range performance on measures derived from spoken language samples; word type frequency, number of propositions per utterance, Mean Length of Utterance (MLU) in morphemes and counts of lexical, morphological and grammatical errors. A more recent study by Newport et al. (2022) investigated 15 adolescents who suffered a left-lateralised perinatal ischaemic stroke. They did not significantly differ from age-matched peers on CELF 5 Word Structure (Semel et al., 2017), Test of Reception of Grammar (TROG) overall accuracy (Bishop, 2005), TROG complex sentences and a test assessing comprehension of actives and passives. This may reflect the reorganisation of language functions in the brain, with the right hemisphere taking over language processing.
While the research literature on prognosis in both DLD and early focal lesions is small and fraught with methodological issues (e.g. the lack of a standardised language test for adults), existing data support the tough-but-fragile hypothesis. Current evidence suggests that many children with early focal lesions will have a better language prognosis than many children with a diagnosis of DLD.
Exemplars as a Solution to The Tough-But-Fragile Problem
Dell and Chang (2013) propose that a dependence on exemplar-based representations may explain fragility in an otherwise tough language system. These representations are detailed, concrete, tied to specific usage events and most likely depend on episodic memory (Ambridge, 2020a). Many accounts of exemplar-based representations focus on word meaning (e.g. Ross & Makin, 1999). A child may initially underextend a word, for example assume that
Exemplars are often contrasted with prototypes. While a prototype is also derived from multiple usage events, it is a stored representation, for example a representation of the ‘ideal’ dog. By contrast, according to exemplar theory, categories are constructed spontaneously based on the simultaneous activation of multiple exemplars.
Exemplar theorists highlight the fact that we retain detailed information about our linguistic experiences. For example, phonological representations may contain subphonemic information. In frequent
Syntactic representations may also contain fine-grained contextual information. For example, constructional idioms exhibit detailed context-dependent meanings which go beyond or even contradict their literal interpretation. For example, the
Exemplar Representations in Child Language
Children exhibit a dependence on exemplars which may result in linguistic generalisations which are ‘faulty’ in the sense that they do not occur in the adult system. One example is children’s conflation of tense and aspect. Because achievement and accomplishment verbs (e.g.
Other studies find that children’s inflection of past tense is strongly influenced by the phonological properties of the stem. Young children often omit the regular past tense affix when the stem already ends in an alveolar plosive (/t/ or /d/), for example
Children’s use of the passive also demonstrates exemplar-based generalisation. In competent adult speakers, this involves the movement of a post-verbal argument into subject position. Almost any active transitive sentence may be passivised irrespective of verb semantics. Often a key motivation is to move an argument into a position where it is topicalised. However, early passives exhibit semantic constraints which do not play such a strong role in the adult system. A key property is that the subject must be highly affected (Pinker, 1989). Exactly when the passive becomes ‘abstract’, and consequently less influenced by semantic constraints, has been rigorously debated. Even as old as five or six, children struggle to comprehend passive sentences where the subject is not highly affected (Ambridge, 2020a).
How Important Are Exemplars to Language Acquisition?
There is strong evidence that at least some processes evident in early child language exhibit exemplar-like properties. However, this does not entail a central role for exemplars in language development. Critics of exemplar theory question whether we can store the kind of detailed representations required by the theory. Moreover, it is not clear how exemplar-based analogy would work without some kind of a priori specification of which dimensions are most relevant (Demuth & Johnson, 2020; Lieven et al., 2020; Schuler et al., 2020). In response to these criticisms, and others, Ambridge (2020b) acknowledges that exemplar-based representations must become more abstract over time. Given the many criticisms of exemplar theory, it might be best to regard exemplar-based phenomena as a characteristic of an early-stage of language acquisition, which is not a necessary precursor to the development of more abstract representations.
However, there is evidence that exemplar-based language use persists into adulthood. Though competent adult speakers rarely make past tense errors, product-oriented schemas may exist in the adult system. For example, Bybee and Slobin (1982) found that adult suffix-omission errors tended to occur on verbs where the stem form ended in a /t/ or /d/. Albright and Hayes (2003) also found an effect of stem phonology with respect to grammaticality judgements of past tense forms. Moreover, past tense forms which are in free variation in many varieties of English, for example
Turning to the passive, the concept of affectedness influences adults’ judgements of the grammaticality of passive sentences long after it has ceased to impact on performance in controlled tests of comprehension (Ambridge et al., 2016). The authors conclude that ‘a semantic constraint on the passive must be incorporated into accounts of the adult grammar’ (p. 1435). In addition, alongside the much-studied
Data from adult usage of the past tense and the passive indicate that exemplars do not suddenly give way to abstract representations, but exist, albeit in a limited fashion, within the adult system. Consequently, the shift from exemplar-based to abstract representations can be conceived as a gradual and graded process. This is an important principle when discussing the possibility of exemplar-based representations in DLD.
How Dell and Chang Evoke Exemplars to Explain Tough-But-Fragile Systems
Dell and Chang (2013) argue that exemplar-based learning can severely affect generalisation within a system which is ostensibly ‘tough’. This is demonstrated via an artificial neural network model. Such networks display excellent learning as they have a strong tendency to minimise prediction error. Once a model has reached a certain level of complexity, in terms of the number of nodes and training epochs, the removal of nodes has relatively little impact on the model’s capacity to learn and generalise. This ‘toughness’ mimics children’s recovery from focal legions.
However, neural networks are also fragile in the sense that they are prone to overfitting. This involves modelling the noise in the training data, which, in turn, impacts on generalisation. Here, the term ‘noise’ is used to describe information which is not relevant to the learning task and does not support generalisation. For example, using lexical aspect or stem phonology to determine past tense inflection misses the broader generalisation that regularity is an abstract feature in the verb lemma. Likewise, using affectedness to determine the suitability of the passive misses the broader generalisation that virtually any post-verbal argument may be passivised for discourse reasons.
Though Dell and Chang do not use the term
Chang’s (2002) study artificially induced overfitting in an artificial neural network model by adding redundant connections to create, a ‘linked path’ model. The additional connections linked the message/lexical system with the sequencing system. Such a manipulation caused the model to learn correspondences between noun lemmas and their thematic roles, for example assuming that
It must be acknowledged that overfitting is not a critical problem in machine learning. Engineers can employ various techniques to minimise overfitting, for example adding noise to the training data. Moreover, with substantial data, large networks recover from initial overfitting, a phenomenon called ‘double descent’ (Belkin et al., 2020). As our brains are more complex than the most sophisticated AI model to date, it seems unlikely that they are going to be worse at recovering from overfitting. Nonetheless, given that overfitting is an emergent property of computational systems directly modelled on human neural architectures, it provides a potentially important explanation for language learning difficulties.
Dell and Chang (2013) note that exemplar-based learning strategies may be relatively accurate and are therefore reinforced. In machine learning terms, the model is trapped in a ‘local minimum’, a term used to describe the entrenchment of a suboptimal generalisation. Product-oriented schemas for the past tense may be regarded as an example of this. This generalisation is at least partially successful as many verbs ending in an alveolar plosive do not take the past tense suffix. Albright and Hayes (2003) identify another suboptimal generalisation, noting that all verbs in English ending in a voiceless fricative (/f/, /θ/, /s/, /ʃ/) inflect for past tense via addition of the /t/ allomorph. Though this generalisation is completely reliable, it may foster an overdependence on stem phonology, which may prevent the child from attaining the broader generalisation, that regularity is encoded in the lemma.
Passive use may likewise exhibit the fossilisation of suboptimal learning strategies.
While, to my knowledge, Dell and Chang (2013) are the only researchers to propose an overfitting account of DLD, there have been numerous overfitting accounts of autism. These focus on sensory processing abnormalities, which are characterised as attempting to learn from the noise in the input (Cruys et al., 2013; Idei et al., 2020; Pellicano, 2010). In addition, autistic individuals often perform poorly on a visual categorisation task, which involves learning from noisy input. Categories consist of an arrangement of dots in two-dimensional space. Exemplars of these categories are generated by randomly displacing dots, but with noise carefully calibrated to make the category learnable. In comparison with non-autistic controls, adult autistic participants find it difficult to learn the categories under these noisy conditions (Gastgeb et al., 2012). Again, this learning style can be explained via a learning system which models noise rather than signal.
The research on autism described above suggests that individual differences can arise in the degree to which learning is based on exemplars. This is an important precondition of an exemplar-based account of DLD. It should be noted, however, that this does not presume a shared genetic basis for DLD and autism, a claim which is not supported by recent research (Nudel et al., 2024).
Examples of Exemplar-Based Learning in DLD
I will now explore whether widely observed linguistic difficulties in DLD may be explained via exemplar-based learning. For brevity’s sake, I will focus on two key linguistic characteristics of DLD, difficulties with tense-marking and complex sentences. In addition, I will investigate evidence for a chunk-based learning and processing style.
Tense-Marking
Tense-marking difficulties in DLD are well-evidenced and may constitute an important linguistic marker of DLD (Conti-Ramsden et al., 2001). A typical error is the Optional Infinitive (OI) whereby the infinitive is used instead of an inflected form (Rice et al., 1998). OI errors occur in both present and past tense contexts, affecting both regular and irregular verbs. Numerous accounts of tense-marking difficulties have attributed errors to the analysis of stored forms, and the development of linguistic schemas which are based on factors which do not guide generalisation in the adult system, for example phonological properties of the stem. These depict exemplar-like processes, though it is important to note that none, except Owen Van Horne et al. (2017), have explicitly used the term ‘exemplar’.
A frequent finding is that zero-affixation errors, for example
A second finding supporting an exemplar-based account is that, for regular past tense verbs, the input frequency of the inflected form impacts on children with DLD, but not language-matched typically-developing peers. Van der Lely and Ullman (2001) found that the input frequency of the inflected form was significantly negatively associated with zero affixation errors in the DLD group, but not the language-matched control group. In Dutch, Rispens and De Bree (2014) found that, for novel verbs, the DLD group were likely to make errors involving the erroneous inclusion of a
If past tense difficulties reflect exemplar-based processes, it follows that we can improve past tense inflections by deliberately exposing children to stimuli which disrupt these processes. This was observed by Owen Van Horne et al. (2017). Using a sentence imitation task, they trained 18 children with DLD to produce regular past tense forms. Verbs were divided into easy and difficult sets based on a norming study, which investigated zero-affixation rates for a wide range of verbs (Owen Van Horne & Green Fager, 2015). This manipulated (a) their phonological properties, for example whether they lent themselves to a product-oriented schema, (b) their lexical aspect; in particular whether they implied a natural end state, and (c) input frequencies for inflected forms. Those children who were initially trained on verbs which deliberately disrupted their exemplar-based learning, for example atelic verbs ending in alveolar plosives, demonstrated significantly greater learning. By contrast, those children who were initially trained on ‘exemplar-like’ verbs demonstrated weaker learning. The fact that ‘exemplar-busting’ stimuli promote more rapid learning provides further proof for the exemplar-based nature of language learning in DLD.
Complex Syntax
Children with DLD frequently struggle to produce and comprehend sentences with non-canonical word order such as passives (e.g. Van der Lely, 1996), object questions (e.g. Deevy & Leonard, 2004) and relative clauses (e.g. Novogrodsky & Friedmann, 2006). Most accounts of this phenomenon posit difficulties with syntactic movement processes (Novogrodsky & Friedmann, 2006), or short-term/working memory (e.g. Deevy & Leonard, 2004). To my knowledge, no one has yet proposed an exemplar-based account of such difficulties. However, the data are consistent with an exemplar-based interpretation which highlights the role of discourse.
While psycholinguists focus on the structure of complex sentences, they often overlook their discourse properties. Relative clauses ground the head noun in the discourse, making it identifiable by both speaker and hearer (Fox & Thompson, 1990). Consequently, relative clause internal Noun Phrases (NPs) are usually pronominal. Questions, which, in English, have a similar syntactic structure, exhibit similar discourse properties. For example, in object questions, subjects are usually pronominal (Riches & Garraffa, 2017). Passives likewise have an important discourse function. By enabling us to place an underlying non-subject in a topicalised position, they facilitate topic maintenance (Ferreira, 2021). Experimental support for this view is provided by Thompson et al. (2013) who found that competent adult speakers produce frequent passives when patients are topicalised. For this reason, passive subjects are usually pronominal.
Discourse fundamentally shapes our use of complex sentences. Contemori and Belletti (2014) identified an intriguing phenomenon whereby speakers strongly avoid producing object relatives with full NPs, for example
Note that the preferred structure,
Exemplar accounts are consistent with such discourse-related phenomena because they presuppose that stored grammatical constructions retain detailed contextual information. Goldberg (2014) demonstrates this phenomenon, arguing that in ditransitive sentences, for example
If grammatical constructions strongly specify discourse properties, it follows that complex sentences will be hard to interpret when their discourse properties are atypical. For example, object relative clauses which contain lexically-restricted NPs are more difficult to process than those containing pronouns (Gordon et al., 2004). Even choice of pronoun matters. When discourse-accessible
The degree to which processing is impacted by discourse is likely to reflect the degree to which representations are exemplar-based. When an individual has representations which are sufficiently abstract, they may have only minor difficulties processing complex sentences with atypical discourse properties. However, an individual whose representations are more exemplar-based will be strongly affected when discourse properties are atypical. It is therefore important, in controlled studies of syntactic processing, to create sentences which are naturalistic from a discourse perspective. However, studies of complex syntax in DLD employ stimuli which, in discourse terms, are highly atypical. One example is object relatives containing discourse-new NPs, which are frequent in studies of DLD (e.g. Novogrodsky & Friedmann, 2006). Likewise, many studies of passive comprehension which present passive sentences in isolation (e.g. Van der Lely, 1996) do not present them in a context where they have a topicalisation function. Such stimuli, though lacking in ecological validity, are chosen for practical reasons. Presenting these constructions with pronouns would involve the creation of a rich context, which is difficult within a controlled experimental setting. Moreover, the inclusion of pronouns would involve the introduction of case-marking cues, which add an additional and potentially confounding variable to already complex experimental designs. Consequently, many controlled studies of complex syntax in DLD assess an individual’s ability to adapt their syntactic representations to an atypical discourse context rather than their ability to process complex sentences in discourse contexts where they would naturally occur.
While it is difficult to run a controlled experiment on complex syntax with naturalistic stimuli, it is possible to create or facilitate a more naturalistic context in studies which elicit spoken language within a conversational setting. According to the logic of the above argument, children with DLD will demonstrate better abilities with complex sentences in such naturalistic contexts. Admittedly some naturalistic studies do suggest severe difficulties with complex sentences. For example, Marinellie (2004) sampled the language of 15 children with DLD in a conversational context, and found that rates of complex sentences, for example relative and adverbial clauses, were approximately half those of language-matched controls. Paradis et al. (2022), also using a conversational task, found a similar pattern for bilingual children with DLD.
However, open-ended conversational tasks may not be ideal for eliciting complex syntax, as sentence complexity may be influenced by factors unrelated to structural language, for example pragmatic abilities, conversational style, motivation, and rapport. Expository discourse tasks, where participants must create a monologue describing an inherently complex topic, provide a better means of eliciting syntactically-complex utterances. Using this paradigm, Nippold et al. (2008) identified very similar rates of complex sentences in adolescents with DLD (
Some studies of passive production in DLD have also employed naturalistic elicitation paradigms. These elicit passives via a prompt which topicalises the patient, for example
More serious difficulties with the passive were identified by Du et al. (2024) in a study of the
To summarise, while complex sentences have salient discourse characteristics, experimental studies of complex syntax in DLD tend to employ stimuli which lack appropriate discourse contexts, and may even have atypical discourse characteristics. Consequently, assuming that the discourse properties of complex sentences are encoded in their grammatical representations, such studies assess children’s flexibility in adapting their exemplars to atypical discourse contexts. By contrast, numerous studies which create or facilitate discourse contexts which support the use of complex sentences (Leonard et al., 2006; Nippold et al., 2008; Stanford & Delage, 2023) find only minor difficulties in DLD. Such a pattern can be easily explained via an exemplar-based account which argues that fine-grained contextual information, including discourse properties, is stored within grammatical representations.
The Use of Chunks During Language Processing
Some researchers have suggested that children with DLD have a dependence on high-frequency chunks during language production and comprehension. Such multi-word units are exemplar-like as they lack the abstraction which characterises adult language use. Bybee (2013) has characterised chunk-dependence as a consequence of exemplar-based learning. Moreover, an overdependence on input frequency is consistent with Chang’s (2002) linked-path model which predicts input-dependence regarding word order, though it should be noted that, in this model, such effects arose at the message level, for example making links between the lemma of
Some studies of DLD have presented novel items in specific syntactic frames, and observed an unwillingness to ‘go beyond the input’ by using that item in an unattested item-frame combination. Skipp et al. (2002) modelled novel verbs in a variety of frames, for example V, SV, VO, SVO. They found that children with DLD (
A further exploration of chunk dependence was conducted by Kueser and Leonard (2020). They performed an elicited imitation task using sentences which controlled the frequency of chunks, for example
This finding would suggest heightened chunk frequency in the DLD group. Unfortunately, for the purposes of the current discussion, the key interaction between group (DLD, age-matched and language-matched) and condition (presence vs. absence of a high-frequency chunk) did not reach significance (
Caveats
While there is, I believe, converging evidence for an exemplar-based account of DLD, there are some potential difficulties with this account.
First, the definition of what constitutes an exemplar, and the range of properties which may be used to characterise exemplars, could be more firmly operationalised. The key criterion is that exemplars exhibit constraints, or principles of generalisation which are ‘noisy’ in the sense of not existing within the adult system. Examples are semantic constraints on affixation in regular verbs, strong discourse constraints on the production and comprehension of complex sentences, and the possible existence of a chunk-based learning style. Further empirical and theoretical work is clearly needed to facilitate identification of exemplars.
Another issue is whether an exemplar-based learning style can explain co-occurring difficulties, for example motor difficulties, or dyslexia. Though the theory focuses primarily on language difficulties, it is possible that an exemplar-based learning style may be found across domains. For example, researchers have argued that performance on a visual pattern-learning task by autistic individuals may indicate atypical domain-general learning mechanisms (Gastgeb et al., 2012). According to this logic, the hypothesised exemplar-based learning style in DLD may have ramifications beyond language. However, a detailed discussion of how this might manifest is beyond the scope of this paper.
A further limitation of this account is that it does not readily explain inconsistent language use, for example the within-child co-occurrence of inflected and infinitival forms of the same verb. Such inconsistency would not be predicted by an exemplar account. One possibility is that inconsistent performance may reflect the application of a metalinguistic strategy. A child who is dependent on a product-oriented schema may become aware that most verbs in the past tense need to be inflected either via suffixation or a stem change. They may therefore develop a metalinguistic strategy which adds the suffix whenever their product-oriented schema results in no change. Because of the cognitive effort involved in such a process, they may only apply this intermittently. Though this goes beyond the exemplar account by proposing an alternative mechanism, exemplar-based processes remain a key underlying factor.
Many readers will question whether the study of artificial neural networks can serve as an empirical basis for making claims regarding language impairments. The emergence of powerful large language models (LLMs) has generated both excitement and controversy in the field of linguistics. Some have proclaimed that the incredible power of these models and the extremely naturalistic language they produce effectively rings the death knell for theories of Universal Grammar (Piantadosi, 2023). Others have argued that these models are nothing more than technically-sophisticated versions of the horse Clever Hans, and that they produce and comprehend language in a very superficial manner (Chomsky et al., 2023). For example, these models lack a genuine understanding of intentionality, and their semantic representations are, in effect, floating signifiers which are not linked to real-world objects or situations. Moreover, their training data greatly exceed the children’s linguistic input, making them poor models of acquisition.
While many of the criticisms of LLMs as models of human learning and cognition are valid, there are a number of reasons why we might wish to adopt them as psychologically plausible models. First, they mimic neural architectures and learning mechanisms. In terms of architectures, they involve neurons with multiple inputs, corresponding to dendrites, and a single output, corresponding to axons, and these neurons are arranged in layers as they are in the human brain. They also implement prediction-based learning, a process which many also assume to underly human learning (Friston, 2010). Finally, their linguistic behaviours closely resemble those of humans. For example, Cai et al. (2024) ran 12 classic psycholinguistic studies using ChatGPT and identified human-like performance in 10 of these. Paradigms included lexical and structural priming, and implicit causality (
A further issue is whether overfitting can genuinely happen in the human brain. Though this phenomenon occurs naturally in artificial neural networks, it is relatively easy to overcome. Therefore, is it plausible that this process also happens in the human brain, which is vastly more complex than the most complex artificial neural network? Two possible candidate mechanisms which are specific to the human brain are myelination and precision. Myelination, a neurological process which prunes weak connections and thereby fortifies strong connections plays an important role in brain development. In the context of overfitting, it reduces noise within neural networks. With regard to myelination, Chang’s (2002) linked-path model could be described as ‘under-myelinated’ as it exhibits a surfeit of connections due to the addition of paths between the message-level and sequential system. There is, moreover, evidence for reduced myelination (i.e. over-connectivity) in DLD (Krishnan et al., 2022), which could result in more noisy networks.
Precision is, in simple terms, an estimate of the learning potential of any unpredicted stimulus (Parr & Friston, 2017). Recent research has identified neurons in the auditory cortex which spike when an unpredicted stimulus is encountered, and consequently, it is likely that they encode estimates of precision (Pérez-González et al., 2024). Furthermore, recent data gathered by Blake et al. (2025) indicate that children with DLD find it difficult to identify collocations when presented with pairs consisting of collocations and a non-collocational distractor (
Possible Future Directions
As mentioned above, definitions of exemplars and descriptions of the range of characteristics that they display are relatively underspecified, and further work is needed in this area. Further behavioural studies could be conducted to explore the putative exemplar-related effects described above, for example the relationship between verb telicity and successful regular past tense affixation in DLD, the extent to which the production of complex sentences by children with DLD is influenced by discourse phenomena, and the degree to which children with DLD exhibit a dependence on high-frequency chunks. Neurobehavioural/neuroimaging studies may also be conducted in an attempt to identify neural signatures of exemplar-based learning. The kind of visual pattern-learning tasks which have been used with autistic children to explore exemplar-based process (e.g. Gastgeb et al., 2012) could also be administered to children with DLD to ascertain, first, whether they exhibit poorer generalisation, and second whether performance on this task correlates with linguistic measures.
A particularly exciting possibility is that exemplar-based theory will be applied in intervention. I have already discussed an intervention for the past tense which is closely informed by exemplar theory (Owen Van Horne et al., 2017). This employs what could be described as an ‘exemplar-busting’ approach, by initially introducing stimuli which are designed to deliberately disrupt exemplar-based representations. A similar technique could be applied in various grammatical domains. As mentioned above, when complex sentences are introduced into a clinical context, they typically contain full NPs and consequently are already potential ‘exemplar busters.’ What may be lacking is the application of a learning regime which manipulates item difficulty. Owen Van Horne et al. (2017) specifically found that a difficult-to-easy gradient was most effective. In the context of complex sentences, this would involve introducing more naturalistic complex sentences, containing pronouns rather than full NPs, at a later stage.
The Tough-But-Fragile Paradox Revisited
The evidence for an exemplar-based account of DLD can best be described as emerging. Nonetheless, a key attraction of this approach is that it provides a solution to the tough-but-fragile paradox, whereby the language system recovers well in cases of early neurological damage, but shows poor prognosis in children with DLD. This is a key paradox which all accounts of DLD must explain. The idea that powerful neural systems can be severely affected by overfitting has been consistently demonstrated in artificial neural networks, raising the possibility that this may also happen in the human brain. It remains to be seen whether existing accounts of language difficulties in DLD can provide such a persuasive explanation. One possibility is that subtle weaknesses in specific cognitive systems, for example phonological working memory, or auditory processing, may, via a series of cascading effects, lead to severe long-term consequences. But given what we know about neural plasticity, and the impressive language development of children with early focal lesions, these accounts are questionable. By contrast, an overfitting account offers an attractive alternative which is theoretically well-formulated, empirically grounded, and has important implications for intervention.
Supplemental Material
sj-docx-1-fla-10.1177_01427237251371762 – Supplemental material for Developmental Language Disorder: A Consequence of Exemplar-Based Learning?
Supplemental material, sj-docx-1-fla-10.1177_01427237251371762 for Developmental Language Disorder: A Consequence of Exemplar-Based Learning? by Nick Riches in First Language
Footnotes
Author Contributions
Funding
The author received no financial support for the research, authorship, and/or publication of this article.
Declaration of Conflicting Interests
The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Supplemental Material
Supplemental material for this article is available online.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
