Word Learning Ability Varies Across Contexts and Time: A Longitudinal Study of Primary School Children with Developmental Language Disorder

Abstract

Background and Aims

Children with developmental language disorder (DLD) have difficulty learning new words, but we know little about whether and to what extent their word-learning ability improves over time. Our primary goal was to compare the rate of development of word learning abilities, specifically the encoding of new lexical-semantic information, in children with DLD and their age-mates with typical language development (TLD). Secondary goals were to examine variation in outcomes according to the aspect of word knowledge under consideration, the word-learning context, and the cognitive abilities of the learners.

Methods

Children with DLD (n= 38) and TLD (n = 45) participated in two experiments each year from Grades 1 to 4. In each, they were taught a new set of 12 novel words and their referents. The experiments involved a cross-situational word learning context (Study 1) and ostensive naming and mutual exclusivity contexts (Study 2). Post-training probes measured knowledge of the phonological form of the word and the link between the word and its referent. In Year 1, we measured various aspects of cognition. Linear mixed models with fixed effects for diagnostic group, year, and context yielded main outcomes.

Results

Children with DLD were less able than peers with TLD to learn form- and link information in all three contexts and in all four years, but their relative rate of growth across the years was similar. Neither form- nor link-learning was consistently harder than the other for the DLD group; however, the size of the TLD-DLD performance gap was especially large for form learning in the ostensive naming (most direct) training context. The lower cognitive abilities of children with DLD, especially phonological short-term memory and receptive vocabulary knowledge, accounted for variance in form and link learning. With cognitive scores in the statistical models, the TLD-DLD gap in link learning was no longer significant.

Conclusions

As a group, primary school children with DLD present with weaknesses in word learning abilities but age-appropriate rates of improvement in those abilities over time. The problem reflects their lower verbal cognition. The extent of the problem varies with the context in which the word is learned and the aspect of word knowledge that is measured as an outcome.

Implications

The implication for scientists is that a diversity of contexts and outcome measures must be included in future research. The implication for clinicians is two-fold. First, brief opportunities to learn word forms in ostensive naming contexts may be leveraged for dynamic assessments, as it is this learning goal and this learning context that most effectively distinguished DLD from TLD. Second, when treating a child with DLD, the common practice of direct teaching is unlikely to be effective unless active engagement, sufficiently high dosage, optimally sized target sets, and rich vocabulary instruction are included.

Keywords

Developmental language impairment longitudinal word learning

Developmental language disorder (DLD) is a highly prevalent neurodevelopmental condition that limits spoken and written language learning, comprehension, and expression and leaves individuals vulnerable to broader impacts on academic performance (Ziegenfusz et al., 2022), employment (Conti-Ramsden et al., 2018), and mental health (Nudel et al., 2023). As a group, people with DLD have, among other symptoms, vocabularies characterized by less breadth and depth than expected for their age (McGregor et al., 2013). That is, they have less lexical-semantic knowledge than their peers with typical language development (TLD).

Lexical-semantic knowledge is the product of our exposure to words in the language in meaningful contexts and the sensory and cognitive mechanisms that allow us to learn from these exposures. Our lexicons vary greatly one from the other because exposures vary with amount and quality of schooling, what and how often we read, and numerous other experiences. That said, some variance clearly reflects inter-individual differences in the mechanisms that support word learning. In the case of DLD, we hypothesize that it is a mechanistic limitation, not a difference in experience, that results in low levels of lexical-semantic knowledge, simply put, many people with DLD have difficulties learning new words.

Evidence in support of this thesis involves multiple studies in which people with DLD and peers with typical language development (TLD) are given equivalent experiences to learn new words and the DLD group outcomes are lower. Only by altering those experiences by providing more exposure to new words (Gray, 2003; 2004; McGregor, Arbisi-Kelm et al., 2020) or more supportive learning contexts (Haebig et al., 2019; McGregor, Gordon et al., 2017; Pomper et al., 2022; van Berkel-van Hoof et al., 2019) will the DLD group approach comparable levels of word learning as their peers. The word learning problems of monolingual English-speaking children with DLD are well documented, but there is also growing evidence from other language communities including monolingual speakers of Dutch (Broedelet et al., 2023; van Berkel-van Hoof et al., 2019), Hebrew (Barak et al., 2022), and French (Krzemien et al., 2021), as well as bilinguals who speak Cantonese and English (Kan, 2024), Hebrew and Russian (Barak et al., 2022), Spanish and Catalan (Ahufinger et al., 2021), Spanish and English (Kapantzoglou et al., 2012), and Spoken Arabic and Modern Standard Arabic (Ghawi-Dakwar & Saiegh-Haddad, 2025).

In a meta-analysis of 28 cross-sectional studies (244 effect sizes) involving participants with mean ages as young as 3;7 (years; months) and as old as 12;3, Kan and Windsor (2010) concluded that children with DLD perform approximately 0.6 SDs lower on word learning tasks than their age-mates. As for the specific mechanistic limitations that may contribute, children with lower receptive language abilities and lower nonverbal IQs were more likely to have poor word learning outcomes than those stronger in those domains.

Lexical-semantic knowledge builds over time as exposures accrue and word-learning abilities develop. Three longitudinal studies demonstrate that the vocabulary knowledge of children with DLD accrues at age-expected rates as measured from ages 6 to 10 years (Norbury et al., 2021), 8 to 16 years (McGregor et al., 2013), and 2;6 to 15 years (Rice & Hoffman, 2015). Rice and Hoffman (2015) continued to follow their participants from 15 to 21 years, and, at that later stage of development, people with DLD began to lag behind their peers in receptive vocabulary knowledge. Typical accrual from 2;6 to 16 years should not be taken to mean that children with DLD learn as many words as their peers each year but, instead, that they are adding words to their lexicons at a rate that maintains, rather than narrows or widens, the lexical-semantic knowledge gap between TLD and DLD children. Unless children with DLD are consistently getting more word exposures than their peers, it would seem that their ability to accrue vocabulary knowledge at the same relative rate must reflect an age-appropriate rate of the development of word-learning abilities during these years.

To test this hypothesis, and to more fully understand the word-learning problems associated with DLD, we used a longitudinal design to determine how the ability to learn new words changes over the first four years of primary school in children with DLD or TLD. To our knowledge this is the first study to examine longitudinal change in spoken word learning ability among children with DLD, but interested readers may wish to see Factor and Goffman (2022) for a parallel study in the manual domain. To provide a more nuanced understanding, we focused on the encoding of both phonological word forms and the link between forms and their semantic referents in three different word learning contexts. Next, we motivate these methodological decisions.

Stages of Word Learning

Word learning is a gradual process (Wojcik, 2013). When a person encounters a word for the first time, a fragile memory trace may be formed. This stage, referred to as encoding (Melton, 1963), is supported by the medial temporal lobe and the hippocampus (Davis & Gaskell, 2009). With time (Walker, 2005), sleep (Dumay & Gaskell, 2007; Gais, Lucas & Born, 2006), and (at least in infants) food (Valiante et al., 2006), that memory may begin to consolidate. Consolidation involves the transfer of the memory trace to the neocortex (Dudai, 2004). A consolidated memory is stronger and less vulnerable to decay than a newly encoded memory. Once the word is reencountered, additional information may be added to the memory trace via processes of re-encoding, and that memory may then reconsolidate (Lee et al., 2017).

When we taught novel words to adults with DLD and TLD via passive study or active retrieval training, those with DLD recalled fewer of the new words immediately post training, suggesting poorer encoding. However, the two groups demonstrated similar consolidation abilities, in that both groups retained what they had encoded over shorter (20 min) or longer (1 day, 1 week) retention intervals (words learned via passive study at a 1-day retention interval being an exception), suggesting largely intact consolidation (McGregor, Gordon et al., 2017). In McGregor, Eden et al. (2020), we directly tested the hypothesis that the word learning problems associated with DLD are characterized by a weakness in encoding but strengths in consolidation and reconsolidation. Again, we found consistent evidence of poor encoding. The DLD and TLD groups were similarly able to retain what they had encoded when measured 1 day or 1 month after training; however, the DLD group did demonstrate more forgetting than their peers with TLD at the 1-week interval, again demonstrating that retention may not always be age-appropriate, but it is a relative strength for those with DLD. Finally, additional exposures to the words following the initial training boosted performance equivalently in both groups, revealing that re-encoding and re-consolidation are also strengths.

Although our participants were young adults, these patterns of relative strengths and weaknesses have been replicated in studies of children with DLD in multiple labs. Calabrese et al. (2025) conducted a meta-analysis of 46 studies to determine the extent to which children with DLD performed more poorly than their age-mates during encoding (78 effect sizes), consolidation (8 effect sizes), and reconsolidation (19 effect sizes). The mean effect for encoding was large and significant (d = .82), whereas the mean effects for consolidation (d = −.2) and reconsolidation (d = .23) were small and not significant.

Cognitive abilities in the domains of verbal working or short-term memory (often measured with nonword repetition tasks) and extant vocabulary knowledge (most often measured with receptive vocabulary tests) moderated the effect of encoding. In other words, even within the DLD group there were individual differences, and those who came to the task with higher vocabulary knowledge and verbal working memory tended to be the better word learners. The role of verbal short-term memory was also reflected in the finding that longer words yielded larger effect sizes than shorter words. Although not considered in Calabrese et al. (2025), other aspects of cognitive function, such as attention switching and inhibition are also associated with word encoding ability (Kapa & Erikson, 2020).

Aspect of Word Knowledge to be Learned

In the Kan and Windsor (2010) meta-analysis, word learning was measured in three different ways across studies: 1) In 12 studies, the child was given an array of toys and was asked to follow a command that included the target word (e.g, “Show me, Tigger is [newly taught verb] Minnie.” [Skipp et al., 2002]); 2) In another 12 studies, the child was shown a single referent and was asked, “What's this?;” and 3) In seven studies the child was given an array of (pictured) referents and was asked to “Point to the [newly taught noun].” To accomplish any of these tasks, one must have knowledge of the semantic referent, the phonological form of the target word, and the link between the two; however, the task in the first set of studies puts greater burden on semantic knowledge; the second set puts greater burden on phonological word form knowledge; and the third set puts greater burden on linking the word form to the semantic referent.

Children with DLD demonstrate poorer word learning than their age-mates with TLD on all three tasks; however, the effect sizes for the diagnostic group differences were larger for the tasks that depended highly on semantic knowledge (g = .52) and link knowledge (g = .64) than for those that depended highly on word form knowledge (g = .26). As Kan and Windsor note, this does not necessarily mean that learning to link words to referents or learning the meanings of words is harder for people with DLD than learning the word forms. Because it is more difficult to produce than recognize a newly learned word, the finding could reflect floor effects (see also Ghawi-Dakwar & Saiegh-Haddad, 2025).

In fact, studies published since the 2010 meta-analysis often report that, relative to link learning, word form learning represents a significant challenge for children and adults with DLD (Jackson et al., 2021; McGregor, Arbisi-Kelm et al., 2020; Pomper et al., 2022), although this might be especially true of those who have DLD with co-occurring dyslexia (Alt et al., 2019). As Benham et al. (2018) elegantly demonstrate, children with DLD do not configure novel phonological sequences as efficiently as children with TLD, and their attempted productions of newly trained word forms are less accurate and more variable.

Word Learning Contexts

Most—but certainly not all—research on word learning in the DLD population involves direct teaching contexts (i.e., structured tasks that involve repeated presentations of words and their referents, naming, and supportive cues, Jackson et al., 2019), but most word learning in the real world does not (Bloom, 2002). The word learning challenges associated with DLD have also been observed in less direct contexts including animated stories (Oetting et al., 1995; Rice et al., 1992; Rice et al., 1994), shared book reading (Nash & Donaldson, 2005; Smeets et al., 2014; Storkel et al., 2019), written texts (McGregor et al., 2024; Steele & Watkins, 2010), and college lectures (Becker & McGregor, 2016). These contexts provide varying supports for—and demands upon—the learner, but on average, those with DLD exhibit poorer learning in all contexts.

Experimental word learning paradigms involve manipulation of context to examine the effects of learner engagement (Haebig et al., 2019; Jackson et al., 2021; Leonard et al., 2021 [Studies 1 and 3]; McGregor, Gordon et al., 2017; Pomper et al., 2022) and the need for the learner to make inferences ((McGregor et al., 2024; Steele & Watkins, 2010) and aggregate information over time (Broedelet et al., 2023). Although the outcomes for the DLD groups in these studies are typically lower than those of the TLD groups, in a few cases, increased engagement (Haebig et al., 2019) or support (Pomper et al., 2022) ameliorated group differences on some aspects of word learning.

The Current Studies

To summarize, the extant literature supports the conclusions that 1) as a group, people with DLD have difficulty learning new words; 2) the extent of the word learning difficulty varies with receptive vocabulary, nonverbal IQ, phonological short-term memory, and executive function; 3) the encoding of new words seems to be the bottleneck, whereas consolidation and re-consolidation are relative strengths; 4) it is unclear whether phonological or semantic aspects of word learning are more problematic; and finally, 5) the difficulty is evident in a range of word learning contexts.

These conclusions motivate the current project, as summarized in Table 1. We compared children with DLD and their age-mates with TLD as they performed novel word learning tasks in each of four years, from grade 1 to grade 4 (roughly 7 to 10 years of age). Given the robust support for the encoding hypothesis, we chose to focus on performance during and immediately after training to capture the learning stage that is most vulnerable for people with DLD. We probed outcomes of both phonological and semantic aspects of word learning with task type held constant, specifically, alternative forced choice (3AFC) recognition probes. We also examined the cognitive mechanisms that explained variation in learning outcomes.

Table 1.

Overview of the Project.

Questions	How do word learning abilities vary by a) diagnostic group, b) year, c) contexts of learning, and d) aspect of word knowledge to be acquired?
	What cognitive capacities account for variation in outcomes?
Dependent variable	The primary dependent variable was accuracy on form-to-referent linking during training (Study 1 only) and accuracy on post-training probes of word form and link recognition (Studies 1 and 2).
Independent Variables	Diagnostic group: The project involved comparison of primary school children with developmental language disorder (n = 38) or typical language development (n = 45).
	Year: A longitudinal design in which word learning opportunities were repeated each year from Grades 1 through 4 allowed analysis of change over time.
	Context: Study 1 examined learning in a cross-situational context; Study 2: compared learning in contexts of ostensive naming and mutual exclusivity. Contexts in Studies 1 and 2 were compared indirectly via examination of between-diagnostic group effect sizes.
	Cognitive capacities: In year 1, cognitive capacities of interest were estimated by measures of receptive vocabulary, phonological short-term memory, verbal working memory, visual short-term and working memory, speed of processing, sustained attention and nonverbal IQ.
Controlled variables	Word forms: Each set of 12 nonwords comprised 6 monosyllables and 6 disyllables; Sets were equated for phonotactic probability.
	Word referents: Each set of 12 referents comprised unfamiliar objects selected from the familiar categories of birds, insects, mammals, and fruit.
	Timing of measurement: Probes to measure the linking of word forms to referents were administered during training (Study 1); probes to measure form and link recognition outcomes were administered immediately after training (Studies 1 and 2), thus isolating encoding from consolidation.
	Novelty: A different set of 12 words and referents was randomly assigned in each learning context and in every year per individual participant to isolate encoding from consolidation and re-encoding of familiar information.

One innovation of the current study is that we examined word learning abilities in more than one word learning context within the same sample of participants. Given that word-learning (and word-learning difficulties) emerge within a variety of contexts in the real-world, we have designed three laboratory tasks that vary in contexts and, thus, learning demands. The contexts are Cross-Situational (CS), which places heavy demands on the aggregation of information across episodes, Mutual Exclusivity (ME), which places heavy demands on inference, and Ostensive Naming (ON), which eliminates the ambiguity of the word form to referent linkage. CS and ME reflect aspects of the word-learning contexts we encounter in everyday life whereas ON captures aspects of word-learning that might take place in a classroom or language intervention session where direct teaching takes place. By examining word-learning in these three contexts, we aim to determine the extent to which the word-learning problems associated with DLD are context-dependent.

To ease the burden on the reader, we have divided the paper into two separate studies. The first examines learning during and immediately after CS exposures. The second compares learning outcomes in more implicit (ME) and explicit (ON) contexts. Because the same participants and outcome measures apply to both studies, we begin by describing those commonalities.

General Method

Ethics and Preregistration

To ensure ethical treatment of human subjects, this project was approved by the Internal Review Board of Boys Town National Research Hospital. It was preregistered in 2017 at The Research Registry under the title The Dynamics of Word Learning in Children with Developmental Language Disorder: A Prospective Cohort Study (preregistration ID#3425). This paper addresses Aims 1 and 2 of the preregistration; outcomes associated with Aim 3 were published in McGregor et al., 2024. The project, as carried out, differed from the preregistration in four ways:

The preregistered target sample size was 40 children with DLD and 40 age-mates with TLD, but due to recruitment challenges, we enrolled 38 children with DLD. We exceeded our target somewhat for the TLD group, enrolling 45.

The proposed measures of verbal working memory and visual short-term and working memory were subtests of the Automated Working Memory Assessment (AWMA, Alloway, 2007), but that assessment was phased out when we began the study. We were able to use the verbal working memory task from the AWMA, which involved backward digit recall stimuli, but we changed to the Corsi Block-Tapping task (Farrell Pagulayan et al., 2006) to measure visual short-term memory and the Odd-One-Out task (Henry, 2001) to measure visual working memory.

The preregistered cut-off for exclusion on the Perceptual Index of the Wechsler Abbreviated Scales of Intelligence–Second Edition (Wechsler, 2011) was 75 or below; we used 70 instead to be consistent with the Diagnostic and Statistical Manual of Mental Disorders (DSM-5: American Psychiatric Association, 2013).

The preregistered cut-off for categorizing participants into DLD or TLD groups was 80 or lower on the Test of Narrative Language-2. Instead, we used a cutoff of 92 because it has been shown to maximize sensitivity and specificity of DLD identification (Gillam & Pearson, 2017).

Participants

The participants were 38 children with DLD (16 girls) and 45 children with TLD (25 girls), enrolled when they had completed kindergarten and not begun second grade (i.e., during first grade, summer months inclusive). None of the participants repeated a grade during the study; therefore, years of education remained well matched by diagnostic group throughout. All lived in the United States, and all were monolingual speakers of English. The two groups were matched for age in months. They differed significantly on all other demographic and cognitive measures administered in Year 1 of the study, except for speed of visual processing. (Table 2).

Table 2.

Comparison of Diagnostic Groups on Intake Measures.

Construct	Descriptive Statistic	DLD (n = 38)	TLD (n = 45)	t	p	Effect Size D
Age in Months	M (SD)Range	86.47 (5.67)74–96	86.60 (4.58)76–98	−0.11	.9125	−0.03
Language	M (SD)Range	81.58 (8.19)61–91	111.3 (8.98)94–127	−15.77	<.0001	−3.45
Vocabulary	M (SD)Range	92.88 (87.84)75–125	111.40 (14.09)78–140	−5.69	<.0001	−1.31
Sustained Attention	M (SD)Range	0.68 (0.30)0–1	0.86 (0.21)0–1	−2.73	.0087	−0.68
Phonological STM	M (SD)Range	0.72 (0.14).3−.9	0.85 (0.09).6–1	−5.03	<.0001	−1.15
Verbal WM	M (SD)Range	1.71 (1.01)0–4	2.76 (0.80)2–5	−5.15	<.0001	−1.16
Visual STM	M (SD)Range	4.48 (1.03)2–6	5.18 (0.86)4–8	−3.13	.0026	−0.74
Visual WM	M (SD)Range	1.24 (0.56)0–3	2.18 (1.21)1–6	−4.56	<.0001	−0.94
Processing Speed	M (SD)Range	88.91 (20.66)54–120	94.93 (25.21)27–176	−1.15	.2535	−0.26
Nonverbal IQ	M (SD)Range	90.12 (11.70)71–120	107.5 (10.41)86–130	−6.84	<.0001	−1.58
Parent Education	M (SD)Range	14.24 (2.66)10–20	16.93 (2.23)12–22	−4.96	<.0001	−1.11

Note. Language was measured with the Test of Narrative Language-2 with outcomes expressed as standard scores (normative M = 100, SD =15; Gillam & Pearson, 2017). Vocabulary was measured with the NIH Toolbox Picture (receptive)Vocabulary Test with outcomes expressed as standard scores (normative M = 100, SD =15; Gershon et al., 2013). Sustained attention was measured with the Track-It task (Erickson et al., 2015; Fisher et al., 2013) and scored as the proportion of heterogeneous trials correct after excluding trials failed on the memory check. Phonological short-term memory was measured with the Nonword Repetition Test (Dollaghan & Campbell, 1998) and scored as the proportion of phonemes correctly produced (maximum of 96). Verbal working memory was measured with the Backwards Digit Test (Alloway, 2007) and scored as span (maximum number of digits recalled correctly on at least four of six trials). Visual short-term memory was measured with the Corsi Block-Tapping Task (Corsi; Farrell Pagulayan et al., 2006) and scored as span (the highest level at which participant correctly reproduces at least one sequence). Visual working memory was measured with the Odd-One-Out (OOO) task (Henry, 2001) and scored as span (the highest list length at which the child was able to recall the odd one out spatial location on at least three trials correctly out of four). Processing speed was measured with the visual Pattern Recognition Test from the NIH Toolbox (Gershon et al., 2013) with outcomes expressed as standard scores (normative M = 100, SD =15). NVIQ was estimated with the Perceptual Index of the Wechsler Abbreviated Scales of Intelligence–Second Edition (normative M = 100, SD =15; Wechsler, 2011). Parent education was measured as the total years of education of the most educated parent.

DLD = Developmental Language Disorder; TLD = Typical Language Development; STM = Short-term Memory; WM = Working Memory

Sample sizes varied over the years due to attrition and occasional technological malfunctions. We report the exact sample sizes in the results sections below.

Subsets of these children also participated in McGregor et al. (2021, 2023, 2024). Also, outcomes from the preregistered project in Year 1 are reported in McGregor et al. (2022) and Pomper et al. (2022) . Because the current paper builds on these by documenting change from Years 1 to 4, we will summarize the findings of the 2022 papers when we introduce the specific studies.

Stimuli

Each participant completed two word-learning tasks in each of the four years, for a total of eight tasks. Therefore, we created eight sets of novel words and assigned them in counterbalanced order across tasks, years, and individuals, so that no word learning targets were encountered by a given person in more than one learning context or more than one year. Each set included 12 words, half monosyllabic and half disyllabic. To reduce noise in the data, we took care to make the sets similar in distribution of onset segments, neighborhood density, and phonotactic probability. The novel words and details about their characteristics, are available in OSF at McGregor et al. (2026, November 28).

Each word was assigned to a pictured referent. The referents were color photographs of real but unfamiliar birds, insects, mammals, or fruits found via internet searches. Each set of 12 included three referents from each of these four categories.

Data Collection

The data collectors (NE and TAK) were assigned participant cases whom they saw for three or four data collection visits during each of the four years of the study. The visits involving novel word learning were scheduled at least one week apart, to reduce interference, but not more than three weeks apart, to control for maturation. In the first and second years of the project, data were collected in a mobile lab van equipped with a table, chairs, and video cameras. Because of the COVID-19 pandemic, we moved to virtual data collection via a secure Zoom link for the third and fourth years of the project. All data collection sessions were digitally recorded. All digital files were reviewed in full to determine whether glitches (of particular concern for the Zoom sessions) prevented the child from hearing or seeing the training or test trials. If a glitch occurred during training, the child's data from all outcome probes were excluded for that particular context and year. If a glitch occurred during the administration of an outcome probe that interfered with more than three of 12 trials, we also deleted those data. The extent of data loss per context, year and outcome measure appears in McGregor et al. (2026, November 28). The highest amount of data loss occurred in Study 1, where we lost five data sets for 3AFC form and five for 3AFC link in Year 3. The lowest amount occurred in Study 2 where we lost one data set for form and one data set for link, both in Year 3.

Outcome Measures

We used two probes to measure learning outcomes: 1) A 3AFC ‘dot task’ (Gordon & McGregor, 2014) measured learning of the phonological word forms (form recognition). The participant heard three novel words; one was a trained target, and the other two were foils created by changing one phoneme in the target. As each word was presented, a dot appeared on the computer screen. After all three word choices and dots were presented, the participant then pointed to the dot corresponding to the word they had learned. 2) A 3AFC picture pointing probe measured the ability to link word forms to their referents (link recognition). The participant saw three referents from the training set and heard one word form. They indicated which of three was named. The 3AFC form recognition task was administered five minutes after the final training cycle, and this was immediately followed by the 3AFC link recognition task. The reverse order would have provided the participants with an additional exposure to the correct word form just prior to the form recognition task. The timing of these probes, being soon after training, was critical as we intended them as measures of encoding, not consolidation. That said, we recognize that these are not pure measures of encoding—sensory and attentional processes are required to perceive the memoranda and retrieval processes are necessary to formulate a response.

With these commonalities explained, we now turn to the studies themselves. In Study 1, we examined word learning abilities in a cross-situational learning context (extension of McGregor et al., 2022). All children participated. In Study 2, we used a between-subject design, comparing word learning abilities in response to direct and indirect instructional contexts (extension of Pomper et al., 2022). We began with a within-subject design, but pilot data suggested that the attempt to learn numerous new words over a short period introduced interference. Therefore, we randomly assigned half of the children from the TLD and DLD diagnostic groups to direct instruction and the other half to indirect instruction. Those assignments held constant over the four years to enable evaluation of change in learning ability.

Study 1: Cross-Situational Learning

In any given real-world context, the intended referent of a new word is often unclear (Quine, 1960); however, learners can leverage word-referent co-occurrences over time and across multiple situations to arrive at the correct mapping. Cross-situational learning is thought to be accomplished via proposing links and confirming or correcting the proposal on subsequent trials (Medina et al., 2011; Trueswell et al., 2013; Woodard et al., 2016), by aggregating the statistics of co-occurring word and referent instances in a more implicit manner over time (Smith & Yu, 2008; Yurovsky et al., 2014), or via a combination of the two (Roembke & McMurray, 2016; Stevens et al., 2017; Xu & Tenenbaum, 2007; Yurovksy & Frank, 2015), with strategy varying according to task demands and the number of words to be learned (Roembke & McMurray, 2016).

Cross-situational word learning is evident from infancy (Smith & Yu, 2008; Vlach & Johnson, 2013; Woodard et al., 2016). However, within the population of learners with typical language development, there are individual differences across learners that pattern with their extant vocabulary knowledge and their memory abilities (Vlach & DeBrock, 2017). Given their weaknesses in these domains, it is not surprising that children with DLD are less able than their age-mates with TLD to learn word-to-referent links from cross-situational information (Ahufinger et al., 2021; Broedelet et al., 2023). Eye-tracking data in Ahufinger et al. (2021) also suggest that children with DLD are less confident in their responses than children with TLD, even when they have made the correct mapping.

McGregor et al. (2022) is a report of cross-situational word learning abilities in a subset of the current participants, 28 children with DLD and 44 with TLD. The data were collected when all participants were in Grade 1. The cross-situational word-learning context (the same one reported here) comprised six cycles of 14 trials each (12 novel word trials and two familiar word filler trials). In each novel word trial, the child saw two pictured referents, heard one novel word, and indicated the word-to-referent link by clicking on the referent that they thought was being named. The foil accompanying each target referent was randomly selected from the remaining 11 targets. In that way, a given referent appeared once as a target in each cycle but also could appear one or more times as a foil. Thus, the learner had two sources of information. First, when a given referent appeared as both a target and a foil within a cycle, the learners could narrow the hypothesis space and achieve above-chance performance. Second, as the given name-referent pairing appeared in each of the six cycles, the learner could acquire the correct mapping. The inclusion of six cycles (i.e., six exposures to the target word-referent pairs) was established via piloting with the goal of keeping learners away from floor or ceiling performance.

At the end of the first trial, the group with DLD performed at chance, and the group with TLD performed above chance. On the sixth and final learning cycle, the children with DLD were above chance but 15% less accurate than their peers with TLD. After training, each learner completed the 3AFC form and link measures. Contrary to prediction, form learning was not disproportionately harder than link learning for the children with DLD relative to their peers. Extant receptive vocabulary knowledge accounted for variance in performance whether measured by accuracy on the final cycle of learning, form recognition, or link recognition.

Hypotheses and Predictions

In Study 1, we repeated the training and outcome measures summarized above to yield data in each year from first to fourth grade. We were primarily interested in the extent to which the two groups of participants improved their word learning outcomes from year to year. Given the outcomes of previous word-learning studies, we hypothesized weaker learning outcomes for the DLD group. Given extant longitudinal comparisons of vocabulary knowledge, we hypothesized that the relative rate of growth in word-learning ability would be similar for the two diagnostic groups. We predicted:

Less accurate word-to-referent linking on the part of the DLD group during cross-situational learning as measured on training trials.

Lower form and link recognition outcomes on the part of the DLD group, as measured with 3AFC probes administered immediately after training.

Given outcomes in Year 1, a similar effect size for diagnostic group differences on form recognition and link recognition.

A main effect of year, such that both diagnostic groups improved word learning abilities over time.

We also aimed to identify cognitive mechanisms that relate to cross-situational learning. Given outcomes in McGregor et al. (2022), we predicted that performance would vary with extant vocabulary size. Per the preregistration, we also tested for relationships between learning outcomes and other cognitive variables including phonological short-term memory, verbal working memory, visual short-term and working memory, sustained attention, processing speed, and nonverbal IQ. We also tested the effect of years of parent education, a proxy for socioeconomic status. We were interested not only in whether any predictor was significant but also whether, with additional important predictors in the model, any effect of diagnostic group remained.

Method

Data Analysis

Analysis involved three steps: one to evaluate performance during learning, another to compare learning outcomes to chance, and a third to evaluate predicted learning outcomes. Code for all data analysis appears in McGregor et al. (2025, November 28).

Throughout the project, we defaulted to frequentist models wherever appropriate because they tend to be more familiar to readers. However, the complexity of the analysis of the within-training responses necessitated a hierarchical Bayesian approach. The outcome variable was expressed as an odds ratio (OR), specifically, the odds of a correct answer versus an incorrect answer, which is assumed to follow a Bernoulli distribution (Ber(p)). A logit transformation was then applied to “p” to set up a logistic regression for the probability of successfully identifying each word correctly. The terms in the logistic regression model included a lag effect to capture the fact that the probability of a correct answer may depend upon accuracy in the previous cycle. Other variables in the model included diagnostic group (DLD, TLD), sex (M, F), and cycle (1,…,6), and interactions between diagnostic group and sex, year, cycle, and lag. The model included a random intercept for subject to allow for correlation between repeated observations per participant. A random intercept for word did not improve model fit, so it was removed.

A distribution of vague but proper priors with a mean of 0 and a variance of 100 was placed on the coefficients representing diagnosis, sex, trial, cycle, lag and the interactions. The vague normal prior distribution essentially placed a non-informative prior on beliefs about these estimates, thus allowing the data to drive the model. For the subject effect, a normal prior distribution was used, N (0, 1), with the variance parameter receiving an inverse gamma prior of 0.01 for both shape and scale. There were three chains with random starting values, where each chain ran for 50,000 iterations after a burn-in of 1,000 iterations and thinning of 5. The model was fit using the R package nimble. Convergence was assessed using the Gelman-Rubin R-hat diagnostic.

Next, we examined form and link outcomes. We compared performance by diagnostic group and year against a baseline of chance (chance = 33%); all outcomes appear in the Supplemental Tables. For both form and link, we then ran a linear mixed effects model with a random intercept for each individual and fixed effects for diagnostic group, sex, and year. The dependent variable was proportion of items correctly identified on the recognition probes. Evaluation of residuals indicated that the assumption of normality was not violated by use of proportional data. If interactions occurred, we compared the differences in the least squares means to aid interpretation; these outcomes also appear in Supplemental Tables.

We then repeated those models to determine whether the cognitive variables and parent education (number of years of the more highly educated parent) accounted for variance in outcomes. All test scores were entered as raw scores (i.e., total correct responses) with the exception of the NIH Toolbox Picture Vocabulary Test for which standard scores uncorrected for age are more meaningful (see Gershon et al., 2013). Note that, although there were some correlations among these variables, they did not create a multicollinearity problem. The Variance Inflation Factor values were less than two, with four being the cutoff over which multicollinearity must be addressed (Hair et al., 2009). Therefore, we included each of the individual variables in the model. Estimates were generated using REML, and we used Satterthwaite degrees of freedom. Model selection was performed using AIC. Chronological age never improved model fit so it was omitted from all models.

Results

Learning During Training

In Figure 1, we present the raw data as the mean proportion of referents correctly selected per cycle for the two diagnostic groups in each of the four years. In all years, both groups performed significantly better than chance by the end of cycle 1 (Table S1). When examining referent selection accuracy across each cycle of training, there were main effects of diagnostic group, with the TLD group averaging 33% better learning than the DLD group, and year, such that word learning abilities improved by 6.6% each year (Table 3). There was no effect of sex and no interactions involving diagnostic group.

Figure 1.

Mean proportion of correct referent selections during cross-situational learning by diagnostic group (developmental language disorder, typical language development) and cycle (1–6). Note that chance is .5. Error bars refer to standard error.

Table 3.

Bayesian Logistic Regression Assessing the Effects of Diagnostic Group, Sex, Year, Cycle and Lag on Cross-Situational Learning.

Variable	Posterior Mean ^a	Posterior SD	95% Credible Interval	Odds Ratio	95% Odds Ratio Credible Interval	Inverted Odds Ratio^b
Dx (TLD is reference)	−0.329	0.149	(−0.617, −0.032)^c	0.720	(0.540, 0.968)	1.389
Sex (Male is reference)	−0.069	0.113	(−0.293, 0.159)	0.933	(0.746, 1.172)	1.072
Dx*Sex	−0.048	0.166	(−0.374, 0.278)	0.953	(0.688, 1.322)	1.049
Year	0.066	0.020	(0.027, 0.105)^c	1.068	(1.028, 1.111)
Dx*Year	0.015	0.028	(−0.040, 0.069)	1.015	(0.961, 1.071)
Cycle (1 is reference)
Cycle 2	−0.010	0.106	(−0.180, 0.161)	0.990	(0.835, 1.175)	1.010
Cycle 3	0.326	0.094	(0.142, 0.511)^c	1.385	(1.153, 1.667)
Cycle 4	0.378	0.102	(0.179, 0.580)^c	1.459	(1.196, 1.786)
Cycle 5	0.581	0.114	(0.359, 0.803)^c	1.789	(1.432, 2.233)
Cycle 6	0.469	0.121	(0.235, 0.711)^c	1.599	(1.265, 2.036)
Dx*Cycle
Dx*C2	0.115	0.126	(−0.134, 0.361)	1.122	(0.874, 1.435)
Dx*C3	−0.205	0.134	(−0.468, 0.056)	0.815	(0.626, 1.058)	1.227
Dx*C4	−0.140	0.139	(−0.410, 0.133)	0.869	(0.663, 1.143)	1.150
Dx*C5	−0.485	0.151	(−0.778, −0.192)^c	0.616	(0.459, 0.825)	1.624
Dx*C6	−0.404	0.156	(−0.715, −0.100)^c	0.668	(0.489, 0.904)	1.498
Lag1 (1 to 2)	0.365	0.097	(0.174, 0.556)^c	1.440	(1.190, 1.743)
Lag2 (2 to 3)	0.394	0.105	(0.188, 0.600)^c	1.483	(1.207, 1.821)
Lag3 (3 to 4)	0.724	0.115	(0.499, 0.947)^c	2.062	(1.646, 2.579)
Lag4 (4 to 5)	0.716	0.125	(0.474, 0.962)^c	2.047	(1.607, 2.616)
Lag5 (5 to 6)	0.990	0.134	(0.726, 1.248)^c	2.692	(2.066, 3.484)
Lag1*Dx	−0.248	0.141	(−0.527, 0.027)	0.781	(0.590, 1.028)	1.281
Lag 2*Dx	−0.223	0.149	(−0.515, 0.071)	0.800	(0.597, 1.074)	1.250
Lag 3*Dx	−0.501	0.157	(−0.812, −0.198)^c	0.606	(0.444, 0.821)	1.650
Lag 4*Dx	−0.263	0.165	(−0.585, 0.060)	0.769	(0.557, 1.062)	1.301
Lag 5*Dx	−0.384	0.173	(−0.718, −0.041)^c	0.681	(0.488, 0.960)	1.469

Note. Dx = diagnostic group; DLD = developmental language disorder; TLD = typical language development.

The posterior mean indicates the log odds of the probability of a correct answer.

For ease of interpretation, the negative odds ratio was inverted.

The credible interval does not include 0.

There was an effect of cycle: performance on training cycle 2 did not differ from training cycle 1, but performances on cycles 3, 4, 5, and 6 were reliably higher than cycle 1. There was an interaction between diagnostic group and cycle: Relative to cycle 1, the two groups demonstrated similar changes in level of accuracy on cycles 2, 3, and 4, but on cycles 5 and 6, the TLD group outperformed the DLD group (Table 3). In other words, the gap between the DLD and TLD group was largely the result of the TLD group demonstrating better learning by the end of training (Figure 2).

Figure 2.

Mean proportion of correct referent selections on the final cycle of cross-situational learning by diagnostic group (developmental language disorder, typical language development) and year. Chance is .50. Error bars refer to standard error.

There was a consistent effect of lag: accuracy on one cycle predicted accuracy on the next (Table 3). So, for example, participants were more likely to select a target correctly on cycle two if they had correctly selected that same target on cycle 1, more likely to select a target correctly on cycle three if they had correctly selected that same target on cycle 2, and so on. This suggests that the participants were accruing knowledge as they progressed through the learning cycles. However, there was an interaction between diagnostic group and lag such that the TLD group demonstrated a larger lag effect (i.e., greater accrual) than the DLD group in the lags between cycles 3–4 and 5–6. The interaction helps to explain why the DLD group had lower performance by the end of the training.

Learning Outcomes

Form

With the exception of the DLD group in Year 1, all performances on the 3AFC form recognition probe were significantly higher than chance (Table S2).

A linear mixed effects model revealed main effects of diagnostic group and year (Table 4, Figure 3). The DLD group averaged ∼17% lower accuracy compared to those in the TLD group, holding year and sex constant. On average, the proportion of correct scores increased by about 2% each year, holding diagnostic group and sex constant.

Figure 3.

Growth in form recognition after cross-situational learning by diagnostic group (developmental language disorder, typical language development) and year. Error bars refer to standard error.

Table 4.

Fixed Effects Model of Form Recognition.

Effect	Estimate	SE	Df	t Value	Pr > \|t\|
Intercept	0.5179	0.03036	80	17.06	<.0001
Year	0.01974	0.008025	206	2.46	0.0147
Dx (TLD is reference)	−0.1693	0.02703	206	−6.26	<.0001
Sex (male is reference)	0.01048	0.02685	206	0.39	0.6966

Note. Dx = diagnostic group; TLD = typical language development.

Extant receptive vocabulary and visual working memory were significant predictors of form learning outcomes (Table 5) and, with these scores in the model, the effect of diagnosis was no longer significant. The effect of year remained.

Table 5.

Fixed Effects Model of Form Recognition with Predictors Added.

Effect	Estimate	SE	Df	t Value	Pr > \|t\|
Intercept	−0.4281	0.1867	60.2	−2.29	0.0253
Dx (TLD is reference)	0.0028	0.0387	61.3	0.07	0.9429
Sex (Male is reference)	0.0194	0.0261	57.5	0.74	0.4615
Year	0.0214	0.0083	196	2.57	0.0109
Vocabulary	0.0078	0.0024	55.8	3.24	0.0020
Sustained Attention	−0.0341	0.0576	60	−0.59	0.5557
Phonological STM	0.0024	0.0012	62.2	1.95	0.0552
Verbal WM	0.0036	0.0036	56.4	0.97	0.3337
Visual STM	−0.0003	0.0045	55.3	−0.06	0.9509
Visual WM	0.0081	0.0040	55.5	1.95	0.0475
Processing Speed	−0.0004	0.0021	55.9	−0.21	0.8318
Nonverbal IQ	−0.0003	0.0037	58.1	−0.07	0.9437
Parental Education	0.0052	0.0056	58.7	0.93	0.3546

Note. Dx = diagnostic group; TLD = typical language development;

STM = short-term memory; WM = working memory.

Link

Both diagnostic groups performed above chance in all years (Table S3). There was a statistically significant difference between diagnostic groups and an interaction between diagnostic groups and year (Table 6), reflecting a slower rate of improvement in scores on the part of the DLD group (Figure 4). Female sex was also associated with a lower baseline proportion of correct scores.

Figure 4.

Growth in link recognition after cross-situational learning by diagnostic group (developmental language disorder, typical language development) and year. Error bars refer to standard error.

Table 6.

Fixed Effects Model of Link Recognition After Cross-Situational Learning.

Effect	Estimate	SE	Df	t Value	Pr > \|t\|
Intercept	0.5791	0.04069	80	14.23	<.0001
Dx (TLD is reference)	−0.1404	0.05512	205	−2.55	<.0001
Sex (Male is reference)	−0.07300	0.03120	205	−2.34	<.0001
Year	0.06732	0.01258	205	5.35	<.0001
Dx*Year	−0.03955	0.01891	205	−2.09	0.0377

Note. Dx = diagnostic group; TLD = typical language development.

To understand the diagnostic group x year interaction, we compared the differences in the least squares means for each year and diagnostic group (Table S4). There was a significant difference in the mean proportion of correct scores between the DLD and TLD diagnostic groups at each year, with the DLD group consistently showing lower expected proportions; however, this disparity increased over time, from approximately 18% lower in the first year to nearly 30% lower by the fourth year, for a mean of 24% lower compared to the TLD group. Although the mean scores of the DLD group improved over time, a 1-year increase in time was associated with a 3.9% widening of the gap between the DLD and TLD groups.

Extant receptive vocabulary, phonological short-term memory, and verbal working memory scores accounted for significant variance in link learning (Table 7). With these scores in the model, the effects of diagnosis and sex were no longer significant, but the effect of year and the interaction between year and diagnosis remained.

Table 7.

Solution for Fixed Effects Model of Link Recognition After Cross-Situational Learning with Predictors Added.

Effect	Estimate	SE	Df	t Value	Pr > \|t\|
Intercept	−0.4120	0.2238	60.4	−1.84	0.0705
Dx (TLD is reference)	0.0386	0.0658	188	0.59	0.5584
Sex (Male is reference)	−0.0356	0.0312	56.7	−1.14	0.2583
Year	0.0652	0.0126	193	5.18	<0.0001
Dx*Year	−0.0399	0.0198	195	−2.01	0.0455
Vocabulary	0.0068	0.0029	55.1	2.38	0.0208
Sustained Attention	0.0402	0.0695	60.4	0.58	0.5650
Phonological STM	0.0040	0.0015	61.5	2.74	0.0081
Verbal WM	0.0090	0.0044	55.7	2.07	0.0430
Visual STM	0.0025	0.0054	54.8	0.46	0.6509
Visual WM	0.0013	0.0047	54.3	0.28	0.7804
Processing Speed	−0.0029	0.0025	56.7	−1.15	0.2536
Nonverbal IQ	−0.0046	0.0044	57.7	−1.04	0.3034
Parent Education	0.0052	0.0067	58.1	0.77	0.4434

Note. Dx = diagnostic group; TLD = typical language development; STM = short-term memory; WM = working memory.

Study 1 Summary

As predicted, the DLD group demonstrated less accurate word-to-referent linking than the TLD group during cross-situational learning, although the gap was not evident until the final two training cycles. Also, as predicted, the DLD group demonstrated lower form and link recognition outcomes on the 3AFC probes administered immediately after training. Although the DLD group performed above chance on link learning but not form learning in Year 1, the mean size of the performance TLD-DLD gap was somewhat smaller for form learning (17%) than link learning (24%). Both diagnostic groups demonstrated growth in form and link learning over the four years, but, relative to the TLD group, the link learning abilities of the DLD group grew more slowly. Finally, we predicted that performance would vary with extant vocabulary size and that prediction held; however, form learning also varied with visual working memory and link learning also varied with phonological short-term memory and verbal working memory. With cognitive scores in the statistical model, the TLD-DLD performance gap was no longer significant, but the effects of time remained.

Study 2: Learning via Ostensive Naming and Mutual Exclusivity

Many studies of word learning among individuals with DLD involve direct instruction via ostensive naming. The examiner simultaneously presents a new word and its referent, making the learning goal, as well as the word-to-referent link, obvious. Direct instruction contexts are frequent in educational and clinical environments; however, everyday environments primarily comprise indirect learning contexts (Bloom, 2002; Nagy et al., 1985).

In indirect contexts, the primary goal is typically to communicate rather than to teach and learn a new word. When the communication partner utters an unfamiliar word, the referent is often ambiguous, so there is a greater burden on the learner to discern the correct linking. In Study 1, we examined one such context, cross-situational learning; here, we will examine another, mutual exclusivity.

In Carey's foundational work on children's use of the mutual exclusivity heuristic, a teacher asked her preschoolers, “Can you get me the chromium tray, not the red one, the chromium one?” (Carey, 2010, p. 185). When encountering a new word in such a situation, learners commonly assume the new word refers to an unfamiliar referent, rather than a referent for which they already have a word-to-referent link (see meta-analysis of this phenomenon in Lewis et al., 2020). Because they knew the meaning of “red,” the preschoolers were able to infer that chromium must refer to the olive-green tray that sat alongside the red one.

Marulis and Neuman (2010) conducted a meta-analysis of 67 vocabulary teaching studies involving preschoolers and kindergartners with TLD. They found that, relative to baseline, direct instructional contexts (that included not only ostensive naming but definitions and examples) yielded large effects on learning (Hedge's g = 1.11). In contrast, indirect contexts, which involved new words embedded in activities or story books (where more ambiguity and a greater need for inference exists), yielded moderate effects on learning relative to baseline (Hedge's g = .62).

During Year 1, as reported in Pomper et al. (2022), 36 of the current participants with DLD and all 45 of those with TLD were randomly assigned to a direct ON context or an indirect ME context. Both contexts involved five exposures to each of the 12 target word-referent pairs, five being the number of exposures required during piloting to keep learners away from floor or ceiling performance.

In the direct ON instructional context, the novel pictured referent appeared alone as its name was presented. In the indirect ME instructional context, the novel pictured referent appeared alongside a familiar referent from the same semantic category as the name of the novel referent was presented. One aim was to determine whether the DLD and TLD groups benefited equally from the more direct instruction.

After training, we administered the 3AFC form recognition probe followed by the 3AFC link recognition probe. The TLD group demonstrated stronger form and link learning in the ON context than the ME context, but the benefit of ostensive naming for the DLD group was limited to link learning only. The DLD group performed at chance on form recognition in both the ON and ME contexts. Within-group variance in encoding was accounted for by phonological short-term memory abilities.

Hypotheses and Predictions

In Study 2, we repeated the training and outcome measures summarized above in each year to determine the extent to which the two diagnostic groups improved their word learning abilities over the four years of the study. We hypothesized that the DLD group would demonstrate weaker word learning outcomes than the TLD group, but that the rate of growth would be similar for the two diagnostic groups; therefore, we predicted no group x year interaction.

Given Pomper et al. (2022), we predicted:

The TLD group will perform with higher accuracy than the DLD group on both encoding outcomes: 3AFC form and 3AFC link.

The effect of diagnostic group will be larger when the learning outcome is measured by form recognition than link recognition.

Link outcomes will be better in ON than ME contexts for both groups. For word form outcomes, we predict a diagnostic group x context interaction because only the TLD group will derive the relative benefit of ostensive naming instruction.

There will be a main effect of year such that both diagnostic groups demonstrate improved learning outcomes over time.

We also aimed to understand mechanisms of learning in the two diagnostic groups. Given outcomes in Pomper et al. (2022), we predicted that performance would vary with phonological short-term memory. As in Study 1, we also tested the relationships between learning outcomes and socioeconomic status as well as verbal and visual cognitive variables including receptive vocabulary, verbal working memory, visual short-term and working memory, sustained attention, processing speed, and nonverbal IQ.

Method

The participants were 38 children with DLD (16 girls) and 45 children with TLD (25 girls), the same children who participated in Study 1. Individuals from each group were randomly assigned to either the ON or ME contexts. Note that the random assignment did not yield well-matched groups on all variables (Table 8). The children with DLD who were assigned to ME had significantly better verbal memory scores than those assigned to ON. The children with TLD who were assigned to ON had significantly better scores on phonological- and visual short-term memory than those assigned to ME. That said, we entered these variables into the second regression model for all outcomes, thereby accounting for any effect that these differences had on learning outcomes.

Table 8.

Comparison of Intake Characteristics of Subgroups Assigned to the Ostensive Naming (oN) or Mutual Exclusivity (ME) Context.

Construct	Descriptive Statistic	DLD-ON (n = 20)	DLD-ME (n = 18)	t	p	Effect Size D	TLD-ON (n = 22)	TLD-ME (n = 23)	t	p	Effect Size D
Age	M (SD)Range	85.00 (5.47)74–94	88.11 (5.58)79–96	1.73	.0920	0.56	87.59 (4.49)79–97	85.65 (4.56)76098	−1.44	.1579	−0.43
Language	M (SD)Range	81.05 (8.44)64–91	82.17 (8.10)61–91	0.42	.6799	0.13	112.0 (9.77)94–127	110.7 (8.34)94–127	−0.45	.6566	−0.13
Vocabulary	M (SD)Range	90.44 (13.10)75–113	95.18 (15.21)75–125	0.96	.3442	0.33	114.9 (13.87)91–140	108.0 (13.73)78–128	−1.69	.0985	−0.50
Sustained Attention	M (SD)Range	0.75 (0.25)0.2–1	0.62 (0.34)0–1	−1.18	.2464	−0.42	0.89 (0.15)0.5–1	0.82 (0.26)0–1	−1.10	.2810	−0.33
Phonological STM	M (SD)Range	0.71 (0.14).30−.88	0.72 (0.14).40−.92	0.09	.9279	0.03	0.88 (.06).77–1	.81 (.10).56−.96	−3.02	.0046	−0.89
Verbal WM	M (SD)Range	1.40 (1.10)0–4	2.06 (0.80)1–4	2.12	.0414	0.68	2.68 (0.72)2–5	2.83 (0.89)2–5	0.60	.5507	0.18
Visual STM	M (SD)Range	4.25 (1.02)3–6	4.71 (1.10)2–6	1.28	.2086	0.45	5.55 (0.91)4–8	4.83 (0.65)4–6	−3.04	.0043	−0.91
Visual WM	M (SD)Range	1.25 (0.45)1–2	1.24 (0.66)0–3	−0.07	.9407	−0.03	2.36 (1.36)1–6	2.00 (1.04)1–4	−1.00	.3231	.30
Processing Speed	M (SD)Range	87.06 (18.93)54–120	90.65 (22.62)55–120	0.49	.6243	0.17	98.23 (26.40)55–176	91.64 (24.13)27–138	−0.86	.3923	−0.26
NonverbalIQ	M (SD)Range	88.79 (12.39)73–116	91.80 (10.94)71–120	0.75	.4580	0.26	110.4 (8.32)96–122	104.7 (11.57)86–130	−1.92	.0616	−0.57
Parent Ed	M (SD)Range	13.80 (2.89)10–20	14.72 (2.35)12–20	1.08	.2860	0.35	17.18 (2.48)13–22	16.70 (1.99)12–20	−0.72	.4735	−0.22

Note. Language was measured with the Test of Narrative Language-2 with outcomes expressed as standard scores (normative M = 100, SD =15; Gillam & Pearson, 2017). Vocabulary was measured with the NIH Toolbox Picture (receptive)Vocabulary Test with outcomes expressed as standard scores (normative M = 100, SD =15; Gershon et al., 2013). Sustained attention was measured with the Track-It task (Erickson et al., 2015; Fisher et al., 2013) and scored as the proportion of heterogeneous trials correct after excluding trials failed on the memory check. Phonological short-term memory was measured with the Nonword Repetition Test (Dollaghan & Campbell, 1998) and scored as the proportion of phonemes correctly produced (maximum of 96). Verbal working memory was measured with the Backwards Digit Test (Alloway, 2007) and scored as span (maximum number of digits recalled correctly on at least four of six trials). Visual short-term memory was measured with the Corsi Block-Tapping Task (Corsi; Farrell Pagulayan et al., 2006) and scored as span (the highest level at which participant correctly reproduces at least one sequence). Visual working memory was measured with the Odd-One-Out (OOO) task (Henry, 2001) and scored as span (the highest list length at which the child was able to recall the odd one out spatial location on at least three trials correctly out of four). Processing speed was measured with the visual Pattern Recognition Test from the NIH Toolbox (Gershon et al., 2013) with outcomes expressed as standard scores (normative M = 100, SD =15). NVIQ was estimated with the Perceptual Index of the Wechsler Abbreviated Scales of Intelligence–Second Edition and expressed as standard scores (normative M = 100, SD =15; Wechsler, 2011). Parent education was measured as the total years of education of the most educated parent.

DLD = Developmental Language Disorder; TLD = Typical Language Development; STM = Short-term Memory; WM = Working Memory

Stimuli and data collection for Study 2 mirrored Study 1. The same frequentist data analysis approaches described in Study 1 also applied to Study 2.

Results

Form

Both diagnostic groups performed above chance on the 3AFC form recognition probe in all years except for the DLD group in Year 1 (Table S5).

Overall results for form appear in Figure 5 and Table 9. There were main effects of diagnostic group favoring TLD and context favoring ON. The effect of the diagnostic group was qualified by two interactions. First, as predicted, there was an interaction between diagnostic group and context (Figure 5, Table S6), reflecting a larger gap between the ME and ON contexts for the TLD group (∼19%) than for the DLD group (<1%). The TLD group, but not the DLD group, benefitted from the direct teaching offered in the ON context. Second, and contrary to prediction, there was also an interaction between diagnostic group and year (Table S7), reflecting steeper growth in the DLD group that served to reduce the TLD-DLD performance gap from ∼25% to ∼17% over the course of the study.

Figure 5.

Growth in form recognition after learning in ostensive naming and mutual exclusivity contexts by diagnostic group (developmental language disorder, typical language development) and year. Error bars refer to standard error.

Table 9.

Fixed Effects Model of Form Performance After Learning in Ostensive Naming or Fast Mapping Conditions.

Effect	Estimate	SE	Df	t Value	Pr > \|t\|
Intercept	0.7079	0.04795	78.9	14.76	<.0001
Dx (TLD is reference)	−0.4406	0.07088	72.5	−6.22	<.0001
Sex (Male is reference)	−0.00861	0.02857	70.8	−0.30	0.7640
Context (ON is reference)	−0.2090	0.06451	71.4	−3.24	0.0018
Year	0.003055	0.01566	64.7	0.20	0.8460
Dx*Context	0.3248	0.09907	72.6	3.28	0.0016
Year*Context	0.006645	0.02252	66.8	0.30	0.7688
Dx*Year	0.05602	0.02473	68.6	2.27	0.0267
DxContextYear	−0.05574	0.03482	68.4	−1.60	0.1141

Note. Dx = diagnostic group; TLD = typical language development;

ON = ostensive naming.

In Table 10, we ran the model again with all predictor variables included. All of the significant effects of the prior model remained, but a number of additional effects were obtained. First, the steeper growth of the DLD group in the ON context was now evinced by a three-way interaction between context, year, and diagnostic group (Table S8). Numerous cognitive abilities contributed to form learning. These were phonological short-term memory, verbal working-memory, visual short-term and working memory, speed of visual processing, and nonverbal IQ, which is also a largely visual task.

Table 10.

Fixed Effects Model of Form Recognition Performance After Learning in Ostensive Naming or Fast Mapping Conditions with Predictors Added.

Effect	Estimate	SE	Df	t Value	Pr > \|t\|
Intercept	−0.2216	0.1777	58	−1.25	0.2173
Dx (TLD is reference)	−0.2100	0.0709	217	−2.96	0.0034
Sex (Male is reference)	−0.0170	0.0217	49.5	−0.78	0.4368
Context (ON is reference)	−0.1693	0.0602	204	−2.81	0.0054
Year	0.0053	0.0161	96.2	0.33	0.7418
Dx*Context	0.2692	0.0923	201	2.92	0.0039
Context* Year	0.0175	0.0234	96.6	0.75	0.4564
Dx*Year	0.0518	0.0249	98	2.08	0.0401
DxContextYear	−0.0782	0.0362	97	−2.16	0.0330
Vocabulary	0.0038	0.0020	49.5	1.85	0.0697
Sustained Attention	0.0266	0.0476	49.9	0.56	0.5780
Phonological STM	0.0052	0.0011	57.5	4.72	<.0001
Verbal WM	0.0079	0.0032	50.1	2.42	0.0192
Visual STM	−0.0125	0.00388	49.4	−3.29	0.0019
Visual WM	0.0087	0.0033	48.4	2.60	0.0125
Processing Speed	0.0045	0.0017	49.8	2.64	0.0112
Nonverbal IQ	0.0065	0.0031	51.6	2.07	0.0439
Parent Education	0.0015	0.0048	50.8	0.32	0.7531

Note. Dx = diagnostic group; TLD = typical language development; ON = ostensive naming; STM = short-term memory; WM = working memory.

Link

Both diagnostic groups performed above chance on the 3AFC link recognition task in all years (Table S9).

Link accuracy varied with diagnostic group, context, and year (Table 11, Figure 6). On average, the DLD group was ∼17% lower than the TLD group, and performance in the ME context was ∼19% lower than the ON context. Although the gap between the diagnostic groups was numerically larger in the ON condition, the diagnostic group by context interaction was not significant. Growth averaged 2.3% per year. The interactions by year were not significant and were removed from the final model.

Figure 6.

Growth in link recognition after learning in ostensive naming and mutual exclusivity contexts by diagnostic group (developmental language disorder, typical language development) and year. Error bars refer to standard error.

Table 11.

Fixed Effects Model of Link Recognition After Learning in Ostensive Naming or Fast Mapping Conditions.

Effect	Estimate	SE	Df	t Value	Pr > \|t\|
Intercept	0.6775	0.03885	96.4	17.44	<.0001
Dx (TLD is reference)	−0.1732	0.04684	70.9	−3.70	0.0004
Sex (Male is reference)	−0.04260	0.03283	72.4	−1.30	0.1986
Context (ON is reference)	−0.1863	0.04254	70.4	−4.38	<.0001
Year	0.02264	0.008945	70.6	2.53	0.0136
Dx*Context	0.02911	0.06608	73	0.44	0.6609

Note. Dx = diagnostic group; TLD = typical language development;

ON = ostensive naming.

Receptive vocabulary predicted link recognition outcomes, and with all of the cognitive variables added to the model, the effect of diagnostic group was no longer significant, but the effects of year and context remained (Table 12).

Table 12.

Fixed Effects Model of Link Recognition After Learning in Ostensive Naming or Fast Mapping Conditions with Predictors Added.

Effect	Estimate	SE	Df	t Value	Pr > \|t\|
Intercept	−0.0766	0.2758	61.2	−0.28	0.7821
Dx (TLD is reference)	−0.0586	0.0706	62.1	−0.83	0.4097
Sex (Male is reference)	−0.0535	0.0346	57.7	−1.55	0.1275
Context (On is reference)	−0.1396	0.0479	59	−2.92	0.0050
Year	0.0236	0.0094	66.1	2.52	0.0143
Dx*Context	−0.0255	0.0728	60	−0.35	0.7274
Sustained Attention	−0.0987	0.0759	57.7	−1.30	0.1986
Vocabulary	0.0094	0.0033	56.2	2.90	0.0053
Phonological STM	0.0030	0.0017	63.8	1.75	0.0856
Verbal WM	0.0038	0.0051	57	0.73	0.4677
Visual STM	−00.0045	0.0061	56.4	−0.75	0.4585
Visual WM	0.0017	0.0053	54.8	0.32	0.7494
Processing Speed	0.0031	0.0028	57	1.11	0.2729
Nonverbal IQ	−0.0042	0.0050	58.3	58.3	0.4041
Parental Education	−0.0092	0.0076	57.7	−1.21	0.2328

Note: Dx = diagnostic group; TLD = typical language development; ON = ostensive naming; STM = short-term memory; WM = working memory.

Summary of Study 2

As predicted, the TLD group performed with higher accuracy than the DLD group on both encoding outcomes: 3AFC form and 3AFC link. Also, as predicted, both diagnostic groups demonstrated growth in link learning abilities over time; however, form learning abilities grew only for the DLD group and only in the ON context.

We had predicted a larger TLD-DLD performance gap for form than link learning, but that was the case in the ON context only. The TLD group, but not the DLD group demonstrated better form learning in the ON context than the ME context, thus the gap between the DLD and TLD groups in form learning was largest in the ON context.

Contrary to prediction, link learning varied significantly with extant receptive vocabulary size, not phonological short-term memory. Differences in receptive vocabulary (and the other cognitive variables in the model) reduced the TLD-DLD performance gap in link learning to a nonsignificant level, but the effects of time and context remained.

We were not able to account for the TLD-DLD performance gap (or year or context effects) in form learning, although numerous scores on verbal and visual cognitive tasks did predict form learning performance.

Discussion

Primary School Children with DLD Have Difficulty Learning Word Forms and Word-to-Referent Links

As a group, the children with DLD were weaker word learners than their age-mates who have typical language development. They performed less accurately on both form and link learning in all contexts—cross-situational, mutual exclusivity, and ostensive naming. Holding all other variables constant, effect sizes expressed as the difference in percent accuracy between the groups, ranged from a low of 12% to a high of 30% (Table 13). Given that the children were attempting to learn 12 words in each context, this equates to 1.4 to 3.6 fewer words for the DLD group, per year per context.

Table 13.

TLD-DLD Performance Gaps in Accuracy and Growth of Form and Link Learning Abilities.

Learning Context	Form				Link
	Accuracy		Growth		Accuracy		Growth
	%	wds	%	wds	%	wds	%	wds
Cross-situational	−17%	−2.0	0%	0	−24%	−2.9	−4%	−0.5
Mutual Exclusivity	−12%	−1.4	0%	0	−14%	−1.7	0%	0
Ostensive Naming	−30%	−3.6	6%	0.7	−17%	−2.0	0%	0

Note. Gap sizes reflect mean performance difference per year in percentage points (calculated from beta values in linear mixed models, see calculations in the OSF project folder) and mean performance difference per year in number of words (calculated by multiplying the total number of words trained [12] by the relevant percentage value).

Negative numbers indicate that the DLD group value was lower than the TLD group value; positive numbers indicate that the DLD group value was higher than the TLD group value.

DLD = developmental language disorder; TLD = typical language development; wds = number of trained words

To provide perspective, consider that Segbers and Schroeder (2017) estimated that children typically learn 1,278 new nouns per year between Grades 1 to 4, inclusive, for a total of 5,111 new nouns acquired. If we consider only the TLD-DLD learning gap as measured by link recognition (the type of task that yielded the estimate in Segbers & Schroeder), we can conclude that children with DLD learn approximately 230 fewer nouns each year (1,278 [mean # of nouns learned] x .18 [mean size of TLD-DLD learning gap] = 230) or 920 fewer nouns from first to fourth grade (230×4 = 920). This conclusion is surely wrong in the absolute sense. We have likely overestimated the size of the learning gap in terms of learning ability. Although 12 words per day is a reasonable estimate of children's learning rate (Segbers & Schroeder, 2017), it would seldom be the case that children would attempt to learn 12 completely novel words within five or 10 min, the approximate length of our trainings. It is possible that a longer period of time in which to learn the 12 words would afford greater benefit to the DLD than the TLD group. On the other hand, we may have underestimated the size of the effect in terms of words learned because the Segber and Schroeder estimates of learning per year are more conservative than prior estimates (see Anglin et al., 1993). Moreover, we have limited this study to the learning of nouns and thus compared to estimates of the accrual of nouns but, of course, children also learn verbs, adjectives, and function words.

Context Matters

In the meta-analysis conducted by Kan and Windsor (2010), the effect size for the TLD-DLD gap in word form learning was smaller than for link or semantic learning; however, that finding may be artifactual given that outcome probes with differing task demands are frequently used to measures these various aspects of word knowledge. Indeed, in two previous studies where the type of outcome probe was controlled (Jackson et al., 2021; McGregor, Arbisi-Kelm et al., 2020), the TLD-DLD gap in form learning was larger than link or semantic learning.

In the current study, where all outcomes were measured with 3AFC recognition tasks, we found a third pattern: the relative difficulty of form vs. link learning varied with context (Table 13). The TLD-DLD gap for link learning varied from 14% to 24% across contexts; whereas the gap for form learning varied from 12% to 30%. In other words, form learning represented both the smallest challenge and the largest challenge for the DLD groups. Moreover, in the mutual exclusivity context, the TLD-DLD gap for form and link learning was nearly identical, 12% and 14%, respectively. When task type is held constant, as in the current study, there is no evidence to suggest that form is consistently more difficult to learn than link, or vice versa. That said, when form learning is tapped via naming tasks, the TLD-DLD gap tends to be especially large (e.g., Haebig et al., 2019; Leonard et al., 2019, 2021; Souto et al., 2025).

It is notable that the largest TLD-DLD gap was for form learning in the ostensive naming condition. At first blush, this finding may seem counterintuitive because task demands seem lower in the ostensive naming than mutual exclusivity or cross-situational contexts. Ostensive naming involves direct teaching. Compared to the other two contexts, it highlights the target referent in a more obvious manner and requires no inferences. Moreover, ostensive naming enables explicit learning because the goal is clear: try to remember these new words.

That said, the ostensive naming context allowed the learner to be passive whereas the mutual exclusivity and cross-situational contexts obligated a response on each training trial. That response was not an attempt at form retrieval but, rather, an attempt to link the form to its referent (in the cross-situational context) or to answer a question about the referent (in the mutual exclusivity context). Therefore, the smaller TLD-DLD form learning gap in the mutual exclusivity and cross-situational contexts had nothing to do with overt naming practice but, perhaps, the need to pay attention, engage with the material, and make decisions. Such “desirable difficulty” may have helped the learners with DLD to establish more, or more robust, encoding and retrieval routes, (Bjork, 1994; Bjork & Bjork, 2011; Vlach & Sandhofer, 2010). In McGregor, Gordon et al. (2017), we found a similar outcome. Adults with DLD and their peers with TLD learned novel words passively or actively (in this case via retrieval practice). When naming accuracy was probed the next day, there remained a significant TLD-DLD performance gap for the words trained passively, but the gap for words trained actively had closed. The problematic learning of word form targets in the context of passive exposure may be a signature of DLD (see also Leonard et al., 2021).

This finding holds clinical implications for assessment and treatment. First, it suggests a route to the development of effective dynamic assessments of DLD. In many dynamic assessments, a test-teach-retest approach is used. Given the results of the current study, a useful initial test could be a brief, indirect teaching episode (e.g., determining what the child can learn from several mutual exclusivity exposures). The teaching segment would involve direct instruction that requires only passive listening on the part of the child. The final test probes the word learning outcomes. The current study suggests that, relative to the mutual exclusivity exposures, a typical learner will show enhanced word form learning with direct albeit passive instruction, but a person with DLD will not.

Because this approach to dynamic assessment involves learning novel words, it should be robust across language communities. For example, Kapantzoglou et al. (2012) found that a link recognition probe (termed word identification in the study) administered after direct teaching of three novel words and their referents distinguished bilinguals with DLD and TLD with a sensitivity of 76.9% and a specificity of 80%. Dynamic assessment of word learning should also be robust across communities defined by socioeconomics. Although, on average, children from low-resource communities have lower scores on measures of extant vocabulary size than their better-resourced peers (Guo & Harris, 2000; Lervåg et al., 2019; Norbury et al., 2021; Sirin & Rogers-Sirin, 2005), this relationship does not extend to learning ability (Nikolaeva, 2025). In the current study, years of parent education, a proxy for socioeconomic environment, never predicted learning performance. In short, dynamic assessments are a way to pull apart language learning abilities from vocabulary knowledge as measured by static tests, which we know are often biased (e.g., Stankova et al., 2021; Stockman, 2010). For those who wish to use dynamic assessments with evidence-based cut-points to aid diagnosis, standardized options are available (Petersen et al., 2024; Seymour et al., 2005).

It may be constructive to consider language treatment in light of the finding that children with DLD benefit less from direct teaching than their peers with TLD. Language treatments involve “intentional, systematic actions taken to accelerate, modify, or compensate for inadequate performance, beyond the supports provided in a typical learning environment” (Ukrainetz, 2024, p. 43). The treatment context is often constructed to support explicit learning by ensuring awareness of the learning goal, focused attention, and visual supports. The weak response to ostensive naming in the current study does not mean that language interventionists should abandon direct instruction, but it does suggest that the approach to direct instruction matters. If children with DLD are to benefit, active practice (which was absent in the current study) may be required. Other best practices to consider are optimally sized target sets (which were high and equivalent for DLD and TLD groups here), rich vocabulary instruction (which was absent in the current study), and sufficiently high dosage (which was low and equivalent for DLD and TLD groups here) (Ardanouy et al., 2023; Frizelle et al., 2021; Gordon et al., 2025; Levlin et al., 2022; Peters-Sanders et al., 2019; Storkel et al., 2019). For example, substantially higher cumulative exposures (e.g., 36 in Storkel et al., 2017) may optimize some vocabulary outcomes in young children with DLD (Frizelle et al., 2021).

Would application of any of these best practices reduce the TLD-DLD learning ability gap, or shift both groups upward while preserving the gap? The question should be tested directly, but extant evidence suggests the former. The active learning conditions in McGregor, Gordon et al. (2017) and Haebig et al. (2019) did serve to close the gap. This is, of course, the goal of language intervention.

Primary School Children with DLD Demonstrate age-Appropriate Growth in Word Learning Abilities

We predicted that growth rates in word learning abilities would be similar in the two diagnostic groups and, in six of eight outcome measures, that prediction held. The exceptions were link learning in the cross-situational context, which was characterized by a more modest growth rate for the DLD than the TLD group—about 4% lower— and form learning in the ostensive naming context, which was characterized by a more robust for the DLD than TLD group—about 6% higher (Table 13). We conclude that word learning abilities among primary school students with TLD and DLD grow at roughly comparable rates and the comparable growth in the ability to learn words is key to understanding how children with DLD are able to accumulate vocabulary knowledge at rates that are similar to their peers (McGregor et al., 2013; Norbury et al., 2021; Rice & Hoffman, 2015) despite slower learning over brief time spans (Gray, 2003; 2004). Thus, it seems that, given equivalent opportunities, they learn fewer words, but, each year, both groups of learners improve in their ability to learn words by a similar amount. In that way, the gap between the groups that emerged at some point between the onset of first words and age 2;6 (Rice & Hoffman, 2015) remains consistent.

The pattern would not have to work out this way. Theoretically, children with DLD could catch up to their peers, but by age four or five years, language learning abilities canalize (Bornstein, 2014; Hayiou-Thomas et al., 2014; McKean et al., 2017; Ukoumunne et al., 2012). Thus, it is hard to imagine how primary school children with DLD would be able to develop at a faster rate than typical, even within the context of language intervention. Alternatively, they could not only start later but also develop word learning abilities more slowly and, as a result, the TLD-DLD vocabulary gap would widen over time. Although this is not the case during the primary and secondary school years, it is interesting that Rice and Hoffman (2015) found a relative slowing of vocabulary accrual on the part of adolescents and young adults with DLD. Perhaps the development of word learning abilities asymptotes in adulthood or, perhaps, after leaving school, adults find themselves in more disparate educational and professional environments such that opportunities to learn new words are more limited for learners with DLD. The ultimate test of this account will require a longitudinal study of the development of word learning abilities that spans toddlerhood to adulthood.

Phonological Short-Term Memory and Extant Receptive Vocabulary Partially Account for the Word Learning Problems That Characterize DLD

Although several aspects of visual cognition were relevant to word learning abilities in some contexts, phonological short-term memory and extant receptive vocabulary knowledge were the most robust predictors of performance. Phonological short-term memory, here measured by accuracy of nonword repetition, accounted for variation in the ability to learn word forms in the ON/ME contexts and links in the ON/ME and CS contexts (Table 14). This outcome accords with that reported in Adlof and Patten (2017). They provided children, ages 5 to 12 years, with direct training of six disyllabic words and their referents via a script that included semantic description, 21 exposures to each target word, and three chances to produce each word. After controlling age and extant vocabulary knowledge, nonword repetition performance accounted for 8% of the variance in word form recognition (called phonological recognition in their study) and 13% of the variance in link recognition (called semantic recognition in their study).

Table 14.

Summary of Significant Predictors of Form and Link Outcomes in Cross-Situational (CS), Ostensive Naming (ON), and Fast Mapping (ME) Contexts.

Cognitive Predictor	Aspect of Word to Be Learned
	Form	Link
Receptive Vocabulary	CS	CA, ON/ME
Sustained Attention
Phonological Short-Term Memory	ON/ME	CS, ON/ME
Verbal Working Memory	ON/ME	CS
Visual Short-Term Memory	ON/ME
Visual Working Memory	CS, ON/ME
Visual Processing Speed	ON/ME
Nonverbal IQ	ON/ME

Phonological short-term memory has long been recognized as critical to word learning (Archibald & Joanisse, 2013; Gathercole and Baddeley, 1990; Gathercole et al., 1997). Magnetoencephalography recordings made during word learning suggest that individuals with stronger phonological short-term memory abilities are better able than others to process longer words, resist interference during word processing, and learn words more quickly (Ylinen et al., 2020). Many, but not all, people with DLD have phonological short-term memory limitations (Archibald & Joanisse, 2009), and many, but not all, have word learning problems (Gray, 2004; McGregor, Arbisi-Kelm et al., 2017). Those with phonological short-term memory limitations and those with poor word learning are largely overlapping subsets of the population (Jackson et al., 2021).

Extant receptive vocabulary also accounted for variation in the ability to learn word forms (in the cross-situational context) and word-to-referent links (in the mutual exclusivity and ostensive naming contexts). This finding accords with Gray (2004) who reported that preschoolers’ scores on the Peabody Picture Vocabulary Test-III correlated with post-training comprehension outcomes when probed with a task akin to our link recognition probe as well as word form outcomes when probed with a naming task. Receptive vocabulary scores also correlated with word form recognition performance for newly learned object labels in a study of preschoolers with DLD and TLD conducted by Alt et al. (2004) and with word form production performance in a similar study by Jackson et al. (2016).

The relationship between word learning and vocabulary knowledge is reciprocal. The better the learner, the more words learned, and, conversely, the more extensive the word knowledge, the better the subsequent learning. The influence of extant vocabulary on subsequent vocabulary learning is evident in studies demonstrating, for example, that learning outcomes are better when target word forms are similar to known words and target referents are similar to known referents (Borovsky & Elman, 2006; Hoover et al., 2010; James et al., 2021; McKean et al., 2013; Vitevitch et al., 2014).

Receptive vocabulary and phonological short-term memory work together to support word learning, and this is increasingly the case beyond the preschool years (Baddeley, 2003). In the Embedded-Process Model (Cowan, 2017; 2022), short-term memory is a response to a change in the verbal environment, for example, hearing a word. That stimulus, once perceived, activates relevant knowledge in long term lexical store, including similar structures (e.g., partially shared segments, phonotactics, and prosodic patterns) and meanings (e.g., partially shared functions, physical features, roles in event structures). Comprehension occurs when the activated knowledge matches the input. Learning begins when the input and activated long-term representations do not fully match (Schwering & MacDonald, 2020). In the case of novel word learning, phonological short-term memory supports word form learning by maintaining active representations of individual phonemes in a precise order so that the whole can be mapped onto its referent. A child with a rich vocabulary and sufficient knowledge of the phonotactic properties of their language will have stronger, more robust activated long-term memories on which to encode the new word form and to situate its referent.

Although phonological short-term memory and extant receptive vocabulary size were robust predictors, word learning depends on a complex of additional mechanisms. Visual memory abilities and visual processing speed played a role in some cases, as did nonverbal IQ, which we measured with a visual matrices task (Table 14). Words are learned in a visual world and many word referents are physical objects, features of visual objects, or visually salient actions that must be parsed and linked to word forms. However, that does not make for a satisfactory explanation of the current results because the scores on visual measures accounted for variance in form learning, not link learning. One could argue that visual attention directed to the mouth of the speaker facilitates word form learning—and it does (Fort et al., 2012; Tsang et al., 2018)—but that too fails as an explanation here given that all word forms were presented by the computer in the auditory modality only.

While searching for an explanation, it is useful to consider that seemingly visual measures of cognitive ability engage other aspects of cognition including language (Lancaster et al., 2025). Moreover, verbalization boosts performance on visual tasks and children with DLD are less able than their peers with TLD to leverage verbal mediation to this end (Arslan et al., 2020). Thus, it is possible that the relationships between visual cognition and form learning abilities reduces to additional evidence of the role of verbal cognition.

Finally, it is important to note that the various cognitive processes considered in this longitudinal study of growth in word-learning ability do, themselves, grow over developmental time. Our measures were taken in Year 1 of the study and, thus, they provide a useful baseline of TLD-DLD differences in word-learning mechanisms and the extent to which those mechanisms are associated not only with concurrent but also later word-learning ability. In the future, it will be useful to determine whether these mechanisms develop at similar rates in the two groups and whether their contribution to word-learning remains stable over time.

Limitations and Future Directions

This study is limited in a number of ways. Ours was a small sample, thus it is imperative that our findings are replicated in the future. The slower growth rate for link learning in the cross-situational naming context and the faster growth rate for form learning in the ostensive naming context on the part of the DLD group, if replicated, require explanation. Differential growth rates in the underlying cognitive mechanisms that support link and form learning in these contexts would seem the right place to begin.

Another limitation is that the words to be learned were all labels for object referents. This was a good decision in that noun referents comprise the bulk of word learning in the age range sampled here (Segbers & Schroeder, 2017). However, one should not generalize conclusions based on noun learning to the learning of other word classes or referent types. A third limitation is that we covered only the first four years of formal education and, because word learning abilities change in a nonlinear fashion (Ravid et al., 2020), a longer longitudinal study of growth trajectories will be necessary if we are to fully understand the word learning limitations associated with DLD.

There also remains more work to be done if we are to understand the cognitive underpinnings of the word learning problem associated with DLD. A suitable explanation for the role of visual mechanisms is necessary. Moreover, in several instances, the effect of diagnostic group remained even after entering scores on the eight cognitive measures that we included based on our review of the literature, suggesting that there are yet other mechanisms that we have not accounted for.

The current outcomes also highlight the need to expand the variety of word learning contexts that are included in research studies. The overwhelming number of studies examining word learning in direct teaching contexts in the DLD literature may have skewed our understanding of word learning abilities in this population. Research conducted on incidental learning in more ecologically valid environments will be especially pertinent.

Finally, the form-learning difficulties of the DLD group in the direct teaching contexts suggest that we can leverage such contexts for dynamic assessments. We stress that, while dynamic assessments are essential for populations who are mis-served by current assessments of static knowledge, they are likely to be best practice for the assessment of all people with DLD. We must continue to develop dynamic assessments because these, by definition, tap learning ability. At its roots, DLD is a learning problem.

Conclusions

To conclude, in this the first longitudinal study of spoken word learning abilities in the DLD population, we have garnered evidence of robust gaps in learning ability relative to peers with TLD but, at least in the primary school years, roughly equivalent rates of improvement in those abilities. Both word forms and the linking of forms to their semantic referents are vulnerable learning targets, and children with DLD who present with weak phonological short-term memory and a small extant receptive vocabulary are more likely than others to demonstrate these learning problems. Finally, not all word learning contexts are equivalent and that variation turns out to be more important than we, as a field, had realized.

Supplemental Material

sj-docx-1-dli-10.1177_23969415261448861 - Supplemental material for Word Learning Ability Varies Across Contexts and Time: A Longitudinal Study of Primary School Children with Developmental Language Disorder

Supplemental material, sj-docx-1-dli-10.1177_23969415261448861 for Word Learning Ability Varies Across Contexts and Time: A Longitudinal Study of Primary School Children with Developmental Language Disorder by Karla K. McGregor, Nichole Eden, Timothy Arbisi-Kelm and Jacob Oleson in Autism & Developmental Language Impairments

Footnotes

Acknowledgments

We are forever thankful for the generosity of the children and caregivers who participated in this project over the course of four years. The project benefited from the input of Drs. Angela AuBuchon, Katherine Gordon, Justin Kueser, Julia Nikolaeva, Ron Pomper, Claire Selin, and Erin Smolak.

ORCID iDs

Karla K. McGregor

Nichole Eden

Timothy Arbisi-Kelm

Ethical Considerations

Ethical approval: This research was approved by the Internal Review Board of Boys Town National Research Hospital (approval no. 17-04-XP) on June 21, 2017.

Consent to participate: Written informed consent to participate in this study was provided by the participants’ legal guardians/next of kin. Participants gave verbal assent.

Author Contributions

Conceptualization: KKM; Data curation: TAK, JO; Formal analysis: JO; Funding acquisition: KM; Investigation: NE, TAK; Methodology: KKM, TAK; Project administration: NE; Software: TAK; Supervision: KKM; Validation: NE, TAK: Visualization: JO; Writing-Original: KKM; Writing-Review and Editing: KKM, TAK, JO.

Funding

This research was supported by the National Institute on Deafness and Other Communication Disorders of the National Institutes of Health under (award no. 2R01DC011742, K. McGregor, P.I.). Members of the Technical Core at Boys Town National Research Hospital were instrumental in moving the data collection to a virtual platform during the COVID19 pandemic. They are funded by the National Institute of General Medical Sciences of the National Institutes of Health under award number P20GM109023 L. Liebold, P.I.) The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Institutes of Health, (grant number 2R01DC011742, P20GM109023).

Declaration of Conflicting Interest

Salary support for the authors was provided by the National Institutes of Health, Boys Town National Research Hospital (KKM, NE, TAK) and the University of Iowa (JO). Karla McGregor conducts volunteer advocacy work for the DLD community as the Chair of DLDandMe.org.

Data Availability Statement

All de-identified data and statistical analysis code are available at McGregor, K. K., Eden, N., Arbisi-Kelm, T., & Oleson, J. (2025, November 28). Dynamics of Word Learning. Retrieved from osf.io/r56s7

Supplemental Material

Supplemental material for this article is available online.

References

Adlof

S. M.

Patten

(2017). Nonword repetition and vocabulary knowledge as predictors of children's phonological and semantic word learning. Journal of Speech, Language, and Hearing Research, 60(3), 682–693. https://doi.org/10.1044/2016_JSLHR-L-15-0441

Ahufinger

Guerra

Ferinu

Andreu

Sanz-Torrent

(2021). Cross-situational statistical learning in children with developmental language disorder. Language, Cognition and Neuroscience, 36(9), 1180–1200. https://doi.org/10.1080/23273798.2021.1922723

Alloway

T. P.

(2007). Automated Working Memory Assessment. Pearson Assessment.

Alt

Gray

Hogan

T. P.

Schlesinger

Cowan

(2019). Spoken word learning differences among children with Dyslexia, concomitant Dyslexia and Developmental Language Disorder, and typical development. Language, Speech, and Hearing Services in Schools, 50(4), 540–561. https://doi.org/10.1044/2019_LSHSS-VOIA-18-0138

Alt

Plante

Creusere

(2004). Semantic features in fast-mapping. Journal of Speech, Language, and Hearing Research, 47(2), 407–420. https://doi.org/10.1044/1092-4388(2004/033)

American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders, Fifth Edition. https://doi.org/10.1176/appi.books.9780890425596

Anglin

J. M.

Miller

G. A.

Wakefield

P. C.

(1993). Vocabulary development: A morphological analysis. Monographs of the Society for Research in Child Development, 58(10), i-186. https://doi.org/10.2307/1166112

Archibald

L. M.

Joanisse

M. F.

(2009). On the sensitivity and specificity of nonword repetition and sentence recall to language and memory impairments in children. Journal of Speech, Language, and Hearing Research, 52(4), 899–914. https://doi.org/10.1044/1092-4388(2009/08-0099)

Archibald

L. M.

Joanisse

M. F.

(2013). Domain-specific and domain-general constraints on word and sequence learning. Memory & Cognition, 41(2), 268–280. https://doi.org/10.3758/s13421-012-0259-4

10.

Ardanouy

Delage

Zesiger

(2023). Effectiveness of a group intervention for lexical enrichment in 6-to-10-year-old children with developmental language disorder. Child Language Teaching and Therapy, 39(3), 218–233. https://doi.org/10.1177/02656590231188523

11.

Arslan

Broc

Mathy

(2020). Lower verbalizability of visual stimuli modulates differences in estimates of working memory capacity between children with and without developmental language disorders. Autism & Developmental Language Impairments, 5, https://doi.org/10.1177/2396941520945519

12.

Baddeley

(2003). Working memory and language: An overview. Journal of Communication Disorders, 36(3), 189–208. https://doi.org/10.1016/S0021-9924(03)00019-4

13.

Barak

Degani

Novogrodsky

(2022). Influences of bilingualism and developmental language disorder on how children learn and process words. Developmental Psychology, 58(5), 821. https://doi 10.1037/dev0001324 https://doi.org/10.1037/dev0001324

14.

Becker

T. C.

McGregor

K. K.

(2016). Learning by listening to lectures is a challenge for college students with developmental language impairment. Journal of Communication Disorders, 64, 32–44. https://doi.org/10.1016/j.jcomdis.2016.09.001

15.

Benham

Goffman

Schweickert

(2018). An application of network science to phonological sequence learning in children with developmental language disorder. Journal of Speech, Language, and Hearing Research, 61(9), 2275–2291. https://doi.org/10.1044/2018_JSLHR-L-18-0036

16.

Bjork

E. L.

Bjork

R. A.

(2011). Making things hard on yourself, but in a good way: Creating desirable difficulties to enhance learning. In Gernsbacher

M. A.

Pew

R. W.

Hough

L. M.

Pomerantz

J. R.

(Eds.), Psychology and the real world: Essays illustrating fundamental contributions to society, 2 (pp. 56–64). Worth Publishers.

17.

Bjork

R. A.

(1994). Memory and metamemory considerations in the training of human beings. In Metcalfe

Shimamura

(Eds.), Metacognition: Knowing about knowing (pp. 185–205). MIT Press.

18.

Bloom

(2002). How children learn the meanings of words. MIT press.

19.

Bornstein

M. H.

(2014). Human infancy… and the rest of the lifespan. Annual Review of Psychology, 65, 121. https://doi.org/10.1146/annurev-psych-120710-100359

20.

Borovsky

Elman

(2006). Language input and semantic categories: A relation between cognition and early word learning. Journal of Child Language, 33(4), 759–790. https://doi.org/10.1017/S0305000906007574

21.

Broedelet

Boersma

Rispens

(2023). Implicit cross-situational word learning in children with and without developmental language disorder and its relation to lexical-semantic knowledge. Frontiers in Communication, 8, 1021654. https://doi.org/10.3389/fcomm.2023.1021654

22.

Calabrese

Hedger

Pritchard

Stojanovik

Pagnamenta

(2025). Word learning in children with developmental language disorder: A meta-analysis testing the encoding hypothesis. Journal of Memory and Language, 145, 104678. https://doi.org/10.1016/j.jml.2025.104678

23.

Carey

(2010). Beyond fast mapping. Language Learning and Development, 6(3), 184–205. https://doi.org/10.1080/15475441.2010.484379

24.

Conti-Ramsden

Durkin

Toseeb

Botting

Pickles

(2018). Education and employment outcomes of young adults with a history of Developmental Language Disorder. International Journal of Language & Communication Disorders, 53(2), 237–255. https://doi.org/10.1111/1460-6984.12338

25.

Cowan

(2017). The many faces of working memory and short-term storage. Psychonomic Bulletin & Review, 24(4), 1158–1170. https://doi.org/10.3758/s13423-016-1191-6

26.

Cowan

(2022). Working memory development: A 50-year assessment of research and underlying theories. Cognition, 224, 105075. https://doi.org/10.1016/j.cognition.2022.105075

27.

Davis

M. H.

Gaskell

M. G.

(2009). A complementary systems account of word learning: Neural and behavioural evidence. Philosophical Transactions of the Royal Society B: Biological Sciences, 364(1536), 3773–3800. https://doi.org/10.1098/rstb.2009.0111

28.

Dollaghan

Campbell

T. F.

(1998). Nonword repetition and child language impairment. Journal of Speech, Language, and Hearing Research, 41(5), 1136–1146. https://doi.org/10.1044/jslhr.4105.1136

29.

Dudai

(2004). The neurobiology of consolidations, or, how stable is the engram? Annual Review of Psychology, 55(1), 51–86. https://doi.org/10.1146/annurev.psych.55.090902.142050

30.

Dumay

Gaskell

M. G.

(2007). Sleep-associated changes in the mental representation of spoken words. Psychological Science, 18(1), 35–39. https://doi.org/10.1111/j.1467-9280.2007.01845.x

31.

Erickson

L. C.

Thiessen

E. D.

Godwin

K. E.

Dickerson

J. P.

Fisher

A. V.

(2015). Endogenously and exogenously driven selective sustained attention: Contributions to learning in kindergarten children. Journal of Experimental Child Psychology, 138, 126–134. https://doi.org/10.1016/j.jecp.2015.04.011

32.

Factor

Goffman

(2022). Phonological characteristics of novel gesture production in children with developmental language disorder: Longitudinal findings. Applied Psycholinguistics, 43(2), 333–362. https://doi.org/10.1017/S0142716421000540

33.

Farrell Pagulayan

Busch

R. M.

Medina

K. L.

Bartok

J. A.

Krikorian

(2006). Developmental normative data for the Corsi block-tapping task. Journal of Clinical and Experimental Neuropsychology, 28(6), 1043–1052. https://doi.org/10.1080/13803390500350977

34.

Fisher

Thiessen

Godwin

Kloos

Dickerson

(2013). Assessing selective sustained attention in 3- to 5-year-old children: Evidence from a new paradigm. Journal of Experimental Child Psychology, 114(2), 275–294. https://doi.org/10.1016/j.jecp.2012.07.006

35.

Fort

Kandel

Chipot

Savariaux

Granjon

Spinelli

(2012). Seeing the initial articulatory gestures of a word triggers lexical access. Language and Cognitive Processes, 28(8), 1207–1223. https://doi.org/10.1080/01690965.2012.701758

36.

Frizelle

Tolonen

A. K.

Tulip

Murphy

C. A.

Saldana

McKean

(2021). The influence of quantitative intervention dosage on oral language outcomes for children with developmental language disorder: A systematic review and narrative synthesis. Language, Speech, and Hearing Services in Schools, 52(2), 738–754. https://doi.org/10.1044/2020_LSHSS-20-00058

37.

Gais

Lucas

Born

(2006). Sleep after learning aids memory recall. Learning & Memory, 13(3), 259–262. http://www.learnmem.org/cgi/doi/10.1101/NA. https://doi.org/10.1101/lm.132106

38.

Gathercole

S. E.

Baddeley

A. D.

(1990). The role of phonological memory in vocabulary acquisition: A study of young children learning new names. British Journal of Psychology, 81(4), 439–454. https://doi.org/10.1111/j.2044-8295.1990.tb02371.x

39.

Gathercole

S. E.

Hitch

G. J.

Martin

A. J.

(1997). Phonological short-term memory and new word learning in children. Developmental Psychology, 33(6), 966. https://doi.org/10.1037/0012-1649.33.6.966

40.

Gershon

R. C.

Slotkin

Manly

J. J.

Blitz

D. L.

Beaumont

J. L.

Schnipke

Wallner-Allen

Golinkoff

R. M.

Gleason

J. B.

Hirsh-Pasek

Adams

M. J.

Weintraub

(2013). Iv. NIH toolbox cognition battery (CB): Measuring language (vocabulary comprehension and Reading decoding). Monographs of the Society for Research in Child Development, 78(4), 49–69. https://doi.org/10.1111/mono.12034

41.

Ghawi-Dakwar

Saiegh-Haddad

(2025). Word learning in Arabic diglossia in children with typical language development and developmental language disorder. Journal of Speech, Language, and Hearing Research, 68(3S), 1533–1551. https://doi.org/10.1044/2024_JSLHR-23-00618

42.

Gillam

R. B.

Pearson

N. A.

(2017). Test of Narrative Language–Second Edition. Pro-Ed.

43.

Gordon

K. G.

McGregor

K. K.

(2014). A spatially-supported forced-choice recognition test reveals children’s long-term memory for newly learned word forms. Frontiers in Language Sciences, 5(164), 1–12. https://doi.org/10.3389/fpsyg.2014.00164

44.

Gordon

K. R.

Moss

Swinburne Romine

R. E.

Fleming

K. K.

Storkel

H. L.

(2025). Interactive book Reading to accelerate word learning by kindergarten children with developmental language disorder: Incorporating retrieval-based practice into training. Language, Speech, and Hearing Services in Schools, 56(4), 1110–1125. https://doi.org/10.1044/2025_LSHSS-24-00147

45.

Gray

(2003). Word-learning by preschoolers with specific language impairment: What predicts success?. Journal of Speech, Language, and Hearing Research, 46(1), 56–67. https://doi.org/10.1044/1092-4388(2003/005)

46.

Gray

(2004). Word-learning by preschoolers with specific language impairment: Predictors and poor learners. Journal of Speech, Language, and Hearing Research, 47(5), 1117–1132. https://doi.org/10.1044/1092-4388(2004/083)

47.

Guo

Harris

K. M.

(2000). The mechanisms mediating the effects of poverty on children’s intellectual development. Demography, 37(4), 431–447. https://doi.org/10.1353/dem.2000.0005

48.

Haebig

Leonard

L. B.

Deevy

Karpicke

Christ

S. L.

Usler

, …, & Weber

(2019). Retrieval-based word learning in young typically developing children and children with development language disorder II: A comparison of retrieval schedules. Journal of Speech, Language, and Hearing Research, 62(4), 944–964. https://doi.org/10.1044/2018_JSLHR-L-18-0071

49.

Hair

Black

Babin

Anderson

(2009). Multivariate data analysis (7th ed). Pearson Prentice Hall.

50.

Hayiou-Thomas

M. E.

Dale

P. S.

Plomin

(2014). Language impairment from 4 to 12 years: Prediction and etiology. Journal of Speech, Language, and Hearing Research, 57(3), 850–864. https://doi.org/10.1044/2013_JSLHR-L-12-0240

51.

Henry

L. A.

(2001). How does the severity of a learning disability affect working memory performance? Memory (Hove, England), 9(4-6), 233–247. https://doi.org/10.1080/09658210042000085

52.

Hoover

J. R.

Storkel

H. L.

Hogan

T. P.

(2010). A cross-sectional comparison of the effects of phonotactic probability and neighborhood density on word learning by preschool children. Journal of Memory and Language, 63(1), 100–116. https://doi.org/10.1016/j.jml.2010.02.003

53.

Jackson

Leitao

Claessen

(2016). The relationship between phonological short-term memory, receptive vocabulary, and fast mapping in children with specific language impairment. International Journal of Language & Communication Disorders, 51(1), 61–73. https://doi.org/10.1111/1460-6984.12185

54.

Jackson

Leitão

Claessen

Boyes

(2019). The evaluation of word-learning abilities in people with developmental language disorder: A scoping review. International Journal of Language & Communication Disorders, 54(5), 742–755. https://doi.org/10.1111/1460-6984.12490

55.

Jackson

Leitão

Claessen

Boyes

(2021). Word learning and verbal working memory in children with developmental language disorder. Autism & Developmental Language Impairments, 6, 1–20. https://doi.org/10.1177/23969415211004109

56.

James

Gaskell

M. G.

Pearce

Korell

Dean

Henderson

L. M.

(2021). The role of prior lexical knowledge in children's and adults’ incidental word learning from illustrated stories. Journal of Experimental Psychology: Learning, Memory, and Cognition, 47(11), 1856–1869. https://doi-org.proxy.lib.uiowa.edu/10.1037/xlm0001080. https://doi.org/10.1037/xlm0001080

57.

Kan

P. F.

(2024). Word learning in bilingual children at risk for developmental language disorder. American Journal of Speech-Language Pathology, 33(6), 2746–2766. https://doi.org/10.1044/2024_AJSLP-23-00489

58.

Kan

P. F.

Windsor

(2010). Word learning in children with primary language impairment: A meta-analysis. Journal of Speech, Language, and Hearing Research, 53(3), 739–756. https://doi.org/10.1044/1092-4388(2009/08-0248)

59.

Kapa

L. L.

Erikson

J. A.

(2020). The relationship between word learning and executive function in preschoolers with and without developmental language disorder. Journal of Speech, Language, and Hearing Research, 63(7), 2293–2307. https://doi.org/10.1044/2020_JSLHR-19-00342

60.

Kapantzoglou

Restrepo

M. A.

Thompson

M. S.

(2012). Dynamic assessment of word learning skills: Identifying language impairment in bilingual children. Language, Speech, and Hearing Services in Schools, 43(1), 81–96. https://doi.org/10.1044/0161-1461(2011/10-0095)

61.

Krzemien

Thibaut

J. P.

Jemel

Levaux

Maillart

(2021). How do children with developmental language disorder extend novel nouns?. Journal of Experimental Child Psychology, 202, 1–24. https://doi.org/10.1016/j.jecp.2020.105010

62.

Lancaster

H. S.

Smolak

Milne

Gordon

K. R.

Emerson

S. N.

Selin

(2025). Analyzing the impact of four cognitive constructs on nonverbal intelligence test performance: Implications for children with neurodevelopmental disorders. Language, Speech, and Hearing Services in Schools, 56(3), 834–846. https://doi.org/10.1044/2025_LSHSS-24-00056

63.

Lee

J. L.

Nader

Schiller

(2017). An update on memory reconsolidation updating. Trends in Cognitive Sciences, 21(7), 531–545. https://doi-org.proxy.lib.uiowa.edu/10.1016/j.tics.2017.04.006. https://doi.org/10.1016/j.tics.2017.04.006

64.

Leonard

L. B.

Christ

S. L.

Deevy

Karpicke

J. D.

Weber

Haebig

, …, & Krok, W. (2021). A multi-study examination of the role of repeated spaced retrieval in the word learning of children with developmental language disorder. Journal of Neurodevelopmental Disorders, 13(1), 1–16. https://doi.org/10.1186/s11689-021-09368-z

65.

Leonard

L. B.

Karpicke

Deevy

Weber

Christ

Haebig

, …, & Krok, W. (2019). Retrieval-based word learning in young typically developing children and children with developmental language disorder I: The benefits of repeated retrieval. Journal of Speech, Language, and Hearing Research, 62(4), 932–943. https://doi.org/10.1186/s11689-021-09368-z

66.

Lervåg

Dolean

Tincas

Melby-Lervåg

(2019). Socioeconomic background, nonverbal IQ and school absence affects the development of vocabulary and Reading comprehension in children living in severe poverty. Developmental Science, 22(5), e12858. https://doi.org/10.1111/desc.12858

67.

Levlin

Wiklund-Hörnqvist

Sandgren

Karlsson

Jonsson

(2022). Evaluating the effect of rich vocabulary instruction and retrieval practice on the classroom vocabulary skills of children with (developmental) language disorder. Language, Speech, and Hearing Services in Schools, 53(2), 542–560. https://doi.org/10.1044/2021_LSHSS-21-00101

68.

Lewis

Cristiano

Lake

B. M.

Kwan

Frank

M. C.

(2020). The role of developmental change and linguistic experience in the mutual exclusivity effect. Cognition, 198, 104191. https://doi.org/10.1016/j.cognition.2020.104191

69.

Marulis

L. M.

Neuman

S. B.

(2010). The effects of vocabulary intervention on young children’s word learning: A meta-analysis. Review of Educational Research, 80(3), 300–335. https://doi.org/10.3102/0034654310377087

70.

Mcgregor

K. K.

Arbisi-Kelm

Eden

(2017). The encoding of word forms into memory may be challenging for college students with developmental language impairment. International Journal of Speech-Language Pathology, 19(1), 43–57. https://doi.org/10.3109/17549507.2016.1159337

71.

Mcgregor

K. K.

Arbisi-Kelm

Eden

Oleson

(2020). The word learning profile of adults with developmental language disorder. Autism and Developmental Language Impairments, 5, 1–19. https://doi.org/10.1177/2396941519899311

72.

Mcgregor

K. K.

Eden

Arbisi-Kelm

Oleson

(2020). The fast-mapping abilities of adults with developmental language disorder. Journal of Speech, Language, and Hearing Research, 63(9), 3117–3129. https://doi.org/10.1044/2020_JSLHR-19-00418

73.

Mcgregor

K. K.

Eden

Arbisi-Kelm

Oleson

(2026), April 20. Dynamics of Word Learning, Retrieved from osf.io/r56s7

74.

Mcgregor

K. K.

Gordon

Eden

Arbisi-Kelm

Oleson

(2017). Encoding deficits impede word learning and memory in adults with developmental language disorders. Journal of Speech, Language, and Hearing Research, 60(10), 2891–2905. https://doi.org/10.1044/2017_JSLHR-L-17-0031

75.

Mcgregor

K. K.

Ohlmann

Eden

Arbisi-Kelm

Young

(2023). Abilities and disabilities among children with developmental language disorder. Language, Speech, and Hearing Services in Schools, 54(3), 927–951. https://doi.org/10.1044/2023_LSHSS-22-00070

76.

Mcgregor

K. K.

Oleson

Bahnsen

Duff

(2013). Children with developmental language impairment have vocabulary deficits characterized by limited breadth and depth. International Journal of Language and Communication Disorders, 48(3), 307–319. https://doi.org/10.1111/1460-6984.12008

77.

Mcgregor

K. K.

Pomper

Eden

Appenzeller

Arbisi-Kelm

Polese

Reed

D. K.

(2024). Inferring word class and meaning from spoken and written texts: A comparison of children with and without developmental language disorder. Journal of Speech, Language, and Hearing Research, 67(12), 4783–4798. https://doi.org/10.1044/2024_JSLHR-23-00743

78.

Mcgregor

K. K.

Pomper

Eden

Arbisi-Kelm

Ohlmann

Gajre

Smolak

(2021). Children’s language abilities predict success in remote communication contexts. Language Development Research, 1(1), 245. https://doi.org/10.34842/8jgf-r802

79.

Mcgregor

K K

Smolak

Jones

Oleson

Eden

Arbisi-Kelm

Pomper

(2022). What children with developmental language disorder teach us about cross-situational word learning. Cognitive Science, 46(2), e13094. https://doi.org/10.1111/cogs.13094

80.

McKean

Letts

Howard

(2013). Functional reorganization in the developing lexicon: Separable and changing influences of lexical and phonological variables on children's fast-mapping. Journal of Child Language, 40(2), 307–335. https://doi.org/10.1017/S0305000911000444

81.

McKean

Wraith

Eadie

Cook

Mensah

Reilly

(2017). Subgroups in language trajectories from 4 to 11 years: The nature and predictors of stable, improving and decreasing language trajectory groups. Journal of Child Psychology and Psychiatry, 58(10), 1081–1091. https://doi.org/10.1111/jcpp.12790

82.

Medina

T. N.

Snedeker

Trueswell

J. C.

Gleitman

L. R.

(2011). How words can and cannot be learned by observation. Proceedings of the National Academy of Sciences, 108(22), 9014–9019. https://doi.org/10.1073/pnas.1105040108

83.

Melton

A. W.

(1963). Implications of short-term memory for a general theory of memory. Journal of Verbal Learning and Verbal Behavior, 2, 1–21. https://doi.org/10.1016/S0022-5371(63)80063-8

84.

Nagy

W. E.

Herman

P. A.

Anderson

R. C.

(1985). Learning words from context. Reading Research Quarterly, 20(2), 233–253. https://doi.org/10.2307/747758

85.

Nash

Donaldson

M. L.

(2005). Word learning in children with vocabulary deficits. Journal of Speech, Language, and Hearing Research, 48(2), 439–458. https://doi.org/10.1044/1092-4388(2005/030)

86.

Nikolaeva

J. I.

(2025). Growth trajectories of expressive vocabulary size in toddlerhood: A comparison of late talker status and vocabulary growth phenotype in predicting later language [Doctoral dissertation]. Northwestern University]. ProQuest Dissertations & Theses, Ann Arbor, MI, USA.

87.

Norbury

Griffiths

Vamvakas

Baird

Charman

Simonoff

Pickles

(2021). Socioeconomic disadvantage is associated with prevalence of developmental language disorders, but not rate of language or literacy growth in children from 4 to 11 years: evidence from the Surrey Communication and Language in Education Study (SCALES). Available at SSRN: https://ssrn.com/abstract=3814832 or http://dx. https://doi.org/10.2139/ssrn.3814832

88.

Nudel

Christensen

R. V.

Kalnak

Schwinn

Banasik

Dinh

K. M.

, …, & DBDS Genomic Consortium. (2023). Developmental language disorder–a comprehensive study of more than 46,000 individuals. Psychiatry Research, 323, 115171. https://doi.org/10.1016/j.psychres.2023.115171

89.

Oetting

J. B.

Rice

M. L.

Swank

L. K.

(1995). Quick incidental learning (QUIL) of words by school-age children with and without SLI. Journal of Speech, Language, and Hearing Research, 38(2), 434–445. https://doi.org/10.1044/jshr.3802.434

90.

Petersen

D. B.

Spencer

T. D.

Konishi-Therkildsen

(2024). Dynamic Measures of Narrative Language and Decoding. Language Dynamics Group, TX.

91.

Peters-Sanders

L. A.

Kelley

E. S.

Biel

C. H.

Madsen

Soto

Seven

, …, & Goldstein, H. (2019). Moving forward four words at a time: Effects of a supplemental preschool vocabulary intervention. Language, Speech, and Hearing Services in Schools, 51(1), 165–175. https://doi.org/10.1044/2019_LSHSS-19-00029

92.

Pomper

Mcgregor

K. K.

Arbisi-Kelm

Eden

Ohlmann

(2022). Direct instruction improves word learning for children with developmental language disorder. Journal of Speech, Language, and Hearing Research, 65(11), 4228–4249. https://doi.org/10.1044/2022_JSLHR-22-00300

93.

Quine

W. V. O.

(1960). Word and object. MIT Press.

94.

Ravid

Keuleers

Dressler

W. U.

(2020). Emergence and early development of lexicon and morphology. In Pirrelli

Plag

Dressler

W. U.

(Eds.), Word knowledge and word usage (pp. 593–633). De Gruyter. https://doi.org/10.1515/9783110440577

95.

Rice

Buhr

Oetting

(1992). Specific language impaired children’s quick incidental word learning (QUIL) of words: The effect of a pause. Journal of Speech and Hearing Research, 35(5), 1040–1048. https://doi.org/10.1044/jshr.3505.1040

96.

Rice

Oetting

Marquis

Bode

Pae

(1994). Frequency of input effects on SLI children’s word comprehension. Journal of Speech and Hearing Research, 37(1), 106–122. https://doi.org/10.1044/jshr.3701.106

97.

Rice

M. L.

Hoffman

(2015). Predicting vocabulary growth in children with and without specific language impairment: A longitudinal study from 2; 6 to 21 years of age. Journal of Speech, Language, and Hearing Research, 58(2), 345–359. https://doi.org/10.1044/2015_JSLHR-L-14-0150

98.

Roembke

T. C.

McMurray

(2016). Observational word learning: Beyond propose-but-verify and associative bean counting. Journal of Memory and Language, 87, 105–127. https://doi.org/10.1016/j.jml.2015.09.005

99.

Schwering

S. C.

MacDonald

M. C.

(2020). Verbal working memory as emergent from language comprehension and production. Frontiers in Human Neuroscience, 14, 68. https://doi.org/10.3389/fnhum.2020.00068

100.

Segbers

Schroeder

(2017). How many words do children know? A corpus-based estimation of children’s total vocabulary size. Language Testing, 34(3), 297–320. https://doi.org/10.1177/0265532216641152

101.

Seymour

H. N.

Roeper

de Villiers

P. A.

(2005). Diagnostic evaluation of language variation–norm referenced. Pearson.

102.

Sirin

S. R.

Rogers-Sirin

(2005). Components of school engagement among African American adolescents. Applied Developmental Science, 9(1), 5–13. https://doi.org/10.1207/s1532480xads0901_2

103.

Skipp

Windfuhr

K. L.

Conti-Ramsden

(2002). Children’s grammatical categories of verb and noun: A comparative look at children with specific language impairment (SLI) and normal language (NL). International Journal of Language and Communication Disorders, 37(3), 253–271. https://doi.org/10.1080/13682820110119214

104.

Smeets

D. J.

Van Dijken

M. J.

Bus

A. G.

(2014). Using electronic storybooks to support word learning in children with severe language impairments. Journal of Learning Disabilities, 47(5), 435–449. https://doi.org/10.1177/0022219412467069

105.

Smith

L. B.

(2008). Infants rapidly learn word-referent mappings via cross-situational statistics. Cognition, 106(3), 1558–1568. https://doi.org/10.1016/j.cognition.2007.06.010

106.

Souto

Leonard

L. B.

Deevy

Christ

S. L.

Karpicke

J. D.

Schroeder

M. L.

(2025). Word learning in children with developmental language disorder: The use of retrieval practice during shared book Reading. Journal of Speech, Language, and Hearing Research, 68(7), 3305–3321. https://doi.org/10.1044/2025_JSLHR-24-00809

107.

Stankova

Rodríguez-Ortiz

I. R.

Matić

Levickis

Lyons

Messarra

, …, & Law

(2021). Cultural and linguistic practice with children with developmental language disorder: Findings from an international practitioner survey. Folia Phoniatrica et Logopaedica, 73(6), 465–477. https://doi.org/10.1159/000511903

108.

Steele

S. C.

Watkins

R. V.

(2010). Learning word meanings during Reading by children with language learning disability and typically-developing peers. Clinical Linguistics & Phonetics, 24(7), 520–539. https://doi.org/10.3109/0269920090353247

109.

Stevens

J. S.

Gleitman

L. R.

Trueswell

J. C.

Yang

(2017). The pursuit of word meanings. Cognitive Science, 41(S4), 638–676. https://doi.org/10.1111/cogs.12416

110.

Stockman

I. J.

(2010). A review of developmental and applied language research on African American children: From a deficit to difference perspective on dialect differences. Language, Speech, and Hearing Services in Schools, 41(1), 23–38. https://doi.org/10.1044/0161-1461(2009/08-0086)

111.

Storkel

H. L.

Komesidou

Pezold

M. J.

Pitt

A. R.

Fleming

K. K.

Romine

R. S.

(2019). The impact of dose and dose frequency on word learning by kindergarten children with developmental language disorder during interactive book Reading. Language, Speech, and Hearing Services in Schools, 50(4), 518–539. https://doi.org/10.1044/2019_LSHSS-VOIA-18-0131

112.

Storkel

H. L.

Voelmle

Fierro

Flake

Fleming

K. K.

Romine

R. S.

(2017). Interactive book reading to accelerate word learning by kindergarten children with specific language impairment: Identifying an adequate intensity and variation in treatment response. Language, Speech, and Hearing Services in Schools, 48(1), 16–30. https://doi.org/10.1044/2016_LSHSS-16-0014

113.

Trueswell

J. C.

Medina

T. N.

Hafri

Gleitman

L. R.

(2013). Propose but verify: Fast mapping meets cross-situational word learning. Cognitive Psychology, 66(1), 126–156. https://doi.org/10.1016/j.cogpsych.2012.10.001

114.

Tsang

Atagi

Johnson

S. P.

(2018). Selective attention to the mouth is associated with expressive language skills in monolingual and bilingual infants. Journal of Experimental Child Psychology, 169, 93–109. https://doi.org/10.1016/j.jecp.2018.01.002

115.

Ukoumunne

O. C.

Wake

Carlin

Bavin

E. L.

Lum

Skeat

, …, & Reilly, S. (2012). Profiles of language development in pre-school children: A longitudinal latent class analysis of data from the early language in Victoria study. Child: Care, Health and Development, 38(3), 341–349. https://doi.org/10.1111/j.1365-2214.2011.01234.x

116.

Ukrainetz

T. A.

(2024). The foundations of language intervention: Reasoning and research. In Ukrainetz

T. A.

(Ed.), School-age language intervention-second edition (pp. 43–72). Pro-Ed.

117.

Valiante

A. G.

Barr

R. G.

Zelazo

P. R.

Papageorgiou

A. N.

Young

S. N.

(2006). A typical feeding enhances memory for spoken words in healthy 2-to 3-day-old newborns. Pediatrics, 117(3), e476–e486. https://doi.org/10.1542/peds.2004-2859

118.

van Berkel-van Hoof

Hermans

Knoors

Verhoeven

(2019). Effects of signs on word learning by children with developmental language disorder. Journal of Speech, Language, and Hearing Research, 62(6), 1798–1812. https://doi.org/10.1044/2019_JSLHR-L-18-0275

119.

Vitevitch

M. S.

Storkel

H. L.

Francisco

A. C.

Evans

K. J.

Goldstein

(2014). The influence of known-word-frequency on the acquisition of new neighbors in adults: Evidence for exemplar representations in word-learning. Language, Cognition and Neuroscience, 29(10), 1311–1316. https://doi.org/10.1080/23273798.2014.912342

120.

Vlach

Sandhofer

(2010). Desirable difficulties in cross-situational word learning. Proceedings of the Annual Meeting of the Cognitive Science Society (Vol. 32, No. 32). Retrieved from https://escholarship.org/uc/item/2151507t.

121.

Vlach

H. A.

DeBrock

C. A.

(2017). Remember dax? Relations between children's cross-situational word learning, memory, and language abilities. Journal of Memory and Language, 93, 217–230. https://doi.org/10.1016/j.jml.2016.10.001

122.

Vlach

H. A.

Johnson

S. P.

(2013). Memory constraints on infants’ cross-situational statistical learning. Cognition, 127(3), 375–382. https://doi.org/10.1016/j.cognition.2013.02.015

123.

Walker

M. P.

(2005). A refined model of sleep and the time course of memory formation. Behavioral and Brain Sciences, 28(1), 51–64. https://doi.org/10.1017/S0140525X05000026

124.

Wechsler

(2011). Wechsler Abbreviated Scale of Intelligence--Second Edition (WASI-II) [Database record]. APA PsycTests. https://doi.org/10.1037/t15171-000

125.

Wojcik

E. H.

(2013). Remembering new words: Integrating early memory development into word learning. Frontiers in Psychology, 4, 151. https://doi.org/10.3389/fpsyg.2013.00151

126.

Woodard

Gleitman

L. R.

Trueswell

J. C.

(2016). Two-and three-year-olds track a single meaning during word learning: Evidence for propose-but-verify. Language Learning and Development, 12(3), 252–261. https://doi.org/10.1037/a0016134

127.

Tenenbaum

J. B.

(2007). Word learning as Bayesian inference. Psychological Review, 114(2), 245–272. https://doi.org/10.1037/0033-295X.114.2.245

128.

Ylinen

Nora

Service

(2020). Better phonological short-term memory is linked to improved cortical memory representations for word forms and better word learning. Frontiers in Human Neuroscience, 14, 209. https://doi.org/10.3389/fnhum.2020.00209

129.

Yurovsky

Frank

M. C.

(2015). An integrative account of constraints on cross-situational learning. Cognition, 145, 53–62. https://doi.org/10.1016/j.cognition.2015.07.013

130.

Yurovsky

Fricker

D. C.

Smith

L. B.

(2014). The role of partial knowledge in statistical word learning. Psychonomic Bulletin & Review, 21, 1–22. https://doi.org/10.3758/s13423-013-0443-y

131.

Ziegenfusz

Paynter

Flückiger

Westerveld

M. F.

(2022). A systematic review of the academic achievement of primary and secondary school-aged students with developmental language disorder. Autism & Developmental Language Impairments, 7, 1–33. https://doi.org/10.1177/23969415221099397

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.03 MB