Abstract
By leveraging Phonology-to-Semantics Consistency (PSC), which quantifies form-meaning systematicity as the semantic similarity between a target word and its phonological nearest neighbours, we document a unique effect of systematicity on Age of Acquisition (AoA). This effect is also found after controlling for the effect of neighbourhood density measured for word forms and lexical semantics and several other standard predictors of AoA. Moreover, we show that the effect of systematicity is not reducible to iconicity. Finally, we extensively probe the reliability of this finding by testing different statistical models, analysing systematicity in phonology and orthography and implementing random baselines, reporting a robust, unique negative effect of systematicity on AoA, such that more systematic words tend to be learned earlier. We discuss the findings in the light of studies on non-arbitrary form-meaning mappings and their role in language learning, focusing on the analogical process at the interface of form and meaning upon which PSC is based and how it could help children infer the semantics of novel words when context is scarce or uninformative, ultimately speeding up word learning.
Keywords
Introduction
The factors that influence how children learn their first language from exposure have been widely studied (Bates & MacWhinney, 1987; Pinker, 1984). The spate of studies which addressed this question highlighted several factors which predict Age of Acquisition (AoA), including word frequency (Braginsky et al., 2019; Ghyselinck et al., 2004), phonological neighbourhood density (PND; Jones & Brandt, 2019), semantic neighbourhood density (SND; Fourtassi et al., 2020), contextual diversity (Hills et al., 2010), perceptual features (Peters & Borovsky, 2019), valence, concreteness, and word length (Braginsky et al., 2019), salience in time, space, and, linguistic context (Roy et al., 2015), iconicity (Laing, 2019; Perry et al., 2015; Sidhu et al., 2021), sound symbolism (Imai & Kita, 2014; Kantartzis et al., 2011), and systematicity in form-meaning mappings (Monaghan et al., 2012, 2014). Building on recent studies that quantified the degree of form-meaning systematicity for individual lexical items (Hendrix & Sun, 2020; Marelli & Amenta, 2018), in this work, we assess the effect of systematicity on AoA, after controlling for known predictors of AoA itself. We are the first to assess the effect of form-meaning systematicity on AoA while controlling for measures of neighbourhood density in form and meaning as well as their interaction. Moreover, we rely on measures of form-meaning systematicity which have been already shown to reliably predict reaction times (RTs) in lexical decision studies as well as morphological priming studies (Amenta et al., 2017, 2020; Hendrix & Sun, 2020; Marelli & Amenta, 2018; Marelli et al., 2015) and reading times in a naturalistic setting (Amenta et al., submitted). Finally, our work is the first to sketch a mechanistic account of how systematicity could facilitate word learning, grounded in analogical processes at the interface between form and meaning (Marelli & Amenta, 2018).
The study of different aspects of the relation between word form and meaning has seen a huge increase over the last decades (see Dingemanse et al., 2015; Nielsen & Dingemanse, 2020; Sidhu & Pexman, 2018a, for reviews about this line of work). Contrary to the long-held position that natural languages need to be arbitrary to be effective communication tools (Hockett, 1960), numerous studies have shown the widespread presence of systematic relations between word form and meaning at the sub-morphemic level, painting a more similar picture to the original position held by de Saussure (1916) that, while being a cardinal organising principle of natural languages, arbitrariness is not absolute and form-meaning systematicity is present beyond morphology. Onomatopoeias, for example, are words that have a direct form-meaning relation because they imitate natural sounds (see Laing, 2019, and references therein for a more detailed discussion of onomatopoeias). Other well-known examples come from studies on sound symbolism (Hinton et al., 1994): in
A useful distinction to be made when discussing non-arbitrariness in natural languages is that between iconicity, which studies resemblances between form and meaning (Blasi et al., 2016; Perry et al., 2015), and
The study of the relation between systematicity and AoA is also not new. Gasser (2004) pointed out that while arbitrariness is useful when communicating, as it makes potentially confusable lexical items more discriminable by orthogonalising form and context, systematicity could help in learning the language. In the extreme, learning a perfectly arbitrary language is harder because no analogies can be formed and every item has to be learned individually. On the contrary, a completely systematic language would be prone to confusion, since similar words would denote similar concepts, which are likelier to occur in similar contexts, making them harder to discern (Eco, 1995). In line with this argument, it has been shown that when few words are known, systematicity helps map forms to referents, whereas with larger vocabularies, systematicity only supports a more effective learning of category structures (Brand et al., 2018; Monaghan et al., 2011, 2012). Moreover, it has been shown that sound-meaning mappings in the English vocabulary are more systematic than would be expected by chance (Monaghan et al., 2014), suggesting that systematicity is not simply due to pockets of sound symbolism and phonaesthemes, but rather is a more general property found across the entire language. The aforementioned studies, however, did not consider the potentially confounding effects of neighbourhood density at the phonological and semantic level on acquisition patterns, with both PND and SND having been recently shown to influence language acquisition (Fourtassi et al., 2020; Jones & Brandt, 2019). It could thus be the case that the effect of systematicity on acquisition is reduced to that of neighbourhood density in form and semantics, such that words are learned earlier simply because they happen to be deeply entrenched in the language network at multiple levels (Hills et al., 2009) rather than because their form cues their meaning in a reliable way.
Closely related to the goal of assessing whether form-meaning systematicity explains unique variance in AoA above and beyond neighbourhood density in form and meaning, we also explore whether there is an interaction between systematicity and SND. Following the hypothesis by Gasser (2004) that systematicity can harm communication efficacy due to the increased confusability of similar concepts that would also have similar forms, it would be expected that words found in sparser semantic neighbourhoods can afford less arbitrary form-meaning relations because they have a lower risk of being confused for semantically similar words. In line with this hypothesis, Sidhu and Pexman (2018b) reported that words with sparser semantic neighbourhoods were more iconic, even after partialling out the relation between AoA and iconicity. Thus, we first check whether we replicate this finding with systematicity rather than iconicity. Then, we ask ourselves whether SND and form-meaning systematicity interact: Is it the case that the effect of systematicity on AoA is modulated by SND?
Next to contributing to the study of the relation between form-meaning systematicity and AoA, we also set out to extend and validate the use of Phonology-to-Semantics-Consistency (PSC; Amenta et al., 2017), as well as its orthographic counterpart, Orthography-to-Semantics Consistency (OSC; Marelli et al., 2015), to study AoA dynamics. OSC and PSC (to which we will collectively refer as Form-to-Semantics Consistency [FSC]) are two measures of form-meaning systematicity which have been recently shown to predict RTs in lexical decision and morphological priming tasks (Hendrix & Sun, 2020; Marelli & Amenta, 2018) as well as reading times in naturalistic reading (Amenta et al., submitted). Both quantify form-meaning systematicity as the semantic similarity between a target word and its orthographic/phonological nearest neighbours, thus combining the similarity structure of a word in both form and semantic space. Since these measures have been shown to reliably predict lexical processing in a variety of tasks, we aim to probe whether they can also illuminate our understanding of acquisition dynamics.
OSC was first introduced to explain a consistent, yet overlooked, effect in morphological masked priming studies: Participants are faster at recognising stems from transparent sets (e.g.,
FSC measures considering neighbours which embed the target word, however, are primarily geared towards assessing form-meaning systematicity at the level of morphology, which is typically explicitly excluded in studies on form-meaning systematicity. In a more recent study, Hendrix and Sun (2020) investigated the effects of several variables in a lexical decision task, targeting both words and non-words. Having to establish a measure of form-meaning consistency for non-words, which are unlikely to be embedded in other words in the lexicon, they defined form-based neighbours as the words with the lowest Levenshtein edit-distance 1 from the target word (Levenshtein, 1966). This measure was shown to predict RTs in a lexical decision task for both words and non-words, above and beyond other covariates. FSC measures based on Levenshtein distance sidestep the morphological relations between stems and derived words. This formulation of FSC may thus be more informative to capture non-arbitrary patterns that bear relevance in acquisition and are less confounded by morphological productivity. The comparison of these two implementations of FSC will therefore be interesting to check whether a relation between systematicity and AoA is primarily driven by morphological regularities or by patterns with less obvious correlates in morphology. To control for systematicity based on morphology, we also control for morphological complexity, since many of the first words children learn are monomorphemic (Clark, 2017).
Next to predicting RTs in lexical decision tasks, OSC was also recently found to reliably predict reading times (Amenta et al., submitted) and to interact with surprisal in speeding up reading when the context was not sufficiently constraining in guiding prediction. Therefore, in this work, we take a measure to compute word-level form-meaning systematicity which has been shown to be a useful tool to investigate psycholinguistic phenomena such as reading patterns and lexical decision, and apply it to the analysis of acquisition patterns. If the same measure would prove to be a reliable predictor of AoA, we would thus have a single, effective tool to further analyse the role of form-meaning systematicity not just in lexical processing but also in word learning. Moreover, FSC can be seen as a mechanistic account of how to derive semantic representations from word form alone through an analogical process. Upon the first encounter with a novel word, possibly in an uninformative situational or linguistic context, learners could still gauge the semantics of this new word by leveraging the semantics of similar words they already know. If FSC reliably predicts AoA, we would have preliminary evidence that words for which the process of estimating lexical semantics from word form is more reliable and informative (reflected in higher systematicity) are learned earlier. This could mean that children indeed make educated guesses about lexical semantics by exploiting word forms, relying on systematic correspondences between form and meaning, and that learning benefits from situations where this guesstimation is more reliable, resulting in more systematic words being learned earlier.
From a methodological perspective, the approach underlying FSC is not dissimilar from the one introduced by Otis and Sagi (2008), but broadens its scope and explores different ways of assessing form similarity. Whereas Otis and Sagi (2008) used a similar approach to assess the strength of phonaesthemes, considering as orthographic neighbours all words which contained a letter sequence which was a candidate phonaestheme, FSC measures form-meaning systematicity considering less constrained similarities in form. Different approaches to the study of phonaesthemes using measures of form similarity and distributed semantic representations similar to Otis and Sagi (2008) and to FSC were also adopted by Liu and colleagues (2018) and Abramova and colleagues (2013). All measures were shown to capture known phonaesthemes while not falling prey to similarities in form which are not recognised as phonaesthemic. Moreover, these methods could also find new candidate phonaesthemes, suggesting that the general approach can inform our understanding of non-arbitrariness in language. These results further strengthen the viability of a measure grounded in analogical processes at the interface of form and meaning, which our study aims to extend from the study of systematicity per se to the analysis of its influence on language acquisition.
FSC measures are however not the only way in which form-meaning systematicity has been assessed in the literature. The most straightforward way to assess systematicity is to measure whether items with similar forms also tend to have similar semantic representations, thus measuring the correlation between pairwise similarities across different representations for the same words (Dautriche et al., 2017; Monaghan et al., 2014; Shillcock et al., 2001; Tamariz, 2008). All studies controlled for the effect of morphology by restricting attention to monomorphemic words and reported small yet significant correlations across several languages. In particular, Monaghan and colleagues (2014) quantified the degree of systematicity of each lexical item by measuring how the correlation between pairwise similarities in form and meaning changed when excluding each target word. The difference between the correlation computed including and excluding a given target word was taken to reflect the word’s degree of systematicity. This study shows that more systematic words tend to be overrepresented among early acquired words, pointing to a relation between systematicity and acquisition.
More recently, neural network methods have also been used to quantify systematicity (Pimentel et al., 2019). First, word forms were encoded using a deep learning model such that each word was represented as a probabilistic distribution over discrete units, i.e., letters. Then, a different model was trained to reconstruct the word form conditioned on the semantic representation of the word. The entropy between the two distributions, the true, unconditioned one and the one reconstructed on the basis of semantic information, was taken to reflect the degree of form-meaning systematicity of each word. A small yet reliable systematicity was found for a large number of typologically different languages.
These methods, however, are rather computationally expensive as compared to FSC measures, and crucially, only work for words for which a semantic representation estimated from language data is available, thus speaking only about a mature system and not being able to say much about learning patterns. Recent work has however started to show that systematicity is fruitfully studied using computational methods also for pseudowords (Baayen et al., 2019; Cassani et al., 2020; Chuang et al., 2021; Hendrix & Sun, 2020). In particular, OSC has been used by Hendrix and Sun (2020) to assess the semantic coherence of the form-based nearest neighbours of words and pseudowords alike. This is particularly interesting since, at some point in development, any word is a pseudoword to language learners, who need to be able to map the new word to a semantic representation: FSC could help illuminate this task, by sketching a process where children leverage word form and analogies across form and meaning to estimate lexical semantics even without an informative situational and linguistic context.
To sum up, this work leverages and extends two different yet connected lines of research. On one hand, we build on studies which investigate the factors influencing language acquisition (Braginsky et al., 2019; Ghyselinck et al., 2004; Hills et al., 2010; Jones & Brandt, 2019; Peters & Borovsky, 2019; Roy et al., 2015; Sidhu et al., 2021), particularly on studies showing that sound symbolism helps language learning (Imai et al., 2008; Imai & Kita, 2014; Kantartzis et al., 2011) and that more systematic words are overrepresented among early acquired words (Monaghan et al., 2014). In detail, we investigate whether systematicity predicts AoA while controlling for neighbourhood density in both form and meaning, since recent studies have shown that words that are highly entrenched in the phonological and semantic networks are learned earlier (Fourtassi et al., 2020; Jones & Brandt, 2019). On the other hand, we leverage recently introduced measures of form-meaning systematicity which have been extensively validated on other psycholinguistic tasks such as lexical decision and reading (Amenta et al., submitted; Hendrix & Sun, 2020; Marelli & Amenta, 2018) and extend their use to the analysis of acquisition phenomena. These measures are theoretically relevant because they hold promise to characterise a mechanistic process that children could rely upon when exploiting systematicity in language learning.
Materials and method
Data
Several existing large-scale datasets were used to conduct this study. Objective AoA norms were taken from the dataset provided by Brysbaert and Biemiller (2017), which relies on vocabulary tests to assess when a given proportion of learners reliably know a certain word, probing learners at 2, 4, 6, 8, 10, 12, 13, and 14 years of age. This dataset provides different AoA estimates for different word senses of the same word form; however, since our approach does not discriminate word senses, we took the earliest AoA score for each word form as our target variable. Next to providing objective AoA norms for a large set of words, this dataset was also recently investigated by Sidhu and colleagues (2021) who reported an effect of iconicity on AoA. It is thus interesting to investigate the effect of systematicity on AoA on the same dataset, to better situate any finding in the current literature on non-arbitrariness and language learning.
Other large-scale datasets were used to derive control variables necessary to assess a possible unique effect of form-meaning systematicity on AoA. Concreteness norms were taken from the dataset collected by Brysbaert and colleagues (2014). Valence was extracted from the norms made available by Kuperman and colleagues (2014). Word frequency values come from the SUBTLEX-US dataset (Brysbaert & New, 2009). This dataset was also used as a reference vocabulary to retrieve phonological neighbours when computing PSC. However, we filtered the SUBTLEX-US lexicon by only considering words that appeared in at least one of the aforementioned datasets, to ensure that neighbours were valid English words, for a total of approximately 60 K words. To control for the morphological complexity of target words, a binary variable was created which marked words as either monomorphemic or polymorphemic: to this end, we used the MorphoLEX dataset (Sánchez-Gutiérrez et al., 2018). Iconicity ratings were taken from the dataset provided by Perry and colleagues (2015). To minimise the degrees of freedom in the computational modelling, we extracted a measure of PND from the MALD dataset (Tucker et al., 2019). The target variable we used is
Next to leveraging several datasets which provide lexical variables, we need resources to encode semantics, to compute SND and PSC measures. We chose to represent lexical semantics relying on distributed semantic representations extracted from large-scale corpora on the basis of the assumption that the meaning of a word can be approximated via its co-occurrence patterns with other words in the lexicon (Firth, 1957; Harris, 1954). In a distributional semantic model (DSM; Turney & Pantel, 2010), words are represented as numerical vectors obtained from co-occurrences in large text corpora. Crucially, these vectors exist in a geometrical space in which relations of proximity can be computed, with words with closer meaning being also closer in semantic space. Proximity is computed using the cosine of the angle between the vectors (Bullinaria & Levy, 2007). To derive lexical semantic representations, we relied on the DSM provided by Mandera and colleagues (2017), which has been validated on several psycholinguistic tasks.
2
SND was quantified as the average cosine distance of the 20 nearest neighbours of a target word in semantic space; therefore, higher values on SND indicate that a word is found in a sparser semantic neighbourhood. Finally, to compute PSC, the phonological encoding of a word was extracted from the
Method
FSC
Figure 1 provides a general graphical representation of how FSC measures are computed and how they tap into the interface between word forms and lexical semantics, by quantifying the degree of systematicity as the coherence of the semantic representations of the words which resemble the target in form space. The figure highlights how the measure is conceptualised and how its value depends on the coherence of local neighbourhoods in both form and semantic space, while remaining agnostic to the exact implementation choices about how to encode word form, how to retrieve nearest neighbours in form space, how to represent lexical semantics, and how to compute semantic similarity. As the figure shows, a word can have high FSC regardless of whether it has high density in form or semantic space: FSC will be higher when the nearest neighbours of a target word in form space are semantically coherent with the target word’s semantic representation. At a conceptual level, a high FSC score entails that upon hearing a certain word form, it is comparably easier to form a reliable impression of what this word may mean, even without relying on linguistic or situational context, but rather leveraging the meaning of the words that sound more like the target word itself. In the previous section, we established that word form is going to be represented using phonological transcriptions while semantic space is operationalised using DSMs. In the following paragraphs, we go into more details about how we implemented FSC in this work, explaining how we retrieve nearest neighbours in form space and how we compute the semantic similarity between their semantic representations and that of the target word.

Schematic representation of Form-to-Semantics Consistency (FSC). (a) A word with high FSC and high neighbourhood density in form space: nearest neighbours in form space are also close neighbours of the target in semantic space, indicating a high consistency between the local neighbourhoods of the target word’s form and meaning. (b) A word with low FSC yet high neighbourhood density in form space: even though the target word occupies a dense portion of both form and meaning space, its nearest phonological neighbours are scattered in semantic space, resulting in low systematicity. (c) A word with high FSC in spite of low neighbourhood density in form space: even though the target word occupies a rather empty portion of form space, its nearest phonological neighbours cluster closely to the target in semantic space, showing that FSC and neighbourhood density in form space can be decoupled.
As previously mentioned in the introduction, two operationalisations of FSC have been previously presented in the literature. These definitions differ in how the neighbours in form space are determined as well as in how the similarity between target and neighbours in semantic space is computed. In the original OSC measure introduced by Marelli and colleagues (2015), neighbours are defined as words that begin with the target string (e.g.,
As previously discussed in the introduction, Hendrix and Sun (2020) computed OSC in a different way, which had also been tested by Marelli and Amenta (2018) but was found to be worse at predicting RTs in lexical decision. In this approach, form-based neighbours are words with the smallest Levenshtein distance to the target. Moreover, instead of weighing cosine similarities by word frequency, these are weighted by the Levenshtein distance value, such that semantic representations corresponding to orthographically more similar words contribute more to the OSC estimate. This implementation of OSC has one free parameter,
Statistical approach
All statistical analyses were carried out in R (R Core Team, 2020). First, we applied a Box-Cox transformation to all numerical independent variables using the MASS package (Venables & Ripley, 1999) and then
Moreover, to assess whether PSC explains any unique variance in AoA on top of known predictors, we ran several linear regression models. First, we fit a baseline statistical model where AoA is modelled as a linear combination of frequency, concreteness, word length in phonemes, valence, morphological complexity, PND, and SND. We then added target predictors and interactions to this baseline statistical model, measuring the difference in Akaike Information Criterion (AIC) between different models to assess whether adding a certain predictor indeed improved model fit.
Next to this main analysis, we also controlled for the effect of iconicity. To do this, we restricted our focus to the words for which all necessary ratings are available, resulting in 1,771 words. First, we fitted a baseline statistical model which included the same predictors as the main baseline plus iconicity ratings. We then added each PSC measure individually and measured the ∆
In both analyses, there is a risk of observing adverse effects of collinearity (Tomaschek et al., 2018). To ensure any reported effect is robust and trustworthy, we ran the same models using Random Forest (RF) regression using the
Results
The results are organised in four different subsections. First, we discuss the correlational analysis, focusing on the relation that PSC has with AoA as well as with other variables known to influence AoA. Second, we assess the unique effect of PSC on AoA after factoring in known predictors of AoA. Third, we assess whether any effect of PSC can be reduced to an effect of iconicity (Dingemanse et al., 2015; Perry et al., 2015). Finally, we zoom in on the relation between PSC and SND, following the hypothesis that sparser semantic neighbourhoods can afford more non-arbitrary relations between form and meaning (Gasser, 2004; Sidhu & Pexman, 2018b) and to test whether this relation influences AoA.
Correlational analysis
First, we computed pairwise Pearson’s correlation coefficients for all variables, displayed in Figure 2, together with histograms showing the distribution of each variable and scatterplots showing how variables relate to each other. We see that AoA norms have sizable correlations with all variables, including the expected negative correlations with PSC measures, indicating that words with higher systematicity tend to be acquired earlier. However, the correlation between

Pairwise Pearson’s correlations involving independent variables and AoA norms for phonological variables.
We also observe the expected relation between AoA and PND (
Finally, we observe collinearity across predictor variables, especially involving word length in phonemes and PND (
Form-meaning systematicity and AoA
In this section, we assess whether PSC measures explain any unique variance in AoA once control variables are accounted for: This analysis is particularly relevant considering that especially
Then, we compared
We thus proceeded to fit two further models: adding
Unique effect of PSC on objective AoA norms.
PSC: Phonology-to-Semantics-Consistency; AIC: Akaike information criterion.
The table displays regression coefficients (β) with associated standard errors (
PSC measures explain unique variance in AoA on top of control variables, suggesting that form-meaning systematicity has a unique relation with acquisition patterns.
The RF regressions confirmed that all PSC measures improve the model’s

Variable importance plots from RF regressions for models including measures of form-meaning systematicity on top of the baseline statistical model predicting objective AoA norms.
Form-meaning systematicity, iconicity, or both?
In this section, we focus on a further possible confound which was not addressed in the previous analysis, namely iconicity. We chose to present separate analyses because iconicity ratings are available for a considerably smaller subset of the target vocabulary. After the previous analyses, we can say that PSC measures have a unique effect on AoA. If we were not to find the same effect we have encountered so far after controlling for iconicity, we would be more confident that it is because PSC and iconicity tap into similar aspects of the acquisition processes and its relation to lexical properties. Since it came out as a more reliable and better predictor of AoA than
First of all, we checked to what extent
Once again, we fitted a baseline regression model,

Variable importance plots from RF regressions for models including
To sum up, we confirmed a relation between
Form-meaning systematicity and SND
In the previous analyses, we established that
We start by observing that
We again fitted a linear regression model adding an interaction between SND and

Interaction between Semantic Neighbourhood Distance (SND) and
Therefore, we only partially replicated previous findings, which reported a robust relation between non-arbitrary form-meaning mappings and SND. On one hand, we did see a positive correlation between iconicity and SND, indicating that more iconic words tend to exhibit lower semantic density, in line with evidence from Sidhu and Pexman (2018b). On the other hand, however, we saw a relation in the opposite direction than we predicted (Gasser, 2004) between
Robustness checks
We further carried out four different robustness checks (whose details and outcomes are available as online supplementary materials) to ensure that the reported unique relation between PSC and AoA is reliable. The first one was meant to ensure that the effects we reported do not depend on the assumption of a linear relation between our predictors and AoA. For this analysis, we used Generalised Additive Models (GAMs, Wood, 2017) to predict AoA, modelling each predictor as a simple smooth. We first fitted a baseline statistical model including control variables, then tested whether an interaction between PND and SND (implemented as a partial tensor product) improved the model fit by checking the ∆
The second robustness check tested whether the effect of PSC truly depends on systematic form-meaning correspondences in the lexicon and is not a by-product of other properties of the semantic representations. To probe this, we scrambled the form-meaning correspondences, such that, for example, the word form
The third robustness check considers the role of the reference lexicon used to retrieve nearest neighbours when computing PSC, to exclude the possibility that the relation we showed is only found with a large or specific reference lexicon. To this end, we randomly sampled 50% or 75% of the words in the reference lexicon without replacement. For each sampling rate, we carried out 500 iterations; in each of them, we retrieved phonological nearest neighbours from the vocabulary subset, and computed PSC measures as detailed in Equations 1 and 2. We then measured the ∆
The fourth robustness check we carried out was meant to further ensure that the effect of PSC is not an epiphenomenon of simpler neighbourhood measures. Even though the main analysis explicitly controls for PND and SND, and although the random baseline confirms that, even though PSC computed from random permutations of form-meaning correspondences correlates with AoA, it does not explain any unique variance after control variables are considered, we ran an extra analysis which more carefully disentangles the effects of PND, SND, and
Finally, we also replicated the analysis presented in the main text by replacing PSC with OSC, to ensure that patterns are reliable when changing the words’ encoding (Supplemental Appendix E) and also by predicting subjective rather than objective AoA norms (Kuperman et al., 2012), as reported in Supplemental Appendix F. All patterns we found for PSC are also reported for OSC, with some differences in magnitude pointing to a stronger effect for form-meaning systematicity at the phonological level. This is interesting since language learning, especially early on, relies a lot more on spoken rather than written input. Patterns were also largely replicated when predicting subjective AoA norms, although some differences emerged. First, the interaction between SND and PND was significant, with a stronger negative effect of PND on AoA for words in dense semantic neighbourhoods. In line with the analysis on objective AoA norms, however, the main effects of
Discussion
In this article, we investigated the effect of form-meaning systematicity (Amenta et al., 2017; Hendrix & Sun, 2020; Marelli & Amenta, 2018; Marelli et al., 2015) on word acquisition. We first documented a reliable correlation between Phonology-to-Semantics Consistency (PSC) and AoA. Then, we showed that this relation is not confounded by other known predictors of AoA such as frequency, concreteness, valence, word length, and morphological complexity. Crucially, we further showed that the effect of PSC holds when PND and SND are controlled for. Therefore, we showed that the effect of PSC on AoA is not just due to words being entrenched in the phonological and semantic networks but depends on form-meaning systematicity. Moreover, we showed the robustness of the reported patterns by replicating the analyses using RF regressions, which remove possible adverse effects of collinearity (Tomaschek et al., 2018). Finally, we showed that the relation between systematicity and AoA is robust to the inclusion of iconicity (Perry et al., 2015; Sidhu et al., 2021), suggesting that systematicity and iconicity tap into different aspects of acquisition dynamics.
The effect of form-meaning systematicity on AoA proved reliable also in a series of robustness checks. We made sure that the target variables were reliable predictors of AoA also when relaxing the assumption of a linear relation between the variables, using GAMs. We further tested a random baseline where form-meaning correspondences found in the English lexicon were randomly scrambled, showing that PSC only predicts AoA when computed on true form-meaning pairings. We also ensured that the effects we report are robust to changes in the size and composition of the reference vocabulary used to retrieve nearest neighbours when computing PSC. Moreover, we ran an extra analysis to disentangle the effects of PSC, PND, and SND, showing that the effect of PSC on AoA is not an epiphenomenon of neighbourhood density in form or meaning. We further controlled for changes in the encoding of word form (orthography and phonology) and probed subjective AoA norms. Across all these analyses, the main effect of systematicity proved robust. However, target interactions between PND and SND, and between SND and FSC did not; they were reliable predictors in some analyses while they did not improve the model fit in others, inviting caution in drawing firm conclusions about these patterns.
Our work builds on recent studies that quantified form-meaning systematicity above and beyond morphological productivity. The main advantage of FSC measures over alternative approaches to computing form-meaning systematicity (Dautriche et al., 2017; Monaghan et al., 2014; Pimentel et al., 2019; Shillcock et al., 2001; Tamariz, 2008) is that FSC measures work even when a word’s semantic representation is not available (Hendrix & Sun, 2020). This makes FSC more appealing to explore acquisition dynamics since a hallmark of form-meaning systematicity is the possibility of drawing on statistical regularities between form and meaning available in the lexicon when learning and processing language (Monaghan et al., 2012). Therefore, FSC measures can characterise how statistical regularities between form and meaning help children extrapolate meaning from form alone and use these informed guesses to better learn new words. While FSC measures have so far been used simply to quantify the degree of form-meaning consistency, it is rather straightforward to construe them as fully fledged mapping functions; the semantic vectors of the form-based neighbours could be averaged to obtain a new semantic representation, which would exist in the same semantic space derived from contextual information (Firth, 1957; Harris, 1954) and which would conflate the semantics of the most similar words in the vocabulary.
We can sketch a possible mechanistic process that relies on the principles upon which FSC is based to see the possible role of systematicity in word learning. Let us suppose that a learner encounters a novel word for the first time, e.g.,
The process by which one infers a novel word’s semantics leveraging the semantics of similar words, however, can only work if the semantic intuitions derived in this way prove to be generally correct, such that later encounters with the same word, in richer situational and linguistic contexts, do not disprove the form-based semantic intuitions. In other words, this process can only work if the lexicon exhibits form-meaning systematicity in the early stages of word learning. Our results show precisely this, i.e., that English words for which it is easier to derive coherent semantic impressions from similar words tend to be learned earlier, suggesting a place in word learning for an analogical process which bridges word form and meaning. Importantly, we do not contend that this process is always correct, such that the semantics inferred on the basis of word form reflects the true lexical semantics of every word, but rather that words whose phonological neighbours have more coherent semantics are easier to learn.
Therefore, the role of systematicity in language learning is taken to be different from what has been hypothesised in previous studies which target language processing. When considering lexical decision, the influence of FSC on RTs has been ascribed to a stronger activation of lexical items which display higher systematicity, which depends on the concurrent activation of coherent local neighbourhoods in form and meaning. Moreover, when considering reading times, it has been shown that FSC is particularly useful when the context is not particularly constraining, with form-based semantics offering an extra cue and filling in the gap left by an unconstraining linguistic context. When it comes to language learning, however, we hypothesise that the effect of systematicity works by offering a way to more quickly establish semantic representations by combining contextual information with information coming from the word form itself. Words for which this form-meaning systematicity does not exist would thus be harder to learn, since learners need to resolve a conflict, where form-based semantics does not align with contextual semantics. Whereas learners likely end up resolving this conflict by primarily relying upon contextual information, they may still have to understand which source of information is more reliable and find it easier to learn words for which these two sources of information agree. Furthermore, form-based semantic intuitions may be particularly useful with novel words, where contextual information may be scarce or unreliable.
If the process we have just outlined works, and the resulting semantic intuitions are consistent with the true lexical semantics of the words being learned, children could thus considerably speed up word learning by bootstrapping lexical learning using word forms and the semantics they convey. Under a full arbitrariness of the sign hypothesis, this process would be irrelevant since no useful semantic information could be derived from word form, except for morphological compositionality. However, we showed that PSC reliably and uniquely explains variance in acquisition patterns, with more systematic words being learned earlier. Therefore, our study provides preliminary evidence that words have a learning advantage when the form-based semantic inference reflects the lexical semantic representation derived from linguistic context. We can thus hypothesise that the exemplar-based process at the heart of FSC at least partly accounts for lexical acquisition. Importantly, this is the first work to show that a measure of form-meaning systematicity which can model how semantics is extracted from word form alone reliably predicts AoA patterns for large-scale datasets.
Through a number of analyses and robustness checks, we further showed that target-embedding FSC measures explain less variance in AoA than Levenshtein-distance FSC measures. This result contrasts with evidence from Marelli and Amenta (2018) about lexical decision data, where target-embedding measures were better. We suggest that this dissociation is theoretically relevant. Target-embedded neighbours are by necessity longer than the target itself. Thus, to estimate the degree of form-meaning systematicity for a shorter word, the learner should already know longer words than the target. While tenable when dealing with adult-sized lexicons, this assumption contrasts with developmental trajectories, where shorter words are learned earlier on average. On the contrary, Levenshtein-distance FSC does not depend on longer words being known and actually favours words with comparable lengths, offering a more plausible account of how similarity in form space is computed during learning. Thus, if the lexicon offers systematicity at a more coarse level, such that looser similarities in word form that can be leveraged when retrieving form-based nearest neighbours based on Levenshtein distance already cue something about word meaning, then learners can exploit word forms to infer semantics. Our results favour this interpretation.
Moreover, when reviewing the results of the correlational analysis, we noted how the two implementations of FSC correlated only moderately, suggesting they pick up on different aspects of form-meaning systematicity. It is however still interesting to note the predicted relation between AoA and target-embedding FSC, since it highlights a structural property of the lexicon. Target-embedding FSC measures may thus not characterise a mechanistic process by which learners infer novel words’ semantics based on phonological similarities, but still highlight that words for which phonological neighbours are semantically more coherent tend to be learned earlier. At the same time, however, Levenshtein-distance FSC measures may be too unconstrained, since neighbours may be retrieved for any string, phonotactically legitimate or not. Recently, Delgado and colleagues (2020) have provided preliminary evidence that sound-symbolic effects may depend on the degree to which non-words are acceptable in terms of phonotactics. Hence, future work should investigate and contrast different ways of deriving FSC measures, with a focus on which method best characterises how learners analogise over word forms during development.
The conceptualisation we have presented of FSC as a mapping function, which projects novel words onto semantic space, is in line other recent attempts to explicitly model a mapping function from word form to lexical meaning (Baayen et al., 2019; Cassani et al., 2020; Chuang et al., 2021; Hendrix & Sun, 2020). Hendrix and Sun (2020) relied on F
A different approach to map form and meaning (Baayen et al., 2019; Cassani et al., 2020; Chuang et al., 2021) uses simple linear mappings to learn a function which takes a form-based representation and projects it onto the same semantic space as context-based semantic vectors, not unlike other approaches in the study of morphology (Marelli & Baroni, 2015; Marelli et al., 2017). In this line of work, word form has so far been encoded using orthographic or phonological n-grams as well as audio features, whereas semantic representations have typically consisted of sparse distributed vectors derived from linguistic context using error-driven learning (Rescorla & Wagner, 1972). These measures have been primarily used to investigate the influence of form-meaning mappings in lexical decision tasks, both visual and auditory, with only a small-scale study that targeted acquisition phenomena (Cassani et al., 2020). In this work, semantic vectors derived from pseudowords were analysed to show that they entertain predictable relations with other semantic representations at the level of lexical categories (Monaghan et al., 2012), suggesting that a direct form-meaning mapping could help children form expectations about the syntactic role as well as the possible referent of a novel word (Fitneva et al., 2009). However, whereas these approaches seek to find an optimal solution to form-meaning mappings for the entire lexicon, FSC measures only target systematic relations within local neighbourhoods in form and meaning and rely on an analogical principle. Future work should focus on whether the algorithmic differences underlying different approaches to (and measures of) form-meaning systematicity bear relevance to psycholinguistic phenomena.
Next to discussing how the measures we tested can capture mechanisms relevant to word learning, it is important to discuss in more detail why form-meaning systematicity might help acquisition in the first place. We started by considering evidence that words in denser semantic as well as phonological neighbourhoods tend to be learned earlier (Fourtassi et al., 2020; Jones & Brandt, 2019). This suggests that words which are more entrenched in the language network at the level of form and meaning are easier to learn, in line with evidence by Hills and colleagues (2009) that the structure of the language network itself drives acquisition. Our analyses however show that the effect of form-meaning systematicity on AoA cannot be reduced to neighbourhood density in form and meaning. What could the advantage that form-meaning systematicity brings? Several studies have documented that children learn more iconic and systematic words earlier and more easily (Brand et al., 2018; Imai et al., 2008; Imai & Kita, 2014; Kantartzis et al., 2011, 2019; Laing, 2019; Lockwood et al., 2016; Maurer et al., 2006; Monaghan et al., 2011, 2012; Motamedi et al., 2020; Nygaard et al., 2009; Perniss & Vigliocco, 2014), with one study also showing that children are particularly sensitive to sound-symbolic patterns in language EEG (Kovic et al., 2010). This evidence aligns with the theory by Gasser (2004) that systematicity helps learning by making it possible to leverage analogies in form to infer something about meaning. Iconic words, and especially onomatopoeias (Laing, 2019), may be learned earlier also because of the resemblance between word form and referent (Dingemanse et al., 2015; Nielsen & Dingemanse, 2020), offering a first source of information to figure out that words are used referentially in natural languages (Harnad, 1990). Our results suggest another possible advantage: Words with stronger form-meaning systematicity may occupy a privileged place in the language network, by which they have stronger relations with other words when considering both form and meaning. Crucially, it is not just a matter of occupying denser neighbourhoods, but of sharing similar relations across modalities. Our results call for more studies on the structure of the language network which consider cross-modal mappings, and its potential role in facilitating language learning.
Furthermore, we have shown that systematicity and iconicity are both reliable predictors of AoA and that they relate differently to SND; more iconic words tend to have lower SND, while the opposite was reported for systematicity, suggesting that they tap into different aspects of non-arbitrariness. At a fundamental level, however, FSC and iconicity both rely on analogies; iconicity draws attention on similarities between word form and perception, with words reproducing some physical property of their referent. Systematicity, on the contrary, draws attention on analogies between word form and lexical semantics. It may thus be the case that the risk of confusing two iconic words in context is higher when the two words are similarly iconic (which entails they have similar forms and mimic reality in a similar way), but lower when the two words are similarly systematic, since the situational context may help disambiguate them (hence the different relation with SND). These are however speculations at present and more work is needed to properly characterise the effects of iconicity and systematicity. Moreover, and more importantly, while iconicity can facilitate language learning from the very start of language acquisition, systematicity can only work after a few words are known. Therefore, we hypothesise that FSC and iconicity ultimately rely on a similar process whereby learners notice a reliable correspondence across two domains and use this to refine their intuitions about lexical meaning. First, this process would work on the observation that a word which resembles a referent in the world tends to denote that referent. Then, when some words are known, it could be extended to noticing that words that sound similar tend to refer to somewhat similar meanings. Future work should investigate the time course of the effect of both iconicity and systematicity on language learning: our study offers a promising way to conceptualise their impact on AoA and an effective tool to quantify systematicity, which has been validated in several studies on a variety of psycholinguistic phenomena.
Following existing work on non-arbitrariness (Monaghan et al., 2014; Perry et al., 2015), it could be further hypothesised that indeed later acquired words increase the degree of arbitrariness in the lexicon. Our work also suggests that systematicity computed over smaller vocabularies predicts AoA just as well or better than when computed on a larger vocabulary, confirming that larger vocabularies may feature less systematicity. Some studies also suggest that the role of systematicity is modulated by lexical knowledge (Brand et al., 2018; Monaghan et al., 2011). When few words are known, systematicity may actually offer cues to the specific semantic content of an individual word, whereas with larger vocabularies, systematicity may be more useful to derive a word’s category (lexical or semantic). Longitudinal computational analyses which leverage productive form-meaning mapping functions (Amenta et al., 2017; Baayen et al., 2019; Cassani et al., 2020; Chuang et al., 2021; Hendrix & Sun, 2020; Marelli et al., 2015) to map form onto meaning on the basis of the available lexicon can thus be useful to illuminate these dynamics further, by studying the semantic representations derived from word forms and how they relate with the available representations in semantic space at any given time. The present article shows that FSC measures hold promise in this regard since they have a strong unique relation with AoA, and they have been shown to work with words and pseudowords alike (Hendrix & Sun, 2020).
While our work focused on establishing whether FSC measures influenced AoA, our results only apply to English. This choice was primarily motivated by the large availability of rich secondary data on which to perform our computational analyses and the possibility of relying upon a large sample size. However, the effects we have investigated are not hypothesised to be language-specific (Blasi et al., 2016): Other languages have larger shares of iconic words in their vocabulary than English, e.g., Japanese. Therefore, replicating our study on a larger pool of typologically different languages will likely illuminate the relation between form-meaning systematicity and acquisition further.
Finally, our work did not provide conclusive evidence about the relation between systematicity, SND, and PND and how they may or may not interact in language learning. Interactions that proved significant in the main analysis did not prove reliable when using non-linear models or when predicting subjective AoA norms. More work should thus go into investigating these relations, formulating more precise and stringent hypotheses, which can build on the evidence we provide, particularly about the different nature of iconicity and systematicity, which we have previously discussed, and how this may bear relevance when analysing the relation between different forms of non-arbitrariness and other lexical properties.
In conclusion, our study showed that computational measures of form-meaning systematicity that rely on the coherence of local neighbourhoods in form and meaning account for a significant portion of variance in AoA. In detail, words with higher form-meaning coherence tend to be learned earlier. Our results thus show that English words for which it is easier to derive coherent semantic impressions from similar words tend to be learned earlier, suggesting a place in word learning for an analogical process which bridges word form and meaning, such that children could guesstimate the meaning of a novel word by relying on the meaning of known similar sounding words. Crucially, this effect is not reducible to words being highly entrenched in the language network when considering form and meaning alone. Moreover, the measures of form-meaning systematicity we used can be seamlessly extended to actually generate distributed semantic representations from form alone, allowing to study the role of form-meaning systematicity during development using large datasets and a data-driven approach.
Supplemental Material
sj-pdf-1-qjp-10.1177_17470218211053472 – Supplemental material for Not just form, not just meaning: Words with consistent form-meaning mappings are learned earlier
Supplemental material, sj-pdf-1-qjp-10.1177_17470218211053472 for Not just form, not just meaning: Words with consistent form-meaning mappings are learned earlier by Giovanni Cassani and Niklas Limacher in Quarterly Journal of Experimental Psychology
Footnotes
Acknowledgements
The authors are grateful to Marco Marelli, Emmanuel Keeulers, and Peter Hendrix for the useful feedback during the conceptualisation of the work. They also thank Yu-Ying Chuang, Manuel Perea, and an anonymous reviewer for useful comments and improvement points on an earlier version of this work.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
