Learning to predict: Second language perception of reduced multi-word sequences

Abstract

The cognitive entrenchment of frequent sequences comes as ‘chunking’ (holistic storage) and as ‘procedure strengthening’ (predicting elements in a sequence). A growing body of research shows effects of entrenchment of multi-word sequences in the native language, which is learned and shaped continuously and intuitively. But how do they affect second language (L2) speakers, whose language acquisition is more analytic but who nonetheless also learn through usage? The present study tests advanced English learners’ receptive processing of multi-word sequences with a word-monitoring experiment. Recognition of to in the construction V to V_inf was tested for full and reduced forms ([tʊ] vs. [ɾə]), conditioned by the general frequency of the V-to sequence and the transitional probability (TP) of to given the verb (V > to). The results are compared with those previously obtained from native speakers. Results show that recognition profits from surface frequency, but not from TP. Reduced forms delay recognition, but this is mitigated in high-frequency sequences. Unlike native speakers, advanced learners do not exhibit a chunking effect of high-frequency reduced forms, and no facilitating effect of TP. We attribute these findings to learners’ lesser experience with spontaneous speech and phonetic reduction. They recognize reduced forms less easily, show weaker entrenchment of holistic representations, and do not draw on the full range of probabilistic cues available to native speakers.

Keywords

chunking frequency effects multi-word sequences phonetic reduction second language processing entrenchment

I Introduction

According to usage-based models of language, speakers’ grammatical competence develops through experience with linguistic forms and structures (Bybee, 2010; Diessel, 2019; Ellis, 2013, 2019; Goldberg, 2006; Tomasello, 2000, 2005). Language learning is ‘a predominantly inductive and experience-driven process’ (Wolter and Gyllstad, 2013: 452), in which the frequency of linguistic items and their combinations play a central role. From this perspective, any language acquisition results from ‘an accumulation of statistical probabilities and abstraction of regularities out of previous construction encounters’ (Supasiraprapa, 2019: 988). Thus, the mind builds up its knowledge of linguistic structures by forming dynamic associations between signs or constructions. These can achieve different degrees of cognitive entrenchment (Blumenthal-Dramé, 2012, 2018; Langacker, 1987). A high degree of entrenchment increases the probability that a particular linguistic item will be (re)used in the future, as well as reinforcing the associations in which that linguistic item takes part (see Balog, 2023; Diessel, 2019; Schmid, 2015, 2020).

Support for the usage-based perspective comes from studies demonstrating that language users are sensitive to frequency effects across different levels of the system: pronunciation variants (Brand and Ernestus, 2018; Bürki et al., 2011; Connine and Pinnow, 2006), syllables, morphemes, words (Alvarez et al., 2001; Reichle and Perfetti, 2003), phrasal and clausal structures (Arnon and Snider, 2010; Bannard and Matthews, 2008; Jurafsky et al., 2001; McConnell and Blumenthal-Dramé, 2021; Reali and Christiansen, 2007). At the phrasal level, for instance, Bannard and Matthews (2008) and Arnon and Snider (2010) show that children and adult first language (L1) speakers, respectively, process frequent multiword sequences faster than low-frequency ones.

A number of studies have suggested a similar role for frequency and intuitive statistics in the learning of a second language (Diependaele et al., 2013; Ellis, 2019; Ellis et al., 2016; Hernández et al., 2016; Ortega, 2013; Sonbul, 2015; Supasiraprapa, 2018, 2019; Wolter and Gyllstad, 2013). On the other hand, it has been argued that frequency information is not as central, or that it might be processed differently, in L2 learning because of nonnative speakers’ more analytical/non-formulaic approach to the target language (Wray, 2002), or because of the meddling of automatized routines shaped in the L1 (Ellis, 2006: 164). Yet, proficient learners have shown a notable degree of reliance on frequency information in language perception (see Durrant and Schmitt, 2010; Siyanova-Chanturia et al., 2011; Supasiraprapa, 2019; Wolter and Gyllstad, 2013; Yamashita and Jiang, 2010).

The present study re-examines these issues with regard to the entrenchment of multi-word sequences and processing of reduced forms. While multi-word frequency effects have been found in second language use and perception (see Section I.1), the possible routes of entrenchment (as either ‘procedure strengthening’ / sequential knowledge or ‘chunking’ / holistic access; see Section I.2) have rarely been discussed explicitly. Therefore, we address the following questions:

To what extent do second language learners form holistic units from frequent compositional sequences?

Do they rely on probabilistic information in receptive processing?

How do frequency-based expectations affect their recognition of reduced forms, given that reduction generally poses a problem even to advanced learners (Ernestus et al., 2017)?

We investigate the interplay of frequency information and reduction in the perception of English verb + to-infinitive sequences (V to V_inf, e.g. want to V_inf, pretend to V_inf) with advanced L2 learners (L1 Spanish). The experiment design – a word-monitoring task – is adapted from a previous L1 study (Lorenz and Tizón-Couto, 2019). The results can therefore be directly compared to those from native speakers.

The remainder of this introduction provides a review of entrenchment of multi-word sequences. Section II presents the experiment materials and design, including a list of variables considered and the data analysis procedure. The results are shown in Section III. We discuss the findings and compare them to those from native speakers in Section IV. Section V summarizes the main findings, suggesting that proficient L2 listeners draw on frequency information, but do not avail themselves of the full range of it.

1 Entrenchment of multi-word sequences in L2 learning

Entrenchment as an effect of frequency is known to pertain not only to single items but also to sequences. A deeply entrenched multi-word sequence may be produced or understood without the need to process every element incrementally (see Balog, 2023: 214–218; Blumenthal-Dramé, 2018; Siyanova-Chanturia, 2015), and ‘phrases that are of sufficient frequency can attain independent representation as a way of making processing more efficient’ (Arnon and Snider, 2010: 69). Although the nature of this independent mental representation is complex (and deserves further explanation; see Section I.2); existing research attests to the effects of frequency and entrenchment of multi-word sequences in the native language, which speakers learn and shape continuously and intuitively (see Arnon and Snider, 2010; Blumenthal-Dramé, 2018; Kapatsinski and Radicke, 2009; Reali and Christiansen, 2007; Siyanova-Chanturia et al., 2011; Sosa and MacFarlane, 2002; Tremblay et al., 2011).

Even though the acquisition process of a second language usually differs from that of the native language, there is good evidence that (proficient) L2 learners can also use frequency information to process language beyond the single-word level, as in formulaic sequences and collocations. For instance, a number of studies have confirmed the more efficient processing of formulaic sequences (e.g. take the bull by the horns) by L2 learners: they are processed faster and more accurately than nonformulaic sequences (Conklin and Schmitt, 2008; Jiang and Nekrasova, 2007; Underwood et al., 2004; Yamashita and Jiang, 2010). Similar findings have been reported with (non-formulaic) collocations and multi-word sequences. Durrant and Schmitt (2010) show that proficient learners can memorize lexical items better from collocations (adj-n) than individually, and conclude that learners ‘do not focus their learning entirely on individual words’ (p. 182). In Siyanova-Chanturia et al.’s (2011) eye-tracking study (reading times and fixation count), both native speakers and highly proficient nonnative speakers were able to process frequent three-word binomial phrases (e.g. bride and groom) significantly faster than the less frequent reversed forms (e.g. groom and bride). In contrast, less proficient nonnative speakers exhibited ‘comparable reading speeds for both phrase types’ (p. 5). The authors conclude that proficient L2 speakers are sensitive to the frequency of multi-word sequences, in a similar way as native speakers are. Wolter and Gyllstad (2013), Sonbul (2015), and Wolter and Yamashita (2018) report similar findings for adjective-noun collocations (e.g. middle class, real estate), in addition to an influence of the ‘congruency’ of the collocational pairs (i.e. whether they have a suitable L2 > L1 word-by-word translation). Wolter and Gyllstad (2013: 472) conclude that a ‘unified’ model of L2 processing should assume ‘greater centrality for frequency effects (and better recognition for larger grain sizes) with gains in L2 proficiency’. Effects of collocation strength also extend to recognition of function words, in nonnative speakers as well as native speakers (Baese-Berk et al., 2018). In two similar studies, Hernández et al. (2016) and Supasiraprapa (2019) use phrasal-decision tasks on four-word compositional phrases (e.g. a lot of places, I have to say) with native and non-native speakers. Both report a steady frequency effect across groups and conclude that their results support single-system models of language, in which words and larger sequences are processed by the same cognitive mechanisms.

However, other studies have reported that collocations and multi-word expressions pose a challenge to non-native speakers, and that other factors may impede learners’ sensitivity to frequency. González-Fernández and Schmitt (2015) find only a modest frequency effect for the knowledge of two-word collocations by Spanish learners and highlight the role of language exposure (similarly, Smith, 2021). Fioravanti et al. (2021) suggest that (non-)compositionality plays a greater role than frequency, such that chunking might be limited to non-compositional idioms in L2 learners. In von Stutterheim et al.’s (2021) study on expressions of motion events, the typical event construal in the L1 appears to determine proficient non-native speakers’ choice of constructions, overriding the effect of frequency in the L2.

Overall, there is strong evidence that gradual, frequency-driven entrenchment of multi-word sequences plays a role in L2 learning, especially at higher levels of proficiency. The jury is still out on whether or how this differs from processes in the L1. More specifically, an open question regards entrenchment as ‘chunking’ versus sequential knowledge, as discussed in the next section.

2 Entrenchment of sequences and chunking

Entrenchment of a frequent sequence can be apprehended in two different, though not mutually exclusive, ways. First, what is strengthened in memory may be the string as a whole, i.e. a complete, largely invariant phrase or bigram, such as I don’t know or want to. This view corresponds to the notion of chunking, which leads to a memory representation as a single unit in which internal elements and boundaries are no longer salient or perceptually relevant (see Bybee, 2006; Diessel, 2007; Ellis, 2002a, 2002b; Ellis et al., 2008). Second, a sequence can be entrenched by ‘procedure strengthening’ (Divjak and Caldwell-Harris, 2015: 66–67; Hartsuiker and Moors, 2018). The individual elements are not backgrounded, but rather predictable from each other, due to a tacit knowledge of their frequent co-occurrence.

In both perspectives, entrenchment provides a general processing advantage for the respective sequence: a holistic unit is more easily recognized than an assembled string of items, as is a predictable sequence.

It has been suggested that procedure strengthening and chunking are on a continuum: A frequent sequence is entrenched as a procedure, and this entrenchment eventually leads to an increasingly holistic perception and storage, i.e. chunking (see Blumenthal-Dramé, 2012: 68–69; Langacker, 1987: 59–60). This has been supported by word-monitoring studies which find chunking effects only at very high collocation frequencies (Kapatsinski and Radicke, 2009; Sosa and MacFarlane, 2002). However, given that mental representations are gradient and can be redundant (see Elman, 2009), a stored ‘chunk’ does not preclude recognition of the sequence, and a phrase might even be acquired as a whole before its composition from the parts (see Arnon and Christiansen, 2017: 632; Siyanova-Chanturia, 2015). That the components of frequent sequences remain relevant is shown by Arnon and Cohen Priva (2014), where both n-gram frequency and individual word frequency showed effects on word durations in speech production (see also Konopka and Bock, 2009; Molinaro et al., 2013; Snider and Arnon, 2012). Moreover, individuals differ in their tendency to focus on the elements or the whole (McConnell, 2023; see also Balog, 2023).

The status of a ‘chunk’ need not be strictly a matter of ‘stored vs. computed’ (Caldwell-Harris et al., 2012: 4). It can be defined by ‘global precedence’, by which ‘the configuration as a whole is cognitively more prominent than its individual component parts’ (Blumenthal-Dramé, 2018: 138). A holistic and a compositional access can be activated at the same time, at varying levels. This also shows in our earlier findings (Lorenz and Tizón-Couto, 2019), where a chunking effect depends not just on frequency but also on form (reduction). Phonetic reduction is known to result from entrenchment generally. Yet, its role in L2 processing is still under-researched.

As this summary shows, it is important to ‘directly [consider] the relationship between the parts and the whole’ (Siyanova-Chanturia, 2015: 291), as well as the forms they take. The present study addresses this by means of a word-monitoring experiment with advanced L2 learners of English.

II Method

Our study focuses on the V + to-infinitive (V to V_inf) construction in English. High-frequency types of this construction (e.g. have to V_inf, want to V_inf) are often regarded as chunks and frequently reduced in spontaneous speech (see Krug, 2000; Lorenz and Tizón-Couto, 2024a; Tizón-Couto and Lorenz, 2018). We might assume that the rate and degree of reduction is directly linked to the item’s general frequency (Bybee, 2001: 165–166, 2006; Ellis, 2002b: 331) and to its predictability in context (see Flach, 2020; Levshina and Lorenz, 2022). However, usage of a reduced variant is also contingent on register and style (Berglund, 2000; Levshina and Lorenz, 2022; Lorenz, 2020), as well as morpho-phonological properties and semantic transparency (Lorenz and Tizón-Couto, 2020). Some reduced forms have acquired degrees of conventionality, even to the point of divergence and ‘emancipation’ from the source form (e.g. gonna vs. going to; Lorenz, 2013; Mahler, 2022).

We investigate how chunking and procedure strengthening apply to V to V_inf constructions by testing the recognition of the element to. If the element is predictable due to the frequency of the sequence or its likelihood given the preceding verb, this should result in faster recognition. If, on the other hand, the V-to item is initially perceived as a chunk, recognition of the element will be slowed down. Importantly, we include full and reduced realizations, to see how probabilistic expectation might help non-native listeners cope with reduction.

1 Experiment design and task

The experiment is a replication of the word-monitoring task reported in Lorenz and Tizón-Couto (2019), except with advanced learners of English instead of native speakers. The task consists in listening to recorded sentences and responding to the word to (or noting its absence) as quickly as possible.

The participants were 44 Spanish learners of English living in the region of Galicia (all right-handed and of normal hearing). All had a certified C1–C2 level of proficiency according to the Common European Framework of Reference (CEFR, 2001).

The input material was recorded by a female speaker with a General American accent. The experiment comprises 126 sentences, of which 42 each are target, control and distractor items (for a list of the items, see Appendix 1 in supplemental material). Target items contain a V to V_inf construction – examples (1a) and (1b) – control items contain to in a different context (2), and distractors include no to at all (3). The control items contain the same verbs as the target items, but with a different complement (e.g. prefer and agreed in example (2); compare (1a)–(2a) and (1b)–(2b)), so that participants could not expect a V to V_inf pattern whenever the verb might trigger it.

(1) a. When the monkeys have their party I prefer to leave the house.

b. The elephants all agreed to have their tusks painted.

(2) a. At dinner parties I prefer penguins to monkeys.

b. We all agreed that it was a bad idea to paint the elephants’ tusks.

(3) There is just no point in teaching the crocodiles manners.

Target items come in one of two forms: A full form in which to is fully articulated [tʊ], or a reduced form with to pronounced with an alveolar flap and a schwa [ɾə]. Each participant was assigned to one of two versions of the presentation list, so that they would hear half of the items in the full form and the other half reduced. While one version contained, for example, prefer [tʊ] and remember [ɾə], the other version had prefer [ɾə] and remember [tʊ]. Apart from this, the items were the same.

The experiment was carried out on a laptop computer with OpenSesame (version 3.2.6; Mathôt et al., 2012) with the ‘Expyriment’ backend (Krause and Lindemann, 2014). Participants heard the stimuli on studio over-ear headphones and were asked to hit a marked button on the keyboard as soon as they heard the word to; another button was to be pressed if to was absent from the sentence. The response and reaction time (from onset of to) were recorded. The actual task was preceded by a practice round of four items; stimuli in the main task were presented in random order. To ascertain participants’ continuing attention, short questions about the content of individual sentences were strewn in at varying intervals.

2 Variables

The variables and analysis are built on our previous experiment (Lorenz and Tizón-Couto, 2019: 757–760), with the addition of control variables which particularly relate to L2 acquisition. Our main analysis concerns response times (RTs) on correctly identified items, as a measure of processing effort. RTs were measured from the onset of to, i.e. from the release burst of the plosive. For the analysis, we transformed them to logarithmic values, so as to not overestimate the differences between longer response times (see Baayen, 2008: 30).

Condition (full vs. reduced), frequency and transitional probability (TP) are the test variables. Derived from the ‘Spoken’ section of the Corpus of Contemporary American English (COCA; Davies, 2008– ), frequency was measured as the normalized frequency (tokens per 1 million words) of the given verb form with a to-infinitive complement, and log-transformed for analysis. TP is the relative likelihood [0,1] of a to-infinitive occurring after a particular verb. For instance, have to V_inf has a high frequency (951.4 per million, log = 6.86) but a low TP (0.136, meaning that 13.6% of all instances of have are followed by to V_inf), whereas deign to V_inf is the opposite (frequency 0.03 per million, log = −3.6; TP = 0.941).

We also tested for the following set of control variables.

a Frequency of to-V_inf and backward transitional probability

Phonetic reduction of an item can be conditioned by its collocational frequency with the following word (see Barth and Kapatsinski, 2017; Bell et al., 2009; Gradoville, 2017; Kilbourn-Ceron et al., 2020). A word that often occurs after to might aid the hearer to more quickly detect the element to. This is controlled for by considering both the surface frequencies of to-V_inf sequences and the probability of to given the following word (backward TP), as derived from the spoken section of COCA.

b Verb duration, syllable count and verb form

The duration of the verb in each V-to item was measured, ranging from 182 ms to 590 ms (mean = 350 ms). Since many verbs are monosyllabic, an additional factor considers whether the verb has one syllable or more. The inflectional forms of the Vs reflect the form with the highest surface frequency in the corpus (e.g. trying for the lemma try). To control for the potential influence of inflection, the 42 verbs were coded for present (26), past (12) and progressive (4).

c Merged plosive cluster

In verbs ending in alveolar stops (/d/ or /t/ in need to or forgot to), this sound merges with the initial /t/-sound in to, regardless of its variant (full or reduced). The separation between the verb and to is less marked in these cases (19 out of 42) and might delay recognition.

d Preceding sound

Lenited /t/-sounds are typically disfavored after fricatives in speech production (see Lorenz and Tizón-Couto, 2024a). Because some of our experimental items include this context (10/42), the sound segment preceding /t/ was coded for two levels: fricative (e.g. deserve to) or vowel/nasal (e.g. continue to, began to).

e Control before target and item count

This variable checks whether the control item with a given verb was heard before the target item with the same verb, as this situation might have led to a priming effect.

To control for learning or fatigue effects, the item count during the experiment was considered, both as a control variable and as a by-participant random slope in the statistical model (see Section II.3).

f Gender and age

Participants were asked to provide their details as they were introduced to the experiment. Twenty-seven reported ‘female’, 15 ‘male’ and one ‘other’. Age was taken up by year of birth and ranges from 17 to 54 years (mean = 30; median = 27).

g Proficiency level of participants (certificate and test score)

So as to cross-check the proficiency levels certified by the participants (C1 or C2 in CEFR) before the experiment, they completed a brief comprehension test consisting of two listening tasks. They listened twice to two original recordings of American English extracted from a lecture and a radio programme, and then had to fill gaps of one to four words in 18 sentences (9 per task) about the content of the recordings. The two tasks were designed on the basis of the guidelines for standardized tests for the C1 level, in particular those implemented at Escuela Oficial de Idiomas (Specialized Official Foreign Language Education) within the Spanish educational system. The proficiency level of participants was coded on the basis of both their overall certification and the score they achieved in the listening comprehension test. The latter measures listening proficiency directly, which is especially relevant for the experiment.

h Years of learning English and time abroad

Participants were asked to provide the number of years learning English at school, high school, university or elsewhere. They also listed previous stays in English speaking countries which were longer than one month. This variable was coded as ‘none’, ‘UK’, or ‘America’ (where ‘America’ means any stay in the U.S.A. or Canada).

i Congruency

The native languages of the participants are Spanish and Galician.¹ These two languages might feature either a particle (que, a, de, etc.) between the V and the V_inf in instances corresponding to the English ‘V to V_inf’ construction (e.g. tengo/teño que ver, comenzar/comezar a sentir, tratar de explicar) or an empty slot (zero in quiero/quero comer, necesita parar, etc.). On the basis of the distinction proposed in Yamashita and Jiang (2010) or Wolter and Gyllstad (2013) for two-word collocations, we coded this variable as ‘congruent’ (the corresponding Spanish/Galician sequence includes a particle) or ‘incongruent’ (the Spanish/Galician equivalent does not include a particle).

3 Data analysis

The experiment yielded 3,612 responses in total (1,806 each of target and control items). We define a correct response to a target item as identifying to within 100–3,000 ms of its onset. Earlier or later responses, even if correct, are arguably due to either false recognition or later-stage re-processing after hearing the complete sentence. The resulting accuracy rate on target items, i.e. the rate of correct responses, is 78% (1,412/1,806).²

The analysis of response times concerns only correct responses to target items. After removing individual outliers (by-subject z-score > 2.5; see Baayen and Milin, 2010: 16), the final data set comprises 1,372 data points. We modeled the results by way of a mixed-effects generalized additive model (GAM; Wood, 2017) on the logarithmized response times.³ A GAM can represent correlations without enforcing a linear correspondence between variables. We applied exactly the same modeling procedure as in the native-speaker study, where it is laid out in fuller detail (Lorenz and Tizón-Couto, 2019: 760–761). We employ frequency and TP (transitional probability) as test variables (which must remain in the model), and condition as a moderator variable whose interaction with every other variable is tested (see Jaccard, 2001). Potential control variables are all the others listed in Section II.2 above. In addition, we tested a number of random factors to capture the variability brought in by factors outside of the study design, such as individual differences between participants and idiosyncrasies of particular test items.⁴ The final set of control variables, as well as the random effects structure, were arrived at by backward stepwise variable selection, which starts from including all terms and gradually removes those that do not make a significant contribution to the outcome. This was done first with the random factors in the absence of control variables. The control variables were then put through variable selection in a second stage.

The resulting model includes the control variables test score, preceding sound, plosive cluster (each in interaction with condition), verb duration and backward TP, as well as random effects for subject (intercept), verb (intercept and slope by condition) and item count (slope by subject). The model specification is:⁵

logrt ~ s(logfreq, bs = ‘cr’, by = condition) + s(TP, bs = ‘cr’, by = condition) + s(subject.fac, bs = ‘re’) + s(verb, bs = ‘re’) + s(count_exp_items, subject.fac, bs = ‘re’) + s(condition, verb, bs = ‘re’) + condition + backward_TP + log(verb_duration) + condition * plosive_cluster + condition * prec.sound + condition * test_score

III Results

We first report briefly on accuracy, as incorrect responses are a coarse indicator of recognition problems – more detail and the statistical model for accuracy can be found in Appendix B in supplemental material. Between conditions, the accuracy rate is significantly lower for reduced forms (67%) than full forms (89%). Moreover, accuracy on reduced forms is lower with low-frequency items (see Appendix B in supplemental material). This confirms the expectation that reduced forms are more difficult to recognize, especially in less common, less familiar bigrams.

The core of the study design is the response times (RT) when to was correctly recognized. Overall, reduced items come with greater response latencies, compared to both full and control items (see Figure 1). These differences match those produced by native speakers, although native speakers showed a higher overall accuracy (86%; Lorenz and Tizón-Couto, 2019: 761).

Figure 1.

Beanplot:* Response times to control, full and reduced items (correct responses only).

How are these response latencies affected by bigram frequency, transitional probability, or other variables? The main results – for the test variables frequency and TP – are presented in Figure 2. The graphs show response times as estimated by the model for reduced and full items across the range of values for frequency (left panel) and TP (right panel). The effects of all other model terms are set to their mean.

Figure 2.

Response times to full and reduced items by frequency (left panel) and transitional probability (TP) (right panel).

In both graphs, recognition of reduced items is consistently slower than that of full forms, as expected. Surface frequency (Figure 2, left panel) shows a continuous facilitating effect on full forms, with only slight bends in the curve (left panel, blue line). The effect of frequency with reduced items is more erratic, but also generally such that higher frequency aids recognition, with the strongest effect at the high end of the frequency scale (the steep downward slope between log frequency 5.0 and 7.5). It is in these high-frequency sequences that we might expect delayed responses due to chunking, as was observed with native speakers. Advanced learners show no indication of such a chunking effect, though they clearly are responsive to bigram frequency in terms of procedure strengthening.

The transitional probability of to given the preceding verb has no evident effect on its recognition (Figure 2, right panel). The fluctuation in the curve for full forms shows that there is some variance between items, but the result is inconclusive at best. Generally, reduction delays recognition evenly across the TP range. This is different from the finding with native speakers, where reduced item recognition profited immensely from high TP. Thus, advanced learners seem to lack the sensitivity to this conditional frequency measure that native speakers have.

Five additional variables turned out to affect response times (see Figure 3). Three of these concern acoustic/phonetic properties of the main verb: verb duration, preceding sound (i.e. the speech sound before to) and the presence of a plosive cluster at the word boundary. The other two are the probability of to given the following word (backward TP) and, importantly, the participant’s English proficiency as measured by the comprehension test.

Figure 3.

Response times to full and reduced items by control variables.

Participants with higher test scores respond faster in both conditions, though the effect is stronger with reduced items (Figure 3, lower left panel). Apparently there are large differences even between advanced learners in how effectively they process spoken input and how well they can cope with reduction. However, we do not find any effect of congruency with the L1 pattern, thus no indication of L1 interference in this case (in contrast to other studies, e.g. Wolter and Gyllstad, 2013; Wolter and Yamashita, 2018; Yamashita and Jiang, 2010).

A merged plosive cluster occurs when the verb preceding to ends in a /t/ or /d/ (e.g. in hate to) and the two plosives are realized as one (full: [heɪtʊ], reduced: [heɪɾə]). This seems to make full forms easier to identify, perhaps due to a slightly longer closure, but delays the recognition of reduced items (Figure 3, top right panel). A plausible explanation is that the merged and reduced plosive obscures the word boundary and hence the onset of to. A facilitating effect is observed for longer verb duration (top left panel). A longer verb gives the hearer more time to parse the input and prepare for the next item; this appears to affect full forms in particular, though the interaction term is not part of the final model. Similar effects of verb duration and plosive cluster were also observed in the experiment with native speakers (Lorenz and Tizón-Couto, 2019: 770). This is not the case with preceding sound, where in advanced learners we see delayed recognition of reduced items preceded by a vowel or nasal (e.g. try or happen). These are the environments in which /t/-flapping is most common in (American) English and should therefore be expectable (see Patterson and Connine, 2001; Zue and Laferriere, 1979).

An additional – and surprising – relevant variable is backward TP, the likelihood of to given the following item (Figure 3, bottom right panel). The stimuli were designed to avoid surprises in the slot after to, filling it with common verbs such as give, play, have, be, etc. The range of backward TP is therefore small. Still, a higher backward TP facilitates the recognition of reduced to. This is all the more surprising as the test variable of forward TP showed no such effect.

IV Discussion

We first discuss the results of the word-monitoring study with advanced learners of English in comparison to those gained from native speakers with the same study design (Lorenz and Tizón-Couto, 2019). Then we will relate the findings to what is known about second language processing and comprehension.

First, it is no surprise that learners show a lower accuracy rate and longer response times overall. Processing a spoken input in a foreign language clearly is more difficult than in the native language (see Arnon and Christiansen, 2017; and, for an overview, see Ernestus et al., 2017: 3–4). However, the relative delay and lower accuracy in response to reduced forms is very similar between the cohorts. Reduction generally makes for a less clear acoustic signal, which puts the burden of reconstructing the item on the hearer (see Ernestus, 2014; Ernestus et al., 2002; Lindblom, 1990). This has been shown even for frequently reduced items (Pitt, 2009; Pitt et al., 2011; Ranbom and Connine, 2007), and it holds in our results for advanced learners (this study) as well as native speakers (Lorenz and Tizón-Couto, 2019).

How do advanced learners make use of frequency information, compared to native speakers? To what extent do they form sequential expectations or holistic units from compositional sequences? In Lorenz and Tizón-Couto (2019) we have adopted a view that high-frequency combinations can be entrenched first as sequential information (‘procedure strengthening’) and, second, as holistic storage (‘chunking’); see Section I.2 above, and Divjak and Caldwell-Harris (2015), Siyanova-Chanturia (2015), and Blumenthal-Dramé (2018). We argued that these are not mutually exclusive; reduced forms can make the individual parts less discernible and give prominence to the whole, while full forms make it easier to recognize the items in sequence. Moreover, native speakers identify reduced items more quickly when they are predictable from the immediate context (as measured by transitional probability). This implies a tacit knowledge of conditional probabilities and the ability of ‘predictive processing’, of intuitively forming and updating expectations of what comes next (see Divjak, 2019, chapter 8; Kuperberg and Jaeger, 2016; Pickering and Garrod, 2007). With native speakers, these expectations come to bear when facing a reduced input, that is, as a ‘helping hand’ when the acoustic signal is weakened (Huettig and Mani, 2016; for a recent discussion, see McConnell and Blumenthal-Dramé, 2021). In our present findings, advanced learners are sensitive to frequency information, too, and can reach for its ‘helping hand’. They clearly profit from higher bigram frequencies, which we interpret as an entrenchment effect in terms of procedure strengthening. However, this is limited to surface frequency: the more complex cue from transitional probability seems to require more experience to enable reliable predictions (see also Grüter and Rohde, 2021).

We also do not observe a chunking effect with high-frequency bigrams. This might be explained in two ways: first, if chunking only sets in at very high frequencies, most learners may simply not have had the necessary amount of exposure, especially to reduced forms; second, learners might be trained to parse structures in the target language in a more analytical, word-by-word fashion (see Wray, 2002: 205–206). Moreover, the task of the present experiment rather discourages holistic processing, so the effect of chunking will only show when it indeed overrides sequential prediction. Other studies that show collocation frequency effects in learners have evoked chunking as an explanation but do not distinguish it from procedure strengthening (e.g. Durrant and Schmitt, 2010; Sonbul, 2015).

Given that transitional probability has no interpretable effect in either condition, we could conclude that learners base their expectations solely on surface frequency and do not draw on any more complex probabilistic information. However, higher backward transitional probability does facilitate recognition. This indicates that learners are not really insensitive to conditional probabilities, though they differ from native speakers. A native-speaker production study by Bell et al. (2009) suggests that reduction of high-frequency function words (such as to) is contingent on the previous word, especially when reduction affects the onset (as in the reduced variant [ɾə] that we used). This was matched in the effect of (forward) TP in the word-monitoring study with native speakers (Lorenz and Tizón-Couto, 2019: 772). The learners’ responses seem to rather correspond to mid-frequency function words, which are affected by their predictability given the following word (Bell et al., 2009: 102). Thus, the difference could be explained by learners’ generally lower exposure to the language. It is probably reinforced by their longer response times: their reaction extends past the following word, which consequently has a greater influence than with faster reactions. An additional factor might be that in English teaching, verbs are often presented with their to-infinitive, so that to V becomes an overtly learned pattern, while V to is much less so.

Another difference from native speaker responses is the delayed recognition of reduced to when it follows a vowel or nasal. These are typical environments for /t/-reduction, though both production and perception of reduced forms vary with lexical frequency (Patterson and Connine, 2001; Pinnow et al., 2017; Pitt et al., 2011). The observed effect may then also be interpreted as a result of learners’ lower frequency of exposure, especially to casual speech. They will be less familiar with the reduced form and therefore less likely to expect it even in its typical environments; but it is these environments where it is less clearly marked off from the preceding phone. In this respect, it is also noteworthy that proficiency in listening comprehension makes a strong difference especially on the recognition of reduced forms, even though all participants had a very high level of English overall. This confirms that coping with reduction is a particularly demanding aspect of speech perception, and is not easily acquired. It remains an open question to what extent this depends on reduction patterns in the L1; for example, vowel reduction to schwa is relatively rare in Spanish/Galician (our participants’ L1). Speakers of languages in which it is more common (e.g. Portuguese, French, German) might be able to profit from this in the L2.

Our findings dovetail with usage-based accounts of second language acquisition, which take input frequencies as a crucial element of developing the L2 inventory (e.g. Ellis, 2002a, 2015; Ellis and Wulff, 2020; Eskildsen, 2009; Wulff, 2019). Second language learners do not only learn by the book but also, like native speakers, keep tally of commonly occurring items and combinations of items. Then, how do frequency-based expectations come to bear in coping with reduction? In our results, the clearest and strongest effects are those of V + to bigram frequency. This shows especially at the high end of the frequency range, where the slope is at its steepest (see Figure 2, left panel, above). At the highest frequency, reduced items are recognized almost as quickly as full forms and much faster than mid-frequency reduced items. It is known that learners often rely on constructional prototypes, fixed sequences that serve as templates around which constructional patterns are construed (Eskildsen and Cadierno, 2007; Myles, 2004). While this has usually been reported from less advanced learners’ L2 production, it is plausible that it still affects highly proficient learners’ perception. High-frequency V-to items like going to, want to, have to (with log frequencies around 7 in our data) may well be such formulaic prototypes in a learner’s English inventory, and therefore be much more deeply entrenched than the runners-up in terms of frequency, such as need to and used to (with log frequencies around 5). Still, there seems to be no interference from holistically stored reduced variants (in contrast to the chunking effect found with native speakers). While the frequency effects that we find are better attributed to procedure strengthening than chunking, they do not disprove the idea that L2 learners can also draw on chunks based on frequency and form: at least, the effects of phonological environment (preceding sound and plosive cluster merging) show that they are sensitive to the phonological embedding of reduction. Substantial effects of memorized chunks might depend on a very high level of exposure, particularly to spontaneous spoken language.

V Conclusions

In sum, our word-monitoring experiment adds to the evidence of frequency effects in advanced learners’ speech perception. A high surface frequency of a bigram clearly facilitates recognition and also aids the recovery of a reduced form. We do not, however, observe a chunking effect of high-frequency sequences, where a holistic perception would take precedence over the compositional one. Moreover, learners seem to make little use of predictability from transitional probability in the given task.

This differs from earlier findings with native speakers, where a wider range of frequency effects was observed (facilitation with increasing surface frequency, chunking effect with reduced forms, faster recovery of reductions with high transitional probability). In comparison with other second language studies, our results confirm that learners are sensitive to collocation frequencies (see Conklin and Schmitt, 2008; Siyanova-Chanturia et al., 2011; Wolter and Gyllstad, 2013), and support the notion that learners rely primarily on surface frequency and less on sequential probabilities (see Arnon and Christiansen, 2017; Diependaele et al., 2013; Ellis et al., 2008). Thus, advanced L2 learners profit from frequency information, but do not avail themselves of the full range of it. The ‘helping hand’ of probabilistic knowledge is there, but L2 learners only reach for its little finger.

In our understanding, much of this can be explained by learners’ generally lower exposure to the target language, even at an advanced level of proficiency. Probabilistic information is relatively complex, as it involves not only the frequency of a given sequence but also of potential competitors. For hearers to be able to draw on this kind of information for effective speech perception may require a large amount of experience and a diverse input of spoken language. The way in which second language learners acquire the finer skills of processing that optimize speech perception may be similar or identical to that of native speakers: intuitive, through experience, by a gradient entrenchment of structures and patterns. It is the amount and kind of experience that is different.

Supplemental Material

sj-docx-1-slr-10.1177_02676583241246147 – Supplemental material for Learning to predict: Second language perception of reduced multi-word sequences

Supplemental material, sj-docx-1-slr-10.1177_02676583241246147 for Learning to predict: Second language perception of reduced multi-word sequences by David Tizón-Couto and David Lorenz in Second Language Research

Supplemental Material

sj-docx-2-slr-10.1177_02676583241246147 – Supplemental material for Learning to predict: Second language perception of reduced multi-word sequences

Supplemental material, sj-docx-2-slr-10.1177_02676583241246147 for Learning to predict: Second language perception of reduced multi-word sequences by David Tizón-Couto and David Lorenz in Second Language Research

Footnotes

Acknowledgements

We would like to thank the editorial board of Second Language Research for their scrutiny and valuable suggestions; and three anonymous reviewers for their constructive criticism and insightful comments. We would also like to acknowledge the useful feedback we received from various colleagues at the Fifth Variation and Language Processing Conference (VALP5) in Copenhagen (2021), the Ninth International Conference of the German Cognitive Linguistics Association (DGKL/GCLA-9) in Erfurt (2022), and the Fourteenth International Conference of Experimental Linguistics (ExLing 2023) in Athens. All of these have immensely improved this article; any remaining obscurities are entirely our own responsibility.

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The research reported in this article was funded by grant PID2020-118143GA-I00, awarded by MCIN/AEI/10.13039/501100011033/, and Xunta de Galicia (grant ED431C2021/52); support is gratefully acknowledged.

ORCID iDs

David Tizón-Couto

David Lorenz

Open Badges Statement

The datasets and R-scripts used in the present study are published as Lorenz and Tizón-Couto (2024b) in the Tromsø Repository of Language and Linguistics (TROLLing), available at .

Supplemental material

Supplemental material for this article is available online; it includes the list of stimulus sentences (Appendix 1) and the statistical model summaries (Appendix 2).

Notes

References

Alvarez

Carreiras

Taft

(2001) Syllables and morphemes: Contrasting frequency effects in Spanish. Journal of Experimental Psychology: Learning, Memory, and Cognition 27: 545–55.

Arnon

Christiansen

(2017) The role of multiword building blocks in explaining L1–L2 differences. Topics in Cognitive Science 9: 542–51.

Arnon

Cohen Priva

(2014) Time and again: The changing effect of word and multiword frequency on phonetic duration for highly frequent sequences. The Mental Lexicon 9(3): 377–400.

Arnon

Snider

(2010) More than words: Frequency effects for multi-word phrases. Journal of Memory and Language 62: 67–82.

Baayen

(2008) Analyzing Linguistic Data: A Practical Introduction to Statistics Using R. Cambridge: Cambridge University Press.

Baayen

Milin

(2010) Analyzing reaction times. International Journal of Psychological Research 3(2): 12–28.

Baese-Berk

Morrill

Dilley

(2018) Predictability and perception for native and non-native listeners. Linguistics Vanguard 4: 20170022.

Balog

(2023) Entrenchment revisited: Some old and new concepts and their empirical validation. PhD dissertation, Friedrich-Alexander-Universität Erlangen-Nürnberg, Available at: https://open.fau.de/server/api/core/bitstreams/5bc90442-66f3-4ca8-8f5a-f578e65c7f77/content (accessed April 2024).

Bannard

Matthews

(2008) Stored word sequences in language learning: The effect of familiarity on children’s repetition of four-word combinations. Psychological Science 19: 241–48.

10.

Barth

Kapatsinski

(2017) A multimodel inference approach to categorical variant choice: Construction, priming and frequency effects on the choice between full and contracted forms of am, are and is. Corpus Linguistics and Linguistic Theory 13(2): 203–60.

11.

Bell

Brenier

Gregory

, et al. (2009) Predictability effects on durations of content and function words in conversational English. Journal of Memory and Language 60: 92–111.

12.

Berglund

(2000) Gonna and going to in the spoken component of the British National Corpus. In: Mair

Hundt

(eds) Corpus Linguistics and Linguistic Theory: Papers from the Twentieth International Conference on English Language Research on Computerized Corpora (ICAME 20). Amsterdam: Rodopi, pp. 35–49.

13.

Blumenthal-Dramé

(2012) Entrenchment in usage-based theories: What corpus data do and do not reveal about the mind. Berlin: Mouton de Gruyter.

14.

Blumenthal-Dramé

(2018) Entrenchment from a psycholinguistic and neurolinguistic perspective. In: Schmid

(ed.) Entrenchment and the Psychology of Language Learning. Berlin: Mouton de Gruyter, pp. 129–52.

15.

Brand

Ernestus

(2018) Listeners’ processing of a given reduced word pronunciation variant directly reflects their exposure to this variant: Evidence from native listeners and learners of French. The Quarterly Journal of Experimental Psychology 71(5): 1240–59.

16.

Bürki

Alario

Frauenfelder

(2011) Lexical representation of phonological variants: Evidence from pseudohomophone effects in different regiolects. Journal of Memory and Language 64: 424–42.

17.

Bybee

(2001) Phonology and language use. Cambridge: Cambridge University Press.

18.

Bybee

(2006) From usage to grammar: The mind’s response to repetition. Language 82(4): 711–33.

19.

Bybee

(2010) Language, usage and cognition. Cambridge: Cambridge University Press.

20.

Caldwell-Harris

Berant

Edelman

(2012) Entrenchment of phrases with perceptual identification, familiarity ratings, and corpus frequency statistics. In: Divjak

Gries

(eds) Frequency effects in language representation. Berlin: Mouton de Gruyter, pp. 165–94.

21.

Conklin

Schmitt

(2008) Formulaic sequences: Are they processed more quickly than nonformulaic language by native and nonnative speakers? Applied Linguistics 29(1): 72–89.

22.

Connine

Pinnow

(2006) Phonological variation in spoken word recognition: Episodes and abstractions. The Linguistic Review 23(3): 235–45.

23.

Council of Europe (2001) Common European Framework of Reference for Languages: Learning, Teaching, Assessment (CEFR). New York: Cambridge University Press.

24.

Davies

(2008– ) The Corpus of Contemporary American English (COCA): 450 million words, 1990–present. Available at: https://www.english-corpora.org/coca (accessed April 2024).

25.

Diependaele

Lemhöfer

Brysbaert

(2013) The word frequency effect in first- and second-language word recognition: A lexical entrenchment account. Quarterly Journal of Experimental Psychology 66: 843–63.

26.

Diessel

(2007) Frequency effects in language acquisition, language use, and diachronic change. New Ideas in Psychology 25(2): 108–27.

27.

Diessel

(2019) The grammar network: How linguistic structure is shaped by language use. Cambridge: Cambridge University Press.

28.

Divjak

(2019) Frequency in language: Memory, attention and learning. Cambridge: Cambridge University Press.

29.

Divjak

Caldwell-Harris

(2015) Frequency and entrenchment. In: Dąbrowska

Divjak

(eds) Handbook of cognitive linguistics. Berlin: Mouton de Gruyter, pp. 53–75.

30.

Durrant

Schmitt

(2010) Adult learners’ retention of collocations from exposure. Second Language Research 26: 163–88.

31.

Ellis

(2002a) Frequency effects in language processing: A review with implications for theories of implicit and explicit language acquisition. Studies in Second Language Acquisition 24(2): 143–88.

32.

Ellis

(2002b) Reflections on frequency effects in language processing. Studies in Second Language Acquisition 24(2): 297–339.

33.

Ellis

(2006) Selective attention and transfer phenomena in L2 acquisition: Contingency, cue competition, salience, interference, overshadowing, blocking, and perceptual learning. Applied Linguistics 27: 164–94.

34.

Ellis

(2013) Second language acquisition. In: Trousdale

Hoffmann

(eds) Oxford handbook of construction grammar. Oxford: Oxford University Press, pp. 365–78.

35.

Ellis

(2015) Cognitive and social aspects of learning from usage. In: Cadierno

Eskildsen

(eds) Usage-based perspectives on second language learning. Berlin: Mouton De Gruyter, pp. 49–73.

36.

Ellis

(2019) Essentials of a theory of language cognition. Modern Language Journal 103: 39–60.

37.

Ellis

Römer

O’Donnell

(2016) Usage-based approaches to language acquisition and processing: Cognitive and corpus investigations of construction grammar. Malden, MA: Wiley-Blackwell.

38.

Ellis

Simpson-Vlach

Maynard

(2008) Formulaic language in native and second language speakers: Psycholinguistics, corpus linguistics, and TESOL. TESOL Quarterly 42: 375–96.

39.

Ellis

Wulff

(2020) Usage-based approaches to L2 acquisition. In: Van Patten

Keating

Wulff

(eds) Theories in second language acquisition: An introduction. New York: Routledge, pp. 63–82.

40.

Elman

(2009) On the meaning of words and dinosaur bones: Lexical knowledge without a lexicon. Cognitive Science 33: 547–82.

41.

Ernestus

(2014) Acoustic reduction and the roles of abstractions and exemplars in speech processing. Lingua 142: 27–41.

42.

Ernestus

Baayen

Schreuder

(2002) The recognition of reduced word forms. Brain and Language 81: 162–73.

43.

Ernestus

Dikmans

Giezenaar

(2017) Advanced second language learners experience difficulties processing reduced word pronunciation variants. Dutch Journal of Applied Linguistics 6(1): 1–20.

44.

Eskildsen

(2009) Constructing another language: Usage-based linguistics in second language acquisition. Applied Linguistics 30(3): 335–57.

45.

Eskildsen

Cadierno

(2007) Are recurring multi-word expressions really syntactic freezes? Second language acquisition from the perspective of usage-based linguistics. In: Nenonen

Niemi

(eds) Collocations and idioms 1: Papers from the First Nordic Conference on Syntactic Freezes. Joensuu: Joensuu University Press, pp. 86–99.

46.

Fioravanti

Senaldi

MSG

Lenci

, et al. (2021) Lexical fixedness and compositionality in L1 speakers’ and L2 learners’ intuitions about word combinations: Evidence from Italian. Second Language Research 37(2): 291–322.

47.

Flach

(2020) Reduction hypothesis revisited: Frequency or association? In: Sanchez-Stockhammer

Günther

Schmid

(eds) Language in mind and brain: Multimedial proceedings of the workshop held at Ludwig–Maximilian University Munich, December 10–11, 2018. Munich: Open Access LMU, pp. 16–22.

48.

Goldberg

(2006) Constructions at work: The nature of generalization in language. Oxford: Oxford University Press.

49.

González-Fernández

Schmitt

(2015) How much collocation knowledge do L2 learners have? The effects of frequency and amount of exposure. International Journal of Applied Linguistics 166(1): 94–126.

50.

Gradoville

(2017) The cognitive representation of multi-word sequences: A usage-based approach to the reduction of Fortalezense Portuguese para. Lingua 199: 94–116.

51.

Grüter

Rohde

(2021) Limits on expectation-based processing: Use of grammatical aspect for co-reference in L2. Applied Psycholinguistics 42(1): 51–75.

52.

Hartsuiker

Moors

(2018) On the automaticity of language processing. In: Schmid

(ed.) Entrenchment and the psychology of language learning. Berlin: Mouton de Gruyter, pp. 201–26.

53.

Hernández

Costa

Arnon

(2016) More than words: Multiword frequency effects in non-native speakers. Language, Cognition and Neuroscience 31(6): 785–800.

54.

Huettig

Mani

(2016) Is prediction necessary to understand language? Probably not. Language, Cognition and Neuroscience 31(1): 19–31.

55.

Jaccard

(2001) Interaction effects in logistic regression. Thousand Oaks, CA: Sage.

56.

Jiang

Nekrasova

(2007) The processing of formulaic sequences by second language speakers. The Modern Language Journal 91(3): 433–45.

57.

Jurafsky

Bell

Gregory

, et al. (2001) Probabilistic relations between words: Evidence from reduction in lexical production. In: Bybee

Hopper

(eds) Frequency and the emergence of linguistic structure. Amsterdam: John Benjamins, pp. 229–54.

58.

Kampstra

(2008) Beanplot: A boxplot alternative for visual comparison of distributions. Journal of Statistical Software 28: 1–9. Available at: http://www.jstatsoft.org/v28/c01/ (accessed April 2024).

59.

Kapatsinski

Radicke

(2009) Frequency and the emergence of prefabs: Evidence from monitoring. In: Corrigan

Moravcsik

Ouali

, et al. (eds) Formulaic Language: Volume II: Acquisition, loss, psychological reality, functional explanations. Amsterdam: John Benjamins, pp. 499–520.

60.

Kilbourn-Ceron

Clayards

Wagner

(2020) Predictability modulates pronunciation variants through speech planning effects: A case study on coronal stop realizations. Laboratory Phonology 11(1): 5.

61.

Konopka

Bock

(2009) Lexical or syntactic control of sentence formulation? Structural generalizations from idiom production. Cognitive Psychology 58(1): 68–101.

62.

Krause

Lindemann

(2014) Expyriment: A Python library for cognitive and neuroscientific experiments. Behavior Research Methods 46(2): 416–28.

63.

Krug

(2000) Emerging English modals: A corpus-based study of grammaticalization. Berlin: Mouton de Gruyter.

64.

Kuperberg

Jaeger

(2016) What do we mean by prediction in language comprehension? Language, Cognition and Neuroscience 31(1): 32–59.

65.

Langacker

(1987) Foundations of cognitive grammar: Theoretical prerequisites: Volume 1. Stanford, CA: Stanford University Press.

66.

Levshina

Lorenz

(2022) Communicative efficiency and the Principle of No Synonymy: Predictability effects and the variation of want to and wanna. Language and Cognition 14(2): 249–74.

67.

Lindblom

(1990) Explaining phonetic variation: A sketch of the H and H theory. In: Hardcastle

Marchal

(eds) Speech production and speech modelling. Dordrecht: Kluwer Academic, pp. 403–39.

68.

Lorenz

(2013) From reduction to emancipation: Is gonna a word? In: Hasselgård

Ebeling

(eds) Corpus perspectives on patterns of lexis. Amsterdam: John Benjamins, pp. 133–52.

69.

Lorenz

(2020) Converging variations and the emergence of horizontal links: to-contraction in American English. In: Sommerer

Smirnova

(eds) Nodes and networks in diachronic construction grammar. Amsterdam: John Benjamins, pp. 243–74.

70.

Lorenz

Tizón-Couto

(2019) Chunking or predicting: Frequency information and reduction in the perception of multi-word sequences. Cognitive Linguistics 30(4): 751–84.

71.

Lorenz

Tizón-Couto

(2020) Not just frequency, not just modality: Production and perception of English semi-modals. In: Hohaus

Schulze

(eds) Re-assessing modalising expressions: Categories, co-text, and context. Amsterdam: John Benjamins, pp. 79–107.

72.

Lorenz

Tizón-Couto

(2024a) Coalescence and contraction of V-to-V_inf sequences in American English: Evidence from spoken language. Corpus Linguistics and Linguistic Theory 20(1): 1–36.

73.

Lorenz

Tizón-Couto

(2024b) Replication data for ‘Learning to predict: Second language perception of reduced multi-word sequences’. DataverseNO, V1. DOI: 10.18710/TE5ZOG

74.

Mahler

(2022) Emerging modals revisited: Comparing English semi-modals and their contractions in the spoken BNC1994 and BNC2014 corpora. In: Krug

Werner

Schützler

, et al. (eds) Perspectives on contemporary English: Structure, variation, cognition. Berlin: Peter Lang, pp. 13–37.

75.

Mathôt

Schreij

Theeuwes

(2012) OpenSesame: An open-source, graphical experiment builder for the social sciences. Behavior Research Methods 44(2): 314–24.

76.

McConnell

(2023) Individual differences in holistic and compositional language processing. Journal of Cognition 6(1): 29.

77.

McConnell

Blumenthal-Dramé

(2021) Usage-based individual differences in the probabilistic processing of multi-word sequences. Frontiers in Communication 6: 703351.

78.

Molinaro

Canal

Vespignani

, et al. (2013) Are complex function words processed as semantically empty strings? A reading time and ERP study of collocational complex prepositions. Language and Cognitive Processes 28(6): 762–88.

79.

Myles

(2004) From data to theory: The over-representation of linguistic knowledge in SLA. Transactions of the Philological Society 102(2): 139–68.

80.

Ortega

(2013) SLA for the 21st century: Disciplinary progress, transdisciplinary relevance, and the bi/multilingual turn. Language Learning 63: 1–24.

81.

Patterson

Connine

(2001) Variant frequency in flap production: A corpus analysis of variant frequency in American English flap production. Phonetica 58: 254–75.

82.

Pickering

Garrod

(2007) Do people use language production to make predictions during comprehension? Trends in Cognitive Sciences 11(3): 105–10.

83.

Pinnow

Connine

Ranbom

(2017) Processing pronunciation variants: The role of probabilistic knowledge about lexical form and segmental co-occurrence. Journal of Cognitive Psychology 29(4): 393–403.

84.

Pitt

(2009) The strength and time course of lexical activation of pronunciation variants. Journal of Experimental Psychology: Human Perception and Performance 35(3): 896–910.

85.

Pitt

Dilley

Tat

(2011) Exploring the role of exposure frequency in recognizing pronunciation variants. Journal of Phonetics 39(3): 304–11.

86.

R Core Team (2023) R: A language and environment for statistical computing [software]. Vienna: R Foundation for Statistical Computing. Available at: http://www.R-project.org (accessed April 2024).

87.

Ranbom

Connine

(2007) Lexical representation of phonological variation in spoken word recognition. Journal of Memory and Language 57(2): 273–98.

88.

Reali

Christiansen

(2007) Processing of relative clauses is made easier by frequency of occurrence. Journal of Memory and Language 57(1): 1–23.

89.

Reichle

Perfetti

(2003) Morphology in word identification: A word-experience model that accounts for morpheme frequency effects. Scientific Studies of Reading 7: 219–37.

90.

Schmid

(2015) A blueprint of the entrenchment-and-conventionalization model. Yearbook of the German Cognitive Linguistics Association 3(1): 3–26.

91.

Schmid

(2020) The dynamics of the linguistic system: Usage, conventionalization, and entrenchment. Oxford: Oxford University Press.

92.

Siyanova-Chanturia

(2015) On the ‘holistic’ nature of formulaic language. Corpus Linguistics and Linguistic Theory 11(2): 285–301.

93.

Siyanova-Chanturia

Conklin

van Heuven

WJB

(2011) Seeing a phrase ‘time and again’ matters: The role of phrasal frequency in the processing of multiword sequences. Journal of Experimental Psychology: Learning, Memory, and Cognition 37(3): 776–84.

94.

Smith

(2021) Exploring knowledge of transparent and non-transparent multi-word phrases among L2 English learners living in an Anglophone setting. System 101: 102590.

95.

Snider

Arnon

(2012) A unified lexicon and grammar? Compositional and non-compositional phrases in the lexicon. In: Gries

Divjak

(eds) Frequency effects in language. Berlin: Mouton de Gruyter, pp. 127–63.

96.

Sonbul

(2015) Fatal mistake, awful mistake, or extreme mistake? Frequency effects on off-line/on-line collocational processing. Bilingualism: Language and Cognition 18(3): 419–37.

97.

Sosa

MacFarlane

(2002) Evidence for frequency-based constituents in the mental lexicon: Collocations involving the word of. Brain and Language 83: 227–36.

98.

Supasiraprapa

(2018) Prototype effects in first and second language learners: The case of English transitive semantics. Bilingualism: Language and Cognition 21(3): 618–39.

99.

Supasiraprapa

(2019) Frequency effects on first and second language compositional phrase comprehension and production. Applied Psycholinguistics 40(4): 987–1017.

100.

Tizón-Couto

Lorenz

(2018) Realizations and variants of have to: What corpora can tell us about usage-based experience. Corpora 13(3): 371–92.

101.

Tomasello

(2000) First steps toward a usage-based theory of language acquisition. Cognitive Linguistics 11(1/2): 61–82.

102.

Tomasello

(2005) Constructing a language: A usage-based theory of language acquisition. Cambridge, MA: Harvard University Press.

103.

Tremblay

Derwing

Libben

, et al. (2011) Processing advantages of lexical bundles: Evidence from self-paced reading and sentence recall tasks. Language Learning 61: 569–613.

104.

Underwood

Schmitt

Galpin

(2004) The eyes have it: An eye-movement study into the processing of formulaic sequences. In: Schmitt

(ed.) Formulaic sequences. Amsterdam: John Benjamins, pp. 153–72.

105.

von Stutterheim

Lambert

Gerwien

(2021) Limitations on the role of frequency in L2 acquisition. Language and Cognition 13(2): 291–321.

106.

Wickham

(2016) ggplot2: Elegant graphics for data analysis. New York: Springer.

107.

Wolter

Gyllstad

(2013) Frequency of input and L2 collocational processing: A comparison of congruent and incongruent collocations. Studies in Second Language Acquisition 35(3): 451–82.

108.

Wolter

Yamashita

(2018) Word frequency, collocational frequency, L1 congruency, and proficiency in L2 collocational processing: What accounts for L2 performance? Studies in Second Language Acquisition 40(2): 395–416.

109.

Wood

(2017) Generalized Additive Models: An Introduction with R (2nd ed.). Boca Raton, FL: Chapman and Hall/CRC Press.

110.

Wray

(2002) Formulaic language and the lexicon. Cambridge: Cambridge University Press.

111.

Wulff

(2019) Acquisition of formulaic language from a usage-based perspective. In: Siyanova-Chanturia

Pellicer-Sanchez

(eds) Understanding formulaic language: A second language acquisition perspective. New York: Routledge, pp. 19–37.

112.

Yamashita

Jiang

(2010) L1 influence on the acquisition of L2 collocations: Japanese ESL users and EFL learners acquiring English collocations. TESOL Quarterly 44: 647–68.

113.

Zue

Laferriere

(1979) Acoustic study of medial /t, d/ in American English. The Journal of the Acoustical Society of America 66(4): 1039–50.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.02 MB

0.03 MB

Learning to predict: Second language perception of reduced multi-word sequences

Abstract

Keywords

I Introduction

1 Entrenchment of multi-word sequences in L2 learning

2 Entrenchment of sequences and chunking

II Method

1 Experiment design and task

2 Variables

a Frequency of to-Vinf and backward transitional probability

b Verb duration, syllable count and verb form

c Merged plosive cluster

d Preceding sound

e Control before target and item count

f Gender and age

g Proficiency level of participants (certificate and test score)

h Years of learning English and time abroad

i Congruency

3 Data analysis

III Results

IV Discussion

V Conclusions

Supplemental Material

sj-docx-1-slr-10.1177_02676583241246147 – Supplemental material for Learning to predict: Second language perception of reduced multi-word sequences

Supplemental Material

sj-docx-2-slr-10.1177_02676583241246147 – Supplemental material for Learning to predict: Second language perception of reduced multi-word sequences

Footnotes

Acknowledgements

Declaration of Conflicting Interests

Funding

ORCID iDs

Open Badges Statement

Supplemental material

Notes

References

Supplementary Material

a Frequency of to-V_inf and backward transitional probability