Abstract
Aims and objectives:
Bilingual speakers are rarely equally proficient in both languages, making the determination of language dominance a major challenge. For a regional language like Low German, an additional challenge is the absence of a standard variety as a reference. To assess language dominance in both standardized and non-standardized languages, examining oral fluency provides a promising, non-invasive source of information. In this paper, we investigate several variables of speed and breakdown fluency to assess how well they reflect variations in language dominance among bilingual speakers of Low and High German, while also examining the role of the speaking task.
Design:
We recruited 95 bilingual speakers of Low and High German. The participants ranged in age from 15 to 88 years (47 female and 48 male). They completed four different tasks in each language (narrative task, picture-story task, direction-giving task, and reading task).
Data and analysis:
We analyzed seven fluency variables: articulation rate, speech rate, phonation/time ratio, mean length of runs, mean length of silent pauses, duration of silent pauses per minute, and number of silent pauses per minute. Generalized linear mixed models were fitted for these variables with language, dominance score, and gender as fixed effects and age as a covariate.
Findings:
Six of the seven fluency variables were found to be sensitive to variations in language dominance. Mean length of runs, duration of silent pauses per minute, and number of silent pauses per minute yielded the most consistent results across speaking tasks.
Originality:
This study links fluency measures with language dominance in a demographically changing group of speakers of an endangered regional language.
Implications:
We conclude that variables of speed and breakdown fluency are sensitive to variation in language dominance, even for two closely related languages such as Low and High German.
Introduction
Language dominance
Bilingual speakers are rarely completely balanced in their two languages. More often one language is more dominant than the other. Language dominance refers to the relative proficiency level in two languages as well as to the input and use of both languages (Montrul, 2016; Treffers-Daller, 2019). Speakers tend to be more dominant in one of the languages depending on factors such as age of acquisition and frequency of use (Birdsong, 2014; Flege et al., 2002; Grosjean, 1982; Olsson & Sullivan, 2005). An imbalance in the level of dominance between two languages may involve differences in language processing and production that are reflected in differences in fluency, complexity, and accuracy (Birdsong, 2014). These differences may be due to the increased cognitive load of more difficult tasks, especially when speaking the less-familiar language. Grosjean (1980) reports a slower speech rate, longer pauses, shorter mean length of runs and a lower phonation/time ratio for more complex tasks. Daller et al. (2010) found a strong correlation between speaking rate (words per second) and language dominance, with participants speaking more slowly in the less-dominant language. Language dominance is a dynamic construct that varies according to different speaking situations and circumstances and may also change over time (Olsson & Sullivan, 2005). This inherent variability makes assessing language dominance a challenging task.
Fluency
In second language acquisition research, the term “fluency” is commonly used to refer to specific temporal features of spoken language, including speech rate, pausing behavior, and repair phenomena. More broadly, fluency can be interpreted as an overall indicator of language proficiency (Lennon, 1990; Lintunen et al., 2020; Segalowitz, 2010). We use the term in the narrow sense by examining the relationship between language dominance and temporal variables of speech.
Variation in fluency can be described in three dimensions: speed fluency, breakdown fluency, and repair fluency (Skehan, 2003; Tavakoli & Skehan, 2005). Speed fluency refers to how quickly speech is delivered. It is usually assessed by the articulation rate, which is measured by the number of words, syllables, or phonemes per time interval excluding pauses, and by the speech rate, which includes pauses (Cucchiarini et al., 2000, 2002; De Jong et al., 2013, 2015; Di Silvio et al., 2016; Guz, 2015). Breakdown fluency involves silent and filled pauses that interrupt the flow of speech (De Jong et al., 2013, 2015; Di Silvio et al., 2016; Guz, 2015; Housen et al., 2012). Repair fluency is evident in corrections, false starts, and repetitions that occur during speaking (Lahmann et al., 2017). The phonation/time ratio is a particularly sensitive indicator of breakdown fluency, highlighting differences between L1 and L2 speech. Speech rate as well as mean length of runs are complex measures which combine speed and breakdown fluency (cf., e.g., Kormos & Dénes, 2004; Lennon, 1990; Tavakoli et al., 2020).
Furthermore, the type and complexity of speech tasks have been reported to have a significant influence on fluency (cf., e.g., Tavakoli & Wright, 2020). The effects of task complexity are usually explored by modifying task design and implementation features. Notable effects on speed and breakdown fluency have been found when changing the number of elements to process, the interconnectedness of those elements, the amount of preparation time, and the familiarity with the task through practice or repetition (De Jong et al., 2012; Ellis, 2005; Foster & Skehan, 1996; Hunter, 2017; Kovač & Vickov, 2018; Lambert et al., 2017; Levkina & Wright, 2012; Mehnert, 1998; Ortega, 1999; Tavakoli & Skehan, 2005).
Fluency in bilinguals can vary widely, depending on several factors, including age of acquisition, context of acquisition, frequency of use, language proficiency, language processing strategies, and the individual’s motivation and attitudes toward each language (Grosjean, 2013). Bilinguals who have acquired both languages early in life and had consistent exposure to both languages tend to be more fluent in both languages than those who acquired one language later in life (Bialystok, 1999; Daller et al., 2010; Flege et al., 2002). Bilinguals who have a high level of proficiency in both languages are generally more fluent in both languages than those who are less proficient in one of their languages. In addition, the way in which bilinguals process and use their two languages can affect fluency. For example, bilinguals who have developed language processing strategies that enable them to switch easily between the two languages tend to be more fluent in both languages than those who have not developed such strategies (Green, 1998).
To summarize, fluency in bilinguals is a complex and multi-faceted concept that is influenced by a range of factors, including the age of acquisition, the context of acquisition, the frequency of use, and the individual’s motivation and attitudes toward each language. Bilinguals who have developed high proficiency in both languages and who have developed effective language processing strategies tend to be more fluent in both languages.
Low German and High German
Low German is an endangered language spoken in the northern part of Germany with more than 2.5 million active speakers. Furthermore, the closely related Plautdietsch is spoken in linguistic enclaves in the United States, Mexico, Bolivia, Brazil, Paraguay, Russia, Kazakhstan, among others (Louden, 2020). In Germany, Low German is recognized as a regional language and protected by the European Charta for Regional or Minority Languages since 1999 (Council of Europe, 2021).
Low and High German are two closely related languages, with (High) German being the everyday language throughout Germany. High German is highly standardized in grammar, spelling, and pronunciation, however, variation in standard-near registers can be observed in Germany and the neighboring countries with German as official language. In contrast, Low German is a non-standardized language with strong dialectal variation in grammar, vocabulary, and pronunciation. It differs from High German mainly in that it has not undergone the Second Germanic Consonant Shift and that it has a less-complex case system. The proportion of North Germans who consider themselves to have a good or very good command of Low German decreased from 35% in 1984 (Stellmacher, 1987) to 15% in the latest surveys of 2007 (Möller, 2008) and 2015 (Adler et al., 2016), with the highest proportion of proficient speakers being those aged older than 80. Nowadays, Low German is no longer acquired monolingually, but only alongside High German. The use of Low German is largely confined to the private sphere, while High German prevails in public life, particularly in the education system and in more formal contexts. The demographic change in language proficiency suggests a shift in dominance from older to younger bilingual speakers of Low and High German. Among the older speakers, a dominance of Low German or a balanced bilingualism can be expected, while among younger speakers, High German is likely to be the dominant language.
Research question
As Low German is a language without a codified standard variety, established procedures for the standardized assessment of language proficiency based on the grammar, vocabulary, or pronunciation are not available. Against this background, the present paper addresses the question of how well commonly used fluency variables reflect variations in language dominance. For the purposes of the present study, we disregard variables of repair fluency and concentrate on speed and breakdown fluency variables, which can be captured automatically, thus allowing for the analysis of large corpora.
In line with previous research (e.g., Daller et al., 2010; Tavakoli & Wright, 2020), we anticipate that both speed and breakdown fluency decrease in the less-dominant language. In particular, we expect to find a lower articulation rate, a lower speech rate, a lower phonation/time ratio, a shorter mean length of runs, a longer mean length of silent pauses, a longer duration of silent pauses, and a lower number of silent pauses in the less-dominant language. As the surveys mentioned in the previous section suggest, High German dominates over Low German among young speakers, while older speakers can be expected to be more balanced bilinguals. Therefore, we expect a strong correlation between speaker age and language dominance which is why we include age as a co-variate in our statistic models. We also include gender as a factor for exploratory purposes, although we have no specific expectations about the possible effects of gender on the relationship between language dominance and fluency.
Since the type of speech task can also have an influence on the cognitive load associated with language use, we include four different task types for exploratory purposes: a narrative task, a picture-story task, a direction-giving task, and a reading task, the last three at different levels of difficulty.
Method
Subjects
We recruited 95 participants (47 female and 48 male), aged between 15 and 88 years at the time of recording. All participants grew up in the municipality of Krummhörn in East Frisia, in the northwest of the federal state of Lower Saxony in Germany, and were native speakers of East Frisian Low German (LG) and standard High German (HG). The municipality of Krummhörn is renowned for having one of the highest numbers of LG speakers (Strybny, 2009) and the local dialect is documented in detail in a study by Reershemius (2004). Four of the participants had to be excluded from the investigation due to technical errors while recording. Six additional participants were excluded because it was later discovered that they did not meet the requirement of having been raised in Krummhörn. The remaining 85 participants (43 female and 42 male) were between 15 and 88 years of age. All participants volunteered to take part in this study and had given their written informed consent prior to the recording. They were compensated for their time with a small amount of money. The general study design was approved by the ethics committee of the University of Oldenburg.
Dominance score
All participants completed a questionnaire containing a sociodemographic part as well as questions about their age of language acquisition, language use, and self-attributed language proficiency for LG as well as HG. Methods for assessing language dominance include self-report questionnaires (e.g., Flege et al., 2002; Poarch et al., 2019), oral proficiency tests (Daller et al., 2010), cognitive tasks, or a combination thereof (e.g., Olsson & Sullivan, 2005). An example for a self-report questionnaire is the Bilingual Language Profile (BLP) by Birdsong et al. (2012) which includes four components: language history, language use, language proficiency, and language attitudes. Each answer option is linked to a numeric value which is used to calculate a language dominance score. Marian et al. (2007) developed the Language Experience and Proficiency Questionnaire (LEAP-Q) based on standardized language tests and self-reported measures to get individual language profiles for bilingual and multilingual speakers.
The questionnaire used in this study consisted of 55 questions, which resulted from a combination and adaptation of Adler et al. (2016), Birdsong et al. (2012) and Poarch et al. (2019) (see Supplemental Appendix). According to their self-assessment, all speakers scored at least a 4 in both languages on a 7-point scale of language competence, where 1 stands for gar nicht gut (“not at all”) and 7 for sehr gut (“very well”), with an overall average of 6.2 for LG and 6.6 for HG. Based on their answers to the questions on age of acquisition, frequency of use, and language proficiency, 1 we calculated separate language scores for LG and HG for each participant. For statistical analysis, a dominance score was calculated by subtracting each individual’s HG score from their LG score, resulting in positive values for LG-dominant speakers and negative values for HG-dominant speakers.
Tasks
The tasks were organized in two blocks, one block per language. The order of the languages was counter-balanced across participants. In each block, the participants were asked to complete different versions of the same tasks to avoid repetition effects between the language blocks. We prepared two different versions of each task for both LG and HG to avoid version-specific effects, and ensured an even distribution of versions across recordings, taking into account participants’ age and gender. All LG tasks and instructions had previously been translated from HG by a local speaker. We trained a group of bilingual Krummhörn residents to conduct the experiment and interview the speakers, with technical assistance from the first and second author. This group consisted of eight people who were divided into four groups of two. They were female, aged between 60 and 65 years, and had grown up and lived in Krummhörn all their lives. Each interview was led by one of these four teams. Before each recording, they decided who would give instructions in which language, and then they spoke only the chosen language throughout the recording. 2
The participants completed four tasks in each language, with two different levels of difficulty for the last three: a narrative task (“= Monologue”), a picture-story task (= “Story”), a direction-giving task (= “Route”), and a reading task (= “Reading”). In the first task (Monologue), our participants were asked questions to elicit 2–3 minutes of monologue-like spontaneous speech. The questions included topics such as their daily lives, hobbies, interests, life in Krummhörn, and attitudes toward the local dialect. The remaining three tasks, each with two difficulty levels, were recorded in descending order: “level 2” (higher difficulty) first, “level 1” (lower difficulty) second. In the second task (Story), participants were asked to retell a story depicted in a comic strip consisting of six pictures. The comic strips were taken from the “Father and Son” series by Plauen (2016). At the higher difficulty level, they had to start their story immediately. At the lower level, they had 30 seconds to prepare. In the third task (Route), the participants were asked to perform a map task. The recording assistant described a scenario in which she had to visit two different places in Pewsum (the largest village in Krummhörn) and needed help choosing a route to her destination. The participants were given a map of Pewsum with an outlined route. At the higher difficulty level, the map did not include any landmarks or street names. At the lower level, the map included landmarks and street names. In the fourth task (Reading), the participants were asked to read aloud a fable by Aesop (either “The Northwind and the Sun” or “The Raven and the Fox”). At the higher difficulty level, they had to start reading immediately without any preparation time. At the lower level, they read the same story a second time.
Recording procedure and acoustic analysis
All speech samples were recorded with a portable digital recorder (Tascam DR-100 MKIII) and a head-mounted microphone (DPA 4066). 3 The recordings were digitized at a sampling rate of 48 kHz and with 24 bits/sample quantization and were later downsampled to 16 kHz for further analysis. The recordings took place in the speaker’s homes. Our aim was to build a comfortable and natural atmosphere for our participants. The rooms were carefully checked for background noises.
The recordings were cleaned up by excluding all instruction parts and analyzed using the syllable nuclei-script by De Jong et al. (2021) in Praat (Boersma & Weenink, 2020). To evaluate the accuracy of the script, 16 recordings were manually annotated. These recordings were equally distributed over language, age, gender, and task type and otherwise chosen randomly. We calculated the differences between the number of syllables counted by the script and by hand and calculated the error rate per 100 syllables. This resulted in an average error rate of 12.96 per 100 syllables. Although slightly higher than typical inter-rater discrepancies, the error rate shows no systematic bias or correlation with our variables and is therefore not considered a threat to the validity of our findings. To find the optimal setting for silent pauses, we tested various script settings using this procedure and found that the default settings with a minimum pause length of 200 ms yielded the best results. Fragmented intonation phrases, word repetitions, and filled pauses were not excluded. However, they only occurred relatively rarely in our corpus.
For the purpose of comparison, we examined seven common variables of speed and breakdown fluency, although they are not entirely independent of one another: articulation rate, speech rate, phonation/time ratio, mean length of runs, mean length of silent pauses, duration of silent pauses per minute, and number of silent pauses per minute (cf., e.g., Cucchiarini et al., 2002; Kormos & Dénes, 2004). All variables were either obtained directly from the script or derived from its output.
Articulation rate was calculated by dividing the total number of syllables by the total speaking time (excluding silent pauses) and is expressed in syllables per second. We also measured articulation rate in phones per second for a subset of 20 speakers evenly distributed across gender and age to account for possible differences in the syllable structure between LG and HG. Since we got very similar results for both measures, we chose the syllable as the unit of measurement to increase comparability with other studies. Speech rate was obtained by dividing the total number of syllables by total speaking time (including silent pauses). Phonation/time ratio is the total speaking time excluding silent pauses divided by total speaking time including silent pause time, which was multiplied by 100 to get a percentage value. Mean length of runs is the mean number of syllables between two silent pauses, which was calculated by dividing the total number of syllables by the total number of runs. Mean length of silent pauses is the total silent pause time divided by the number of silent pauses. Duration of silent pauses per minute is the total duration of silent pauses divided by the total speaking time including silent pauses divided by 60. Finally, the number of silent pauses per minute is the number of silent pauses divided by the total speaking time including silent pauses divided by 60.
Statistical analysis
As we were interested in the individual sensitivity of each fluency variable, we fitted separate generalized linear mixed models using the glmmTMB package (Magnusson et al., 2023) in R (R-Core-Team, 2022) for Articulation Rate, Speech Rate, Phonation/Time Ratio, Mean Length of Runs, Mean Length of Silent Pauses, Duration of Silent Pauses per Minute and Number of Silent Pauses per Minute as dependent variables. As fixed effects, we included
For each dependent variable, we reduced the full model step by step to select the model with the best fit using the AIC (Akaike information criterion) score as the selection criterion. Based on Matuschek et al. (2017), the models were reduced using the following procedure: First we excluded the intercept for
Post hoc analyses of the interactions were performed to examine the difference between slopes for both levels of
As indicated in the Research question section, we expect lower Articulation Rate, Speech Rate, Mean Length of Runs, and Phonation/Time Ratio in the less-dominant language, as well as higher Mean Length of Silent Pauses, Duration of Silent Pauses per Minute, and Number of Silent Pauses per Minute. However, we also expect that these differences will be greater the more dominant the speakers are in HG, which should be reflected in an interaction between the
Results
Before examining the relationship between language dominance and fluency, we will first analyze the relationship between the speakers’ age and their language scores for LG and HG, which were used to calculate
Age and language dominance
Figure 1 shows the language scores for LG and HG as a function of

Regression lines for individual global language scores for LG and HG and AGE as a predictor variable.
Fluency variables
Figure 2 shows regression lines for LG and HG for the seven fluency variables analyzed, broken down by task. All graphs, except those showing results for the Reading task and for most tasks of the dependent variable Mean Length of Silent Pauses, show crossing regression lines for LG and HG, with lower values for LG (black lines) than for HG (blue lines) in the HG-dominant speakers for Articulation Rate, Speech Rate, Phonation/Time Ratio, Mean Length of Runs, and with higher values for LG in the HG-dominant speakers for Duration of Silent Pauses per Minute and Number of Silent Pauses per Minute, a pattern that tends to reverse as the dominance scores increase. For the Reading task, Figure 2 shows lower values for LG than for HG for the first four variables and lower values for HG than for LG for the remaining three variables, with little or no variation across the range of

Regression lines for Articulation Rate, Speech Rate, Phonation/Time Ratio, Mean Length of Runs, Mean Length of Silent Pauses, Duration of Silent Pauses per Minute, and Number of Silent Pauses per Minute with DOMINANCE SCORE as a predictor variable.
Table 1 lists main effects of
Assessing model fit (χ²) with
Corrected p levels for monologue: * < .0125, ** < .0025, *** < .00025; corrected p levels for Story, Route, and Reading: * < .0083, ** < .0016, *** < .00016.
The contrast analyses in Table 2 demonstrate for the interaction effects a significant difference between the slopes of the regression lines for LG and HG, suggesting that for all fluency variables except Mean Length of Silent Pauses, the lower the dominance score of the bilingual speakers, the less fluent they were in LG compared to HG. Due to the correction factor, contrasts for Phonation/Time Ratio and Duration of Silent Pauses per Minute do not reach the 5% level of significance. Nevertheless, they do show a tendency in the direction that was expected.
Contrasts between LG and HG slopes for interactions involving
Selected distributions for analysis: Gaussian distribution: Articulation Rate, Speech Rate Phonation/Time Ratio, and Number of Silent Pauses per Minute; gamma distribution: Mean Length of Runs and Duration of Silent Pauses per Minute.
Corrected p levels for Monologue: * < .0125, ** < .0025, *** < .00025; for Story, Route, and Reading: * < .0083, ** < .0016, *** < .00016.
Discussion
This study aimed to investigate how well individual fluency variables reflect variations in language dominance among balanced and non-balanced bilingual speakers of LG and HG. In line with previous studies, we expected to find a decrease in speed and breakdown fluency in the less-dominant language. We also expected that the more balanced the speakers were, the smaller the difference between the two languages. In addition, we expected language dominance to vary with age, with younger speakers more likely to be dominant in HG and dominance becoming more balanced with increasing age.
We observed a strong correlation between language dominance and age, which is consistent with the ongoing dominance shift reported for the LG speaking population in Germany by Möller (2008) and Adler et al. (2016). To assess speed fluency, we examined articulation rate and speech rate. While the first variable reflects the speed of articulatory movements, the latter is a complex variable that combines speed and breakdown fluency by including pauses. In line with previous research, the participants showed both a lower articulation rate and a lower speech rate in the less-dominant language, but only in the Story task (Daller et al., 2010; Di Silvio et al., 2016; Guz, 2015). We conclude that both variables of speed fluency might be useful predictors of language dominance. However, since both variables show significant interactions between
For breakdown fluency, almost all measures show significant interactions between
Since the use of a non-dominant language is usually associated with increased cognitive load, the variation found in this study can be interpreted as the effect of different levels of cognitive load when using the two languages. However, we observed a relatively large influence of the task type on the fluency variables, which we did not expect to this extent. While the Story task seems to show very consistent results, the Monologue and Route task seem less predictable in their outcome. While only two variables show significant results for the Monologue task (Mean Length of Runs and Number of Silent Pauses per Minute), for the Route task, we found three significant interactions. It is also striking that none of the fluency variables varied depending on language dominance in the reading task. Instead, we found only a main effect of
Varying the difficulty level in the Story, Route, and Reading task led to no or inconclusive results. The effects were not reported in the previous section as the results raise the suspicion that the manipulations were either not large enough or not appropriate to be effective in inducing systematic variation in cognitive load (cf. Frank et al., 2023 for a similar conclusion regarding the analysis of vowel spaces). We therefore included task difficulty as a random factor rather than as a predictor variable in our regression models. The unexpected outcome of the manipulation of difficulty levels may also be due to the way our tasks were approached. In case of the Route task, for example, where participants were given maps labeled with street names and names of landmarks in the easier version of the task, many participants chose to reduce their vocabulary output to a minimum by only naming the street names along the indicated route, resulting in very short utterances. On the other hand, in the more difficult version, participants used a wide range of different ways to solve the task. Some participants simply said “left” and “right” without actually describing the route, while others either recalled landmarks from memory or told the interviewer who lived in each house they were passing during their route description. For future studies, we recommend placing greater emphasis on the premise that the interviewer must be able to reach the intended destination on the base of a careful route description. Otherwise, the necessary cognitive load varies significantly in both languages, making it difficult to relate observed variations in fluency to variations in language dominance. In the case of the Story task, unexpected effects of manipulations of the difficulty level arose from the fact that in the easier version of the task, in which additional preparation time was granted, participants may have used the extra time for planning the overall structure of the narrative or for planning individual utterances at the stages of conceptualization, grammatical encoding, and morpho-phonological encoding in the sense of Levelt’s model of speech production (Levelt, 1999). This strategy may have increased the complexity of the narrative or of individual utterances rather than the fluency of speech. Such a behavior would be consistent with Skehan’s Limited Attentional Capacity model (Skehan, 2015; Skehan & Foster, 2001), according to which cognitive resources can be variably allocated to different aspects of performance (see also Peters et al., 2023, 2024).
In summary, this study shows that most of the included variables of speed and breakdown fluency reflect variation in language dominance for Low and High German, two closely related languages that are very similar in terms of grammar, vocabulary, and pronunciation, making them particularly suitable for the comparative study of fluency in bilingual speakers. In our study, Mean Length of Runs, which reflects aspects of both speed and breakdown fluency, and the breakdown fluency variables Duration of Silent Pauses per Minute and Number of Silent Pauses per Minute yielded the most consistent results across speaking tasks, with the exception of the Reading task, which showed only a main effect of
Supplemental Material
sj-docx-1-ijb-10.1177_13670069251353090 – Supplemental material for On the relationship between language dominance and fluency in bilingual speakers of Low and High German
Supplemental material, sj-docx-1-ijb-10.1177_13670069251353090 for On the relationship between language dominance and fluency in bilingual speakers of Low and High German by Tio Rohloff, Marina Frank and Jörg Peters in International Journal of Bilingualism
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was funded by the German Research Foundation (DFG), grant number PE 793/3-1. We would like to thank Heinz Richter for his support in translating the instructions and tasks into the regional dialect of Low German spoken in Krummhörn as well as the Low German instructors and participants for making this research possible.
Supplemental material
Supplemental material for this article is available online.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
