Abstract
Aims and Objectives:
Translation equivalents intuitively seem to overlap in meaning. Moreover, the models of the bilingual lexicon often represent the meaning shared between two translations as a holistic node in the semantic network. However, research on semantic representation and processing questions this holistic approach. For instance, abstract words are assumed to be more language-dependent, while concrete words’ meanings are seen as more consistent cross-linguistically. The non-cognate translation priming paradigm offers an ideal methodological setting to study semantic overlap (proxied by concreteness) between translations. Priming effects between non-cognate translation equivalents are assumed to emerge due to spreading activation at the semantic level. Hence, a larger semantic overlap between translation prime-target pairs should lead to larger priming effects. Nevertheless, the evidence from previous translation priming studies investigating concreteness displays a blurry picture, potentially reflecting a shared limitation: their relatively small sample sizes. We overcame this problem by analysing the largest translation priming dataset to date.
Methodology:
Two hundred Spanish–English highly proficient bilinguals were tested in a bidirectional translation priming experiment employing 314 non-cognate translation equivalents differing in concreteness.
Data and analysis:
We analysed response times and error rates employing conservative (generalized) linear mixed-effects models.
Findings:
The results showed that concrete translation pairs elicited larger priming effects than abstract ones, evidencing differences in semantic representation between concrete and abstract words. Importantly, the influence of concreteness appeared only in the forward translation direction, suggesting language experience-related differences in meaning representation.
Originality:
The present study analysed the largest dataset in the translation priming literature to date, employing a conservative statistical approach to shed light on the effects of concreteness on translation priming.
Implications:
Our study spotlights the complexity and non-holistic nature of the bilingual semantic representation of concrete and abstract words. The present findings call for more research to help the current models of the bilingual lexicon implement more nuanced semantic representations.
Keywords
Introduction
Word meanings map conceptualizations of real-world entities to lexical forms – arbitrary or not (e.g., sound-symbolic words). Theoretical models of the lexicon assume that semantic information is stored at a different level than other types of information (sub-lexical, lexical, and grammatical). However, a question emerges when considering the case of bilingualism: How do bilinguals represent meanings for words that we conventionally refer to as translation equivalents? Is semantic representation shared or separate for these words? The present study investigates the degree of semantic alignment between translation equivalents. To do so, we assess the extent of semantically mediated translation priming in concrete and abstract translation pairs.
Naturally, we tacitly assume that translation equivalents overlap in semantics across languages. That is, we expect ‘love’ and
A way to approach the study of translation semantic overlap is by exploring word concreteness. Monolingual research suggests that abstract words have more distributed and inconsistent meanings and thus depend more on their linguistic context for interpretation than concrete words do (e.g., Crutch & Warrington, 2005; Hoffman et al., 2013; Pexman et al., 2013). If so, this may result in fewer semantic features shared between abstract translation equivalents (i.e., an overall lower degree of semantic overlap).
This study reanalysed data from Chaouch-Orozco et al. (2022), which tested 200 highly proficient Spanish–English bilinguals in a translation priming paradigm. The stimuli consisted of 314 translation equivalents that varied in concreteness. Anticipating our results, the data suggest that abstract translation equivalents are less aligned semantically than concrete translation pairs. Our analyses also indicate that, despite extensive experience with their second language (L2), our participants’ representations of L2 concrete words are not as semantically rich as those of their first language (L1) words.
Exploring the bilingual semantic system with translation priming
Translation priming experiments have been a valuable tool to study the inner workings of the bilingual lexicon. In this paradigm, prime and target in the related condition are translation equivalents (e.g.,
The priming literature with non-cognate translation equivalents – translation pairs with no overlap at the form level – is abundant. Because these pairs are taken to be connected exclusively at the semantic level, we can expect non-cognate translation priming effects to happen mostly – or at least in part – through semantic mediation (Xia & Andrews, 2015; see also Jiang & Forster, 2001 for an alternative account based on episodic links), making these a good testing ground for the study of bilingual semantic processing. With other factors minimized, priming is assumed to be larger when semantic activation is more robust. Crucially, this should be a function of the degree of semantic overlap between prime and target: with higher overlap, more of the activation reaching the prime can spread over to the target, priming it more.
Importantly, not all models of the bilingual lexicon assume different degrees of semantic alignment among pairs of translation equivalents. The following section discusses two of the most prominent models of bilingual lexical-semantic organization: the Distributed Feature Model (de Groot, 1992; van Hell & De Groot, 1998) and Multilink (Dijkstra et al., 2019).
Models of bilingual semantic representation and processing
The Distributed Feature Model (DFM) and Multilink align in assuming that all nodes within the lexicon are fully interconnected. However, the two proposals contrast in how the semantic system operates, being the DFM distributed in nature, while, Multilink, holistic (we also refer the reader to Dong et al., 2005, for a more developmentally nuanced proposal; and Finkbeiner et al., 2004, for a distributional alternative with a slightly different focus).
For the DFM, meanings are not represented by single units but by activation patterns and connection weights across semantic features within the network. For example, the meaning of ‘dog’ is not a single semantic unit but a co-activation of more primitive features (i.e., an animal, four-legged, able/prone to bark, and so on). This has consequences for both within- and cross-language lexical processing. Simply put, the larger the overlap in features between two words (both sub-lexical and semantic), the faster the processing. Consequently, the DFM can account for things like cognate status effects, whereby cognate translations, which share features at all levels, are processed faster, or concreteness effects, whereby abstract word pairs, which bear less feature overlap, are processed more slowly.
Multilink is a computational, connectionist model with a tradition in the study of the bilingual lexicon, as it is the continuation of the Bilingual Interactive Activation (BIA) and BIA+ models (Dijkstra & Van Heuven, 2002; van Heuven et al., 1998). Multilink offers a comprehensive account of bilingual word recognition and production, incorporating some of the tenets of another prominent theory, the Revised Hierarchical Model (Kroll et al., 2010; Kroll & Stewart, 1994). In Multilink’s view, differences in L1 and L2 word processing are mostly accounted for by the
We should note here that Dijkstra and colleagues have acknowledged from the start the aspiration for Multilink to incorporate a more nuanced semantic system in future instantiations, potentially embracing distributed representations (Dijkstra et al., 2019, p. 5). However, little is known about how meaning is represented in the bilingual lexicon, and more research is needed to better understand how bilinguals juggle (potential) non-holistic semantic representations for translation equivalents (De Deyne et al., in press). In an effort to increase our understanding of bilingual semantic representation, the present analysis employs word concreteness as a candidate to proxy semantic overlap between translation equivalents. The following section reviews some studies that have attempted to explore bilingual semantic memory by manipulating this factor.
The study of translation equivalence through concreteness effects
As we said earlier, abstract meanings have been claimed to depend more on the linguistic context of each language, making abstract words and their meanings more language-specific (e.g., van Hell & De Groot, 1998). In addition, abstract meanings may be more dispersed and consist of a lower number of semantic features (e.g., de Groot, 1989). As such, concrete translation pairs may overlap semantically more than abstract ones (van Hell & De Groot, 1998). These claims have found support in different studies with bilingual populations. For instance, van Hell and de Groot (1998) asked Dutch-English unbalanced bilinguals to produce within- and between-language associates to word cues in both Dutch and English. Their results showed that subjects elicited more translation equivalents as associates when words were concrete, suggesting that concrete translations share a larger number of semantic features. In Tokowicz et al.’s (2002) Dutch–English translation norms, participants gave higher similarity ratings to concrete translations than to abstract ones, indicating that concrete meanings are more similar across languages.
In the realm of translation priming studies, Schoonbaert et al. (2009) tested Dutch–English bilinguals in two priming experiments (1 and 2) manipulating translation direction (L1 to L2 and L2 to L1) and stimulus onset asynchrony (SOA; the time elapsed between the onset of the prime and the onset of the target) (100 milliseconds vs 250 milliseconds). Note that SOA is one of the central manipulations in priming experiments; longer SOAs give more time for primes to be processed and activate targets. Schoonbaert et al. (2009) employed a masking paradigm. Masked presentations (Forster & Davis, 1984) are an important implementation of the traditional priming paradigm. In this type of presentation, primes are presented after a forward mask (e.g., #######) and, occasionally, before a backward, post-prime mask too. This paradigm is believed to tap into more automatic/early processes by making a very briefly presented prime even less salient (Forster et al., 2003).
In Schoonbaert et al. (2009), the prime duration was set at 50 milliseconds in both conditions. However, in the long-SOA condition, a 50-millisecond post-prime blank screen and a 150-millisecond backward mask were presented, which meant that the onset of the prime and the onset of the target were separated by 300 milliseconds. In the short-SOA condition, the blank screen was dropped, and the backward mask was reduced to 50 milliseconds, leaving an SOA of 100 milliseconds. The authors reported a trend towards larger priming for concrete as compared to abstract words. In another study, Ferré et al. (2017) tested highly proficient Spanish–English bilinguals in two masked translation priming lexical decision tasks that crossed SOA (50 milliseconds vs 100 milliseconds) and concreteness manipulations. In Experiment 1, only a 50-millisecond prime appeared before the target. In Experiment 2, SOA was extended to 100 milliseconds by introducing a 50-millisecond post-prime backward mask. Overall, Ferré and colleagues observed larger priming for concrete words, but only in Experiment 2, suggesting that the 50 milliseconds SOA of Experiment 1 may not be enough time for a complete semantic activation. Chen et al. (2014) reported a lack of concreteness effects on masked translation priming in both translation directions when testing Chinese-English bilinguals with a 50-millisecond prime and a 150-millisecond backward mask (i.e., a 200 milliseconds SOA).
While all the studies discussed earlier employed masked priming, two studies have used overt priming. This point is critical because longer prime presentations may be needed when inspecting semantic processes (see Altarriba & Basnight-Brown, 2007, for a review) such as concreteness. Jin’s (1990) Experiment 1 tested Korean–English bilinguals in an overt (i.e., unmasked) translation priming lexical decision task with 150-millisecond primes. Participants were tested in both translation directions. Jin’s (1990) results showed larger priming effects for concrete than abstract words. Crucially, this difference was larger in the L1–L2 direction than in the L2–L1, backward one. Finally, Smith et al. (2019) tested Hebrew–English bilinguals in an overt translation priming lexical decision with a 200-millisecond SOA (150-millisecond prime plus 50-millisecond post-prime blank screen) in both translation directions. Contributing to the overall unsettled nature of these phenomena, their data showed no concreteness effects on priming.
At least five studies have thus attempted to explore the role of semantic overlap (proxied by concreteness) on translation priming, with mixed findings. At this point, it is necessary to consider potential reasons for the disparities between these results. First, the different bilingual experiences of the subjects in previous studies may account for at least some of the divergencies in the observed priming patterns. However, the use of different measures and scales to assess the participants’ L1/L2 experience makes comparisons difficult. Second, prime duration and SOA could also be thought to modulate the presence of concreteness effects. However, the evidence is too mixed for us to be able to draw conclusions in this regard. Third, although non-significant, the trending results in Schoonbaert et al. (2009) may indicate that, regardless of prime duration and SOA, an effect could be better captured and characterized if larger datasets were available.
Sample size and its connected issues are a common problem in translation priming studies, and in chronometric research more generally. First, because response times are noisy measures, effect sizes in this type of study tend to be small (e.g., Brysbaert, 2019). Second, creating non-cognate translation stimuli while controlling for factors that have been reported to affect processing speed (e.g., number of letters, frequency, orthographic neighbourhood, and many others) is particularly challenging – especially in languages sharing many cognate words – which makes it difficult to raise the number of observations by having more stimuli. Brysbaert and Stevens (2018) recently suggested that a minimum of 1,600 observations are necessary to interpret differences of about 15 milliseconds. Importantly, none of the four studies discussed above are even close to that number – the most extensive dataset having less than 800 observations per condition. To answer recent calls for better statistical practices, we need larger, more statistically reliable datasets to better understand the role of concreteness as a proxy of semantic overlap between translation equivalents.
The present study aims to make a significant contribution in this regard. We re-analysed a translation priming dataset from Chaouch-Orozco et al. (2022). Their study followed up on a previous investigation on the role of language experience on translation priming effects (Chaouch-Orozco et al., 2021), with a more comprehensive scope. Chaouch-Orozco et al. (2022) tested 200 highly proficient Spanish–English unbalanced bilinguals, using 314 non-cognate translation equivalent pairs, providing the largest dataset in the translation priming literature to date. Their study solely focused on the role of L1/L2 experience on priming, which was manipulated through immersion. Importantly for our purposes, their word stimuli were obtained from a continuum of concreteness values, but this factor was only used as a covariate in their analyses and failed to show a significant modulation of the interaction between immersion and priming effects. Given the large sample size, Chaouch-Orozco et al.’s (2022) dataset has the potential to overcome some of the limitations of previous studies, making it a valuable source of information on the role of concreteness in bilingual lexicon-semantic representation and processing.
The present study provides a systematic analysis of concreteness in this dataset, where we expected larger priming effects in concrete trials due to increased activation at the semantic level. This would be in line with the previous literature suggesting that concrete translation pairs are more semantically aligned than abstract ones.
Method
Participants
Two hundred highly proficient, late sequential Spanish–English bilinguals (164 females and 36 males) were tested in two lexical decision experiments, one for each translation direction (see Table 1 for participant’s characteristics). The main focus of Chaouch-Orozco et al. (2022) was to investigate how immersion in a second language affected priming effects. For this purpose, participants were divided into two equally large groups manipulating this factor (i.e., immersed vs non-immersed). Thus, half of the participants were living in Spain and the other half in the United Kingdom (mean length of immersion: 6 years; range: 1–21). Importantly, L2 proficiency, measured with the LexTale test (Lemhöfer & Broersma, 2012), was controlled across participants, being all subjects highly proficient in their L2 (
Participant characteristics.
Materials
In all, 314 non-cognate translation equivalents were used to create the critical stimuli (see the complete list in Chaouch-Orozco et al., 2022; see Table 2 for stimuli characteristics) and were employed in each translation direction. Since translation pairs were extracted from a continuum of frequency and concreteness values, these two factors were introduced as covariates in the original analysis. None of them modulated the effect of immersion on priming effects, so they were not further discussed. Across the two languages, words were matched for concreteness, frequency, length, and orthographic neighbourhood. Concreteness for each translation equivalent pair was established with the following procedure. First, each English word within each pair was given a concreteness value from the ratings by Brysbaert et al. (2014; range: 1.04–5). Note that, because these were translation equivalent pairs and to avoid employing different measures, only the value from the English word in each pair was used. Next, two different methods for categorizing each pair as concrete or abstract were employed. First, following Reilly and Desai (2017), the stimuli were divided into thirds, classifying as concrete and abstract only the thirds with the highest and lowest values, respectively. The second method consisted of dividing the words into concrete and abstract with a median split. Two different models were fitted for each of these variables obtained from the two approaches. The results showed minimal differences between the two treatments. Consequently, the median-split method was chosen because it allowed to keep the whole set of stimuli (compared to giving up a third of the observations). Word frequencies were obtained from the SUBTLEX corpora (British English: Spanish: Cuetos et al., 2011; van Heuven et al., 2014). Another 314 pseudowords in each language were created with the software
Stimuli characteristics.
Procedure
The experiments were presented online with
Each trial consisted of a fixation point (500 milliseconds), followed by the prime (200 milliseconds), immediately followed by the target, which remained on the screen until the participants made the lexical decision. They were asked to respond as fast and as accurately as possible. The experiments were divided into blocks of 40 trials, and participants could rest between them.
Data analysis
The code for the present study can be found in Supplemental Material; the data can be found in the first author’s OSF repository (https://doi.org/10.17605/OSF.IO/4H63C). Both the accuracy (error rate) and the response time (RT) analyses considered only trials involving word stimuli. RT analyses were run only on correct-response trials. Following Baayen and Milin (2010), RTs below 200 milliseconds and above 5,000 milliseconds were removed from the analysis. We employed inverse-Gaussian transformations on the RTs after confirming they offered the best fit through visual (quantile-quantile plots) and Shapiro–Wilk tests (inverse Gaussian:
We employed (generalized) linear mixed-effects models (Baayen et al., 2008) in
The maximal models included main effects and interactions of interest as fixed effects (Brauer & Curtin, 2018). Thus, the model for the main analysis included language (i.e., target language; English vs Spanish), prime type (related vs control), and concreteness (i.e., concrete vs abstract) and their interactions as fixed effects. Subject, prime, and target were included as random intercepts (Feldman et al., 2015), whereas any predictor and interaction that varied within subject, prime, and target were included as a random slope. Full-CRI structures were specified with random intercepts by each grouping factor for subjects, primes, and targets. The full specification of the maximal and non-converging models can be found in the OSF repository linked above.
Results
Response times
Table 3 summarizes RTs and error rates in all conditions (see the summary of the final model in Supplemental Material). The final model revealed the main effects of language (β = –0.05,
Mean response times (RTs, in milliseconds; standard errors), error rates (%), and priming effects (in milliseconds).
To further inspect this finding and to discard any potential confounds, we ran independent analysis were group (immersed vs non-immersed), prime frequency, and target frequency were included as co-variates. In both analyses including group and prime frequency, the same three-way significant interaction was obtained (β = 0.03,
Accuracy analysis
Correct or incorrect responses were dummy-coded (1 or 0, respectively). We used generalized linear mixed-effects models with a binomial family to fit the error data. Significant effects were obtained only for the main effect of language (
Discussion
This study investigated whether the degree of semantic overlap between translation equivalents predicts non-cognate translation priming effects. To address this question, we proxied semantic overlap with word concreteness. Concrete and abstract meanings have been argued to differ in their semantic robustness and uniformity across languages – concrete meanings are assumed to be richer and more consistent overall. In line with our predictions, concrete translation equivalents yielded larger priming effects than abstract pairs, indicating stronger semantic activation from prime to target in these pairs, potentially due to a larger semantic overlap. However, this effect was only obtained in the L1 Spanish–L2 English direction.
Our study adds to the few previous investigations on how concreteness impacts translation priming. Overall, this significant effect of concreteness aligns with Ferré et al.’s (2017) and, particularly, Jin’s (1990) findings. Our study replicates Jin’s first report on the asymmetric impact of concreteness on translation priming across translation directions. It remains an open question, however, why the effect did not appear in other studies (Chen et al., 2014; Schoonbaert et al., 2009; Smith et al., 2019). In support of the present data, our experiment uses the largest dataset to date, with 15,000 observations per condition, dramatically increasing the number employed in previous studies.
At this point, we should note that observing a concreteness effect with non-cognate stimuli – irrespective of the target language – suggests that abstract translation equivalents are less aligned semantically than concrete translation pairs and that the bilingual semantic system is non-localist in nature. If translation equivalents shared equally overlapping semantic nodes, we should not have obtained the current differences between concrete and abstract priming effects. Thus, our results support the predictions of the Distributed Feature Model and challenge those of localist accounts such as Multilink, highlighting the need for these to implement more nuanced semantic systems. Importantly, however, the concreteness effect in our data is asymmetric with respect to translation direction. This language dependency suggests the existence of persistent differences in the way bilinguals represent the meaning of concrete and abstract L1/L2 words. Notably, the DFM is not suited to explain this asymmetry, as it remains agnostic regarding potential cross-language differences in the robustness of meanings brought about by bilingual experience. To accommodate our findings, we resort to the Dual Coding Theory (DCT; Paivio, 1991, 2010 see Pavlenko, 2012, for a similar account).
According to the DCT, concrete and abstract words are represented qualitatively differently. While both concrete and abstract meanings operate within a linguistic system, activation of concrete words also spreads into the sensorimotor system, which encodes non-verbal encounters with the real-world entities these words’ meanings represent. This way, the DCT provides a rationale for the idea of concrete meanings being richer than abstract ones: Extra sensorimotor information would make these representations more complex.
Crucially, non-verbal experiences may be language-dependent in late sequential bilinguals like those in the present study (Jared et al., 2013, p. 388; Pan et al., 2021). The reason lies in the quantitatively and qualitatively different experiences these bilinguals have with their first and second languages. Contrary to the acquisition of the first language, their L2 is – usually – learned in formal contexts (i.e., classrooms) and is considerably deprived of real-life experiences (e.g., Pavlenko, 2017). As such, this type of bilingual would be much more exposed to linguistic than sensory information in their second language. Due to this divergent non-verbal experience in the two languages, L1 concrete words would be semantically richer than L2 concrete words for late sequential bilinguals. In the case of abstract words, however, differences would not be that pronounced given that both L1 and L2 abstract words primarily rely on linguistic information to carve out their meanings – precisely the type of information that is just as abundant in formal L2 learning contexts. Following this line of thought, we argue that the asymmetry of our concreteness effect emerged because the less detailed semantic specifications of the L2 concrete primes could not activate the semantically richer L1 targets to the same extent as the L1 primes did with the L2 targets. Likewise, the comparable size of priming for the abstract translation pairs in both directions would be explained by a similar integration/representation of L1 and L2 abstract meanings in the bilingual lexicon.
In light of the above, we conclude by noting that it would be worth addressing the interactions between concreteness and the multifaceted nature of bilingual experience (e.g., different linguistic profiles and language combinations) in future investigations. We strongly believe that this line of research has the potential to be a fruitful approach to better understand the extent to which translation equivalents overlap in meaning and how that affects their processing.
Supplemental Material
sj-pdf-1-ijb-10.1177_13670069221146641 – Supplemental material for Are translation equivalents really equivalent? Evidence from concreteness effects in translation priming
Supplemental material, sj-pdf-1-ijb-10.1177_13670069221146641 for Are translation equivalents really equivalent? Evidence from concreteness effects in translation priming by Adel Chaouch-Orozco, Jorge González Alonso, Jon Andoni Duñabeitia and Jason Rothman in International Journal of Bilingualism
Supplemental Material
sj-pdf-2-ijb-10.1177_13670069221146641 – Supplemental material for Are translation equivalents really equivalent? Evidence from concreteness effects in translation priming
Supplemental material, sj-pdf-2-ijb-10.1177_13670069221146641 for Are translation equivalents really equivalent? Evidence from concreteness effects in translation priming by Adel Chaouch-Orozco, Jorge González Alonso, Jon Andoni Duñabeitia and Jason Rothman in International Journal of Bilingualism
Supplemental Material
sj-rmd-3-ijb-10.1177_13670069221146641 – Supplemental material for Are translation equivalents really equivalent? Evidence from concreteness effects in translation priming
Supplemental material, sj-rmd-3-ijb-10.1177_13670069221146641 for Are translation equivalents really equivalent? Evidence from concreteness effects in translation priming by Adel Chaouch-Orozco, Jorge González Alonso, Jon Andoni Duñabeitia and Jason Rothman in International Journal of Bilingualism
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Partial funding for the empirical work comes from the Language Learning Dissertation Grant and a PhD studentship from the University of Reading (ref GS18-009) awarded to A.C.-O., the Helping grant project from the Tromsø Forskningsstiftelse (TFS), 2019-2023 (J.G.A. and J.R.) Project PID2021-126884NB-I00 from the Spanish Government. (J.A.D. and J.G.A.), Project 2021-T1/HUM-20037 from the Comunidad de Madrid (J.G.A.) and Project LESS (code 326487) from the Norwegian Research Council (J.G.A.)
Supplemental material
Supplemental material for this article is available online.
Notes
Author biographies
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
