Abstract
The purpose of this cluster randomized group study was to investigate the effect of multitiered, dual-language instruction on children’s oral language skills, including vocabulary, narrative retell, receptive and expressive language, and listening comprehension. The participants were 3- to 5-year-old children (n = 81) who were learning English and whose home language was Spanish. Across the school year, classroom teachers in the treatment group delivered large-group lessons in English to the whole class twice per week. For a Tier 2 intervention, the teachers delivered small-group lessons 4 days a week, alternating the language of intervention daily (first Spanish, then English). Group posttest differences were statistically significant, with moderate to large effect sizes favoring the treatment group on all the English proximal measures and on three of the four Spanish proximal measures. Treatment group advantages were observed on Spanish and English norm-referenced standardized measures of language (except vocabulary) and a distal measure of language comprehension.
Reading comprehension and academic achievement are dependent on oral language skills (Catts, Fey, Tomblin, & Zhang, 2002; Gough & Tunmer, 1986; Griffin, Hemphill, Camp, & Wolf, 2004; Storch & Whitehurst, 2002). Although interventions to promote code-related skills have proliferated, interventions to systematically teach oral language and its components, such as vocabulary, narratives, listening comprehension, and use of complex sentences (Cain & Oakhill, 2011; Elleman, Lindo, Morphy, & Compton, 2009; Mehta, Foorman, Branum-Martin, & Taylor, 2005; Verhoeven & van Leeuwe, 2008), remain largely unavailable to early-childhood educators (Zucker, Cabell, Justice, Pentimonti, & Kaderavek, 2013). Spanish-speaking children entering English-only elementary schools are in particular need of effective interventions that are strategically and intensely designed to prepare them for the academic language demands of school (Castro, Páez, Dickinson, & Frede, 2011). The purpose of this study was to examine the effect of an innovative instructional model designed specifically for young dual-language learners on children’s oral language skills preparatory to their entrance into kindergarten.
The Oral Language and Literacy Connection
Oral language is a unique and meaningful indicator of academic success (Barton-Hulsey, Sevcik, & Romski, 2017; Catts, Nielsen, Bridges, & Liu, 2016; Chaney, 1998; Clarke, Snowling, Truelove, & Hulme, 2010; Larney, 2002). Specifically, vocabulary (Bleses, Makransky, Dale, Højen, & Ari, 2016; National Institute of Child Health and Human Development, 2000), narrative ability (Griffin, et al., 2004), listening comprehension (Catts, Adlof, & Ellis Weismer, 2006), and the use of complex sentences (Craig, Connor, & Washington, 2003) are key contributors to reading comprehension. Limited reading comprehension can be the direct result of limited academic English oral language skills (Cain, Lemmon, & Oakhill, 2004; Catts et al., 2006; Dickinson, McCabe, Anastasopoulos, Peisner-Feinberg, & Poe, 2003; Hammer, Lawrence, & Miccio, 2007). Many young children with typical language-learning ability may not produce or understand language on par with academic expectations for a variety of reasons, including economic, cultural, and linguistic diversity (National Center for Education Statistics, 2017). The idea that children with language differences must wait until their language difficulties evolve into reading difficulty and poor academic performance to receive special, individualized help is problematic because with early identification and intervention, their difficulties may be prevented (Catts, 1993; Catts et al., 2006).
With the adoption of higher language and reading standards across states, expectations of what children are to understand and produce linguistically in school have likewise increased. Young children who have typical language-learning abilities but who are far behind their peers in English language development, for whatever reason, have few options. The outdated dichotomous system of general and special education cannot fully meet the needs of children with typically developing language skills who are learning English. More research is needed to develop effective models of instruction that are strategically designed to facilitate and hasten the acquisition of English (Vaughn et al., 2006).
Multitiered Systems of Support
One model that may have utility for promoting English language acquisition before children experience academic failure, is multitiered system of supports (MTSS). The idea of providing special services to children who are not performing as expected, irrespective of ability status, is not new. In 2004, the reauthorized IDEA (Individuals with Disabilities Education Act) clearly outlined the concept of response to intervention that has been shaped into the contemporary framework of MTSS. In general, MTSS is a framework for identifying children with emerging difficulties so that timely differentiated and preventative instruction can be dispensed according to individual children’s needs. As a conceptual basis for early identification and prevention (Fuchs & Deshler, 2007), MTSS is a paradigmatic model, not a formula, method, or procedure. Therefore, there are many effective ways to actualize the chief MTSS attributes, which are (a) multiple tiers of instruction and intervention, (b) students who need more support transition to more intense arrangements of intervention, (c) interventions are intensified by adjusting the duration and frequency of intervention, and the expertness of the interventionist, (d) educators other than classroom teachers assist in the delivery of targeted and intensive interventions, and (e) tiered placement is determined irrespective of special education classification (Marston, 2005).
MTSS has several advantages over the traditional general-special education dichotomy. Perhaps the greatest is that rather than focusing on what caused the delays, MTSS delivers supplemental intervention to all who need it, not just those with the appropriate diagnosis. Despite the success of MTSS for early reading intervention, language has been neglected. If the goal is to ensure all children receive what they need to succeed in school, then more systematic language intervention should be considered for children with language differences. In the traditional system, children who receive language supports experience no intermediate step such as Tier 2 intervention. There is no strategy for eliminating environmental confounds to language delays and no way to prevent language-related disabilities. Students go straight from classroom instruction to special education, and that pathway is only available to students who have a disability. Nonetheless, a multitiered approach for language, one that affords an intermediate, preventative step, is possible, especially in early childhood (Carta & Young, 2019; Durán & Wackerle-Hollman, 2019; Greenwood et al., 2013).
Dual-Language Approach to Intervention
Recent recommendations for creating powerful interventions for Spanish-speaking English learners include incorporating children’s first language (L1) to facilitate development of their second language (L2; Baker, 2000; Barnett, Yarosz, Thomas, Jung, & Blanco, 2007; Castro, Garcia, & Markos, 2013; Collier & Thomas, 2017; Coltrane, 2003; MacSwan & Rolstad, 2005; Restrepo, Morgan, & Thompson, 2013). Those who receive sustained dual-language instruction tend to be two to three years ahead of those who receive English-only instruction in terms of academic performance (Mahoney, MacSwan, & Thompson, 2005; Rolstad, Mahoney, & Glass, 2005). Collier and Thomas (2017) argued that sustained L1 and L2 instruction engages sociocultural, linguistic, cognitive, and academic processes that lead to high academic achievement in children’s L2. Furthermore, they posit that when schools provide strong dual-language programs, children from low–socioeconomic backgrounds overcome the negative effects of poverty. Such sentiments are echoed in the recent National Academies of Sciences, Engineering, and Medicine (2017) report on promoting educational success of children learning English, to include recommendations for incorporating children’s L1 and involving families in the promotion and retention of their home language.
The possibility of skills learned in one language transferring with minimal direct teaching to another language helps explain the facilitative effects seen in dual-language instruction research (Méndez, Crais, Castro, & Kainz, 2015; Miller, Heilmann, Nockerts, Iglesias, Fabiano, & Francis, 2006; Proctor, Carlo, August, & Snow, 2005; Restrepo et al., 2013; Rolstad et al., 2005). That is, when children receive strategic language instruction in L1, it is possible that their knowledge and skills transfer to L2, and vice versa in some cases (Marian & Kaushanskaya, 2007). It is theorized that cross-language interactions will occur across structures that have a similar, underlying cognitive schema (MacWhinney, 1999). Schemas are the mental organization of prior experiences (Anderson & Pearson, 1984), and such schemas can be expressed through narration (Stein & Glenn, 1979). Narrative organization is very similar across English and Spanish, which implies that the narrative schemas for the two languages are similar. This underlying similarity suggests that narrative structure will have linguistic reciprocity between L1 and L2 (and vice versa). For example, Petersen, Thompsen, Guiberson, and Spencer (2016) found that the effects of an L2 intervention targeting narrative and linguistic structures transferred to typically developing children’s L1. In vocabulary programs, this transfer is evidenced by the faster acquisition of concepts from L1 instruction to L2 than when children receive the instruction only in L2 (English in the case of the United States; Perozzi, 1985; Perozzi & Chavez Sanchez, 1992). Moreover, Miller et al. (2006) found that sentence complexity and story structure in L1 at school entry predicted academic achievement in L2 in Spanish-English dual-language learners. These studies, correlational and causal, indicate that one language can facilitate the acquisition of another language and that the stronger the child’s L1, the greater their acquisition in L2.
The Current Study
This study represents an early efficacy pilot study to determine the promise of a multitiered dual-language curriculum for a large-scale efficacy trial. As such, it was particularly important to understand the extent to which measures of narrative, vocabulary, language comprehension, and general language abilities could be affected. Therefore, we addressed the following research questions:
Because the curriculum was new, the extent to which preschool teachers perceived it to be feasible in their classrooms was not known. Feasibility of an intervention can depend on how well teachers like it, its contextual fit to the school system, how well teachers understand it and how to deliver the lessons, and the extent to which teachers can make reasonable modifications. Therefore, we also examined the curriculum’s feasibility in a secondary research question:
Method
Setting and Participants
This study was conducted in Head Start preschool classrooms in a southwestern state. During the spring prior to the commencement of the study, the first author gave a presentation regarding the study to administrators of two Head Start grantees (one urban and one rural). Once the administrators volunteered to allow their centers to participate, the first and second authors visited each center to speak directly with teachers about the study. Head Start teachers who were interested in participating signed an informed consent form and completed a demographic survey. When school started at the beginning of August the next year, the research team collected parental permission for children to participate. Using the parent-completed forms at their sites, the teachers identified children from Spanish-speaking homes. All children for whom Spanish was one of the languages spoken at home were invited to participate.
Teachers/Classrooms
In total, 25 classrooms were included in this study. The classrooms were randomly assigned to treatment and control groups at the completion of the consenting and screening process, resulting in 12 classrooms in the treatment group and 13 in the control group. One lead teacher and one teaching assistant provided instruction to 18 to 20 children (3-, 4-, and 5-year-olds) in each classroom. Although efforts were made to recruit classrooms that had at least one teacher or teaching assistant who spoke Spanish fluently, given the available workforce and frequent turnover, three of the treatment classrooms and five of the control classrooms were without a Spanish-speaking teacher or teaching assistant. Children in 18 (nine in treatment and nine in control) of the classrooms attended preschool Monday through Thursday. In the remaining seven classrooms, children attended 5 days a week. All the teachers reported using Creative Curriculum (Dodge, Colker, & Heroman, 2002) as their core curriculum, which was complemented by Teaching Strategies Gold (Heroman, Tabors, & Teaching Strategies, Inc., 2010). The Head Start programs completed Classroom Assessment Scoring System (CLASS; Pianta, La Paro, & Hamre, 2008) observations of all of their teachers during September or October of the school year. These data are reported, along with additional information about the teachers and classrooms, in Table 1.
Teacher and Classroom Characteristics
Children
During the recruitment phase, the research team went to each center during drop-off or pickup times and met with the parents or guardians of the children. The researchers explained the study to the parents in their preferred language (Spanish or English). Consent was obtained from the parents of 144 children of ages 3 to 5 years who were exposed to Spanish at home. Once signed consents were obtained, the research team administered screening measures to assess the children’s language skills in English and Spanish. Screening involved the use of the Expressive Vocabulary (EV) subtest of Clinical Evaluations Language Fundamentals–Preschool (CELF-P; Semel, Wiig, & Secord, 2004; Wiig, Secord, & Semel, 2009), a norm-referenced test of language, and the Narrative Language Measures (NLM) Listening retell subtest of the CUBED Assessment (Petersen & Spencer, 2016).
The goal for participant recruitment was to identify Spanish-speaking children who did not perform according to age expectations on English measures, indicating that they may benefit from a Tier 2 oral language intervention. To select participants, we conducted a multistep process. First, we examined the children’s English NLM Listening retell scores, and any child who earned a retell score of 8 or higher in English was excluded. A retell score of 8 presupposes the use of key story grammar features and places a preschool student above the 20th percentile based on normative data from 281 preschool students across the United States (Petersen & Spencer, 2016). Second, children who earned an English retell score of 0 to 7 but scored within the normal range on the English EV subtest of the CELF-P were also excluded. In other words, scores within age expectations for English on either screening measure disqualified children from being participants. Therefore, children who displayed low English skills and low, moderate, or high Spanish language skills were included as participants. The screening process resulted in 43 children recruited for 12 treatment classrooms and 40 children in 13 control classrooms. Shortly after the pretesting, 2 children from the control group moved away from the area, which resulted in 38 children in the control group.
In five of the 12 treatment classrooms, more than three children qualified to be research participants (i.e., they could potentially benefit from Tier 2 intervention). However, the teachers were not able to feasibly provide the Tier 2 intervention to more than one group every day. Therefore, only three children in each class were able to receive the intended multitiered instruction, and the rest of the children (
The parents completed a brief survey to report demographic information about their children. Child characteristics are shown in Table 2. The parents also reported their highest level of education and annual family income. Only 7% of the treatment group’s parents had attended college, with two of them having earned a college degree, and 8% of the control group’s parents had attended college, with none having earned a college degree. Only 26% of the treatment group parents and 21% of the control group parents reported the family’s annual income to be more than $22,000.
Child Characteristics
Research Assistants
Research assistants (RAs) were responsible for all screening, data collection, and supporting teachers as they implemented the intervention. The RAs visited each classroom once or twice a week to check in with the teachers and teaching assistants and to conduct fidelity observations. The first author completed rigorous training with the RAs prior to their participation in the study. Because they were all needed to observe fidelity, support teachers’ delivery of the intervention, and collect data, they were not blind to assignment.
Video Manual and Training
We created a video manual to explain the rationale and teaching procedures needed to deliver the multitiered language curriculum. The video manual consisted of 13 short (5–15 minutes) modules that covered the active ingredients of the program, its materials, and guidelines for delivering lessons. During a full-day group training prior to the beginning of the school year, the modules were played one by one for the teachers, teaching assistants, and directors from the treatment group. Each teacher and teaching assistant practiced teaching a lesson to the other attendees. Question-and-answer sessions were interspersed throughout the day to address any questions or concerns. In addition to the training, the teachers were given their own flash drives with the video manual so that they could review any module at any time throughout the year. Once they began using the curriculum, the RAs spent 1 to 2 weeks coaching the Head Start teachers and teaching assistants until they felt comfortable delivering the lessons independently.
Research Design and General Procedures
Because the 81 child participants were nested within classrooms, a cluster-randomized group study design was employed to investigate the effect of multitiered dual-language instruction on the children’s language skills. After the children were screened and included as research participants, the RAs completed the pretesting (September). The intervention consisted of three units of instruction (Units A–C), with each unit lasting 8 to 10 weeks. Throughout the school year, the children in both control and intervention classrooms were administered several proximal and distal measures to examine the extent to which the multitiered curriculum affected important child outcomes. Dependent variables included narrative retells, receptive vocabulary, listening comprehension, and general oral language abilities (e.g., understanding and use of grammar). Posttesting was completed at the end of the study (April/May); however, the proximal measures (e.g., receptive vocabulary and narrative retells) were repeated four times across the year to ensure that the participants’ skills were assessed both before and after each of the three units of instruction. The Head Start teachers and teaching assistants completed all of the intervention components by integrating them within the routine of their classroom, although each teacher decided how and when to implement each component.
All the research activities, including assessments and intervention, took place in Head Start classrooms. In an effort to minimize noise and distractions, the RAs conducted the assessments with individual children during scheduled activities that were moderately quiet (e.g., drop-off and pickup times, as the children finished snack time, and when the class was at circle time). Although there were a large number of assessments that were administered to the children individually and repeatedly, all of the assessments were extremely brief (most were under 5 minutes), and only one assessment was completed at a time.
Multitiered Dual-Language Narrative Curriculum
The multitiered dual-language narrative curriculum is called Puente de Cuentos (“Bridge Made of Stories”). It features 36 English stories (three units of 12 stories each), with 36 corresponding Spanish stories. Each story was written to include two target vocabulary words (e.g.,
Stories served as the basis for language instruction in small-group and large-group arrangements. Lessons were scripted for teachers and adhered to a consistent format across the three units. During each lesson, the teacher or teaching assistant read the featured story and then guided the children through a series of activities designed to help them learn the meaning of target words and to retell the stories. Some activities required children to respond together as a group to increase active responding, whereas other activities required children to respond individually. When individual children retold the featured stories, they were prompted (and supported) to use all of the story grammar elements, the target vocabulary words, and complex sentences (e.g., with coordinating and subordinating conjunctions).
The Head Start teachers and teaching assistants worked together to determine how they would deliver the components of Puente de Cuentos. All the children in the classrooms participated in the large-group activities, but the research participants received small-group lessons in addition to the large-group lessons as their Tier 2 intervention. A typical implementation consisted of two English large-group lessons, two Spanish small-group lessons, and two English small-group lessons each week. Spanish small-group lessons preceded English small-group lessons to facilitate cross-language transfer. In the three treatment classrooms that did not have a Spanish-speaking teacher or teaching assistant, the children only received English large- and small-group lessons, each twice a week. In addition to the explicit, teacher-led instruction, the teachers embedded several child-directed extension activities throughout their daily routine.
Parents of the children who qualified for the Tier 2 Puente de Cuentos intervention in the classroom received a set of family engagement activities in Spanish. Each activity featured one of the 72 stories from the Puente de Cuentos curriculum and listed questions and suggestions on how to help their children to retell the story and to use the target words in Spanish.
The control group was considered a “business as usual” condition. The center directors reported that teachers used small-group instruction to differentiate individual students but most consistently delivered instruction in large groups. Because the teachers did not have access to a Spanish curriculum or a systematic Spanish program, instruction was completed in English with occasional directions or explanations in Spanish (if the teacher spoke Spanish).
Proximal Measures and Data Collection
Narrative Language Measures Listening
The NLM Listening is a subtest of the CUBED Assessment (Petersen & Spencer, 2016). To collect retell language samples in English and Spanish using NLM Listening, the RAs read a brief story to a child, and the child retold the story. The RAs scored the children’s retells in real time, giving points for each story grammar element and indicators of complex language use (e.g., subordinating conjunctions such as
Receptive Picture Vocabulary Assessment
The researcher-designed receptive picture vocabulary assessment measured the children’s mastery of the Spanish and English words targeted in the Puente de Cuentos curriculum. The children were shown four different black-and-white line drawings and asked to point to the target word.
Distal Measures and Data Collection
Assessment of Story Comprehension
The Assessment of Story Comprehension (ASC; Spencer & Goldstein, 2019) is a narrative-based, criterion-referenced assessment for preschoolers. It is only in English. During its administration, the RAs read a short story to a child, then asked a series of factual and inferential questions. Examiners wrote the children’s answers word for word on record forms and rated each answer for correctness and clarity on a 0 to 2 or 0 to 3 scale, yielding a total of 17 possible points. Six parallel forms were administered, three at pre-intervention (September) and three at postintervention (April/May). The highest score was used for analysis. Because the ASC stories are significantly different from the Puente de Cuentos stories and children answer factual and inferential questions instead of retelling stories, it is considered a distal measure of language comprehension.
Clinical Evaluation of Language Fundamentals: Preschool
The CELF-P in English and Spanish (Semel et al., 2004; Wiig et al., 2009) includes three language subtests that measure general oral language proficiency. The Sentence Structure (SS) subtest requires children to point to pictures corresponding to a spoken sentence. The Word Structure (WS) subtest requires an expressive response that examines children’s grammatical abilities. In the EV subtest, children label pictures of objects and actions. The EV subtests of the English and Spanish versions were used for screening, but participants who qualified for Tier 2 intervention also completed the SS and WS subtests in English and Spanish as part of pretesting. Raw scores were calculated and used in the analysis.
Feasibility Measures and Data Collection
Usage Rating Profile–Intervention
At the end of the intervention phase, the classroom teachers and teaching assistants completed the Usage Rating Profile–Intervention (URP-I; Chafouleas, Briesch, Riley-Tillman, & McCoach, 2009). The URP-I consists of 35 questions, each with 6-point Likert scale responses regarding four intervention dimensions: acceptability, understanding, feasibility, and system support. Because each dimension has a different number of items, we converted scores to percent so they can be interpreted.
Fidelity Checklists
The RAs monitored the fidelity of the Puente de Cuentos lessons. During each observation, an RA completed a fidelity checklist that documented adherence (12 items), responsiveness (3 items), and quality (9 items) of the intervention (Dane & Schneider, 1998). The RAs recorded the fidelity of 21% of large-group lessons, 21% of Spanish small-group lessons, and 17% of English small-group lessons. To yield a percent fidelity, the number of items completed as intended or with high quality was divided by the total number of items on the checklist and multiplied by 100.
Intervention Logs
To capture information about the extent to which the children received the intended dose, the researchers provided intervention and attendance logs for each classroom. Dose for each type of teacher-directed lesson (i.e., large-group English, small-group Spanish, or small-group English) was recorded, as well as how many extension activities were completed and for which words and concepts.
Implementation Survey
At the end of the school year, the Head Start teachers completed a short survey. This consisted of nine researcher-generated questions that probed the teachers’ perceptions about the modifications completed and needed, planned sustainment, and contextual fit of the Puente de Cuentos curriculum in Head Start settings. Responses were rated using a Likert-type scale of 1 to 5.
Results
Descriptive statistics for the focal measures are shown in Table 3. Less than 1% of the scores were missing overall (18/2754 = 0.0065, or 0.65%), and all available data were used in the multilevel model results that follow. Detailed descriptions of the analyses are available in the online supplemental material.
Descriptive Statistics for Pretest, Posttest, and Adjusted Posttest Scores by Treatment Group
Adjusted posttest means have been adjusted for group differences on the pretest and were used in conducting the analyses of covariance.
Proximal Child Outcomes
We evaluated baseline equivalence across the treatment and control groups on pretest measures. As shown in Table 4, tests of pretest differences on these measures were nonsignificant (
Unconditional Pretest ICCs and Tests of Baseline Equivalence (Random-Intercept ANOVAs a ), with Hedges’ g Effect Sizes With Small-Sample Adjustment
Solutions for S NLM, E Vocab B, S Vocab B, E WS CELF, S SS CELF, and S EV CELF are equivalent to general linear model–based ANOVAs, as the between-class random-intercept variance component estimate was 0 (or near 0).
NLM Listening English and Spanish
On the English and Spanish NLM posttests, tests of the estimated difference between groups on the adjusted means in the random-intercept analyses of covariance showed statistically significant differences in favor of the treatment group (see Table 5). The 95% confidence intervals, although somewhat wide given the pilot study sample size, support the estimated positive effects for the treatment group. On the English NLM, the effect size was large (
Unconditional Posttest ICCs and Tests of Postintervention Differences in Adjusted Means (Random-Intercept ANCOVAs a ), with Hedges’ g Effect Sizes and Improvement Indexes
Solutions for E NLM, S NLM, and E SS CELF are equivalent to general linear model–based analyses of variance, as the between-class random-intercept variance component estimate was 0 (or near 0).
Adjusted posttest means were adjusted for group differences on the pretest and used in conducting the ANCOVAs.
Receptive Picture Vocabulary Assessment
With the exception of the posttest for Spanish Unit B, tests of the estimated difference between groups on the adjusted posttest means for English and Spanish vocabulary were statistically significant, favoring the treatment group (see Table 5). Effect sizes for these five measures (English Vocabulary A, B, and C; Spanish Vocabulary A and C) were moderate (
Distal Child Outcomes
As shown in Table 4, tests of pretest differences on distal measures were not significant with small to moderate effect sizes (
Assessment of Story Comprehension
The random-intercept analysis of covariance on the ASC adjusted posttest means was statistically significant, with a moderate effect size (
Clinical Evaluation of Language Fundamentals—Preschool
Results for adjusted posttest differences between the treatment and control groups differed across the CELF-P SS, WS, and EV subtests, but were very consistent for subtests across English and Spanish. The treatment group clearly outperformed the control group on SS, evidenced by statistically significant differences, moderate effect sizes (
Differences in adjusted posttest means were not statistically significant for WS in either language, but effect sizes approached moderate (
Feasibility
Usage Rating Profile–Intervention
Mean percentage for each dimension of the URP-I is displayed in Figure 1. Higher scores in acceptability, understanding, and feasibility suggest that the intervention was perceived as useful and doable. The teachers and teaching assistants reported Puente de Cuentos to be more acceptable than feasible, although the scores for both characteristics were moderately high. The teachers also reported having a good understanding of the curriculum. For systems support, the teachers reported lower scores than for the other dimensions, but because of the nature of the scale, higher scores were not necessarily desired.

Teachers’ mean ratings on the Usage Rating Profile–Intervention.
Fidelity
After the Head Start teachers and teaching assistants felt comfortable delivering the lessons (1–2 weeks), the RAs began assessing their intervention fidelity using the fidelity checklists. The teachers and teaching assistants demonstrated consistently high fidelity to the Puente de Cuentos procedures. For the small-group lessons in Spanish, the mean fidelity scores were 97%, 96%, and 98% for Units A, B, and C, respectively. For the small-group English lessons, they were 97%, 96%, and 97%, respectively. For the large-group lessons, fidelity was slightly lower, with mean fidelity scores of 91%, 97%, and 94% for the respective units.
Intervention Logs
Based on a review of the intervention logs the teachers completed, very few teacher-directed lessons were omitted, with the exception of small-group lessons in Spanish in the three treatment classrooms without a Spanish-speaking teacher. All planned lessons had been implemented by the middle of May. The small-group intervention portion of the log revealed that all the research participants were present for at least 85% of the Tier 2 lessons intended for them. Moreover, 90% or more of the target words and concepts were addressed through extension activities in all the treatment classrooms.
Implementation Survey
Mean ratings of all the teachers and teaching assistants who completed the implementation survey are displayed in Table 6. Overall, they reported that they made few modifications during the study, but some had plans to make more. Most of the teachers planned to continue using Puente de Cuentos after the study. The mean ratings indicate that there is a reasonable contextual fit between the intervention and their values, students, and setting.
Implementation Survey Results
Discussion
The importance of building oral language skills is clear as there is a strong link between oral language and reading comprehension (Cain et al., 2004; Chaney, 1998; Clarke et al., 2010; Dickinson & Tabors, 2001; Larney, 2002). Vocabulary and narrative skills are particularly important areas to develop early so that children can benefit more from subsequent instruction and comprehension of what is read to them and what they read (Cain & Oakhill, 2011; Elleman et al., 2009; Mehta et al., 2005; Verhoeven & van Leeuwe, 2008). If oral language instructional efforts can incorporate children’s L1 and produce meaningful improvements in English, there is an added benefit of helping cultivate a bilingual and biliterate society (Collier & Thomas, 2017). The purpose of this early-stage efficacy study was to examine the extent to which multitiered dual-language instruction improved children’s Spanish and English language skills on proximal and distal measures of vocabulary, narrative retells, language comprehension, and general language abilities.
Proximal Measures of Vocabulary and Narrative Retell
Consistent with prior English, oral narrative–based language intervention studies that have focused on proximal outcomes (e.g., Spencer, Petersen, & Adams, 2015; Spencer, Petersen, Slocum, & Allen, 2015; Spencer, Weddle, Petersen, & Adams, 2017), we found statistically significant effects for narrative retells in English. Narrative retelling was the most salient instructional activity in the Puente de Cuentos instruction, with all the large-group and half of the small-group lessons based on English stories. The teachers supported the children’s practice of each model story, English vocabulary, and English language complexity through retelling activities in every lesson. Only half of the small-group lessons featured Spanish story retelling, which may account for the differences in the effect sizes for English (
Improvements on the researcher-made receptive vocabulary assessment were statistically significant for all three units of English words, and significant for two of the three units in Spanish. All effect sizes were considered educationally meaningful (
The meaningful improvements in Spanish receptive vocabulary suggest that the combined dose of small-group Spanish lessons in the classroom and family engagement activities was sufficient to help the children learn the words in Spanish. In previous studies, we found little evidence of improvement on the Spanish receptive vocabulary assessment but adequate evidence for improved English vocabulary (Spencer, Moran, Petersen, Thompson, & Restrepo, 2018; Spencer, Petersen, Restrepo, Thompson, & Gutierrez Arvizu, 2019). In these studies, children received English instruction in large groups, in small groups, and through extensions (e.g., storybook reading and child-directed center activities); however, Spanish instruction was only delivered in small groups twice a week for 20 minutes. In the current study, the children’s families received a set of family engagement activities that aligned with all the lessons, and they were only in Spanish. We speculate that by boosting the children’s exposure to the Spanish vocabulary through the family engagement activities, a better English-Spanish instructional balance was created. Another difference between previous studies and the current study was that all the families viewed a video module that showed them how to use the family engagement activities to facilitate storytelling, encourage the use of the target words, and help their children answer questions about the stories. Because we did not isolate the effect of the family engagement activities, this supposition will require replication and more rigorous investigation in the future.
Distal Measures of Language Comprehension and General Language Skills
The chain of logic for building vocabulary and narrative skills is that, if truly successful, improvements will also be detected on language-related measures that do not closely match the intervention. If children’s language comprehension can be improved before they enter kindergarten, there is a chance that their future reading comprehension will also benefit. Although this was not investigated experimentally in this study, other research suggests that language outcomes mediate the effects of language intervention on reading comprehension for students in primary grades (Bowyer-Crane et al., 2008; Clarke et al., 2010; Language and Reading Research Consortium, Jiang, & Logan, 2019). It is the same logic that underpins early-childhood intervention aimed at enhancing language comprehension. The ASC is a standardized, criterion-referenced assessment tool that uses stories and comprehension questions to assess children’s language comprehension skills, similar to the common reading comprehension tasks in elementary grades. The ASC stories are longer and more complex than the stories featured in the multitiered curriculum and were strategically designed to capture inferential comprehending. At pretest, the children’s scores were extremely low, indicating that they were unable to answer questions about a story. This could mean that they did not understand the stories or that they understood the stories but had insufficient expressive language to respond to the questions. At posttest, the children in the treatment group showed small, but important, gains over the children in the control group. Although we can be confident that the multitiered dual-language instruction was responsible for the observed gains, there is substantially more room for growth, as the children in the treatment condition had mean posttest ASC scores of 4.24 out of a total of 17 possible points. Given that the ASC is a distal measure and the children were not directly trained in the intervention to answer factual and inferential questions, this is considered a meaningful outcome with a moderate effect size (
As further evidence of promise, the multitiered dual-language instruction had a statistically significant impact on the treatment group’s scores on the CELF-P SS subtest in English and Spanish. Although not statistically significant, group differences on the English and Spanish WS subtest were meaningful, with moderate effect sizes. This pattern of responding corresponds to developmental expectations. SS is a receptive task in which children point to the picture that corresponds to the sentence the examiner says, while WS requires children to produce a grammatically complex phrase or sentence. It is reasonable that children learn to understand a second language before they are able to speak it. It is possible that stronger effects would be seen if children received 2 years of focused dual-language instruction.
The children in the treatment group did no better than the children in the control group on the EV subtest of the CELF-P in English or in Spanish. Again, this is a consistent and expected pattern. Both groups made equivalent gains from pretest to posttest in English and no gains in Spanish. Although the intervention targeted a large number of words, none of them are on the EV subtests. The EV subtests feature words that are commonly learned in preschools, such as
Feasibility
The URP-I data suggest that the Puente de Cuentos curriculum was generally acceptable to the teachers and teaching assistants. It was easy to understand and regarded as feasible in their setting. This is further evidenced by the high fidelity of lesson delivery and completion of all of the planned lessons before the end of the year. The dimension of systems support of the URP-I cannot be interpreted as easily because high scores suggest that the teacher is unable to implement the intervention without the help of others and low scores indicate that they can easily implement the intervention on their own. A multitiered instructional system necessitates a fair amount of help and teamwork. For example, administrative support is needed so that schedules can be adapted and lesson plans modified to better fit a multitiered delivery. Moreover, the teachers often divided the lesson delivery responsibilities with the teaching assistant, which is an acceptable use of personnel and resources.
The implementation survey revealed that the teachers implemented the program as it was designed, although some of them made changes to the duration of lessons and the materials and activities used. The ability to modify a research-based practice has been associated with sustainability of that practice (Klinger, Vaughn, Hughes, & Arguelles, 1999), so midrange scores (2.00–4.00) on the implementation survey may indicate that teachers feel empowered and knowledgeable about how to adapt Puente de Cuentos for their classrooms. Multitiered instructional systems may pose paradigmatic shifts for early-childhood educators. Likewise, not all early-childhood professionals value teacher-directed instruction. We attribute the high contextual fit scores (4.08–4.83) to the balance of short explicit instruction sessions with child-directed activities in Puente de Cuentos.
Contributions to MTSS in Early Childhood
The implementation of MTSS across early-childhood settings has been limited, and multitiered systems of language support have rarely been attempted or reported in the research literature. This is one of the first studies to report on the efficacy and feasibility of a dual-language multitiered curriculum for preschool children. The promise of MTSS transcends special education and extends services to any and all students who may need extra support. Thus, through MTSS, students who are not meeting English language expectations due to various external factors are eligible to receive the language support they need. Dual-language instruction has been shown to have an equal or stronger impact on academic performance as compared with English-only approaches (Collier & Thomas, 2017; Collins, 2014; Mahoney et al., 2005; Rolstad et al., 2005), with the added benefit of sociocultural, socioeconomic, linguistic, and cognitive gains (Collier & Thomas, 2017). The implementation of a dual-language multitiered system of support merges two powerful, evidence-based approaches. With a tiered system in place that provides special services to all students in need of additional support, and a focus on both L1 and L2, there is a real possibility of meaningful change and, for the first time, significant improvement in reading outcomes for dual-language learners.
Limitations and Future Directions
Despite the valuable contributions this study makes to the literature on dual-language interventions and to the literature on MTSS in early-childhood contexts, there are a number of limitations and points to consider for future research. First, because this was a pilot early-efficacy study, we were limited by our financial resources. These limitations reduced the number of classrooms that could realistically be managed and our ability to monitor conditions in the control classrooms. The small sample may be responsible for the lack of statistical significance found for Spanish receptive vocabulary Unit B and for the WS subtest in English and Spanish. It is possible that significance will be observed when a larger, fully powered efficacy trial can be completed. A second limitation is also related to resources. We were unable to mask the classrooms’ assignment to conditions because all of the RAs were needed to collect pre- and posttest data and observe the teachers for fidelity. With greater financial resources, a second group of data collectors can remain blind to condition.
A number of limitations were related to the dose of the intervention. Several children assigned to the treatment group did not receive the full intervention for various reasons. First, we were unable to be more selective about the classrooms we recruited to participate. Three of the treatment classrooms did not have a Spanish-speaking teacher or teaching assistant. This meant that nine children in the treatment group received multitiered English language instruction instead of dual-language instruction. There were also more control classrooms without a Spanish-speaking teacher than treatment classrooms. Second, in five of the treatment classrooms, more than three children qualified to be research participants. Because the teachers did not have the time to conduct more than one small-group intervention every day, they selected three children for Tier 2 intervention, and the rest (
Although not necessarily weaknesses of the current study, there are a few recommendations that future research in this area can address. The extent to which Spanish instruction benefited the children should be examined in future research. We did not attempt to isolate the effect of the Spanish components or examine cross-language transfer directly, but future researchers should plan for a systematic and rigorous analysis of the value added of using children’s L1 in multitiered dual-language instruction. Likewise, the impact of the small-group instruction on top of the large-group instruction is assumed to have added benefit. However, this should be examined empirically, comparing different variations and possible configurations of the Puente de Cuentos curriculum.
Supplemental Material
Final_Online_Appendix_Material_for_AERA_Open – Supplemental material for Early Efficacy of Multitiered Dual-Language Instruction: Promoting Preschoolers’ Spanish and English Oral Language
Supplemental material, Final_Online_Appendix_Material_for_AERA_Open for Early Efficacy of Multitiered Dual-Language Instruction: Promoting Preschoolers’ Spanish and English Oral Language by Trina D. Spencer, Meghan Moran, Marilyn S. Thompson, Douglas B. Petersen and M. Adelaida Restrepo in AERA Open
Footnotes
Acknowledgements
The research reported here was supported by the Institute of Education Sciences, U.S. Department of Education, through Grant R305A140093 for $1,481,960. The opinions expressed are those of the authors and do not represent the views of the Institute or the U.S. Department of Education.
Authors
TRINA D. SPENCER is an associate professor at the Rightpath Research & Innovation Center at the University of South Florida. Her research interests include multitiered system of supports, curriculum and instructional design, and early-childhood language and literacy.
MEGHAN MORAN is an instructor in the Department of Applied Linguistics at Northern Arizona University. Her areas of interest include teaching English as a second language.
MARILYN S. THOMPSON is a professor in the School of Social and Family Dynamics at Arizona State University. Her areas of expertise include quantitative data analysis and children’s academic development.
DOUGLAS B. PETERSEN is an associate professor in the Department of Communication Disorders at Brigham Young University. His areas of interest include assessment development, dynamic assessment, and dual-language learners.
M. ADELAIDA RESTREPO is a professor of speech and hearing science at Arizona State University. She researches bilingualism and bilingual language and literacy interventions.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
