Effects of a Video Featuring Connected Speech Instruction on EFL Undergraduates in Taiwan

Abstract

This study has the aim of improving the English speaking ability of Taiwanese college freshmen by a video featuring connected speech instruction. Forty-eight students from a private university in northern Taiwan participated in the study, which lasted for 7 weeks. Pre- and post-tests were used to assess their speaking performance in terms of connected speech before and after the experimental treatment. Entry and exit questionnaires were also used to investigate students’ learning attitudes. The results show that such instruction was significantly effective for improving the English language learners’ connected speech skills. Positive results were also observed in the outcomes of the questionnaires, showing significantly enhanced learning attitudes to English speaking. It is hoped that the study results may offer language teachers some insights into the practice of video-aided learning in English speech classes, particularly its efficacy for connected speech.

Keywords

Video instruction speaking competence connected speech

Introduction

Over the past few decades, multimedia technology has rapidly grown in popularity in the field of foreign language teaching because of its numerous advantages (Abrams, 2002; Al-Jarf, 2004; Chun, 2016; H. C. Huang, 2015; Kubler, 2018; Meskill & Anthony, 2005; Salaberry, 2001; Warschauer, 2000). Of all the educational technologies, video-enhanced instruction has been one of the most widely discussed in both research and educational settings (Chun, 2016; Davis & Vincent, 2019; C. Lai et al., 2018; Y. J. Lin & Wang, 2018). This may be because videos often contain authentic aids (Mayora, 2009), rich audio-visual information (e.g., sound effects, facial expressions, body language, and other visual clues), and are full of cultural references (Canning-Wilson & Wallace, 2000; Galbraith & Rodriguez, 2018; Ketcha, 2019; Y. J. Lin & Wang, 2018). The visual and special nature of videos can easily convert complicated concepts into straightforward ideas for learners to retain, making video-based materials a strong reinforcement of second language acquisition (L.-F. Lin, 2010, 2011).

The multiple benefits of video-based instruction are in line with Bax’s (2011) view of the normalization of technology in language education, which refers to the stage where educational technologies are so commonly applied in language learning settings that users are unaware of their roles as effective elements in the learning process. Specifically, Bax (2011) proposed a neo-Vygotskian perspective on pedagogical technology in language classrooms, recognizing the complicated interaction between technology and the classroom’s social activities. While Vygotsky’s communicative theory helps educators study how computers assist in learning, a neo-Vygotskian perspective emphasizes learning and cognitive development (Bax, 2011). As a broad approach, it includes “cultural psychology, sociocognitive-developmental theory and sociohistorical theory” (Bax, 2011, p. 6), highlighting the fact that learning is a social communicative process, not an individualized one (Mercer & Fisher, 1997). Indeed, a learning context as described above is judged with specific potential to develop EFL learners’ language skills, such as speaking (L.-F. Lin, 2010, 2011; Watkins & Wilkins, 2011). Many previous studies (Hişmanoğlu, 2006; C. K. Hsu, 2015; Y. H. Lai, 2010; Mayora, 2009; Sun & Yang, 2015; Weyers, 1999; Wu et al., 2017) have also generated evidence in favor of using video technology to teach speaking skills, indicating that it can help learners by enhancing self-confidence, motivation, long-term listening comprehension, vocabulary learning, and pronunciation skills.

Despite the strong endorsement of the use of educational technology, its effects have not yet been fully exhausted in contexts such as Taiwan (Gu et al., 2021), particularly in terms of students’ English speaking proficiency in general and their specific performance in connected speech. To begin with, Taiwan is an EFL milieu with limited exposure to English. It is thus much more difficult to acquire adequate speaking competence than it is for those in an ESL milieu or whose mother tongue has a language system similar to English. As noted by Gilakjani (2011), the lack of motivation, limited exposure to the target language, insufficient emphasis by language teachers on pronunciation, and mother-tongue interference with English sounds and rules are four common obstacles that EFL learners may encounter in an EFL learning classroom. With regard to this, video-based instruction may help create authentic, linguistic input to improve Taiwanese EFL students’ speech development. It would seem especially helpful to look at whether or not a video featuring connected speech instruction may be effective in a Taiwanese setting. Connected speech, as Field (2003) defines it, confers intelligibility which features the acoustic content of the oral message recognized by interlocutors. This is often seen in the speech of native English speakers, who generally talk fast and without breaks, full of connected speech features such as contraction, intrusion, elision, assimilation, and weak forms (Brown, 1990; Cauldwell, 2002; Field, 2003). However, as Y. H. Lai (2010) and Liang (2015) found, Chinese-speaking EFL learners, like Taiwanese, often had difficulties in pronouncing such sounds, particularly when tense and lax vowels were involved, due to the fact that they were not in the Chinese language system. This finding was later supported in the paper by Wong et al. (2019). They submitted that difficulties in connected speech usually stem from the articulatory variations between one’s mother tongue and the target foreign language. Unfortunately, for the purpose of demonstrating clear and comprehensible speech, Taiwanese language teachers tend to articulate every single English word at the cost of connected speech. In such circumstances, most EFL learners may “have considerable difficulty in understanding what is being said” by native speakers in a real-life context, as Brown (1990, p. 6) warned. This stark reality is what Taiwanese EFL students have been wrestling with (D.-C. Hsu, 2015). Therefore, it is proposed here that authentic connected speech must be taught in Taiwan’s EFL classrooms, and the effects of using videos featuring this specific speaking skill are worth investigation.

Connected Speech and Speaking Ability

Many researchers have perceived that speaking is a complex and remarkably challenging cognitive skill. It requires various mechanisms to operate simultaneously. One area that is particularly difficult for learners to master is connected speech. According to Weinstein (2001), connected speech occurs while one is speaking at a natural speed, in which impromptu pronunciation is altered by adjacent words or sounds. Rosa (2002) indicates that connected speech, such as reduced forms, is common in spoken English, and can be identified in all registers regardless of the speech rate. Brown and Kondo-Brown (2006) describe connected speech as the result of “the continuous chains in normal spoken language and conversation as compared with the typical linguistic analysis of individual phonemes analyzed in isolation” (p. 284). They note that connected speech exists in “all levels of speech” (p. 5), from formal conversations to small talk.

The importance of connected speech is nowadays given increasing weight in the teaching of pronunciation and has led to other relevant issues being addressed in various pronunciation textbooks (e.g., Hagen, 2000; Weinstein, 2001). In spoken language, phonological processes such as reduction, elision, assimilation, and contraction are the four main pillars in constructing connected speech. According to Griffee (1995, p. 28), “connected speech is the natural way we speak, linking together and emphasizing certain words, rather than each word standing alone.” Regardless of how it actually works in native speakers’ speech, in English courses, connected speech may be uncommon for teachers to utter or for students to hear in their audio materials, which focus on providing comprehensible speech. “Many learners are accustomed to hearing a very careful, clear pronunciation of words, such as native speakers might use when talking very emphatically or saying words in isolation” (Rixon, 1986, p. 38). Therefore, this emphasis could result in students lacking the knowledge of connected speech and cause frustration in conversations with native speakers.

Instruction in connected speech has been well recognized as an effective way to help learners comprehend rapid speech better (J. D. Brown & Hilferty, 2006; Celce-Murcia et al., 1996; Matsuzawa, 2006). If the way that a non-native speaker talks is word by word, unconnectedly, his or her language may sound fragmented and unnatural and could exhaust the listener (H. Brown, 2001; Celce-Murcia et al., 1996). It should be noted that constantly practicing the essential features of connected speech in the target language is said to help non-native language learners obtain more native-like pronunciation and more understandable speech (Brown & Kondo-Brown, 2006). Hence, instruction in the features of connected speech will not only raise language learners’ awareness of the existence of these features but also help them advance their ability to use connected speech. As J. D. Brown (2006) asserts, it is crucial for learners to accommodate their registers and styles to the target language. To attain the goal of more mastery over connected speech and a more native-like delivery, it is vital to understand and know how to use the features of speech.

English teachers may consider using video-based materials to enhance Taiwanese students’ speaking abilities. However, in the specific EFL context of Taiwan, no empirical studies have emerged on teaching connected speech by means of video-based treatments. In fact, connected speech remains an under-investigated area in Taiwan’s academia. The two most relevant Taiwanese studies about connected speech instruction over the last decade are Kuo et al. (2013) and D.-C. Hsu (2015), but neither of them records the use of video-based materials. Kuo et al. empirically examined the performance of three groups of college students, having one taught with explicit connected speech-focused instruction; another with stress-focused instruction; and the third with no prosodic treatment. Kuo et al.’s results showed that those who received connected speech instruction outperformed in rhythm those in the other groups. Similarly, D.-C. Hsu investigated the effects of connected speech instruction on the listening and speaking performance of junior high school students. His results show that the participants who were taught about connected speech improved their listening abilities more than did their counterparts who received no treatment of this kind. Both studies confirm the values of teaching connected speech instruction to Taiwanese EFL students, but more relevant studies are urgently needed to shed light on the field, especially those employing the aid of video-based treatment. The present study was thus given the aim of contributing to the knowledge of the field by addressing the pedagogical effects of watching videos featuring connected speech instruction for Taiwanese EFL college students.

Research Questions

Research Question 1: Does video instruction enhance Taiwanese college students’ attitudes to learning English speaking and connected speech?

Research Question 2: Does video instruction improve Taiwanese college students’ speaking performance in terms of connected speech?

The Present Study

Participants

Recruited for the teaching experiment was a convenience sample from two intact Freshmen English Lab courses, which aimed at fostering students’ general English listening and speaking abilities, including pronunciation. Both classes were taught by the same teacher (one of the researchers) and met for 2 hr per week for 18 weeks. Initially, 93 Taiwanese freshmen consented to the experiment, agreeing to attend the treatment, take the relevant tests, and fill out questionnaires. Nevertheless, the data of only 48 students who completed all the requirements were included in the final analysis, 28 of them from the Information and Library Science department and 20 from the Spanish department. Those who failed to complete a pre-test, a post-test, or a questionnaire had their data excluded. The remaining participants consisted of 21 females and 27 males, ranging in age from 18 to 20 years, with an average age of around 19. They had learned English for about 10 years before the experiment. The level of their general English proficiency was somewhere between low-intermediate and intermediate (i.e., about CEFR A2-B1).

Treatment

A 7-week video experiment was applied in this study. While Table 1 summarizes each week’s topic, content, and the clip used, details of the experiment are elaborated below. First, the present researchers cherry-picked suitable online clips that matched the topic of each week. All the clips were sourced from YouTube, one of the most popular online, free video platforms with modern students and educators (Gu et al., 2021; C. K. Hsu, 2015; Mayora, 2009; Sun & Yang, 2015). The clips chosen met a series of criteria. For example, they had to present the main features of connected speech each week. The correctness of the content and material presented in each clip was also examined before use, safeguarding that the pronunciation and connected speech presented in the video were accurate and clear. Next, the speaking speed of the clips were made appropriate for the target participants. In addition, to keep the attention of the participants, the selected clips were short.

Table 1.

The Teaching Unit of the Corresponding Weeks.

Week	Teaching content	Note
1	Unit 1: General Features Clip 1: Connected Speech & Linking (https://www.youtube.com/watch?app=desktop&v=gAHUTKm_1n0)	Week 1 focuses on the general features of connected speech, such as catenation and elision.
2	Unit 2: Connecting Past Tense –ed Clip 2: [t], [d] or [Id]? \| “-ed” Past Tense (https://www.youtube.com/watch?v=j32SurxnE4s&feature=youtu.be)	Week 2 addresses the pronunciation of the past tense –ed and then shows students how to link the various examples of –ed with the next words if they started with a vowel.
3 & 4	Unit 3: Three Sounds of Plural “s” Clip 3: 3 Sounds of the Plural “s” in English: [s], [z], or [ɪz] (https://www.youtube.com/watch?v=cWNW3-4Wpao&feature=youtu.be)	Weeks 3 and 4 talk about the different sounds of plural “s” and how students should link [s], [z] or [ɪz] with the next words if they started with a vowel.
5 & 6	Unit 4: [th] and [s] Sounds Clip 4: English Pronunciation [Th] & [s] (https://www.engvid.com/english-pronunciation-th-s/)	Weeks 5 and 6 discuss “th” and “s” sounds. Students also learn about linking them with the vowels that follow.
7 & 8	Practice & Review Clip 5: Master Spoken English—Connected Speech—Linking Practice (https://youtu.be/BEAMncAMOIU) YouTube Clip 6: Connected Speech & Linking: Overview (https://youtu.be/PpilegRXjRw) (Note. This particular clip has been removed because the associated YouTube account of the clip’s owner was terminated and is therefore no longer available.)	Weeks 7 and 8 are spent on exercises to enhance students’ pronunciation and speaking skills. This is also a review of the connected speech and linking mechanisms learned over the past weeks.

The clips finalized for the treatment contained various rules of connected speech for each corresponding week. Week 1 was about the general features of connected speech. At the same stage, the students were given an overview to show how sounds are linked in English. They were also taught how words ending in vowels or consonants were linked to the following word. Clip 1 shown in Week 1 featured these ideas. In Week 2, the participants learned through the teacher and Clip 2 about connecting past tense –ed, which can be pronounced as [t], [d] or [Id], to the word following them. In Weeks 3 and 4, the students were first taught about the three sounds of the plural “s” in English: [s], [z], or [ɪz]. They then learned the way to connect these sounds to the words that follow them. Clip 3 featured these rules. In Weeks 5 and 6, the participants were shown Clip 4 about features of pronunciation that could easily be neglected by some Taiwanese students, and the videos were about the pronunciation of tapping sounds and the different between the pronunciations of [th] and [s]. They also learned about linking these sounds, where appropriate, with the words that follow them. In the final two weeks (7 and 8), the teacher and the videos (Clips 5 and 6) helped the students to practice the points that they had learned about linking and connected speech. A review of the overall points also took place in the last two weeks.

In showing these videos in the classroom, the teacher adopted five elements of educational practice (Bax, 2011): access, participation and interaction, expert scaffolding, expert modeling, and challenge and contradiction. To be specific, in the access step, the teacher first offered the students some examples of pronunciation and connected speech that they might have heard of in their past learning experience, so as to help them recall their prior knowledge. Next, in the participation and interaction step, they were given warm-up questions about the target sounds and pronunciation rules and discussed them with their classmates. When they moved on to the expert scaffolding process, the teacher served as the expert who scaffolded their learning by checking, assisting them to practice, and giving feedback. Meanwhile, expert modeling through YouTube clips also provided expert examples. In this step, the students attentively watched twice, or three times if the students requested it, and carefully imitated the sounds to themselves. They were then asked to repeat what they had heard and given an explanation to understand the target pronunciations and connected speech. The teacher offered feedback and comment where appropriate to ensure that the students accurately followed the expert models. Afterwards, they practiced with their classmates by reading aloud the sounds that they had just learned and reciting relevant given texts. In the challenge and contradiction step, when a student was reading aloud, his or her group members listened attentively to check whether his or her connected speech was appropriate, and gave feedback. Each student in the same group took turns until every group member had finished practicing. Before the class dispersed, the teacher asked for volunteers or randomly picked one student from each group to read aloud a new text that contained the target connected speech, so as to check the students’ final speaking performance. The teacher then made an overall final comment on the students’ speaking.

Data Collection Procedure and Instruments

This section starts by describing the procedure of implementing the instruments and then describes their content and quality in detail. To begin with, after consenting to the study, the participants first finished a questionnaire about their learning attitude on entry. They then completed a reading aloud pre-test (Pre-test A) which had two texts. Then, the teacher commenced the 7-week experimental treatment. When the treatment was concluded, the same learning attitude questionnaire was administered again as an exit questionnaire. Then, the participants completed two different post-tests. One was Post-test A, which had exactly the same texts as Pre-test A had. The other was Post-test B; it had two different texts from those of Pre-test A. The reasons for and details of the design of the instruments are addressed below.

Questionnaire

The self-developed questionnaire (see Appendix A), designed with a 5-point Likert-type scale, has 20 items that examined both students’ general attitudes to English speaking/learning and their specific perceptions of learning pronunciation and connected speech. These were investigated together because, as discussed in the introduction, Chinese-speaking EFL learners often find pronouncing such sounds as connected speech challenging (Y. H. Lai, 2010; Liang, 2015; Wong et al., 2019), which is likely to affect their receptivity to learning English speaking in general. A questionnaire that investigated how the participants liked learning about English speaking in general and pronunciation and connected speech in particular should thus best reflect the pedagogical effects of a video featuring connected speech on student speakers’ learning attitudes as a whole.

To safeguard the quality of the questionnaire, an exploratory factor analysis (EFA) was conducted. This was done by means of a pilot study that involved 88 other participants, a sample size complying with the suggested ratio of participants for piloting a questionnaire (Cattell, 1978), namely, 3 (at least) participants:1 (a questionnaire item). As Table 2 shows, the KMO (Kaiser–Meyer–Olkin) value was high (KMO = .934, p = .000) and the value of Bartlett’s Test of Sphericity was significant (χ² = 1948.509, p = .000), indicating that a factor analysis of the data collected from the pilot study was useful. The EFA results further show that the questionnaire as a whole was highly valid (78.36% of variance explained) and reliable (Cronbach’s α = .97). It also had three underlying components, all of good quality: self-efficacy in speaking English (Items 1–12) (27.89% of variance explained, Cronbach’s α = .91); learning preference (Items 13–16) (26.53% of variance explained, Cronbach’s α = .90); and learning motivation to speak English (Items 17–20) (23.93% of variance explained, Cronbach’s α = .87). The final version was first administered before the treatment as an entry questionnaire and then again as an exit questionnaire.

Table 2.

KMO Value and Bartlett’s Test.

KMO measure of sampling adequacy		.934
Bartlett’s Test of Sphericity	Approx. χ²	1948.509
	df	190
	Sig.	.000

Note. KMO = Kaiser–Meyer–Olkin.

GEPT Reading Aloud sections

The GEPT (General English Proficiency Test) is a five-level criterion-reference testing system offered in Taiwan. The function of the GEPT is to assess EFL learners’ proficiency in general English, with the aim of advocating the practice of lifelong learning and fostering the use of the communicative approach in the field of English learning and teaching. The intermediate level of GEPT, the level of basic English communicators who can handle most conversations on everyday topics (D. Huang, 2017), was used in this study in view of the participants’ proficiency (i.e., between low-intermediate and intermediate levels). Note that only the Reading Aloud section of the GEPT intermediate level speaking tests was applied in this study given that its goal was to assess the participants’ ability to reproduce connected speech. Participants were requested to finish reading all the tests (i.e., Pre-test A, Post-test A, and Post-test B) (see Appendix B) within 2 min. Each test had two short Reading Aloud Sections from two different GEPT intermediate level speaking tests. Pre-test A and Post-test A shared exactly the same texts, so as to carefully access whether the participants had improved their speaking in terms of the same connected speech contained in them. In addition to Post-test A, the participants also took Post-test B, which contained totally different texts from the pre-test, so as to further examine whether they could effectively apply what they learned to different texts.

Raters and Rating Criteria

Two raters were involved in assessing the student speakers’ performance. One was the teacher of the course, and the other was an experienced college lecturer who also taught English speaking and listening at the same experimental site. Before assessment, they had consulted an expert in the field and a native speaker of English regarding the words in the test paragraphs that would naturally be connected in speech. The two consultants marked all the connected speech in the tests, thus providing rating criteria for the raters to follow. For example, in Pre-test A, spent two and at the end were marked as geminations of /t/, last evening as an elision of /t/, and at a and get a as catenation. Following the above criteria, the raters then randomly marked five participants’ audio files of the tests together so as to adjust their rating. Afterwards, they started marking student speakers’ performance on all the tests. The accuracy rate (between 0% and 100%) of the connected speech by each student was finally calculated for data analysis. It should also be noted that intra-class correlation (ICC) was performed; the test result showed a statistically significant agreement between the raters with respect to their accuracy rate for the tests (ICC = .936 at p = .000). Finally, their marking was averaged for data analysis.

Data Analysis

To address the study’s objective, the data collected were quantitatively analyzed using IBM SPSS Statistics 23. First, a set of paired-sample t-tests was first performed on the scores that the participants assigned to the entry and exit questionnaires. This helps to answer Research Question 1 about whether or not the participants improved their learning attitudes to English speaking. Then, another set of paired-sample t-tests was performed on the participants’ speaking scores for the speaking tests. This time, bar charts were also created where appropriate to present the results of our descriptive analysis, with the aim of illustrating any improvement between the speaking tests. These results together help to answer Research Question 2, which asked whether the participants made significant gains in their performance of connected speech after completing the experiment. Finally, because SPSS is defaulted not to generate effect sizes for t-tests, the present researchers themselves calculated effect size d for all the t-test results, using the formula d = (M₁ – M₂)/SD (Plonsky & Oswald, 2014) to determine the magnitude of the effects observed. A d value of .40 indicates a small effect, .70 a medium one, and 1.00 a large one (Plonsky & Oswald, 2014).

Results

This section first presents the results of the overall questionnaires and each questionnaire dimension. It then reports on the results of the connected speech pre- and post-tests.

Results of the Questionnaire as a Whole

A paired sample t-test was conducted to compare the scores of the entry and exit questionnaires and check whether the participants had developed different learning attitudes to connected speech after the 7-week experiment. Overall, Table 3 shows a statistically significant difference, t(47) = −3.22, p = .002, between the entry scores (M = 3.21, SD = .63) and the exit scores (M = 3.44, SD = .51), showing a medium-sized effect (d = .401). This means that the implementation of the 7-week video instruction significantly increased the participants’ positive perception of learning spoken English.

Table 3.

Paired Sample t-Test for the Entry and Exit English Speaking Questionnaires.

Questionnaire	N	M	SD	df	t	p	Effect size (d)
Overall entry scores	48	3.21	.63	47	−3.22	.002	.401
Overall exit scores	48	3.44	.51	47	−3.22	.002	.401

Results of the Questionnaire Subscales

According to Table 4 on the dimension of self-efficacy, there was a statistically significant difference between the entry scores (M = 3.13, SD = .71) and the exit scores (M = 3.38, SD = .52), t(47) = −3.41, p = .001, with a medium-sized effect at d = .401. Although the evidence was not apparent on the aspect of preference (pre-test: M = 3.31, SD = .79; post-test: M = 3.41, SD = .71), t(47) = −1.29, p = .204, a statistically significant difference, t(47) = −2.80, p = .007, d = .450, was found for motivation (pre-test: M = 3.38, SD = .57; post-test: M = 3.67, SD = .71). Altogether, these results indicate that the experiment helped learners make significant gains in terms of self-efficacy and motivation in speaking English.

Table 4.

Paired t-Tests for the Subscales of the English Speaking Questionnaire.

Subscales	Item	N	M	SD	df	t	p	Effect size (d)
Self-efficacy	Entry	48	3.13	.71	47	−3.41	.001	.401
Self-efficacy	Exit	48	3.38	.52	47	−3.41	.001	.401
Preference	Entry	48	3.31	.79	47	−1.29	.204	.133
Preference	Exit	48	3.41	.71	47	−1.29	.204	.133
Motivation	Entry	48	3.38	.57	47	−2.80	.007	.450
Motivation	Exit	48	3.67	.71	47	−2.80	.007	.450

Results of Pre-Test A and Post-Test A

According to Table 5, generally, in Pre-test A, the participants achieved 24% accuracy in word linking. The accuracy rate increased to 40% in Post-test A (namely, a rise of 16%). Such an increase was statistically significant since a statistical difference was found between Pre-test A (M = .24, SD = .16) and Post-test A (M = .39, SD = .20), t(47) = −6.47, p = .000, with a medium-sized effect (d = .828). The results suggest that after the experiment the participants had indeed significantly improved in terms of their overall performance in connected speech.

Table 5.

Paired t-Tests for Overall Word Linking in Pre-Test A and Post-Test A.

Tests	Accuracy rate	N	M	SD	df	t	p	d
Pre-test A	24%	48	.24	.16	47	−6.47	.000	.828
Post-test A	40%	48	.39	.20	47	−6.47	.000	.828

Figure 1 presents a detailed analysis by illustrating the accuracy rates of each type of connected speech taught in this experiment. Notably, the participants showed especially great improvement in the linking of [k] and [a], with an increase of 31%. However, the participants in this study improved less with regard to linking [f] to sounds such as [a], [o], and [u]; they achieved only 19% accuracy in Pre-test A and 29% in Post-test, a gain of 10%.

Figure 1.

Accuracy rate of word linking in Pre-test A and Post-test A.

Results of Pre-Test A and Post-Test B

In this section, only the overlapping linking sounds of Pre-test A and Post-test B were examined to determine whether the students were able to apply the learned connected speech skills in a different speaking task, such as that of Post-test B. The results were fruitful, with an overall 28% increase (Table 6). The paired t-test further showed a statistically significant difference between Pre-test A (M = .19, SD = .15) and Post-test B (M = .47, SD = .23), t(47) = −9.95, p = .000, with great effect (d = −1.44). This suggests that after the experiment the participants positively applied their learned connected-speech skills when reading different texts in Post-test B.

Table 6.

Paired t-Tests for the Overlapped Linking Sounds in Pre-Test A and Post-Test B.

Tests	Accuracy rate	N	M	SD	df	t	p	d
Pre-test A	19%	48	.19	.15	47	−9.95	.000	1.44
Post-test B	47%	48	.47	.23	47	−9.95	.000	1.44

Presented in Figure 2 is the specific linking of the same consonants (i.e., [t]-[t], [t]-[a], [t]-[i], [s]-[a], and [s]-[o]) examined in both Pre-test A and Post-test B. As shown, the participants made observable improvements in Post-test B, showing that they were capable of applying the learned speaking skills in different tests. With regard to accuracy, a 20% increase was found in omitting the same consonant [t]-[t]; a 67% increase was gained in connecting [t] and [i]; a 41% increase was gained in [t]-[i]; and a 23% increase was gained in [s]-[o]. However, no improvement was made by the participants in the linking of [t] and [a]; in fact, it declined by 10%.

Figure 2.

Accuracy rate of specific linking sounds in Pre-test A and Post-test B.

Finally, this section shows the results of students’ performance by looking at all the linking sounds between the tests. As Table 7 shows, they made an overall 8% improvement from Post-test A to Post-test B. This was found to be a statistically significant difference between the tests (Pre-test A: M = .24, SD = .16; Post-test B: M = .32, SD = .18), at t(47) = −3.73 and p = .001, with small effect size d. The results further validate the effectiveness of implementing video instruction for learning of connected speech.

Table 7.

Paired t-Tests for All the Linking Sounds Between Pre-Test A and Post-Test B.

Tests	Accuracy rate	N	M	SD	df	t	p	d
Pre-test A	24%	48	.24	.16	47	−3.73	.001	.469
Post-test B	32%	48	.32	.18	47	−3.73	.001	.469

Discussion and Conclusion

The purpose of the current study was to investigate whether the use of video instruction could constitute an effective way of helping EFL students learn to properly connect words in speaking English. According to the results of the data analyses shown above, progress in articulating connected speech by learners was observed after the 7-week experiment in video instruction. The same positive results were also shown in the participants’ perception of learning spoken English. The findings merit discussion in the field.

First, the results of the questionnaire revealed that students’ preference for self-efficacy and motivation in learning English speaking skills significantly increased after the experiment, which not only verifies the success of this study but also lends support to the pedagogical practice of video-enhanced instruction, as described in previous studies (Galbraith & Rodriguez, 2018; Hişmanoğlu, 2006; Ketcha, 2019; Y. J. Lin & Wang, 2018; Peters & Webb, 2018; Weyers, 1999). In particular, this finding relates to the results obtained by Herron (1994) and C. K. Hsu (2015). The former reported that video materials can help improve comprehension and students usually consider them more entertaining and enjoyable, which may lead to better information retention. The latter observed improved learning motivation in students who learned through video-aided instruction. Together, these results enhance Bax’s (2011) normalization of technology, proving that multimedia technology, such as the online clips used in the present study, functions to intensify learning quality.

Nevertheless, although positive results can be found in students’ perception of self-efficacy and motivation, no significant gain was shown in students’ preference as far as learning English speaking skills was concerned. This may be attributed to the fact that Taiwan offers mostly EFL contexts, so it is difficult to create ample chances for learners to speak or use English outside of the classroom. Such an unpromising environment may have caused students to respond unfavorably to certain subscale questions assessing their preference, such as “I like speaking English” (Item 13) or “I like to seek opportunities to practice my English speaking skills in my everyday life” (Item 15).

In addition, according to previous studies, immersing language learners in multimedia instructional environments is regarded as a highly beneficial learning tactic (Mayer, 2005; Plass & Jones, 2005). The findings of the present study confirm this statement and further correspond to those of Watkins and Wilkins (2011), whose findings endorse the effectiveness of conducting video instruction in second-language classrooms. In the present study, the participants’ accuracy rates in reading aloud increased significantly after the experiment. This means that they were able to successfully connect more words. This lends support to the findings of Kuo et al. (2013) and D.-C. Hsu (2015), in that connected speech instruction is feasible and can be effective with Taiwanese EFL learners. In addition, the present finding further demonstrates that integrating video instructions for pronouncing connected speech in EFL speaking classes can also be pedagogically effective.

In reading aloud, the most frequently achieved linking sound is a consonant link to the same or a similar consonant. In other words, when a word ends in a consonant and the next word starts with the same or a similar consonant, the consonants are linked together and the consonant sound has to be pronounced once only. From what the researchers of the study observed, many Taiwanese students tended to skip the final consonant sounds in words, which made it remarkably easy for them to achieve the “consonant + consonant” linking, such as [t]-[t].

However, it should be pointed out that despite the overall improvement in each individual linking sound, no significant improvement was found on the linking of [t] and [a]. Judging from the participants’ recording, this is possibly because some of the participants somehow failed to recognize that certain words should be treated as a chunk when reciting, for example, “at a public square,” “get a seat,” and “bought an umbrella” in the tests. This assumption is based on the evidence that quite a few of the participants would have separated the words in these phrases when reading. For example, some would utter “at,” paused briefly, and then in one breath read “a public square.” Likewise, others would have read “get” and then “a seat.” Still others said “bought” and then “an umbrella.” The reasons why they might tend to identify some chunks more than others are unclear, but this may be an interesting line of inquiry for future researchers when they consider examining students’ performance of certain connected speech sounds such as [t] and [a] or other similar sounds.

Even so, after the experiment, students gained more confidence in speaking English in general. They were also motivated to attend English classes to learn more about English speaking skills and were motivated to imitate proper intonation and pronunciation in speaking English. However, such improved attitudes or learning activity seem to have been confined only to the class times, because the classrooms are to them the main locations for learning English, where they have opportunities to speak English, and where they have tests. In light of this, teachers may consider creating opportunities for students to engage in using English in everyday life that would enable them to apply outside classes the skills they had learned within them.

The findings of the current study suggest a positive answer to the first research question, “Does video instruction enhance Taiwanese college students’ attitudes to learning connected speech?” Indeed, video instruction can help students gain self-efficacy and motivation in learning connected speech. The results also indicate a positive answer to the second research question: “Is video instruction an effective way of improving Taiwanese college students’ performance in connected speech?” This suggests that video instruction is an effective way for Taiwanese college students to learn connected speech, and one which has positive effects on improving connected speech competence.

According to Griffee (1995, p. 28), “connected speech is the natural way we speak, linking together and emphasizing certain words, rather than each word standing alone.” Connected speech is considered an integral part of language. However, some language learners may not be aware of the fact that connected speech, in fact, occurs in every language, so it also appears in their own language. Therefore, it is essential for language teachers to approach the features of connected speech in their teaching and raise learners’ awareness of its existence in the target language to prepare them to achieve fluency in speaking the target language. Constantly practicing the essential features of connected speech in the target language is said to assist non-native language learners obtain more native-like pronunciation and more understandable speech (Brown & Kondo-Brown, 2006).

While the study design itself was valid and enriched the current knowledge of the field, future researchers may consider addressing the following issues to make further contributions. First, student speakers’ long-term ability remains uncertain. Future researchers may add a delayed post-test to evaluate whether students have internalized the knowledge of connected speech that they learned. Second, in addition to the implementation of a questionnaire, conducting interviews to gain in-depth perspectives on the learning of connected speech may shed a different light. Third, it should be acknowledged that the present study had no control group, which may cause the findings of the study to be treated with caution. Future studies may contribute by comparing the effects of a video featuring connected speech instruction with those of a conventional treatment. Similarly, whereas the present study looked at a sample from only one experimental site, future researchers may consider examining participants from diverse settings so as to obtain a more comprehensive view of the pedagogical effects of learning with a video featuring connected speech instruction. Last but not least, it should be noted that in the treatment, the teacher made an overall final comment on her students’ speaking. Although this seems to be a common pedagogical practice in most language classrooms, future researchers may like to consider the possible effects that this particular teacher behavior may have on student speakers’ improvement.

Finally, the findings provide some suggestions for language teachers who wish to enhance their students’ speaking competence in English, especially in the aspect of connected speech, which serves as one of the fundamental elements in speaking a language. It is advised that teachers integrate video instruction in connected speech into their curriculum design, because video is very entertaining and easily arouses learners’ interest. Instruction in connected speech can not only help learners gain fluency in speaking the target language but also provide them with listening skills and better understanding.

Footnotes

Appendix A

Appendix B

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research and/or authorship of this article: This article was written with funding support from Taiwan’s Ministry of Science and Technology (MOST 108-2410-H-032-027; MOST 109-2410-H-032-059; MOST 109-2410-H-032-063).

ORCID iD

Ming Huei Lin

References

Abrams

Z. I.

(2002). Surfing to cross-cultural awareness: Using Internet-mediated projects to explore cultural stereotypes. Foreign Language Annals, 35(2), 141–160. https://doi.org/10.1111/j.1944-9720.2002.tb03151.x

Al-Jarf

(2004). The effects of Web-based learning on struggling EFL college writers. Foreign Language Annals, 37(1), 49–57. https://doi.org/10.1111/j.1944-9720.2004.tb02172.x

Bax

(2011). Normalisation revisited: The effective use of technology in language education. International Journal of Computer-Assisted Language Learning and Teaching (IJCALLT), 1(2), 1–15.

Brown

(1990). Listening to spoken English (2nd ed). Longman.

Brown

(2001). Teaching by principles: An interactive approach to language pedagogy (2nd ed.). Longman.

Brown

J. D.

(2006). Authentic communication: Whyzit importan’ta teach reduced forms. In Newfields

, et al. I. Gledall, M. Kawate-Mierzejewska, Y. Ishida, M. Chapman, & P. Ross (Eds.), Authentic Communication: Proceedings of the 5th Annual JALT Pan-SIG Conference (pp. 13–24). University of Hawai’i at Manoa.

Brown

J. D.

Hilferty

. (2006). The effectiveness of teaching reduced forms for listening comprehension. In Brown

J. D.

Kondo-Brown

(Eds.), Perspectives on teaching connected speech to second language speakers (pp. 51–58). National Foreign Language Resource Center, University of Hawai‘i.

Brown

J. D.

Kondo-Brown

(Eds) (2006). Perspectives on teaching connected speech to second language speakers. University of Hawai‘i, National Foreign Language Resource Center.

Canning-Wilson

(2000). Practical aspects of using video in the foreign language classroom. The Internet TESL Journal, 6(11). http://iteslj.org/Articles/Canning-Video.html

10.

Cattell

R. B.

(1978). The scientific use of factor analysis in behavioral and life sciences. Plenum Press.

11.

Cauldwell

(2002). Streaming speech: Listening and pronunciation for advanced learners of English. In Teeler

(Ed.), Talking computers (pp. 18–22). International Association of Teachers of English as a Foreign Language.

12.

Celce-Murcia

Brinton

Goodwin

(1996). Teaching pronunciation: A reference for teachers of English to speakers of other languages. Cambridge University Press.

13.

Chun

D. M.

(2016). The role of technology in SLA research. Language Learning & Technology, 20(2), 98–115.

14.

Davis

R. O.

Vincent

(2019). Sometimes more is better: Agent gestures, procedural knowledge and the foreign language learner. British Journal of Educational Technology, 50, 3252–3263. https://doi.org/10.1111/bjet.12732

15.

Field

(2003). Promoting perception: Lexical segmentation in L2 listening. ELT Journal, 57(4), 325–334. https://doi.org/10.1093/elt/57.4.325

16.

Galbraith

Rodriguez

(2018). Student engagement and enjoyment of narratives: An empirical study of an authentic music video and a short teaching case. College Teaching, 66(4), 171–180. https://doi.org/10.1080/87567555.2018.1474334

17.

Gilakjani

A. B.

(2011). A study on the situation of pronunciation instruction in ESL/EFL classrooms. Journal of Studies in Education, 1(1), 1–15.

18.

Griffee

D. T.

(1995). Songs in action. Phoenix.

19.

S.-H.

Chang

Y.-S.

Lee

J.-Y.

Lin

M. H.

(2021). Broadcasting yourself via YouTube: Developing the speech of EFL students. TESOL Journal, 12(1), e00532. https://doi.org/10.1002/tesj.532

20.

Hagen

(2000). Sound advice: A basis for listening. Pearson Education.

21.

Herron

(1994). An investigation of the effectiveness of using an advance organizer to introduce video in the foreign language classroom. Modern Language Journal, 78, 190–198. https://doi.org/10.1111/j.1540-4781.1994.tb02032.x

22.

Hişmanoğlu

(2006). Current perspectives on pronunciation learning and teaching. Journal of Language and Linguistic Studies, 2(1), 101–110.

23.

Hsu

C. K.

(2015). Learning motivation and adaptive video caption filtering for EFL learners using handheld devices. ReCALL, 27(1), 84–103. https://doi.org/10.1017/S0958344014000214

24.

Hsu

D.-C.

(2015). The effect of explicit linking instruction on Taiwanese EFL junior high school students’ oral performance and phonological awareness of connected speech [Unpublished master’s thesis]. National Chung Cheng University.

25.

Huang

(2017, June 24–June 25). Exploring strategy use in L2 speaking assessment: The case of the GEPT intermediate level. Paper presented at the 19th Academic Forum on English Language Testing in Asia (AFELTA), Taipei. https://www.lttc.ntu.edu.tw/thesis.htm

26.

Huang

H. C.

(2015). From web-based readers to voice bloggers: EFL learners’ perspectives. Computer Assisted Language Learning, 28(2), 145–170. https://doi.org/10.1080/09588221.2013.803983

27.

Ketcha

R. T.

(2019). Varieties of English in Cameroon audio-visual materials: Cameroon audio-lects: An account of five major English media “audio-lects” in Cameroon. English Today, 35(1), 20–27. https://doi.org/10.1017/S0266078418000019

28.

Kubler

C. C.

(2018). Developing course materials for technology-mediated Chinese language learning. Innovation in Language Learning and Teaching, 12(1), 47–55. https://doi.org/10.1080/17501229.2018.1418626

29.

Kuo

F. L.

Ting

W. Y.

Chiang

H. K.

Pierce

(2013). Effectiveness of connected speech-focused instruction and stress-focused instruction on Taiwanese EFL learners’ speech Intelligibility. SPECTRUM: NCUE Studies in Language, Literature, Translation, 11, 57–69.

30.

Lai

Lyu

(2018). Understanding the nature of learners’ out-of-class language learning experience with technology. Computer Assisted Language Learning, 31(1–2), 114–143. https://doi.org/10.1080/09588221.2017.1391293

31.

Lai

Y. H.

(2010). English vowel discrimination and assimilation by Chinese-speaking learners of English. Concentric: Studies in Linguistics, 36(2), 157–182.

32.

Liang

(2015). Chinese learners’ pronunciation problems and listening difficulties in English connected speech. Asian Social Science, 11(16), 98–106.

33.

Lin

L.-F.

(2010). A video-based CALL program for proficient and less-proficient L2 learners’ comprehension ability, incidental vocabulary acquisition. Educational Media International, 47(3), 199–216. https://doi.org/10.1080/09523987.2010.518812

34.

Lin

L.-F.

(2011). The video comprehension strategies of Chinese-speaking university students. Educational Computing Research, 45(3), 297–319. https://doi.org/10.2190/EC.45.3.c

35.

Lin

Y. J.

Wang

H. C.

(2018). Using enhanced OER videos to facilitate English L2 learners’ multicultural competence. Computers & Education, 125, 74–85. https://doi.org/10.1016/j.compedu.2018.06.005

36.

Matsuzawa

(2006). Comprehension of English reduced forms by Japanese business people and the effectiveness of instruction. In Brown

J. D.

Kondo-Brown

(Eds.), Perspectives on teaching connected speech to second language speakers (pp. 59–66). National Foreign Language Resource Center, University of Hawai‘i. https://doi.org/10.1017/CBO9780511816819.004

37.

Mayer

R. E.

(2005). Cognitive theory of multimedia learning. In Mayer

R. E.

(Ed.), The Cambridge handbook of multimedia learning (pp. 31–48). Cambridge University Press.

38.

Mayora

C. A.

(2009). Using YouTube to encourage authentic writing in EFL classrooms. TESL Reporter, 42(1), 1–12.

39.

Mercer

Fisher

(1997). The importance of talk. In Wegerif

Scrimshaw

(Eds.), Computers and talk in the primary classroom (pp. 13–21). Multilingual Matters.

40.

Meskill

Anthony

(2005). Foreign language learning with CMC: Forms of online instructional discourse in a hybrid Russian class. System, 33(1), 89–105. https://doi.org/10.1016/j.system.2005.01.001

41.

Peters

Webb

(2018). Incidental vocabulary acquisition through viewing L2 television and factors that affect learning. Studies in Second Language Acquisition, 40(3), 551–577. https://doi.org/10.1017/S0272263117000407

42.

Plass

J. L.

Jones

L. C.

(2005). Multimedia learning in second language acquisition. In Mayer

(Ed.), The Cambridge handbook of multimedia learning (pp. 467–488). Cambridge University Press. https://doi.org/10.1017/CBO9780511816819.030

43.

Plonsky

Oswald

F. L.

(2014). How big is “big”? Interpreting effect sizes in L2 research. Language Learning, 64(4), 878–912.

44.

Rixon

(1986). Developing listening skills. Macmillan.

45.

Rosa

(2002). Don’cha know? A survey of ESL teachers’ perspectives on reduced forms instruction. University of Hawai’i Second Language Studies Paper, 21(1), 49–78.

46.

Salaberry

(2001). The use of technology for second language learning and teaching: A retrospective. Modern Language Journal, 85(1), 41–56. https://doi.org/10.1111/0026-7902.00096

47.

Sun

Y. C.

Yang

F. Y.

(2015). I help, therefore, I learn: Service learning on Web 2.0 in an EFL speaking class. Computer Assisted Language Learning, 28(3), 202–219.

48.

Warschauer

(2000). Electronic literacies: Language, culture, and power in online education. Lawrence Erlbaum.

49.

Watkins

Wilkins

(2011). Using YouTube in the EFL classroom. Language Education in Asia, 2(1), 113–119. https://doi.org/10.5746/LEiA/11/V2/I1/A09/Watkins_Wilkins

50.

Weinstein

(2001). Whaddaya say? Guided practice in relaxed speech. Longman.

51.

Weyers

(1999). The effect of authentic video on communicative competence. Modern Language Journal, 83(3), 339–353. https://doi.org/10.1111/0026-7902.00026

52.

Wong

S. W.

Dealey

Leung

V. W.

Mok

P. P.

(2019). Production of English connected speech processes: An assessment of Cantonese ESL learners’ difficulties obtaining native-like speech. The Language Learning Journal. Advance online publication. https://doi.org/10.1080/09571736.2019.1642372

53.

W. C. V.

Hsieh

J. S. C.

Yang

J. C.

(2017). Creating an online learning community in a flipped classroom to enhance EFL learners’ oral proficiency. Journal of Educational Technology & Society, 20(2), 142–157.