Sage Journals: Discover world-class research

Abstract

At present, intelligent oral English training mostly makes systematic intelligent discrimination by inputting human voices, and this method has a high false judgment rate, so it can’t realize targeted pronunciation correction. This study aims to address the issues of insufficient generalization of speech recognition in English oral training caused by stress interference (such as the LibriSpeech test set WER reaching 13.38%) and low accuracy of speech animation synchronization. Moreover, this study proposes an innovative diarticulation model, which quantifies the co-articulation effect through a consonant-vowel/vowel-vowel visual weight function and optimizes the mouth shape parameters through a dynamic feedback mechanism. The experimental results show a significant improvement in the recognition performance of the model, with an F1 value of 0.91 and a domain matching dataset CER of 6–8.5%. The pronunciation correction effect is outstanding, with a 43% reduction in vowel error rate (23.1% → 13.2%) and a 38.5% improvement in consonant connection accuracy. In addition, the synchronization delay is reduced by 15.3% (p < 0.001) compared to PhoneBERT, and the DTW distance is only 0.18. The conclusion shows that the system effectively solves the problem of speech recognition generalization through data adaptive training and dual pronunciation modeling. At the same time, the synchronization accuracy and real-time performance (48FPS) have reached a practical level, providing a reliable technical solution for intelligent oral training.

Keywords

man-machine collaboration English oral training speech recognition

Get full access to this article

View all access options for this article.

References

Chen

. Design of internet of things online oral English teaching platform based on long-term and short-term memory network. Int J Cont Eng Educ Life Long Learn 2021; 31(1): 104–118.

Han

, et al. An oral English evaluation model using artificial intelligence method. Mob Inf Syst 2022; 2022(1): 3998886–3998895.

Mahbub

ISP

Hadina

. A systematic overview of issues for developing EFL learners’oral English communication skills. J Lang Educ 2021; 7(1): 229–240.

Wang

. Speech recognition of oral English teaching based on deep belief network. Int J Emerg Technol Learn 2020; 15(10): 100–109.

Khan

ZJY

Ahmad

. Preparing students for the real world: oral English communication skills for global entrepreneurs. MOJEM: Malays Online J Educ Manag 2023; 11(4): 29–48.

Zhihui

. Research on the construction and development of oral English output module based on “internet”. J Applied Sci Eng 2021; 24(4): 621–626.

Zhao

. An analysis on the application of interactive teaching approach in college oral English class. Advances in Educational Technology Psych 2021; 5(7): 145–153.

Ibna Seraj

Habil

Hasan

. Investigating the problems of teaching oral English communication skills in an EFL context at the tertiary level. Int J InStruct 2021; 14(2): 501–516.

Fang

. Design of oral English intelligent evaluation system based on DTW algorithm. Mobile Network Appl 2022; 27(4): 1378–1385.

10.

Marcum

Kim

. Oral language proficiency in distance English-language learning. Calico Journal 2020; 37(2): 148–168.

11.

Dunifa

. Evaluating oral English program for non-English major students: focusing on self-assessment of students’ speaking abilities and their needs. Novitas-ROYAL (Res Youth Lang) 2023; 17(2): 34–49.

12.

Nagai

Everhart

. Oral English proficiency tests, interpretive labor, and the neoliberal university. J Ling Anthropol 2022; 32(3): 543–560.

13.

Waluyo

Rofiah

. Develo** students’ English oral presentation skills: do self-confidence, teacher feedback, and English proficiency matter? Mextesol J 2021; 45(3): n3–n11.

14.

Zhang

Sun

. Multi‐feature intelligent oral English error correction based on few‐shot learning technology. Comput Intell Neurosci 2022; 2022(1): 2501693–2501702.

15.

Kobayashi

. Examining the effects of metacognitive instruction in oral communication for EFL learners. Journal of Asia TEFL 2020; 17(2): 597–605.

16.

Abid

. Exploring EFL teacher educators’ goals in teaching English oral communication skill. J English Literacy Education: The Teaching and Learning of English as a Foreign Language 2020; 7(1): 20–34.

17.

Abdalgane

Musabal

Ali

. Utilizing Dogme approach to promote EFL learners’ oral skills at the tertiary level. Theory and Practice in Language Studies 2023; 13(1): 100–107.

18.

Albarakati

Jendli

. An examination of students’ motivation through English oral activities and assessment in Saudi Arabia. Inter J English Lang Linguistics Res 2021; 9(2): 44–63.

19.

Tejedor-García

Escudero-Mancebo

Cardeñoso-Payo

, et al. Using challenges to enhance a learning game for pronunciation training of English as a second language. IEEE Access 2020; 8(2): 74250–74266.

20.

Mai

NTN

. Oral communication skills among English-majored graduates: a survey of those at companies in Binh Duong province, Vietnam. J Law Sustain Develop 2023; 11(9): e1232.

21.

Quinto

. Pragmatic instructions in oral communication in context. J Lit English Educ Study Prog 2024; 5(1): 1–16.

22.

Sada

Bulbula

Bulti

. EFL teachers’ and students’ attitudes and practices regarding oral communication in English classes: Ethiopian high school context. Int J Lang Educ 2023; 7(2): 255–270.

23.

Aremu

. A study of the use of computer–mediated language learning on oral English pedagogy in Oyo state, Nigeria. British J English Linguistics 2024; 12(1): 23–38.

24.

Gan

Lam

. Understanding university English instructors’ assessment training needs in the Chinese context. Lang Test Asia 2020; 10(1): 11–20.

25.

Prahaladaiah

Andrew Thomas

. Effect of phonological and phonetic interventions on proficiency in English pronunciation and oral reading. Educ Res Int 2024; 2024(1): 9087087–9087099.

26.

Dong

Pan

Kim

. Exploring the integration of IoT and generative AI in English language education: smart tools for personalized learning experiences. J Comp Sci 2024; 82(1): 102397–102406.

27.

Chen

. Entertainment social media based on deep learning and interactive experience application in English e-learning teaching system. Entertainment Computing 2025; 52(1): 100846–100855.

28.

Huang

. Improvement and optimization method of college English teaching level based on convolutional neural network model in an embedded systems context. Comput Aided Des Appl 2024; 21(2): 212–227.

29.

Cang

Feng

. Construction of English corpus oral instant translation model based on internet of things and deep learning of information security. J Comput Methods Sci Eng 2024; 24(3): 1507–1522.

30.

Dennis

. Using AI-powered speech recognition technology to improve English pronunciation and speaking skills. IAFOR J Edu 2024; 12(2): 107–126.

Design and application of oral English training process combined with human-machine collaboration

Abstract

Keywords

Get full access to this article

References