Sage Journals: Discover world-class research

Abstract

Traditional research on preschool language development often fails to capture the complex nonlinear relationships and high-dimensional characteristics of language growth, leading to low prediction accuracy and poor cross-cultural applicability. This paper introduces a novel BERT (Bidirectional Encoder Representations from Transformers)-based model to predict preschool language development and evaluate its cross-cultural effectiveness. Text data from preschool children’s language datasets across multiple cultural backgrounds is collected, cleaned, and preprocessed to create suitable training samples. Special attention is given to the unique grammatical structures and cultural expressions in each language to ensure compatibility with the model. The BERT model is used to encode the processed text, leveraging its bidirectional self-attention mechanism to extract contextual information and generate deep feature representations essential for understanding preschool language development. The model combines both grammatical and semantic features for meaningful representations in subsequent predictions. Fine-tuning the pre-trained BERT model using the Adam optimizer enhances prediction accuracy, while cross-validation and hyperparameter tuning further improve its performance. Culturally specific annotations and vocabularies are incorporated to ensure the model’s effective prediction of language development across different regions. Experimental results show that the BERT model achieves an MAE (Mean Absolute Error) between 0.20 and 0.25, an MSE (Mean Squared Error) between 0.05 and 0.08, and an average R² value of 0.84 across English, Chinese, Spanish, and Japanese. These results demonstrate the model’s high accuracy and strong cross-cultural stability in predicting preschool language development.

Keywords

preschool language language development prediction prediction model deep learning bidirectional encoder representations from transformers

Get full access to this article

View all access options for this article.

References

Brodin

Renblad

. Improvement of preschool children’s speech and language skills. Early Child Dev Care 2020; 190(14): 2205–2213.

Washington-Nortey

Zhang

, et al. The impact of peer interactions on language development among preschool English language learners: A systematic review. Early Child Educ J 2022; 50(1): 49–59.

Nasreddinova

Sadikova

. Features of the development of preschool children in a bilingual environment. Sci Innovat 2022; 1(B7): 1440–1444.

Alatalo

Westlund

. Preschool teachers' perceptions about read-alouds as a means to support children’s early literacy and language development. J Early Child Literacy 2021; 21(3): 413–435.

Gandolfi

Viterbori

. Inhibitory control skills and language acquisition in toddlers and preschool children. Lang Learn 2020; 70(3): 604–642.

Chen

Justice

Rhoad-Drogalis

, et al. Social networks of children with developmental language disorder in inclusive preschool programs. Child Dev 2020; 91(2): 471–487.

Lyster

SAH

Snowling

Hulme

, et al. Preschool phonological, morphological and semantic skills explain it all: Following reading development through a 9-year period. J Res Read 2021; 44(1): 175–188.

Grover

Rydland

Gustafsson

, et al. Shared book reading in preschool supports bilingual children’s second-language learning: A cluster-randomized trial. Child Dev 2020; 91(6): 2192–2210.

Thomas

Colin

Leybaert

. Interactive reading to improve language and emergent literacy skills of preschool children from low socioeconomic and language-minority backgrounds. Early Child Educ J 2020; 48(5): 549–560.

10.

Limlingan

McWayne

Sanders

, et al. Classroom language contexts as predictors of Latinx preschool dual language learners’ school readiness. Am Educ Res J 2020; 57(1): 339–370.

11.

Kronenberger

Pisoni

. Longitudinal development of executive functioning and spoken language skills in preschool-aged children with cochlear implants. J Speech Lang Hear Res 2020; 63(4): 1128–1147.

12.

Zhang

, et al. Global digital compact: A mechanism for the governance of online discriminatiory and misleading content generation. Int J Hum Comput Interact 2024; 2(3): 1–28. DOI: 10.1080/10447318.2024.2314350.

13.

Piasta

Park

Farley

, et al. Early childhood educators’ knowledge about language and literacy: Associations with practice and children’s learning. Dyslexia 2020; 26(2): 137–152.

14.

Bal

Fok

Lord

, et al. Predictors of longer-term development of expressive language in two independent longitudinal cohorts of language-delayed preschoolers with autism spectrum disorder. JCPP (J Child Psychol Psychiatry) 2020; 61(7): 826–835.

15.

Al-Harbi

. Language development and acquisition in early childhood. EduLearn 2020; 14(1): 69–73.

16.

Redondo

Cózar-Gutiérrez

Gonzalez-Calero

, et al. Integration of augmented reality in the teaching of English as a foreign language in early childhood education. Early Child Educ J 2020; 48(2): 147–155.

17.

Pezold

Imgrund

Storkel

. Using computer programs for language sample analysis. Lang Speech Hear Serv Sch 2020; 51(1): 103–114.

18.

Alstad

Sopanen

. Language orientations in early childhood education policy in Finland and Norway. Nordic J Studies in Educ Policy 2021; 7(1): 30–43.

19.

Hansen

Broekhuizen

. Quality of the language-learning environment and vocabulary development in early childhood. Scand J Educ Res 2021; 65(2): 302–317.

20.

Yilmaz

Topu

Takkaç Tulgar

. An examination of vocabulary learning and retention levels of pre-school children using augmented reality technology in English language learning. Educ Inf Technol 2022; 27(5): 6989–7017.

21.

Alzubi

Jain

Nagrath

, et al. Deep image captioning using an ensemble of CNN and LSTM based deep neural networks. J Intell Fuzzy Syst 2021; 40(4): 5761–5769.

22.

Altwaijry

Al-Turaiki

. Arabic handwriting recognition system using convolutional neural network. Neural Comput Appl 2021; 33(7): 2249–2261.

23.

Wang

Liu

. A teaching quality evaluation model for preschool teachers based on deep learning. Int J Emerg Technol Learn 2021; 16(3): 127–143.

24.

Aslam

Khan

Alamri

, et al. An improved early student’s academic performance prediction using deep learning. Int J Emerg Technol Learn 2021; 16(12): 108–122.

25.

Tahsin Mayeesha

Md Sarwar

Rahman

. Deep learning based question answering system in Bengali. J Inf Telecommun 2021; 5(2): 145–178.

26.

Shahi

Phillips

, et al. Using deep learning and natural language processing models to detect child physical abuse. J Pediatr Surg 2021; 56(12): 2326–2332.

27.

Lee

Song

, et al. Age group classification to identify the progress of language development based on convolutional neural networks. J Intell Fuzzy Syst 2021; 40(4): 7745–7754.

28.

Alaparthi

Mishra

. BERT: A sentiment analysis odyssey. J Market Anal 2021; 9(2): 118–126.

29.

Acheampong

Nunoo-Mensah

Chen

. Transformer models for text-based emotion detection: a review of BERT-based approaches. Artif Intell Rev 2021; 54(8): 5789–5829.

30.

Zou

, et al. Shortcut learning of large language models in natural language understanding. Commun ACM 2023; 67(1): 110–120.

31.

Mao

Liu

, et al. The biases of pre-trained language models: An empirical study on prompt-based sentiment analysis and emotion detection. IEEE Trans Affect Comput 2022; 14(3): 1743–1753.

32.

Acs

Hamerlik

Schwartz

, et al. Morphosyntactic probing of multilingual BERT models. Nat Lang Eng 2024; 30(4): 753–792.

33.

Cahyani

Patasik

. Performance comparison of TF-IDF and Word2Vec models for emotion text classification. Bulletin EEI 2021; 10(5): 2780–2788.

34.

Zhang

. Research on case reasoning method based on TF-IDF. Int J Syst Assur Eng Manag 2021; 12(3): 608–615.

Preschool language development prediction model based on deep learning: Cross-cultural validity evaluation

Abstract

Keywords

Get full access to this article

References