Sage Journals: Discover world-class research

Abstract

The increase of online hate speech, especially against women, has become a big problem in digital communication, especially in low-resource languages like Malayalam and in situations where English is mixed with other languages. This work examines the efficacy of synthetic data augmentation techniques—Machine Translation (MT), Masked Language Modeling (MLM), and Few-Shot Learning (FSL)—in enhancing hate speech identification inside Malayalam-English (Manglish) social media text. We use these three methodologies to improve transformer-based models like mBERT, BERT, and IndicBERT. Our experiments show that classification performance has improved a lot. For example, mBERT got an F1-score of 86.42%, but real data only got 81.24%. LIME's explainability research indicates that contextual clues, not just offending words on their own, are what make detection accurate. Also, synthetic data makes things more fair by cutting down on false positives and false negatives and makes models more broad by exposing them to a larger range of code-mixed expressions. The approach is effective, but it has certain drawbacks. For example, it may be hard to apply to other code-mixed languages or fields, and there are ethical issues with creating synthetic data. The results have practical consequences for implementing fairness-aware, transparent, and resilient hate speech detection algorithms on multilingual social media platforms. This is the first study we know of that looks into synergistic synthetic data augmentation for detecting hate speech that mixes languages, with the goal of reducing online harassment of women.

Keywords

hate speech detection code-mixed text Malayalam-English synthetic data fairness gender-based abuse multilingual transformers

Get full access to this article

View all access options for this article.

References

Alkomah

(2022). A literature review of textual hate speech detection methods and datasets. Information, 13(6), 273. https://doi.org/10.3390/info13060273

Bagora

Shrestha

Maurya

Desarkar

M. S

. (2022). Hostility detection in online hindi-english code-mixed conversations. Proceedings of the 14th ACM Web Science Conference 2022 .

Benhur

Sivanraju

(2021). Pretrained transformers for offensive language identification in tanglish. HASOC 2021.

Bohra

Vijay

Singh

Akhtar

S. S.

Shrivastava

. (2018). A dataset of Hindi-English code-mixed social media text for hate speech detection. Proceedings of the Second Workshop on Computational Modeling of People’s Opinions, Personality, and Emotions in Social Media .

Chakravarthi

B. R.

Priyadharshini

Muralidaran

Jose

Suryawanshi

Sherly

McCrae

J. P

. (2022). Dravidiancodemix: Sentiment analysis and offensive language identification dataset for dravidian languages in code-mixed text. Language Resources and Evaluation, 56(3), 765–806.

Choudhury

B. R.

Chakravarthi Ramesh

S. P. K.

McCrae

Detecting offensive language in code-mixed dravidian languages using transformer-based models. Proc. IEEE World.

Dinarta

Wicaksana

(2021). Enhanced hate speech detection in indonesian-english code-mixed texts using XLM-RoBERTa. arXiv preprint arXiv:2112.09986.

Farooqi

Z. M.

Ghosh

Shah

R. R.

(2021). Leveraging transformers for hate speech detection in conversational code-mixed tweets. arXiv preprint arXiv:2112.09986.

Farsi

Eusha

Hossain

Ahsan

Das

Hoque

M. M

. (2024). CUET_Binary_Hackers@ DravidianLangTech EACL2024: Hate and offensive language detection in Telugu code-mixed text using sentence similarity BERT. Proceedings of the Fourth Workshop on Speech, Vision, and Language Technologies for Dravidian Languages .

10.

Gupta

Roychowdhury

Das

Banerjee

Saha

Mathew

Mukherjee

. (2022). Multilingual abusive comment detection at scale for Indic languages. Advances in Neural Information Processing Systems, 35, 26176–26191

11.

Jagdale

Inamdar

Takalikar

Khade

Joshi

(2024). On importance of code-mixed embeddings for hate speech identification. In International Conference on Computing and Communication Networks . Springer Nature Singapore.

12.

Kamble

Joshi

(2018). Hate speech detection from code-mixed hindi-english tweets using deep learning models. ICON 2018.

13.

Khullar

Nkemelu

Nguyen

V. C.

Best

M. L.

(2024). Hate speech detection in limited data contexts using synthetic data generation. ACM Journal on Computing and Sustainable Societies, 2(1), 1–18. https://doi.org/10.1145/3625679

14.

Mandal

Senapati

Nag

. (2022). Hate-speech detection in news articles: In the context of west bengal assembly election 2021. In: Pattern recognition and data analysis with applications (pp. 247–256). Springer Nature Singapore.

15.

Mane

S. S.

Kundu

Sharma

. (2025). A survey on online aggression: Content detection and behavioral analysis on social media. ACM Computing Surveys, 57(7), 1–36.

16.

Mandl

Modha

Shahi

G. K.

Jaiswal

A. K.

Nandini

Daksh Patel

P. M.

Schäfer

. (2023). Hate speech and offensive content identification in Indo-European languages .

17.

Mathur

Shah

Sawhney

Mahata

. (2018). Detecting offensive tweets in hindi-english code-switched language. Proceedings of the Sixth International Workshop on Natural Language Processing for Social Media .

18.

Miškolci

Kováčová

Rigová

(2020). Countering hate speech on Facebook: The case of the roma minority in Slovakia. Social Science Computer Review, 38(2), 128–146. https://doi.org/10.1177/0894439318791786

19.

Nayak

T. N.

Joshi

(2021). Contextual hate speech detection in code mixed text using transformer based approaches. DravidianLangTech@EACL.

20.

Parvaresh

Harvey

(2023). Rhetorical questions as conveyors of hate speech. In Hate speech in social media: Linguistic approaches (pp. 229–251). Springer Nature Switzerland.

21.

Patwa

Aguilar

Kar

Pandey

Pykl

Gambäck

Chakraborty

Solorio

Das

. (2020). Semeval-2020 task 9: Overview of sentiment analysis of code-mixed tweets. arXiv preprint arXiv:2008.04277.

22.

Periti

Cassotti

Dubossarsky

Tahmasebi

(2024). Analyzing semantic change through lexical replacements. arXiv preprint arXiv:2404.18570.

23.

Rathnayake

Sumanapala

Rukshani

Ranathunga

. (2024). Adapter Fusion-based multi-task learning for code-mixed and code-switched text classification. Engineering Applications of Artificial Intelligence 127, 107239.

24.

Rani

Suryawanshi

Goswami

Chakravarthi

B. R.

Fransen

McCrae

J. P

. (2020). A comparative study of different state-of-the-art hate speech detection methods in Hindi-English code-mixed data. Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying.

25.

Rani

K. P.

Sree Harshita

Charulatha Reddy

Surendranath

Kalyan Kumar

(2025). Hate speech detection based on deep learning by fusion approach. In 2025 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC). IEEE.

26.

Rodriguez

Argueta

Chen

Y. L.

(2019, February). Automatic detection of hate speech on Facebook using sentiment and emotion analysis. 2019 International Conference on Artificial Intelligence in Information and Communication (ICAIIC) (pp. 169–174). IEEE.

27.

Roy

P. K.

Bhawal

Subalalitha

C. N

. (2022). Hate speech and offensive language detection in Dravidian languages using deep ensemble framework. Computer Speech & Language, 75, 101386.

28.

Roy

P. K.

Kumar

. (2025). Ensuring safety in digital spaces: Detecting code-mixed hate speech in social media posts. Data & Knowledge Engineering, 156, 102409.

29.

Sai

Kumar

Saumya

Biradar

. (2025). Iiitdwd_svc@dravidianlangtech-2024: Breaking language barriers; hate speech detection in telugu-english code-mixed text. Proceedings of the Fourth Workshop on Speech, Vision, and Language Technologies for Dravidian Languages.

30.

Shanmugavadivel

Sampath

S. H.

Nandhakumar

Mahalingam

Subramanian

Kumaresan

P. K.

Priyadharshini

(2022). An analysis of machine learning models for sentiment analysis of tamil code-mixed data. Computer Speech & Language, 76, Article 101407. https://doi.org/10.1016/j.csl.2022.101407

31.

Singh

Mamidi

(2020). Towards detection of offensive and hate speech in hinglish code-mixed social media text. LREC 2020.

32.

Stahlberg

Kumar

(2024, June). Synthetic data generation for low-resource grammatical error correction with tagged corruption models. Proceedings of the 19th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2024) (pp. 11–16).

33.

Vargas

F. A.

Carvalho

de Góes

F. R.

Benevenuto

Pardo

T. A. S.

(2021). HateBR: A large expert annotated corpus of Brazilian Instagram comments for offensive language and hate speech detection. arXiv preprint arXiv:2103.14972.

34.

Varma

P. D.

Vinod

Nandakumar

Akshay

Madhu

(2022). Hate speech detection in english and malayalam code-mixed text using bert embedding. In 2022 International Conference on Computing, Communication, Security and Intelligent Systems (IC3SIS). IEEE.

35.

Vidgen

Derczynski

(2020). Directions in abusive language training data, a systematic review: Garbage in, garbage out. Plos one, 15(12), e0243300. https://doi.org/10.1371/journal.pone.0243300

36.

Yadav

A. K.

Kumar

Shivani

Kusum Yadav (2023). Hate speech recognition in multilingual text: Hinglish documents. International Journal of Information Technology, 15(3), 1319–1331. https://doi.org/10.1007/s41870-023-01211-z

Optimizing Hate Speech Detection in Malayalam-English Code-Mixed Text: Handling Women's Abuse by Synthetic Data Augmentation

Abstract

Keywords

Get full access to this article

References