Sage Journals: Discover world-class research

Abstract

Sarcasm detection is becoming increasingly important owing to several reasons such as online brand management, online market research, human-machine interaction and gauging cyberbullying. To improve the performance of sarcasm detection on text, a novel model implementing parallel attention heads based on the concept of transformers at multiple granular levels using a Bidirectional Gated Recurring Unit (Bi-GRU) is proposed. Entropy regularization is implemented to overcome attention collapse and distribute attention mitigating its tendency to over-emphasize particular tokens. Early stopping, and learning rate scheduling used during different implementation and training stages optimize performance and resource utilization. The model is tested on four different sets of embeddings namely: Global Vector (GloVe), FastText, Bidirectional Encoder Representations from Transformers(BERT) and a Latent Semantic Analysis (LSA) reduced ensemble embedding obtained by concatenating the aforementioned embeddings against two datasets. It records maximum accuracy of 93.64% and 81.61% for News Headlines and SARC respectively with FastText outperforming other embeddings. Better results with FastText establish the model's ability to learn well from simpler representations compared to more sophisticated context related patterns established by resource-intensive BERT. The attention heatmaps are plotted to illustrate the interpretability of model. To validate model's effectiveness in real-time, robustness analysis on News Headlines dataset is done using three adversarial testing techniques namely synonym replacement, word dropout and character swap yielding accuracy of 86.12%, 84.16 and 83.21% respectively.

Keywords

sarcasm detection multi-level attention multi-Head attention entropy regularisation fastText BERT robustness analysis

Get full access to this article

View all access options for this article.

References

Frenda

Patti

Rosso

. Killing me softly: creative and cognitive aspects of implicitness in abusive language online. Nat Lang Eng 2023; 29: 1516–37.

Wang

Jin

Musial

Dang

. Topic sentiment spreading model in social networks considering user interest. Proc AAAI Conf Artif Intell 2020; 34: 989–96.

Sahu

Hudnurkar

. Sarcasm detection: a review, synthesis and future research agenda. Int J Image Graph 2023; 23: 2350061.

Gedela

Baruah

Soni

. Deep contextualised text representation and learning for sarcasm detection. Arab J Sci Eng 2024; 49: 3719–34.

Vaswani

Shazeer

Parmar

, et al. Attention is all you need. Adv Neural Inf Process Syst 2017: 30.

Lin

Feng

Santos

, et al. A structured self-attentive sentence embedding. arXiv preprint arXiv:1703.03130. 2017.

Kumar

Narapareddy

Srikanth

, et al. Sarcasm detection using multi-head attention based bidirectional LSTM. IEEE Access 2020; 8: 6388–97.

Wang

Yang

. Measure and improve robustness in NLP models: A survey. arXiv preprint arXiv:2112.08313. 2021.

Goyal

Doddapaneni

Khapra

Ravindran

. A survey of adversarial defenses and robustness in NLP. ACM Comput Surv 2023; 55: 1–39.

10.

Rahaman

Kuri

Islam

, et al. Sarcasm detection in tweets: a feature-based approach using supervised machine learning models. Int J Adv Comput Sci Appl 2021; 12: 6.

11.

Kumar

Sangwan

Arora

, et al. Sarcasm detection using soft attention-based bidirectional long short-term memory model with convolution network. IEEE Access 2019; 7: 23319–28. doi:10.1109/ACCESS.2019.2899260

12.

Eke

Norman

Shuib

. Context-based feature technique for sarcasm identification in benchmark datasets using deep learning and BERT model. IEEE Access 2021; 9: 48501–18.

13.

Babanejad

Davoudi

Papagelis

. Affective and contextual embedding for sarcasm detection. Proc 28th Int Conf Comput Linguist 2020: 225–43.

14.

Pandey

Singh

. BERT-LSTM model for sarcasm detection in code-mixed social media post. J Intell Inf Syst 2023; 60: 235–54.

15.

Kumar

Sangwan

Singh

Wadhwa

. Hybrid deep learning model for sarcasm detection in Indian indigenous language using word-emoji embeddings. ACM Trans Asian Low-Resour Lang Inf Process 2023; 22: 1–20.

16.

Maheswari

Dhenakaran

. Improved ensemble based deep learning approach for sarcastic opinion classification. Multimed Tools Appl 2023: 1–23.

17.

Sharma

Singh

Agarwal

, et al. Sarcasm detection over social media platforms using hybrid ensemble model with fuzzy logic. Electronics (Basel) 2023; 12: 937.

18.

Thaokar

Rout

Ray

. N-Gram based sarcasm detection for news and social media text using hybrid deep learning models. SN Comput Sci 2024; 5: 163.

19.

Potamias

Siolas

Stafylopatis

. A transformer-based approach to irony and sarcasm detection. Neural Comput Appl 2020; 32: 17309–20.

20.

Meng

Zhu

Sun

Zhao

. Sarcasm detection based on BERT and attention mechanism. Multimed Tools Appl 2023: 1–20.

21.

Băroiu

. Trăușan-Matu Ș. Comparison of deep learning models for automatic detection of sarcasm context on the MUStARD dataset. Electronics (Basel) 2023; 12: 666.

22.

Castro

Hazarika

Pérez-Rosas

, et al. Towards multimodal sarcasm detection (an obviously perfect paper). arXiv preprint arXiv:1906.01815. 2019.

23.

Akula

Garibay

. Explainable detection of sarcasm in social media. Proc 11th Workshop Comput Appr Subjectivity Sentiment Soc Media Anal 2021: 34–9.

24.

Kumar

Dikshit

Albuquerque

. Explainable artificial intelligence for sarcasm detection in dialogues. Wirel Commun Mob Comput 2021; 2021: 2939334.

25.

Pospelova

Tarasova

Subbotina

, et al. Explainable artificial intelligence and natural language processing for unraveling deceptive contents. Fusion Pract Appl 2024; 14: 146–46.

26.

Majumder

Poria

Peng

, et al. Sentiment and sarcasm classification with multitask learning. IEEE Intell Syst 2019; 34: 38–43.

27.

Tan

Chow

Kanesan

, et al. Sentiment analysis and sarcasm detection using deep multi-task learning. Wirel Pers Commun 2023; 129: 2213–37.

28.

Liu

Wei

, et al. Sarcasm driven by sentiment: a sentiment-aware hierarchical fusion network for multimodal sarcasm detection. Inf Fusion 2024; 108: 102353.

29.

Bavkar

Kashyap

Khairnar

. Deep hybrid model with trained weights for multimodal sarcasm detection. In: Proceedings of an International Conference on Information, Communication, and Computer Technology, 2023, p.179–94. Springer.

30.

Zhang

, et al. An attention-based, context-aware multimodal fusion method for sarcasm detection using inter-modality inconsistency. Knowl Based Syst 2024; 287: 111457.

31.

Vinoth

Prabhavathy

. An intelligent machine learning-based sarcasm detection and classification model on social networks. J Supercomput 2022; 78: 10575–94.

32.

Karthik

Sethukarasi

. Sarcastic user behavior classification and prediction from social media data using firebug swarm optimization-based long short-term memory. J Supercomput 2022: 1–25.

33.

Ashok

Ghanshyam

Salim

, et al. Sarcasm detection using genetic optimization on LSTM with CNN. In: Proceedings of an International Conference on Emerging Technologies (INCET), 2020, p.1–4. IEEE.

34.

Misra

. News category dataset. arXiv preprint arXiv:2209.11429. 2022.

35.

Misra

Grover

. Sculpting data for ML: the first act of machine learning. Univ Calif San Diego 2021: 158.

36.

Khodak

Saunshi

Vodrahalli

. A large self-annotated corpus for sarcasm. arXiv preprint arXiv:1704.05579. 2017.

37.

Chandrasekaran

Hemanth

Saravanan

. Sarcasm identification in text with deep learning models and Glove word embedding. In: Proc 2nd International Conference Advances in Computing Innovative Technologie Engineering (ICACITE), 2022, p.403–7. IEEE.

38.

Onan

. Topic-enriched word embeddings for sarcasm identification. In: Softw Eng Methods Intell Algorithms. 2019, p. 293–304. Springer.

39.

Yacoub

Slim

Aboutabl

. A survey of sentiment analysis and sarcasm detection: challenges, techniques, and trends. Int J Electr Comput Eng Syst 2024; 15: 69–78.

40.

Patil

Boit

Gudivada

Nandigam

. A survey of text representation and embedding techniques in NLP. IEEE Access 2023; 11: 36120–46.

41.

Misra

Arora

. Sarcasm detection using hybrid neural network. arXiv preprint arXiv:1908.07414. 2019.

42.

Nayak

Bolla

. Efficient deep learning methods for sarcasm detection of news headlines. In: Mach Learn Auton System, 2022, p.371–82. Springer.

43.

Mandal

Mahto

. Deep CNN-LSTM with word embeddings for news headline sarcasm detection. In: International Conference on Information Technology-New Generations (ITNG), 2019, p.495–8. Springer.

44.

Liu

Priestley

Zhou

, et al. A2text-net: A novel deep neural network for sarcasm detection. In: Proceedings of the International Conference on Cognitive Machine Intelligence (CogMI), 2019, p.118–26. IEEE.

45.

Ali

Farhat

Abdullah

, et al. Deep learning for sarcasm identification in news headlines. Appl Sci 2023; 13: 5586.

46.

Prasanna

MSM

Shaila

Vadivel

. Polarity classification on twitter data for classifying sarcasm using clause pattern for sentiment analysis. Multimed Tools Appl 2023; 82: 32789–825.

47.

Lemmens

Burtenshaw

Lotfi

, et al. Sarcasm detection using an ensemble approach. Proc 2nd Workshop Figurative Lang Process 2020: 264–9.

48.

Dadu

Pant

. Sarcasm detection using context separators in online discourse. Proc 2nd Workshop Figurative Lang Process 2020: 51–5.

49.

Mehndiratta

Soni

. Identification of sarcasm using word embeddings and hyperparameters tuning. J Discrete Math Sci Cryptogr 2019; 22: 465–89.

50.

Shrikhande

Setty

Sahani

. Sarcasm detection in newspaper headlines. In: Proceeding of IEEE international conference on industrial and information systems (ICIIS), 2020, p. 483–7. IEEE.

51.

Hazarika

Poria

Gorantla

, et al. Cascade: contextual sarcasm detection in online discussion forums. Proc 27th Int Conf Comput Linguist 2018: 1837–48.

52.

Ilic

Marrese-Taylor

Balazs

Matsuo

. Deep contextualized word representations for detecting sarcasm and irony. Proc 9th Workshop Comput Appr Subjectivity Sentiment Soc Media Anal 2018: 2–7.

53.

Dong

Choi

. Transformer-based context-aware sarcasm detection in conversation threads from social media. arXiv preprint arXiv:2005.11424. 2020.

54.

Shu

. BERT And RoBERTa for sarcasm detection: optimizing performance through advanced fine-tuning. Applied and Computational Engineering 2024; 97: 1–11.

55.

Savini

Caragea

. Intermediate-task transfer learning with BERT for sarcasm detection. Mathematics 2022; 10: 844.

Entropy regularised multi-granular parallel attention framework: An interpretable approach for sarcasm detection

Abstract

Keywords

Get full access to this article

References