Sage Journals: Discover world-class research

Abstract

The current social media platforms face two major issues: decreasing credibility due to the proliferation of artificial intelligence generated content (AIGC) and contamination by human-generated implicitly toxic content. We propose an end-to-end deep learning method to identify AIGC and implicitly toxic content. This method integrates semantic features and long-distance textual dependency features, thereby enhancing recognition accuracy. We constructed 2 data sets including 78,798 messages and 54 topics from social media platforms. Experimental results demonstrate that our proposed method can identify 98.25% of AIGC and 98.06% of human-generated implicitly toxic content. Furthermore, in identifying AIGC, our proposed method’s accuracy is 1.12% and 13.89% higher than that of ERNIE and GPTZero, respectively. In identifying human-generated implicitly toxic content, our method outperforms BERT and XGBoost by 1.01% and 5.69%, respectively. Finally, we conducted an interpretability analysis of the model using the SHAP method to understand how the model identifies AIGC and human-generated implicitly toxic content.

Keywords

AIGC ChatGPT generative artificial intelligence implicitly toxic content social media

Get full access to this article

View all access options for this article.

References

Carvalho

Ivanov

ChatGPT for tourism: applications, benefits and risks. Tour Rev 2024; 79: 290–303.

Helberger

Diakopoulos

ChatGPT and the AI act. Internet Policy Rev 2023; 12: 28–28.

Dwivedi

Kshetri

Hughes

, et al. ‘So what if ChatGPT wrote it?’ Multidisciplinary perspectives on opportunities, challenges and implications of generative conversational AI for research, practice and policy. Int J Inf Manag 2023; 71: 102642.

Paul

Ueno

Dennis

ChatGPT and consumers: benefits, pitfalls and future research agenda. Int J Consum Stud 2023; 47: 1213–1225.

Wen

Sun

, et al. Unveiling the implicit toxicity in large language models. In: Proceedings of the 2023 conference on empirical methods in natural language processing (eds Bouamor

Pino

Bali

), Singapore, 6–10 December 2023, pp. 1322–1338. New York: Association for Computational Linguistics.

Kozyreva

Herzog

Lewandowsky

, et al. Resolving content moderation dilemmas between free speech and harmful misinformation. Proc Natl Acad Sci U S A 2023; 120: e2210666120.

Anuchitanukul

Ive

Specia

Revisiting contextual toxicity detection in conversations. ACM J Data Inf Qual 2023; 15: 1–22.

Garcia-Penalvo

. The perception of artificial intelligence in educational contexts after the launch of ChatGPT: disruption or panic? Educ Knowl Soc 2023; 24: e31279.

Graf

Bernardi

RE.

ChatGPT in research: balancing ethics, transparency and advancement. Neuroscience 2023; 515: 71–73, https://www.webofscience.com/wos/woscc/full-record/WOS:000956134800001

10.

Liu

Zhang

, et al. A method for implicit risk detection and evaluation of Chinese LLMs through TextCNN. In: 2024 Asia-pacific conference on image processing, electronics and computers (IPEC), Dalian, China, 12–14 April 2024, pp. 446–450. New York: IEEE.

11.

Tlili

Shehata

Adarkwah

, et al. What if the devil is my guardian angel: ChatGPT as a case study of using chatbots in education. Smart Learn Environ 2023; 10: 15.

12.

Wang

Pan

, et al. Recognizing large-scale AIGC on search engine websites based on knowledge integration and feature pyramid network. Proc Assoc Inf Sci Technol 2024; 61: 679–684.

13.

Wei

Tyson

. Understanding the impact of AI-generated content on social media: the Pixiv case. In: Proceedings of the 32nd ACM international conference on multimedia, Melbourne VIC, Australia, 28 October–1 November 2024, pp. 6813–6822. New York: ACM.

14.

Lee

K-Y

Hong

C-L

. Usage intentions of AIGC: from social media conversation volume and consumer analysis. In: 2023 IEEE 3rd international conference on social sciences and intelligence management (SSIM), Taichung, 15–17 December 2023, pp. 328–333. New York: IEEE.

15.

Luo

The impact and implications of Artificial Intelligent-Generated Content (AIGC) on marketing campaigns based on social media. SHS Web Conf 2024; 207: 02004.

16.

Wang

Gong

From seeking to sharing: pathways of environmental risk information on social media and the roles of outcome expectations, efficacy, and AI-generated content. Int J Hum Comput Int. Epub ahead of print 8 January 2025. DOI: 10.1080/10447318.2024.2443247.

17.

Yuan

Wang

Yan

, et al. Beyond sentiment exploring the dynamics of AIGC-generated sports content and user engagement on Xiaohongshu. Int Theor Pract Humanit Soc Sci 2024; 1: 162–177.

18.

Guo

Chen

, et al. AIGC challenges and opportunities related to public safety: a case study of ChatGPT. J Saf Sci Resil 2023; 4: 329–339.

19.

Kang

, et al. Towards trustworthy digital media in the AIGC era: an introduction to the upcoming IsoJpegTrust standard. IEEE Commun Stand Magazine 2023; 7: 2–5.

20.

Wang

Zhang

, et al. Security and privacy on generative data in AIGC: a survey. ACM Comput Surv 2024; 57: 1–34.

21.

McLennan

Global risks report

2024, https://www.weforum.Org/publications/global-risks-report-2024/ (2024, accessed 10 November 2024).

22.

Hayawi

Shahriar

Mathew

. The imitation game: detecting human and AI-generated texts in the era of ChatGPT and BARD. J Inf Sci. Epub ahead of print 14 February 2024. DOI: 10.1177/01655515241227531.

23.

Zeng

, et al. Identifying artificial intelligence-generated content in online Q&A communities through interpretable machine learning. J Inf Sci. Epub ahead of print 29 October 2024. DOI: 10.1177/01655515241281491.

24.

Cingillioglu

. Detecting AI-generated essays: the ChatGPT challenge. Int J Inf Learn Technol 2023; 40: 259–268.

25.

Deng

Wang

Pan

, et al. Recognizing AI responses from generative AI in knowledge-based online Q&A communities. In: Knowledge organization for resilience in times of crisis: challenges and opportunities, Wuhan, China, 20–22 March 2024, pp. 355–362. Baden-Baden: Ergon-Verlag.

26.

Chakraborty

Gheewala

Degadwala

, et al. Safeguarding authenticity in text with BERT-powered detection of AI-generated content. In: 2024 international conference on inventive computation technologies (ICICT), Lalitpur, Nepal, 24–26 April 2024, pp. 34–37. New York: IEEE.

27.

Javaji

Sreeya

Rajesh

. Detection of AI generated text with BERT model. In: 2024 2nd world conference on communication & computing (WCONF), Raipur, India, 12–14 July 2024, pp. 1–6. New York: IEEE.

28.

Hazim

Ata

. Textual authenticity in the AI era: evaluating BERT and RoBERTa with logistic regression and neural networks for text classification. In: 2024 international symposium on electronics and telecommunications (ISETC), Timisoara, 7–8 November 2024, pp. 1–6. New York: IEEE.

29.

Chaka

Detecting AI content in responses generated by ChatGPT, YouChat, and Chatsonic: the case of five AI content detection tools. J Appl Learn Teach 2023; 6: 1–11.

30.

Lev

Klein

Wolf

. In defense of word embedding for generic text representation. In: 20th international conference on applications of natural language to information systems (NLDB), Passau, Germany, 17–19 June 2015, pp. 35–50. Cham: Springer.

31.

Johnson

Zhang

. Deep pyramid convolutional neural networks for text categorization. In: Proceedings of the 55th annual meeting of the Association for Computational Linguistics (ACL 2017), Vancouver, BC, Canada, 30 July–4 August 2017, pp. 562–570. Stroudsburg, PA: Association for Computational Linguistics.

32.

Abas

Elhenawy

Zidan

, et al. BERT-CNN: a deep learning model for detecting emotions from text. Comput Mater Contin 2022; 71: 2943–2961.

33.

Ilyas

Javed

Malik

KM.

AVFakeNet: a unified end-to-end Dense Swin Transformer deep learning model for audio-visual deepfakes detection. Appl Soft Comput 2023; 136: 110124.

34.

Deng

Wang

, et al. Recognizing implicitly toxic content based on multiple attention mechanisms. Proc Assoc Inf Sci Technol 2024; 61: 880–882.

35.

Damo

Ocampo

Cabrio

, et al. Unveiling the hate: generating faithful and plausible explanations for implicit and subtle hate speech detection. In: Rapp

Di Caro

Meziane

, et al. (eds) International conference on applications of natural language to information systems, 2024, pp. 211–225. Marseille: European Language Resources Association.

36.

Banaji

Bhaskar

Brownstein

When bias is implicit, how might we think about repairing harm?

Curr Opin Psychol 2015; 6: 183–188.

37.

Yadav

Singh

HateFusion: harnessing attention-based techniques for enhanced filtering and detection of implicit hate speech. IEEE Trans Comput Soc Syst 2024; 99: 1–16.

38.

d’Sa

Illina

Fohr

. Bert and fasttext embeddings for automatic detection of toxic speech. In: 2020 international multi-conference on: ‘organization of knowledge and advanced technologies’ (OCTA), Tunis, Tunisia, 6–8 February 2020, pp. 1–5. New York: IEEE.

39.

Caselli

Basile

Mitrović

, et al. I feel offended, don’t be abusive! implicit/explicit messages in offensive and abusive language. In: Proceedings of the twelfth language resources and evaluation conference, Marseille, 11–16 May 2020, pp. 6193–6202. Marseille: European Language Resources Association.

40.

Tang

Zhou

, et al. A decision-support model through online reviews: consumer preference analysis and product ranking. Inf Process Manag 2024; 61: 103728.

41.

Shao

Bian

, et al. A journal name semantic augmented multi-dimensional feature fusion model for scholarly journal recommendation. Inf Process Manag 2023; 60: 103460.

42.

Elkhatat

Elsaid

Almeer

Evaluating the efficacy of AI content detection tools in differentiating between human and AI-generated text. Int J Educ Integr 2023; 19: 17.

Identification of AIGC and human-generated implicitly toxic content on social media

Abstract

Keywords

Get full access to this article

References