Sage Journals: Discover world-class research

Abstract

Detecting mental illness from short social media posts is challenging because these texts are often brief, fragmented, and lack explicit descriptions of the user’s mental state. Prior studies using encoder-based models such as BERT show promise but struggle when key contextual information is missing. To address this, we propose a method that augments posts with interpretive sentences generated by MentaLLaMA-chat, a generative model specialized in mental health, and fine-tunes BERT on the augmented dataset. We curated 1,525 Japanese posts containing the word “mental” (in katakana) from X (formerly Twitter) and manually annotated them according to Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition criteria, labeling 557 posts as positive and 968 as negative. Our method improved recall by 2.4 percentage points compared to models trained on the original posts alone, while maintaining comparable accuracy and precision. Shapley Additive Explanations analysis revealed that tokens introduced by the interpretive sentences—including both negative and positive expressions—enhanced the model’s ability to identify mental-distress posts. These results demonstrate that generative-model-based text augmentation effectively provides additional context, enabling more accurate detection of mental illness indicators in short, ambiguous social media posts.

Keywords

mental health detection generative model text augmentation BERT fine-tuning SHAP analysis

Get full access to this article

View all access options for this article.

References

Acheampong

F. A.

Nunoo-Mensah

Chen

(2021). Transformer models for text-based emotion detection: A review of BERT-based approaches. Artificial Intelligence Review, 54(8), 5789–5829.

Amin

M. M.

Cambria

Schuller

B. W.

(2023). Will affective computing emerge from foundation models and general artificial intelligence? A first evaluation of ChatGPT. IEEE Intelligent Systems, 38(2), 15–23.

Ayoub

Yang

X. J.

Zhou

(2021). Combat COVID-19 infodemic using explainable natural language processing models. Information Processing & Management, 58(4), 102569.

Balbino

Santana

Teodoro

Song

Zárate

Nobre

(2022). Predicting depression in children and adolescents using the SHAP approach. In Proceedings of the 15th international joint conference on biomedical engineering systems and technologies—volume 5: BIOSTEC (pp. 514–521). SCITEPRESS Digital Library. https://doi.org/10.5220/0010842500003123.

Cao

Liangwen

Wang

Liu

(2020). Microblog-oriented multi-scale CNN multi-label sentiment classification model. In 2020 IEEE/WIC/ACM international joint conference on web intelligence and intelligent agent technology (WI-IAT) (pp. 626–631). IEEE. https://doi.org/10.1109/WIIAT50758.2020.00094.

Colizzi

Lasalvia

Ruggeri

(2020). Prevention and early intervention in youth mental health: Is it time for a multidisciplinary and trans-diagnostic model for care?. International Journal of Mental Health Systems, 14, 1–14.

Devlin

Chang

M. W.

Lee

Toutanova

(2019a). BERT: Pre-training of deep bidirectional transformers for language understanding. https://arxiv.org/abs/1810.04805.

Devlin

Chang

M. W.

Lee

Toutanova

(2019b). BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 https://arxiv.org/abs/1810.04805.

Fukazawa

(2022). Estimating mental health using human-generated big data and machine learning. The Brain & Neural Networks, 29(2), 78–94.

10.

Guo

Sun

Vosoughi

(2021). Emotion-based modeling of mental disorders on social media. In IEEE/WIC/ACM international conference on web intelligence and intelligent agent technology (pp. 8–16). IEEE.

11.

Hebert

Golab

Cohen

(2022). Predicting hateful discussions on Reddit using graph transformer networks and communal context. In 2022 IEEE/WIC/ACM international joint conference on web intelligence and intelligent agent technology (WI-IAT) (pp. 9–17). IEEE Computer Society. https://doi.org/10.1109/WI-IAT55865.2022.00012.

12.

Zhang

Ansari

Tiwari

Cambria

(2021). Mentalbert: Publicly available pretrained language models for mental healthcare. arXiv preprint arXiv:2110.15621.

13.

Karadağ

Akbulut

Zaim

A. H.

(2023). Cyberbullying detection through deep learning: A case study of Turkish celebrities on Twitter. In Web intelligence (Vol. 21, pp. 61–70). IOS Press.

14.

Kokalj

Škrlj

Lavrač

Pollak

Robnik-Šikonja

(2021). BERT meets Shapley: Extending SHAP explanations to transformer-based classifiers. In Proceedings of the EACL Hackashop on news media content analysis and automated report generation (pp. 16–21). Association for Computational LinguisticsSEP.

15.

Kong

Zhao

Chen

Qin

Sun

Zhou

Wang

Dong

(2023). Better zero-shot reasoning with role-play prompting. arXiv preprint arXiv:2308.07702.

16.

Lamichhane

(2023). Evaluation of ChatGPT for NLP-based mental health applications. arXiv preprint arXiv:2303.15727.

17.

Tang

Zhao

W. X.

Nie

J. Y.

Wen

J. R.

(2024). Pre-trained language models for text generation: A survey. ACM Computing Surveys, 56(9), 1–39.

18.

Liu

Tam

Muqeeth

Mohta

Huang

Bansal

Raffel

C. A.

(2022). Few-shot parameter-efficient fine-tuning is better and cheaper than in-context learning. Advances in Neural Information Processing Systems, 35, 1950–1965.

19.

Lundberg

Lee

S. I.

(2017). A unified approach to interpreting model predictions. https://arxiv.org/abs/1705.07874.

20.

Malviya

Roy

Saritha

(2021). A transformers approach to detect depression in social media. In 2021 International conference on artificial intelligence and smart systems (ICAIS) (pp. 718–723). IEEE.

21.

Marmot

Friel

Bell

Houweling

T. A.

Taylor

(2008). Closing the gap in a generation: Health equity through action on the social determinants of health. The Lancet, 372(9650), 1661–1669.

22.

Milintsevich

Sirts

Dias

(2023). Towards automatic text-based estimation of depression through symptom prediction. Brain Informatics, 10(1), 4.

23.

Montejo-Ráez

Molina-González

M. D.

Jiménez-Zafra

S. M.

García-Cumbreras

M. Á.

García-López

L. J.

(2024). A survey on detecting mental disorders with natural language processing: Literature review, trends and challenges. Computer Science Review, 53, 100654.

24.

Nochaiwong

Ruengorn

Thavorn

Hutton

Awiphan

Phosuya

Ruanta

Wongpakaran

(2021). Global prevalence of mental health issues among the general population during the coronavirus disease-2019 pandemic: A systematic review and meta-analysis. Scientific Reports, 11(1), 10173.

25.

Radford

Child

Luan

Amodei

Sutskever

(2019). Language models are unsupervised multitask learners. https://api.semanticscholar.org/CorpusID:160025533.

26.

Ray

P. P.

(2023). ChatGPT: A comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope. Internet of Things and Cyber-Physical Systems, 3, 121–154. https://doi.org/10.1016/j.iotcps.2023.04.003

27.

Rizwan

Mushtaq

M. F.

Akram

Mehmood

Ashraf

Sahelices

(2022). Depression classification from tweets using small deep transfer learning language models. IEEE Access, 10, 129176.

28.

Touvron

Lavril

Izacard

Martinet

Lachaux

M. A.

Lacroix

Rozière

Goyal

Hambro

Azhar

Rodriguez

Joulin

Grave

Lample

(2023). Llama: Open and efficient foundation language models. ArXiv abs/2302.13971. https://api.semanticscholar.org/CorpusID:257219404.

29.

Wang

Gao

Liu

(2023). Exploring the interpretability in speech-based adolescent depression detection by SHAP. In Proceedings of the 2023 9th international conference on communication and information processing (pp. 562–567). Association for Computing Machinery .

30.

Wang

Chen

Zhou

Zheng

Chen

Yan

Tang

et al (2020). Depression risk prediction for Chinese microblogs via deep-learning methods: Content analysis. JMIR Medical Informatics, 8(7), e17958.

31.

World Health Organization . (2022) Mental health: Strengthening our response. https://www.who.int/news-room/fact-sheets/detail/mental-health-strengthening-our-response.

32.

Xian

Lampert

C. H.

Schiele

Akata

(2018). Zero-shot learning—a comprehensive evaluation of the good, the bad and the ugly. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(9), 2251–2265.

33.

Yao

Dong

Gabriel

Hendler

Ghassemi

Dey

A. K.

Wang

(2024). Mental-LLM: Leveraging large language models for mental health prediction via online text data. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 8(1), 1–32.

34.

Yang

Zhang

Xie

Kuang

Ananiadou

(2023). Towards interpretable mental health analysis with large language models. https://arxiv.org/abs/2304.03347.

35.

Yang

Zhang

Kuang

Xie

Huang

Ananiadou

(2024). Mentallama: Interpretable mental health analysis on social media with large language models. In Proceedings of the ACM web conference 2024. WWW ’24 (pp. 4489–4500). Association for Computing Machinery. https://doi.org/10.1145/3589334.3648137.

36.

Zhao

Chen

Yang

Liu

Deng

Cai

Wang

Yin

(2024). Explainability for large language models: A survey. ACM Transactions on Intelligent Systems and Technology, 15(2), 1–38.

37.

Zogan

Razzak

Jameel

(2021). DepressionNet: Learning multi-modalities with user post summarization for depression detection on social media. In Proceedings of the 44th international ACM SIGIR conference on research and development in information retrieval (pp. 133–142). Association for Computing Machinery.

Generative Contextual Insights for Social Media Mental Health Detection With Shapley Additive Explanations (SHAP)-Based Interpretation

Abstract

Keywords

Get full access to this article

References