Graph Data Augmentation for Graph Convolutional Networks Learning in Robust Mental Disorder Prediction with Limited and Noisy Labels

Abstract

Graph neural networks have shown impressive performance in a variety of biomedical application tasks due to their powerful graph representation capabilities. Although GNN has achieved great success, the data noise and data scarcity problems commonly faced in real psychiatric disease prediction scenarios may affect the training and prediction of graph learning models. At present, there is no relevant work to obtain a reasonable solution. Data augmentation, which allows limited data to produce value equivalent to more data without substantially increasing the data, is considered a practical approach to addressing the problem of noisy data and data scarcity. In this work, we propose a method based on graph data augmentation for solving the problem of noisy data and data scarcity in mental illness prediction. To mitigate the negative effects of label noise, we use edge predictors to optimize the graph topology, enhance links to nodes with high similarity, remove erroneous noisy edges, and enhance the model robustness by adding adversarial perturbations in the feature space. In addition, a confident self-checking mechanism allows accurate pseudolabeling to be obtained, providing more supervision for the model training phase and further reducing the effect of label noise. Extensive experiments on two multimodal real mental illness datasets show that the proposed approach has better performance. Sufficient ablation experimental studies were conducted to assess the effectiveness of each component. The experimental results validate the effectiveness and scalability of our framework for population-based disease prediction, even under challenging conditions of data noise and sparsity. The implementation code is publicly available at: https://github.com/jiachengpan98/GDA-GCN.

Keywords

data augmentation disease prediction fMRI graph neural network pseudolabeling

Get full access to this article

View all access options for this article.

References

Anirudh

, Thiagarajan

. Bootstrapping graph convolutional neural networks for autism spectrum disorder classification. In: ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2019: 3197–3201.

Cao

, Yang

, Sarrigiannis

, et al. Dementia classification using a graph neural network on imaging of effective brain connectivity. Comput Biol Med, 2024; 168:107701.

Cao

, Yang

, Qin

, et al. Using DeepGCN to identify the autism spectrum disorder from multi-site resting-state data. Biomedical Signal Proces Control, 2021; 70:103015.

Caron

, Bojanowski

, Joulin

, et al. Deep clustering for unsupervised learning of visual features. In: Proceedings of the European conference on computer vision (ECCV). 2018: 132–149.

Chen

, Zhuang

, Xiao

, et al. AMA-GCN: Adaptive multi-layer aggregation graph convolutional network for disease prediction. arXiv Preprint arXiv, 2021.

Chen

, Liu

, Li

. MST-DGCN: A multi-scale spatio-temporal and dynamic graph convolution fusion network for electroencephalogram recognition of motor imagery. Electronics, 2024; 13(11):2174.

Craddock

, Sikka

, Cheung

, et al. Towards automated analysis of connectomes: The configurable pipeline for the analysis of connectomes (c-pac). Front Neuroinform, 2013; 3389(10):42.

Dgani

, Greenspan

, Goldberger

. Training a neural network based on unreliable human annotation of medical images. In: 2018 IEEE 15th International symposium on biomedical imaging (ISBI 2018). IEEE, 2018: 39–42.

Goldberger

, Ben-Reuven

. Training deep neural-networks using a noise adaptation layer. In: International conference on learning representations. 2017.

10.

, Dong

, Peng

. Multiscale spectral augmentation for graph contrastive learning for fMRI analysis to diagnose psychiatric disease. Knowledge-Based Systems, 2025; 314:113175.

11.

Huang

, Chung

ACS

. Disease prediction with edge-variational graph convolutional networks. Med Image Anal, 2022; 77:102375.

12.

Irvin

, Rajpurkar

, Ko

, et al. A large chest radiograph dataset with uncertainty labels and expert comparison. Proc AAAI Conf Artif Intell, 2019: 33.

13.

, Wang

, et al. Improving medical images classification with label noise using dual-uncertainty estimation. IEEE Trans Med Imaging, 2022; 41(6):1533–1546.

14.

Kazi

, Shekarforoush

, Arvind Krishna

, et al. InceptionGCN: Receptive Field Aware Graph Convolutional Network for Disease Prediction. In: International conference on information processing in medical imaging. Springer International Publishing: Cham; 2019: 73–85.

15.

Kendall

, Gal

. What uncertainties do we need in bayesian deep learning for computer vision? Adva Neural Inform Processing Systems, 2017: 30.

16.

Kipf

, Welling

. Semi-supervised classification with graph convolutional networks. arXiv Preprint arXiv, 2016a.

17.

Kipf

, Welling

. Variational graph auto-encoders. arXiv Preprint arXiv, 2016b.

18.

Kong

, Li

, Ding

, et al. Flag: Adversarial data augmentation for graph neural networks. 2020.

19.

, Han

, Wu

. Deeper insights into graph convolutional networks for semi-supervised learning. In: Proceedings of the AAAI conference on artificial intelligence. 2018, 32(1).

20.

Luan

, Zhao

, Chang

, et al. Break the ceiling: Stronger multi-scale deep graph convolutional networks. Adva Neural Inform Processing Systems, 2019: 32.

21.

Madry

, Makelov

, Schmidt

, et al. Towards deep learning models resistant to adversarial attacks. arXiv Preprint arXiv, 2017.

22.

Miyato

, Maeda

, Koyama

, et al. Virtual adversarial training: A regularization method for supervised and semi-supervised learning. IEEE Trans Pattern Anal Mach Intell, 2019; 41(8):1979–1993.

23.

Pan

, Lin

, Dong

, et al. MAMF-GCN: Multi-scale adaptive multi-channel fusion deep graph convolutional network for predicting mental disorder. Comput Biol Med, 2022; 148:105823.

24.

Parisot

, Ktena

, Ferrante

, et al. Spectral Graph Convolutions for population-based disease prediction. In: International conference on medical image computing and computer-assisted intervention. Springer International Publishing: Cham; 2017: 177–185.

25.

Pham

, Le

, Tran

, et al. Interpreting chest X-rays via CNNs that exploit hierarchical disease dependencies and uncertainty labels. Neurocomputing, 2021; 437:186–194.

26.

Porbadnigk

, Görnitz

, Sannelli

, et al. When brain and behavior disagree: Tackling systematic label noise in EEG data with machine learning. In: 2014 International Winter Workshop on Brain-Computer Interface (BCI). IEEE, 2014: 1–4.

27.

Rakhimberdina

, Liu

, Murata

. Population graph-based multi-model ensemble method for diagnosing autism spectrum disorder. Sensors (Basel), 2020; 20(21):6001.

28.

Rakhimberdina

, Murata

. Linear Graph Convolutional Model for Diagnosing Brain Disorders. In: International Conference on Complex Networks and Their Applications. Springer International Publishing: Cham; 2019: 815–826.

29.

Rong

, Huang

, Xu

, et al. Dropedge: Towards deep graph convolutional networks on node classification. arXiv Preprint arXiv, 2019.

30.

Shafahi

, Najibi

, Ghiasi

, et al. Adversarial training for open-access!. Adva Neural Inform Processing Systems, 2019:32.

31.

Sun

, Lin

, Zhu

. Multi-stage self-supervised learning for graph convolutional networks on graphs with few labeled nodes. In: Proceedings of the AAAI conference on artificial intelligence. 2020, 34(4): 5892–5899.

32.

Tsipras

, Santurkar

, Engstrom

, et al. Robustness may be at odds with accuracy. arXiv Preprint arXiv, 2018.

33.

Yan

, Chen

, Li

, et al. Reduced default mode network functional connectivity in patients with recurrent major depressive disorder. Proc Natl Acad Sci U S A, 2019; 116(18):9078–9083.

34.

Yan

, Wang

, Zuo

, et al. DPABI: Data processing & analysis for (resting-state) brain imaging. Neuroinformatics, 2016; 14(3):339–351.

35.

Yin

, Mostafa

, Wu

. Diagnosis of autism spectrum disorder based on functional brain networks with deep learning. J Comput Biol, 2021; 28(2):146–165.

36.

Zhang

, Ding

, Xu

, et al. Multi-scale time-series kernel-based learning method for brain disease diagnosis. IEEE J Biomed Health Inform, 2021; 25(1):209–217.

37.

Zhao

, Liu

, Neves

, et al. Data augmentation for graph neural networks. In: Proceedings of the aaai conference on artificial intelligence. 2021, 35(12): 11015–11023.

38.

Zheng

, Zhu

, Liu

, et al. Multi-modal graph learning for disease prediction. IEEE Trans Med Imaging, 2022; 41(9):2207–2216.

39.

Zhou

, Zhang

, Huang

. Dynamic self-training framework for graph convolutional networks. 2019.