Sage Journals: Discover world-class research

Abstract

Parkinson's disease (PD) is a neurodegenerative disorder of the brain that primarily affects motor function. Clinical challenges associated with this condition include accurately diagnosing patients in the early stages of the disease and predicting how the condition will progress. This project aims to enhance PD detection by integrating feature selection and classification using supervised learning techniques. Two publicly available datasets—the speech and PD classification datasets—are utilized to evaluate model performance across diverse features. The proposed work employs class balancing through the Synthetic Minority Oversampling Technique (SMOTE) to address the issue of class imbalance in this highly unbalanced dataset. Subsequently, the Relief algorithm is used for feature selection to identify the most relevant predictors. An ensemble of models is applied using the RF-XGBoost-KNN classifiers due to their superior accuracy compared to other classifier combinations. The RF-XGBoost-KNN model stack achieved classification accuracies of 94.56% and 93.53% for the PD speech dataset and Parkinson's Disease Classification Dataset, respectively, demonstrating its potential as a robust tool for early and accurate PD diagnosis.

Keywords

machine learning relief random forest classifier classification feature selection

Get full access to this article

View all access options for this article.

References

Gao

, Sun H, Wang T, et al. Model-based and model-free machine learning techniques for diagnostic prediction and classification of clinical outcomes in Parkinson’s disease. Sci Rep 2018; 8. Springer Science and Business Media LLC. DOI: https://doi.org/10.1038/s41598-018-24783-4.

Pahuja

Nagabhushan

. A comparative study of existing machine learning approaches for Parkinson’s disease detection. IETE J Res 2018; 67: 4–14. Informa UK Limited.

Yaman

Ertam

Tuncer

. Automated Parkinson’s disease recognition based on statistical pooling method using acoustic features. Med Hypotheses 2020; 135: 109483. Elsevier BV.

Karaman

Çakın

Alhudhaif

, et al. Robust automated Parkinson disease detection based on voice signals with transfer learning. Expert Syst Appl 2021; 178: 115013. Elsevier BV.

Solana-Lavalle

Galán-Hernández

J-C

Rosas-Romero

. Automatic Parkinson disease detection at early stages as a pre-diagnosis tool by using classifiers and a small set of vocal features. Biocybern Biomed Eng 2020; 40: 505–516. Elsevier BV.

Kumar

Rekha

. A dense network approach with Gaussian optimizer for cardiovascular disease prediction. New Gener Comput 2023; 41: 859–878.

Kumar

Rekha

. An improved hawks optimizer based learning algorithms for cardiovascular disease prediction. Biomed Signal Process Control 2023; 81: 104442.

Saranya

Karthikeyan

Kumar

, et al. DenseNet-ABiLSTM: revolutionizing multiclass arrhythmia detection and classification using hybrid deep learning approach leveraging PPG signals. Int J Comput Intell Syst 2025; 18. DOI: https://doi.org/10.1007/s44196-025-00765-z.

West

Soltaninejad

Cheng

. Assessing the capability of deep-learning models in Parkinson’s disease diagnosis. In: Lecture notes in computer science. Springer International Publishing, 2020, pp. 237–247. DOI: https://doi.org/10.1007/978-3-030-54407-2_20.

10.

Abdulhay

Arunkumar

Narasimhan

, et al. Gait and tremor investigation using machine learning techniques for the diagnosis of Parkinson disease. Future Gener Comput Syst 2018; 83: 366–373. Elsevier BV.

11.

Shivangi

Johri Tripathi

. Parkinson disease detection using deep neural networks. In: 2019 twelfth international conference on contemporary computing (IC3), Noida, India, 2019, pp. 1–4. DOI: https://doi.org/10.1109/IC3.2019.8844941.

12.

Soumaya

Drissi Taoufiq

Benayad

, et al. The detection of Parkinson disease using the genetic algorithm and SVM classifier. Appl Acoust 2021; 171: 107528. Elsevier BV.

13.

Younis Thanoun

Yaseen

. A comparative study of Parkinson disease diagnosis in machine learning. In: 2020 The 4th international conference on advances in artificial intelligence. ACM, Oct. 09. 2020. DOI: https://doi.org/10.1145/3441417.3441425.

14.

Saad

Zaarour

Guerin

, et al. Detection of freezing of gait for Parkinson’s disease patients with multi-sensor device and Gaussian neural networks. Int J Mach Learn Cybern 2015; 8: 941–954. Springer Science and Business Media LC.

15.

Mirelman

, Ben Or Frank M, Melamed M, et al. Detecting sensitive mobility features for Parkinson’s disease stages via machine learning. Mov Disord 2021; 36: 2144–2155. Wiley.

16.

Karapinar Senturk

. Early diagnosis of Parkinson’s disease using machine learning algorithms. Med Hypotheses 2020; 138: 109603. Elsevier BV.

17.

Wroge

Özkanca

Demiroglu

, et al. Parkinson’s disease diagnosis using machine learning and voice. In: 2018 IEEE signal processing in medicine and biology symposium (SPMB), Philadelphia, PA, USA, 2018, pp. 1–7. DOI: https://doi.org/10.1109/SPMB.2018.8615607.

18.

Ali

Zhu

Golilarz

, et al. Reliable Parkinson’s disease detection by analyzing handwritten drawings: construction of an unbiased cascaded learning system based on feature selection and adaptive boosting model. IEEE Access 2019; 7: 116480–116489.

19.

Lamba

Gulati

Jain

, et al. A speech-based hybrid decision support system for early detection of Parkinson’s disease. Arab J Sci Eng 2022; 48: 2247–2260. Springer Science and Business Media LC.

20.

Moon

, Song HJ, Sharma VD, et al. Classification of Parkinson’s disease and essential tremor based on balance and gait characteristics from wearable motion sensors via machine learning techniques: a data-driven approach. J Neuroeng Rehabil 2020; 17. Springer Science and Business Media LLC. DOI: https://doi.org/10.1186/s12984-020-00756-5.

21.

Bernardo

, Quezada A, Munoz R,

et al. Handwritten pattern recognition for early Parkinson’s disease diagnosis. Pattern Recognit Lett 2019; 125: 78–84. Elsevier BV.

22.

Arunachalam

Rekha

. A novel approach for cardiovascular disease prediction using machine learning algorithms. Concurr Comput 2022; 34. DOI: https://doi.org/10.1002/cpe.7027.

23.

Eid

Soudan

Nassif

, et al. Enhancing intrusion detection in IIoT: optimized CNN model with multi-class SMOTE balancing. Neural Comput Appl 2024. Springer Science and Business Media LLC. DOI: https://doi.org/10.1007/s00521-024-09857-x.

24.

Kim

D-K

Chung

. Addressing class imbalances in software defect detection. J Comput Inf Syst 2023; 64: 219–231. Informa UK Limited.

25.

Kuncheva

Faithfull

. PCA feature extraction for change detection in multidimensional unlabeled data. IEEE Trans Neural Netw Learn Syst 2014; 25: 69–80.

26.

Qian

Wang

Hong

, et al. An efficient and adaptive reconstructive homogeneous block-based local tensor robust PCA for feature extraction of hyperspectral images. IEEE J Sel Top Appl Earth Obs Remote Sens 2024; 17: 4392–4407.

27.

Talukder

, Islam MM, Uddin MA,

et al. Machine learning-based network intrusion detection for big and imbalanced data using oversampling, stacking feature embedding and feature extraction. J Big Data 2024; 11. Springer Science and Business Media LLC. DOI: https://doi.org/10.1186/s40537-024-00886-w.

28.

Nogales

Benalcázar

. Analysis and evaluation of feature selection and feature extraction methods. Int J Comput Intell Syst 2023; 16. Springer Science and Business Media LLC. DOI: https://doi.org/10.1007/s44196-023-00319-1.

29.

Shyamala

Brahmananda . Brain tumor classification using optimized and relief-based feature reduction and regression neural network. Biomed Signal Process Control 2023; 86: 105279. Elsevier BV.

30.

Farokhah

Sarno

Fatichah

. Cross-subject channel selection using modified relief and simplified CNN-based deep learning for EEG-based emotion recognition. IEEE Access 2023; 11: 110136–110150.

31.

Talukder

MSH

Akter

. An improved ensemble model of hyper parameter tuned ML algorithms for fetal health prediction. Int J Inf Technol 2023; 16: 1831–1840. Springer Science and Business Media LC.

32.

Bach

Schacht

Chernozhukov

, et al. Hyperparameter Tuning for Causal Inference with Double Machine Learning: A Simulation Study. arXiv. 2024. DOI: https://doi.org/10.48550/ARXIV.2402.04674.

Leveraging relief feature selection and multi-classifier stacking approach for improved Parkinson's disease diagnosis

Abstract

Keywords

Get full access to this article

References