Physiological,Psychological,and Functional Health Determinants of Depressive Symptoms Among the Elderly in India: Evaluation of Classification Performance of XGBoost Models

Abstract

Background:

Depression among the elderly is a growing public health concern, especially in India. This study aimed to investigate the predictive validity of physiological, psychological, and functional health factors in classifying the level of depressive symptoms among the elderly using the extreme gradient boosting (XGBoost) technique. Additionally, we compared the performance of models trained on original and resampled data.

Methods:

This study is entirely based on secondary data analysis of the Longitudinal Aging Study in India wave 1 data. We classified the observations into “high depressive symptom” and “low/no depressive symptom” groups based on the predictors, including physiological, psychological, and functional health factors, along with socio-demographic factors. We developed three models (Models 1, 2, and 3) trained on original, over-sampled, and under-sampled data, respectively. Model performance was evaluated using the metrics of balanced accuracy, sensitivity, specificity, and area under the receiver operating characteristics curve (AUC).

Results:

The study included 26,065 individuals aged 60 and above. Model 3, trained on under-sampled data, demonstrated the best overall performance. It achieved a balanced accuracy of 64%, with a sensitivity of 62.8% and specificity of 65.2%. The AUC for Model 3 was 0.692. Feature importance analysis revealed that life satisfaction, instrumental activities of daily living, mobility, caste, and monthly per capita expenditure quintiles were among the most influential factors in predicting the level of depressive symptoms.

Conclusion:

The XGBoost models demonstrate promise in predicting depressive symptoms among the elderly. These findings suggest that machine learning models can be envisaged for early detection and management of depression, especially in primary care.

Keywords

LASI ADL machine learning healthy aging mental health

Key Messages:

Machine learning models, especially XGBoost, are effective tools for predicting depression among the elderly.

Factors such as life satisfaction, instrumental activities of daily living, and mobility are key predictors of depressive symptoms, along with socio-demographic variables like caste and income.

We compared the XGBoost models trained on both original and resampled data, and the model trained on under-sampled data showed the highest balanced accuracy and AUC score.

Home to 1.4 billion people, India is on the verge of a demographic shift with a growing elderly population due to increased life expectancy, reduced mortality, and declining fertility.¹ Globally, the elderly population constitutes around 10% of the world’s population and is expected to increase significantly in the coming decades.² As of 2022, India’s elderly population was approximately 10.5% of its total population.³

Depression, a common mental disorder among this population, is observed to affect 13%–34.4% of the elderly population in India, with a higher prevalence in females.⁴ Depression in the elderly arises from a complex interplay of social, psychological, and biological factors.⁵ Physical health problems such as chronic medical conditions, reduced independence in daily life functioning, and negative life events such as the death of a spouse increase the risk of depression.^6,7 Additionally, cognitive decline and diminished mobility, both commonly associated with the aging process, worsen the negative impact of the condition.⁸ Various validated diagnostic tools are available to detect the presence and severity of depression among the elderly, including both interview-administered questionnaires and self-report rating scales.⁹ Despite the popularity and clear advantage of these scales, detecting the problem of depression among the elderly is challenging and often undetected, mainly due to the complex symptomatology, poor patient insight, and co-occurrence of cognitive impairments.¹⁰ Self-report scales rely heavily on the respondent’s ability to comprehend and accurately respond to questions. This may be compromised by factors such as low educational attainment and cognitive impairment, which are particularly relevant in the elderly population. On the other hand, structured interviews require significant time and specialized skills, which limit their practical utility, particularly in resource-constrained settings like many parts of India. There is a critical need for more accurate, efficient, and objective methods to assess depression among the elderly. Technological advances in machine learning models have made it possible to analyze large datasets with algorithms trained to predict the likelihood of depression based on various risk factors.^11,12 One promising approach is the use of advanced machine learning techniques, such as extreme gradient boosting (XGBoost) models.¹³ It has an established track record in predicting a range of health outcomes in previous studies, including heart disease, chronic kidney disease, diabetes, and thyroid disorders,^14–17 and more recently, in mental health conditions such as depression.^18–21

This study attempted to investigate the predictive validity of physiological, psychological, and functional health factors along with socio-demographic factors in the prediction of depressive symptoms among the elderly (60 years and above) using the XGBoost algorithm. Additionally, the study compared the performance metrics of different XGBoost models developed using original and resampled data. The XGBoost machine learning model is relatively new in the context of analyzing depression among the elderly in India, offering a sophisticated approach to classification compared to traditional statistical methods.²² XGBoost significantly aids in predicting depressive symptoms by handling large datasets and identifying complex patterns.¹⁹ It also highlights feature importance, providing insights into key factors contributing to depression.²³ With its ability to achieve high-performance metrics like accuracy and precision, XGBoost is robust and reliable.¹³ Additionally, as an ensemble method, it reduces the risk of overfitting by combining multiple weak models to form a strong predictor.¹³ These models offer the potential to overcome the subjective biases and logistical challenges associated with traditional diagnostic methods for depression.²⁴ By examining physiological, psychological, and functional health determinants, the study provides a more holistic view of both subjective and objective factors influencing depression among the elderly in India. By identifying key predictors and evaluating the performance of advanced machine learning models, this study contributes to the ongoing discourse on developing objective and efficient tools for identifying individuals at risk of depression. Insights from these models can help reduce healthcare costs by facilitating early and proactive identification of high-risk elderly individuals.²⁵ Integrating machine learning and artificial intelligence tools into electronic health records, primary care settings, or mobile health platforms to analyze patient data would assist in flagging individuals at high risk of depression, allowing for timely interventions that could potentially prevent the onset or progression of the condition. As a result, this approach can reduce healthcare costs and minimize the need for extensive clinical evaluations. Additionally, the study contributes to the existing literature by exploring new dimensions of depression determinants and offering a comparative analysis of advanced machine learning techniques against traditional methods, enriching the current body of research. The findings are particularly relevant to mental health professionals and researchers seeking innovative, more objective, and efficient approaches for early identification of depression risk in the elderly.

Methods

This study is a secondary data analysis of the Longitudinal Aging Study in India (LASI)- Wave 1 data.²⁶ The LASI was conducted between April 2017 and December 2018 and provided a comprehensive dataset of over 70,000 older adults aged 45 and above, representing all states and union territories of India. The survey utilized a multi-stage stratified area probability cluster sampling design to ensure representation across diverse geographic and socio-economic strata. Detailed information on the survey design, sampling procedure, and data collection methodologies are available on the data source website.²⁶ The LASI wave I data is freely available in the public domain and accessible upon request.²⁶ We followed the “Strengthening the Reporting of Observational Studies in Epidemiology (STROBE)” cross-sectional reporting guidelines to ensure comprehensive and transparent reporting of our study findings.²⁷ The STROBE checklist has been presented in the supplementary file (see Table S1).

Figure 1 outlines the flowchart for the sample selection process in this study. The LASI dataset initially includes 73,396 observations. The study focused on individuals aged 60 years and above, which aligned with the definition of senior citizen given by the “Maintenance and Welfare of Parents and Senior Citizens Act, 2007.”²⁸ However, we excluded those aged 80 and above to ensure a more homogeneous sample, avoiding the distinct health challenges of the oldest-old population. We applied complete-case analysis to handle missing data, excluding any cases with missing values for relevant variables.²⁹ This resulted in a final sample size of 26,065 observations.

Figure 1.

The Flowchart of Participant Selection for the Study.

Predictor and Outcome Variables

The outcome variable in this study is the participants’ depressive symptom status, classified as a binary variable with two groups: the “high depressive symptom” and “low/no depressive symptom” groups. The “high depressive symptom” group in this study refers specifically to participants who screened positive for depressive symptoms, and those who screened negative were categorized to the “low/ no depressive symptom” group. Depressive symptoms in the LASI were assessed using the Composite International Diagnostic Interview - Short Form (CIDI-SF).^30,31 Developed by the World Health Organization, the CIDI is a structured interview used to assess the presence and severity of various mental health disorders, including depression, anxiety, and substance abuse, based on the criteria outlined in the Diagnostic and Statistical Manual of Mental Disorders (DSM). A shorter version, the Comprehensive International Diagnostic Interview-Short Form (CIDI-SF), exists, making it more suitable for large-scale studies. The CIDI-SF uses a two-step process to screen for depressive symptoms, and it is known to be a useful tool for identifying probable major depressive episodes.^32,33 First, participants are asked, “During the last 12 months, was there ever a time when you felt sad, blue, or depressed for two weeks or more in a row?”. If the answer is “yes,” the participant is asked seven more questions about specific symptoms of depression—loss of interest, low energy, loss of appetite, trouble concentrating, feelings of worthlessness, thoughts of death, and trouble falling asleep. Based on these assessments, individuals were classified into “high depressive symptom” and “low/no depressive symptom” groups. A flowchart outlining the assessment process of depressive symptoms has been given in the supplementary file (see Figure S1). The predictors for classifying the level of depressive symptoms include physiological health (diagnosis of nine chronic conditions), psychological health (life satisfaction and cognitive ability), functional health (activity of daily living (ADL), instrumental ADL (IADL), and mobility difficulties), and socio-demographic factors (gender, monthly per-capita consumption expenditure, age groups, residence, religion, and caste/category). The supplementary file provides details of predictor variables included in the study (see Table S2).

Statistical Analysis

XGBoost is a highly regarded supervised machine learning technique known for its efficiency and scalability.¹³ XGBoost implements the gradient boosting algorithm, where multiple decision tree-based models are generated sequentially. Each model in the sequence is built by evaluating the residuals of the previous model to minimize error. Essentially, XGBoost is a sequential ensemble learning algorithm that enhances predictive power by aggregating multiple weaker models.³⁴

In this study, we investigated the predictive validity of physiological, psychological, and functional health factors along with socio-demographic factors in identifying the elderly with high depression risk using the XGBoost algorithm. In our study, the dataset was split 80:20 for training and testing. To address the class imbalance in the training data, we employed resampling methods, including over-sampling and under-sampling.³⁵ In the over-sampling approach, observations from the smaller “high depressive symptom” group were duplicated to match the larger “low/no depressive symptom” group. Conversely, the under-sampling technique involved randomly reducing the “low/no depressive symptom” group to equal the size of the “high depressive symptom” group. The distribution of the “high depressive symptom” and “low/no depressive symptom” groups in the balanced and original samples is presented in the supplementary file (see Table S3). We built three XGBoost models using both the original and resampled training data: Model 1 used original data; Model 2 used over-sampled data; and Model 3 used under-sampled data to ensure that the machine learning model does not become biased toward the majority class (“low/no depressive symptom” group in this case). This allows the model to learn patterns from both classes more effectively. This results in improved classification performance, especially for the minority class. The testing set was utilized to measure the performance across all models. Additionally, we compared the performance metrics of the three models in the classification of participants into “high depressive symptom” and “low/no depressive symptom” groups among the elderly.

We conducted hyperparameter tuning on the training data, both original and resampled, to obtain the optimal set of parameters for the XGBoost models.³⁶ Within the training set, a 10-fold cross-validation procedure was used to fine-tune and optimize the algorithm. The details of the parameters tuned and the resultant optimal values are provided in the supplementary file (see Table S4). The classification of participants into “high depressive symptom” and “low/no depressive symptom” groups among the elderly was evaluated using confusion matrices, which are 2x2 tables summarizing model results.³⁷ These matrices include components such as “True Positives” (correctly identified as “high depressive symptom” group), “False Positives” (incorrectly identified as “high depressive symptom” group), “True Negatives” (correctly identified as “low/no depressive symptom”), and “False Negatives” (incorrectly identified as “low/no depressive symptom”). From there, we reported accuracy, no information rate (NIR), sensitivity, specificity, positive prediction value (PPV), negative prediction value (NPV), and balanced accuracy.³⁸ The detailed definition of these evaluation metrics is provided in the supplementary file (see Table S5). We also plotted the receiver operating characteristic (ROC) curve for each model to compare performance, showing the relationship between sensitivity and 1-specificity.³⁹ The curve closest to the top-left corner indicates the best balance between sensitivity and specificity. We also reported the values of the area under the ROC curve (AUC). The ROC curve with the highest AUC value indicates the model with the best overall performance in terms of distinguishing between the “high depressive symptom” and “low/no depressive symptom” classes.⁴⁰ To assess each predictor’s contribution to model accuracy, we calculated feature importance.⁴¹ This metric indicates the degree to which a variable enhances accuracy, with higher scores given to features that improve classification accuracy or reduce error.

The R package “xgboost” (version 1.5.2.1) was used to develop and validate the XGBoost models.¹³ The R-script for the statistical analysis is available on GitHub.⁴² The data visualizations of the ROC curve and feature importance of different XGBoost models were done using Microsoft Excel 2019.

Result

Table 1 presents the socio-demographic, physiological, psychological, and functional health characteristics of the study participants across the depressive symptom status groups. The total sample size was 26065, with 6.5% (n = 1,684) of individuals identified with high levels of depressive symptoms. Among the participants, 66.1% (n = 17,241) resided in rural areas. The distribution across the Monthly Per-Capita Expenditure (MPCE) quintiles revealed that the highest proportion of participants, (20.7%; n = 5,408) was in the poorest quintile, while the lowest proportion, (18.7%; n = 4,875) was in the richest quintile. The majority of the participants (68.1%; n = 17,759) were in the 60–69 years age group. Additionally, 74.5% (n = 19,429) of the participants were Hindu, and the remaining 25.5% (n = 6,636) categorized to other religions. In terms of caste, the majority of participants (39.5%; n = 10,293) were from the OBC category, while the lowest proportion (16.9%; n = 4,411) was from the ST category. The gender distribution showed that a majority of the participants were female, accounting for 51.3% (n = 13,361).

Table 1.

Characteristics of the Study Participants ( n = 26605) Across the Depressive Symptom Status Groups.

Characteristics		Total (n = 26605)	Groups
Characteristics		Total (n = 26605)	“low/no Depressive symptom” group (n = 24381, 93.5%) n (%)	“high depressive symptom” group (n = 1684, 6.5%) n (%)
Socio-demographic
Residence
	Rural	17241 (66.1)	15995 (65.6)	1246 (74)
	Urban	8824 (33.9)	8386 (34.4)	438 (26)
MPCE quintile
	Poorest	5408 (20.7)	5043 (20.7)	365 (21.7)
	Poorer	5356 (20.5)	5019 (20.6)	337 (20)
	Middle	5326 (20.4)	5028 (20.6)	298 (17.7)
	Richer	5100 (19.6)	4776 (19.6)	324 (19.2)
	Richest	4875 (18.7)	4515 (18.5)	360 (21.4)
Age group
	60–69 years	17759 (68.1)	16617 (68.2)	1142 (67.8)
	70–79 years	8306 (31.9)	7764 (31.8)	542 (32.2)
Religion
	Hindu	19429 (74.5)	18089 (74.2)	1340 (79.6)
	Others	6636 (25.5)	6292 (25.8)	344 (20.4)
Caste/categories
	General	6941 (26.6)	6491 (26.6)	450 (26.7)
	SC	4420 (17)	4070 (16.7)	350 (20.8)
	ST	4411 (16.9)	4285 (17.6)	126 (7.5)
	OBC	10293 (39.5)	9535 (39.1)	758 (45)
Gender
	Male	12704 (48.7)	11945 (49)	759 (45.1)
	Female	13361 (51.3)	12436 (51)	925 (54.9)
Physiological health
Morbidity
	No chronic conditions	12044 (46.2)	11422 (46.8)	622 (36.9)
	Diagnosed with a single condition	7658 (29.4)	7146 (29.3)	512 (30.4)
	Diagnosed with multiple chronic conditions	6363 (24.4)	5813 (23.8)	550 (32.7)
Psychological and cognitive health
Life satisfaction
	High satisfaction	11875 (45.6)	11373 (46.6)	502 (29.8)
	Medium satisfaction	10744 (41.2)	10049 (41.2)	695 (41.3)
	Low satisfaction	3446 (13.2)	2959 (12.1)	487 (28.9)
Cognitive ability
	Correctly answered three items (day, month, and year)	12783 (49)	12155 (49.9)	628 (37.3)
	Correctly answered only two items	4646 (17.8)	4311 (17.7)	335 (19.9)
	Correctly answered fewer than two items	8636 (33.1)	7915 (32.5)	721 (42.8)
Functional health
Mobility
	No difficulty	7449 (28.6)	7204 (29.5)	245 (14.5)
	Difficulty in 1 to 3 items	7439 (28.5)	7086 (29.1)	353 (21)
	Difficulty in 4 to 6 items	6892 (26.4)	6343 (26)	549 (32.6)
	Difficulty in at least 7 items	4285 (16.4)	3748 (15.4)	537 (31.9)
ADL
	No difficulty	21330 (81.8)	20216 (82.9)	1114 (66.2)
	Difficulty in ADL	4735 (18.2)	4165 (17.1)	570 (33.8)
IADL
	No difficulty	15494 (59.4)	14842 (60.9)	652 (38.7)
	Difficulty in IADL	10571 (40.6)	9539 (39.1)	1032 (61.3)

Of the participants assessed for physiological health status, 46.2% (n = 12,044) reported having no chronic conditions, while 24.4% (n = 6,363) had multiple chronic conditions. In terms of psychological health, 45.6% (n = 10,744) of the participants reported a high level of life satisfaction, while 13.2% (n = 3,446) reported a low level of life satisfaction. A majority of the participants (49%; n = 12,783) demonstrated adequate cognitive function by correctly answering all assessment items of cognitive ability. However, 33.1% (n = 8636) of the participants answered at least two items incorrectly. In terms of functional health, mobility issues were reported by a significant number of participants, with 71.4% (n = 18616) reporting some difficulty in mobility. Furthermore, 18.2% (n = 4,285) of the participants reported difficulty in ADL, and 40.6% (n = 10,571) reported difficulty in IADL.

Within the “high depressive symptom” group (n = 1,684), 74% (n = 1,246) resided in rural areas. The highest proportion (21.7%; n = 365), belonged to the poorest quintile; 67.8% (n = 1,142) of the participants were in the 60–69 age group. Individuals who belonged to the OBC category made up 45% (n = 758), while the ST category was the smallest at 7.5% (n = 126). The gender distribution was balanced, with 54.9% (n = 925) females and 45.1% (n = 759) males. The physiological health status indicated that 36.9% (n = 622) of the “high depressive symptom” group had no chronic conditions, and 32.7% (n = 550) were diagnosed with multiple chronic conditions. Regarding life satisfaction, 41.3% (n = 695) of the participants in the “high depressive symptom” group reported a medium level of life satisfaction. The majority (42.8%; n = 721) of the participants in the “high depressive symptom” group answered fewer than two items in the assessment of cognitive ability. Mobility issues were significant, with 64.5% (n = 1,086) reporting difficulty in at least 4 out of 9 mobility items. For ADL, 33.8% (n = 570) of the participants reported having difficulty. Additionally, in the “high depressive symptom” group, approximately 61.3% (n = 1,032) of the participants reported difficulties with IADLs.

Table 2 summarizes the confusion matrices of three classification models (Model 1, Model 2, and Model 3), and evaluation metrics were estimated from the components of the confusion matrix for each model.

Table 2.

Confusion Matrices of the Models (Models 1, 2 and 3).

Models			Actual Values
Models			“high depressive symptom” group	“low/no depressive symptom” group
Confusion Matrix 1
Model 1 (XGBoost model developed using original data)	Predicted values	“high depressive symptom” group	0	2
		“high depressive symptom” group	True Positive	False Positive
		“low/no depressive symptom” group	336	4874
		“low/no depressive symptom” group	False Negative	True Negative
Confusion Matrix 2
Model 2 (XGBoost model developed using over-sampled data)	Predicted values	“high depressive symptom” group	48	288
		“high depressive symptom” group	True Positive	False Positive
		“low/no depressive symptom” group	537	4339
		“low/no depressive symptom” group	False Negative	True Negative
Confusion Matrix 3
Model 3 (XGBoost model developed using under-sampled data)	Predicted values	“high depressive symptom” group	211	1696
		“high depressive symptom” group	True Positive	False Positive
		“low/no depressive symptom” group	125	3180
		“low/no depressive symptom” group	False Negative	True Negative

Table 3 outlines the predictive performance metrics, including accuracy, NIR, sensitivity, specificity, PPV, NPV, and balanced accuracy for these three models. The Model 1 with the original dataset registered high accuracy (93.5%, 95% confidence interval (CI): [92.8%–94.2%]) and specificity (99.96%). However, the model’s sensitivity was estimated to be zero percent, failing to detect any true cases in the “high depressive symptom” group. This imbalance in the model’s performance is further highlighted by the balanced accuracy of 50%, indicating that the model’s overall ability to differentiate between individuals having high and low/no depressive symptoms is equivalent to random guessing. The ROC curve for Model 1 shows an AUC value of 0.69 (see Figure 2).

Table 3.

Predictive Performance Metrics of the XGBoost Models.

Evaluation Metrics	Model 1 (Original data)	Model 2 (Over-sampled)	Model 3 (Under-sampled)
Accuracy (95% CI)	0.935 (0.928–0.942)	0.842 (0.832–0.852)	0.651 (0.638–0.664)
NIR	0.936	0.936	0.936
Sensitivity	<0.001	0.143	0.628
Specificity	0.999	0.89	0.652
PPV	<0.001	0.082	0.11
NPV	0.936	0.938	0.962
Balanced accuracy	0.5	0.516	0.64

The XGBoost model trained on the over-sampled data shows some improvement in sensitivity (14.3%) and balanced accuracy (51.6%) compared to Model 1. However, the model’s accuracy (84.2%, 95% CI: [83.2%–85.2%]) is lower than the NIR (93.6%). While the model has high specificity (89%) and NPV (93.8%), the low sensitivity (14.3%) and PPV (8.2%) indicate that Model 2 struggles to correctly identify and predict participants in the “high depressive symptom” group. Overall, the over-sampling approach improved the model’s ability to detect true positive cases. The ROC curve for Model 2 shows an AUC value of 0.583, lower than that of Model 1 (see Figure 2).

The XGBoost model trained on the under-sampled data shows a significant improvement in sensitivity (62.8%) and balanced accuracy (64%) compared to Models 1 and 2. However, the overall accuracy (65.1%, 95% CI: [65.1%–66.4%]) is lower than the NIR (93.6%). While the model has improved sensitivity and NPV (96.2%), the model is estimated to have a relatively low specificity (65.2%) and PPV (11.1%). Overall, the under-sampling approach enhanced the model’s sensitivity and balanced accuracy, making it more effective at detecting true positive cases. The ROC curve for Model 3 shows an AUC value of 0.692, which is higher than that of both Models 1 and 2. Overall, Model 3 offers a better classification based on the AUC value and balanced accuracy, giving optimum sensitivity and specificity values compared to Models 1 and 2 (see Figure 2).

Figure 2.

Comparison of ROC Curves of the XGBoost Models.

Factors are plotted based on their feature importance score in different XGBoost models in Figure 3. The variables, namely life satisfaction (16.4%), IADL (14.9%), mobility (13.9%), caste (11.6%), and the MPCE quintile (10.1%), were observed to contribute significantly to the predictive power of Model 1, indicating that they are significant in identifying participants with “high depressive symptom” group. Conversely, the bottom features, which have the lowest importance scores, include age (3.1%), place of residence (3.5%), religion (3.8%), gender (4.3%), and ADL (5%). These features play a relatively minor role in Model 1 in identifying participants in the “high depressive symptom” group. The socio-demographic factors (combination of age groups, residence, MPCE quintile, religion, caste, and gender) recorded the highest feature importance of 36.4% in Model 1, followed by the functional factors (combination of mobility, ADL, and IADL) with a feature importance of 33.8%. Physiological factors recorded the lowest feature importance, 6.7%, in Model 1.

Figure 3.

Feature Importance Scores of Predictors in Different XGBoost Models of the Study.

The top features in Model 2 are the MPCE quintile (13.4%), caste (12.1%), life satisfaction (12.0%), mobility (11.7%), and morbidity (10.0%). In contrast, the bottom features, with lower importance scores, include ADL (3.9%), religion (4.1%), residence (4.4%), sex (5.3%), and age (5.4%). The socio-demographic factors recorded the highest feature importance of 44.7% in Model 2, followed by the functional factors with a feature importance of 24.9%. Physiological factors recorded the lowest feature importance of 9.97% in Model 2. The most significant features in Model 3 are IADL (22.6%), life satisfaction (18.4%), caste (14.0%), and mobility (13.5%). On the other hand, the least important features include age (1.3%), sex (2.0%), residence (2.2%), and religion (3.3%). The functional factors recorded the highest feature importance of 42.2% in Model 3, followed by the socio-demographic factors with a feature importance of 30.4%. Physiological factors recorded the lowest feature importance, 5.6%, in Model 3.

Overall, the variables such as life satisfaction, IADL, mobility, caste, and MPCE quintiles were consistently ranked among the top features across all three models, with an average feature importance of 15.6%, 15.6%, 13%, 12.6%, and 10.3%, respectively. In contrast, the average feature importance across the models for features including age (3.3%), residence (3.3%), religion (3.7%), and sex (3.9%) was less than 5%. The socio-demographic factors registered the highest average feature importance score of 37.2%, followed by functional and psychological factors with feature importance of 33.6% and 21.8%, respectively.

Physiological features have the lowest average importance score of 7.4%, indicating they contribute the least to the predictive accuracy across the models.

Discussion

Our study investigated the physiological, psychological, and functional health determinants of depressive symptoms among the elderly (60–79 years age group) in India, using XGBoost for classification. Out of the three models we built, Model 3 (based on the under-sampled data) offered better classification based on AUC and balanced accuracy. The top predictors, accounting for over 67% of feature importance, were life satisfaction, IADL, mobility, caste, and MPCE quintiles. Age group, residence, religion, and sex collectively contributed 14.3% to feature importance across the models.

From the psychological factors, “life satisfaction,” with a feature importance of 15.6%, emerged as a key predictor of depressive symptoms. This amplifies prior work highlighting life satisfaction’s role in predicting depression among the adult population.⁴³ The study aligns with findings that lower life satisfaction correlates with a higher risk of depression.^44,45 Given the pivotal role of life satisfaction in subjective well-being and quality of life, the study suggests that improving life satisfaction could serve as a preventive measure for depressive episodes.⁴⁶ Additionally, existing research has identified the mediating and moderating effects of “social participation” in the relationship between life satisfaction and depression.⁴⁷ Based on these findings and the results of this study, it is essential to focus on improving subjective well-being and quality of life by promoting and enhancing social connectedness among the elderly and preventing depression.

Among the functional health factors, IADL and mobility were significant predictors of depressive symptoms, with feature importances of 15.6 and 13, respectively. The study found that 61.3% of participants having “high depressive symptoms” reported difficulty with IADL, aligning with past research linking impaired IADL to depressive symptoms.^48,49 Elderly adults with mobility limitations have an unfavorable trajectory in terms of depressive symptoms compared to those without mobility limitations.⁵⁰ In this study, we observed that 85.3% of the participants in the “high depressive symptoms” group were having some trouble with mobility. The stress process theory suggests that such dysfunction may impede the fulfillment of social roles, possibly leading to depression.⁵¹ Additionally, memory, perceived stress, purpose in life, and resilience mediate the relationship between depression and independence in the IADL.^49,52 Enhancing psychological support to improve perceived stress may help prevent functional decline and thereby reduce the incidence of depression among the elderly.

In socio-demographic factors, caste and MPCE quintiles had significant feature importances of 12.6% and 10.3%, respectively. The caste system is a social stratification based on hereditary social roles and occupations. Historically, it has restricted access to resources and opportunities for lower castes in India.⁵³ There is a paucity of literature on the relationship between caste and mental health. We observed a higher prevalence of depressive symptoms among participants belonging to scheduled castes and other backward classes compared to the general and scheduled tribe categories. Previous research conducted in India and Nepal, both following caste systems, observed similar findings of a higher prevalence of mental health disorders among individuals from lower castes than the general category.^54,55 Notably, we observed comparatively lower depressive symptom prevalence among the tribal population. This aligns with the previous literature on the elderly of the Chakhesang tribe from Nagaland, India, which reported a depression prevalence of 0.2% and 2.1% among male and female participants.⁵⁶ We can attribute this trend to higher physical activity, a healthy diet lifestyle, and engagement in the spiritual and cultural activities of the tribal population.^57,58 Factors such as poverty, lack of social support, substance use, and stress mediate the relationship between caste and depression.⁵⁹ Another major socio-demographic factor associated with depressive symptoms is the MPCE quintile. Previous studies have linked higher income levels to improved depressive symptoms.^60,61 However, our study found that depressive symptoms were highly prevalent among both the poorest (21.7%) and richest (21.4%) quintiles. The mental health burden among the poorest quintile can be explained by the mechanisms of the “social causation pathway”⁶² Stressful or unfavorable financial circumstances can contribute to poor living conditions, malnutrition, unhealthy lifestyles, and reduced social capital, all of which are potential risk factors for depression.⁶³ This study observed that depression is not exclusive to the affluent. Increased awareness about mental health issues and access to mental health services are potentially the reasons behind the increased prevalence of depression among the richest quintile. Furthermore, the association between depression and the “Diseases of affluence” such as obesity, overweight, and reduced physical activity have a significant impact on depression.⁶⁴ Additionally, a competitive social environment and social isolation may contribute to an increased prevalence of depression among the elderly population in the richest quintile.

We performed the XGBoost models by utilizing original and resampled data. Model 3 used the under-sampled data shown to be the best model by considering the importance of identifying the “high depressive symptoms” group against the “low/no depressive symptoms” group. However, the overall accuracy of this model could be higher compared to the model performance in the previous literature. Sharma and Verbeke (2020) predicted depression among the general population using biomarkers by employing XGBoost analysis, achieving an accuracy of 97.3% accuracy with their best model.¹⁹ Handing et al. (2022) used a random forest machine learning technique to predict depression among middle-aged and older adults in Europe, yielding an accuracy range between 76 and 82.4%.⁶⁵ The limited number of predictors in this study may have contributed to the model’s lower accuracy.

This study has several strengths. The XGBoost analysis is a widely used machine learning technique that has a proven record in past literature.^{13–16,18,20,65} To the best of the author’s knowledge, limited research has explored the application of XGBoost analysis in predicting depressive symptoms among the elderly population. Furthermore, the study employed an XGBoost model to analyze three different datasets: original, sampled, and under-sampled data; this approach, along with the use of a large dataset, allowed for a robust analysis of depression. We addressed depressive symptoms from both objective and subjective perspectives by incorporating a range of social, physical, psychological, and functional variables. Both of these objective and subjective measures are top predictors for depressive symptoms in this study.

Despite these strengths, our study has some limitations. Since the assessment used in the study is based on symptom screening, which may not equate to a clinical diagnosis of depressive disorder. Therefore, readers should exercise caution when interpreting and generalizing the conclusions. Since the model showed moderate accuracy, sensitivity, and specificity in identifying participants with depressive symptoms, caution should be exercised when applying it in practical mental health screening settings. Moreover, to have a comprehensive predictive validity of the physiological, psychological, functional, and socio-demographic factors in classifying the level of depressive symptoms, multiple machine learning methods such as random forest, support vector machine classification, and other similar methods need to be employed. While the XGBoost model performed well, it is essential to validate its applicability across different populations and settings. Future research should also explore the integration of other machine learning models and hybrid approaches to enhance classification performance. The number of variables in this study across the different domains is 12, and more is needed to give a robust prediction. We did not follow the selection procedure to recruit variables in the XGBoost models. While the use of a large, nationally representative dataset supports the generalizability of our results within India, the findings may not be directly applicable to other countries or cultural contexts due to variations in demographic, socio-economic, and health characteristics.

Additionally, since we used complete-case analysis, the exclusion of participants with missing data may limit generalizability to populations with similar missing data patterns. The cross-sectional nature of the data limits causal inferences between the identified determinants and depressive symptoms.

Our study’s findings have important implications for public health policy and interventions. To prevent depression among the elderly, it is crucial to enhance subjective well-being and quality of life by fostering social connectedness. Addressing broader issues like poverty, lack of social support, and stressful life events through a coordinated, multisectoral approach is essential. Policymakers should focus on reducing income inequality via progressive taxation and universal basic income. The identification of key determinants of depressive symptoms among the elderly can inform the development of targeted prevention and treatment programs. For instance, addressing chronic health conditions through integrated care models that combine physical and mental health services can potentially reduce depression rates. Additionally, enhancing life satisfaction and cognitive health through social support and cognitive training programs could mitigate the risk of depression.

Conclusion

This study points to life satisfaction, functional health, and social factors as key determinants of depressive symptoms among the elderly in India. Furthermore, the XGBoost model proved to be a valuable tool in classifying the level of depressive symptoms, offering the potential for its application in mental health screening programs among the elderly. Continued research and policy efforts are essential to tackle the growing challenge of depression in this demographic.

Supplemental Material

Supplemental material for this article available online.

Footnotes

Data Availability

The study used data from LASI Wave 1, which is freely available in the public domain and accessible upon request.²⁶

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Declaration Regarding the Use of Generative AI

None used.

Declarations of Statements

The manuscript being submitted has not been published, simultaneously submitted, or already accepted for publication elsewhere.

The manuscript has been read and approved by all the authors, that the requirements for authorship as stated earlier in this document have been met, and that each author believes that the manuscript represents honest work.

The manuscript, to the best of the author’s knowledge, does not infringe upon any copyright or property right of any third party.

Ethics Approval

This study is based on secondary data derived from LASI Wave 1 (2017–2018), which received ethical approval from the Indian Council of Medical Research, Ethics Committee. As the data are publicly available and do not involve direct interaction with participants or human subjects, no additional ethics approval was required.

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

Informed Consent

The LASI-wave I data is freely available in the public domain, and the agencies responsible for conducting the field survey ensured that respondents provided informed consent prior to participation. Additionally, the privacy of participants was safeguarded throughout the data collection process.²⁶

Statement on Prior Presentations

This study has not been presented at any conferences or meetings.

References

Agarwal

, Lubet

, Mitgang

, . Population aging in India: Facts, issues, and options. In: Poot

and Roskruge

(eds) Population change and impacts in Asia and the Pacific. Singapore: Springer Singapore, pp. 289–311.

World Bank. Population ages 65 and above (% of total population) | Data, https://data.worldbank.org/indicator/SP.POP.65UP.TO.ZS?end= (accessed 31 July 2024 ).

International Institute for Population Sciences & United Nations Population Fund. India Ageing Report 2023, Caring for Our Elders: Institutional Responses.United Nations Population Fund, New Delhi. New Delhi: United Nations Population Fund, 2023.

Pilania

, Yadav

, Bairwa

, . Prevalence of depression among the elderly (60 years and above) population in India, 1997–2016: A systematic review and meta-analysis. BMC Public Health, 2019; 19: 832.

Remes

, Mendes

and Templeton

. Biological, psychological, and social determinants of depression: A review of recent literature. Brain Sci, 11. Epub ahead of print December 2021. DOI: 10.3390/brainsci11121633.

Sikorski

, Luppa

, Heser

, . The role of spousal loss in the development of depressive symptoms in the elderly - Implications for diagnostic systems. J Affect Disord, 2014; 161: 97–103.

Srivastava

, Debnath

, Shri

, . The association of widowhood and living alone with depression among older adults in India. Sci Rep, 2021; 11: 21641.

Hudon

, Escudier

, De Roy

, . Behavioral and psychological symptoms that predict cognitive decline or impairment in cognitively normal middle-aged or older adults: a meta-analysis. Neuropsychol Rev, 2020; 30: 558–579.

Cusin

, Yang

, Yeung

, . Rating scales for depression. In: Baer

and Blais

(eds) Handbook of clinical rating scales and assessment in psychiatry and mental health. Totowa, NJ: Humana Press, 2010, pp.7–35.

10.

Devita

, De Salvo

, Ravelli

, . Recognizing depression in the elderly: Practical guidance and challenges for clinical management. Neuropsychiatr Dis Treat, 2022; 18: 2867–2880.

11.

Deo

. Machine learning in medicine. Circulation, 2015; 132: 1920–1930.

12.

Obermeyer

and Emanuel

. Predicting the future — Big data, machine learning, and clinical medicine. N Engl J Med, 2016; 375: 1216–1219.

13.

Chen

and Guestrin

. XGBoost: A scalable tree boosting system. Proc 22nd ACM SIGKDD Int Conf Knowl Discov Data Min, https://api.semanticscholar.org/CorpusID:4650265(2016).

14.

Budholiya

, Shrivastava

and Sharma

An optimized XGBoost-based diagnostic system for effective prediction of heart disease. J King Saud Univ - Comput Inf Sci, 2022; 34: 4514–4523.

15.

Raihan

, Khan

MA-M

, Kee

S-H

, . Detection of the chronic kidney disease using XGBoost classifier and explaining the influence of the attributes on the model using SHAP. Sci Rep, 2023; 13: 6263.

16.

Gündoğdu

. Efficient prediction of early-stage diabetes using XGBoost classifier with random forest feature selection technique. Multimed Tools Appl, 2023; 82: 34163–34181.

17.

Dalal

, Lilhore

, Faujdar

, . Enhancing thyroid disease prediction with improved XGBoost model and bias management techniques. Multimed Tools Appl. Epub ahead of print 2024. DOI: 10.1007/s11042-024-19713-8.

18.

Zulkefli

, Diah

, Ismail

, . Web-based mental health predicting system using K-Nearest Neighbors and XGBoost Algorithms. In: Badioze

Zaman H

, Robinson

, Smeaton

, . (eds) Advances in visual informatics. Singapore: Springer Nature Singapore, 2024, pp.381–396.

19.

Sharma

and Verbeke

WJMI

. Improving Diagnosis of depression with XGBOOST machine learning model and a large biomarkers Dutch dataset (n = 11,081). Front Big Data, 3. Epub ahead of print 2020. DOI: 10.3389/fdata.2020.00015.

20.

Chung

and Teo

Single classifier vs. ensemble machine learning approaches for mental health prediction. Brain Informatics, 2023; 10: 1.

21.

Moore

and Bell

. XGBoost, A novel explainable AI technique, in the prediction of myocardial infarction: A UK biobank cohort study. Clin Med Insights Cardiol, 2022; 16: 11795468221133612.

22.

Nickson

, Meyer

, Walasek

, . Prediction and diagnosis of depression using machine learning with electronic health records data: A systematic review. BMC Med Inform Decis Mak, 2023; 23: 271.

23.

Shi

, Wong

, Li

MZ-F

, . A feature learning approach based on XGBoost for driving assessment and risk prediction. Accid Anal Prev, 2019; 129: 170–179.

24.

Smith

, Renshaw

and Bilello

The diagnosis of depression: Current and emerging methods. Compr Psychiatry, 2013; 54: 1–6.

25.

Esteva

, Robicquet

, Ramsundar

, . A guide to deep learning in healthcare. Nat Med, 2019; 25: 24–29.

26.

Longitudinal Ageing Study in India (LASI) | International Institute for Population Sciences (IIPS), https://www.iipsindia.ac.in/lasi (accessed 29 July 2024 ).

27.

von Elm

, Altman

, Egger

, . The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) Statement: Guidelines for Reporting Observational Studies. Ann Intern Med, 2007; 147: 573–577.

28.

Issac

, Ramesh

, Reddy

, . Maintenance and welfare of parents and Senior Citizens Act 2007: A critical appraisal. Indian J Psychol Med, 2021; 43: S107–S112.

29.

Graham

. Missing data analysis: Making it work in the real world. Annu Rev Psychol, 2009; 60: 549–576.

30.

Sunderland

, Andrews

, Slade

, . Measuring the level of diagnostic concordance and discordance between modules of the CIDI-short form and the CIDI-auto 2.1. Soc Psychiatry Psychiatr Epidemiol, 2011; 46: 775–785.

31.

Kessler

, Andrews

, Mroczek

, . The World Health Organization Composite International Diagnostic Interview short-form (CIDI-SF). Int J Methods Psychiatr Res, 1998; 7: 171–185.

32.

Perianayagam

, Prina

, Selvamani

, . Sub-national patterns and correlates of depression among adults aged 45 years and older: Findings from wave 1 of the Longitudinal Ageing Study in India. Lancet Psychiatry, 2022; 9: 645–659.

33.

Aalto-Setälä

, Haarasilta

, Marttunen

, . Major depressive episode among young adults: CIDI-SF versus SCAN consensus diagnoses. Psychol Med, 2002; 32: 1309–1314.

34.

Sarker

. Machine learning: Algorithms, real-world applications and research directions. SN Comput Sci, 2021; 2: 160.

35.

Yang

, Khorshidi

and Aickelin

A review on over-sampling techniques in classification of multi-class imbalanced datasets: Insights for medical problems. Front Digit Heal, 6. Epub ahead of print 2024. DOI: 10.3389/fdgth.2024.1430245.

36.

Bartz-Beielstein

, Chandrasekaran

and Rehbach

Case Study II: Tuning of gradient boosting (xgboost) BT - Hyperparameter tuning for machine and deep learning with R: A practical guide. In: Bartz

, Bartz-Beielstein

, Zaefferer

, . (eds). Hyperparameter tuning for machine and deep learning with R: A practical guide. Singapore: Springer Nature Singapore, 2023, pp.221–234.

37.

Ting

. Confusion Matrix BT - Encyclopedia of machine learning. In: Sammut

and Webb

(eds). Encyclopedia of Machine Learning. Boston, MA: Springer US, 2011, p.209.

38.

Rainio

, Teuho

and Klén

. Evaluation metrics and statistical tests for machine learning. Sci Rep, 2024; 14: 6086.

39.

Martínez-Camblor

, Corral

, Rey

, . Receiver operating characteristic curve generalization for non-monotone relationships. Stat Methods Med Res, 2017; 26: 113–123.

40.

Hajian-Tilaki

. Receiver Operating characteristic (ROC) curve analysis for medical diagnostic test evaluation. Casp J Intern Med, 2013; 4: 627–635.

41.

Musolf

, Holzinger

, Malley

, . What makes a good prediction? Feature importance and beginning to open the black box of machine learning in genetics. Hum Genet, 2022; 141: 1515–1528.

42.

Junaid

XGBoost-for-predicting-Depression-among-elderly-in-India. GitHub, https://github.com/junaidkp727/XGBoost-for-predicting-Depression-among-Elderly-in-India.git (2024, accessed 12 August 2024 ).

43.

Gigantesco

, Fagnani

, Toccaceli

, . The Relationship between satisfaction with life and depression symptoms by gender. Front Psychiatry, 10. Epub ahead of print 2019. DOI: 10.3389/fpsyt.2019.00419.

44.

Bramhankar

, Kundu

, Pandey

, . An assessment of self-rated life satisfaction and its correlates with physical, mental, and social health status among older adults in India. Sci Rep, 2023; 13: 9117.

45.

Yoo

, Chang

and Kim

. Prevalence and predictive factors of depression in community-dwelling older adults in South Korea. Res Theory Nurs Pract, 2016; 30: 200–211.

46.

Joshanloo

. Longitudinal relations between depressive symptoms and life satisfaction over 15 years. Appl Res Qual Life, 2022; 17: 3115–3130.

47.

Gallagher

, Daynes-Kearney

, Bowman-Grangel

, . Life satisfaction, social participation and symptoms of depression in young adult carers: Evidence from 21 European countries. Int J Adolesc Youth, 2022; 27: 60–71.

48.

and Yang

Investigating the link between IADL and Depressive symptoms in older adults: A Cross-sectional serial mediation model. Clin Gerontol, 2023; 46: 844–859.

49.

Tai

, Tsai

, Lin

, . Depressive symptoms and daily living dependence in older adults with type 2 diabetes mellitus: The mediating role of positive and negative perceived stress. BMC Psychiatry, 2024; 24: 14.

50.

, Feng

, Wu

, . Longitudinal trajectories of depressive symptoms: The role of multimorbidity, mobility, and subjective memory. BMC Geriatr, 2023; 23: 22.

51.

Pearlin

, Lieberman

, Menaghan

, . The stress process. J Health Soc Behav, 1981; 22: 337–356.

52.

Hua

and Wang

Memory performance mediates the relationship between depression and independence in instrumental activities of daily living among community-dwelling older adults: Evidence from the China family panel study. Geriatr Nurs (Minneap), 2023; 50: 1–6.

53.

Deshmukh

, Sharma

, Prasad

, . Contemporary meaning of caste discrimination in Indian universities: A constructivist grounded theory. High Educ. Epub ahead of print 2024. DOI: 10.1007/s10734-024-01180-7.

54.

French

. Dalits and mental health: Investigating perceptions, stigma and barriers to support in Kathmandu, Nepal. J Glob Heal Reports, https://api.semanticscholar.org/CorpusID:216492600(2020).

55.

Kiang

, Folmar

and Gentry

. “Untouchable”? Social status, identity, and mental health among adolescents in Nepal. J Adolesc Res, 2018; 35: 248–273.

56.

Kham

and Langstieh

. Depression and related factors among the elderly Chakhesang population. Indian J Gerontol, 2018; 32: 7–20.

57.

Rashmi

, Srivastava

, Muhammad

, . Indigenous population and major depressive disorder in later life: A study based on the data from longitudinal ageing study in India. BMC Public Health, 2022; 22: 2258.

58.

Ka’apu

and Burnette

. A culturally Informed systematic review of mental health disparities among adult indigenous men and women of the USA: What is known? Br J Soc Work, 2019; 49: 880–898.

59.

Kohrt

, Speckman

, Kunz

, . Culture in psychiatric epidemiology: Using ethnography and multiple mediator models to assess the relationship of caste with depression and anxiety in Nepal. Ann Hum Biol, 2009; 36: 261–280.

60.

Shields-Zeeman

and Smit

The impact of income on mental health. Lancet Public Heal, 2022; 7: e486–e487.

61.

Hinata

, Kabasawa

, Watanabe

, . Education, household income, and depressive symptoms in middle-aged and older Japanese adults. BMC Public Health, 2021; 21: 2120.

62.

Guan

, Guariglia

, Moore

, . Financial stress and depression in adults: A systematic review. PLoS One, 2022; 17: e0264041.

63.

Lund

. Poverty and mental health: A review of practice and policies, https://api.semanticscholar.org/CorpusID:54016398(2012).

64.

Hidaka

. Depression as a disease of modernity: Explanations for increasing prevalence. J Affect Disord, 2012; 140: 205–214.

65.

Handing

, Strobl

, Jiao

, . Predictors of depression among middle-aged and older men and women in Europe: A machine learning approach. Lancet Reg Heal Eur, 2022; 18: 100391.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.03 MB

0.09 MB

0.00 MB