Abstract
Objective
With the increasing global burden of cancer, there is a growing need for innovative strategies to improve oncology care. Health-related quality of life (HRQoL) is an outcome measure for assessing the overall wellbeing of patients with cancer. We used machine learning to predict HRQoL and to identify key factors that can inform patient-centered cancer care.
Methods
We conducted a cross-sectional study enrolling patients diagnosed with lung, breast, or colorectal cancer across two provinces in China. We collected data on demographics, clinical characteristics, and patient-centered features. HRQoL was assessed using the widely accepted EQ-5D-5L instrument in cancer care. We trained and evaluated seven machine learning models. SHapley Additive exPlanations (SHAP) analysis was employed to assess feature importance.
Results
Data from 924 patients with cancer were available. The random forest and extreme gradient boosting models had superior predictive performance. Positive SHAP values were primarily observed in patients with early-stage cancer and those enrolled in Urban Employees Basic Medical Insurance. Negative SHAP values were mainly associated with longer duration of chronic comorbidities, colorectal cancer, and ongoing chemotherapy. Age and time since cancer diagnosis exhibited bidirectional impacts.
Conclusions
Our study demonstrates the potential of machine learning models to predict HRQoL in patients with cancer. We identified key predictors of patient HRQoL, like duration of chronic comorbidities, early-stage cancer diagnosis, age, and health insurance coverage. Our findings would facilitate early identification of patients with lower HRQoL and promote the provision of patient-centered oncology care.
Introduction
Health-related quality of life (HRQoL) is a multidimensional concept that encompasses physical, emotional, social, and functional wellbeing, and it is increasingly recognized as a critical outcome in oncology care. Cancer remains a leading cause of morbidity and mortality worldwide, with a significant impact on the HRQoL of patients. 1 In China, the situation is particularly concerning, with a rising incidence of cancer and an increasing burden of chronic comorbidities. This dual challenge underscores the urgent need for effective strategies to monitor and improve the wellbeing of patients with cancer.2,3
HRQoL is increasingly recognized as a critical outcome in oncology research and routine care. It serves as a key predictor of treatment adherence, symptom burden, and overall survival. 4 Furthermore, it is considered as an essential measure of overall wellbeing that directly informs healthcare planning and health service delivery. 5 Previous studies using multivariate analysis showed that the HRQoL of patients with cancer might be influenced by several factors, including the type and stage of cancer, treatment modalities, and demographic characteristics.6–8 However, these traditional methods often have limitations. They typically assume linear relationships between variables and struggle with the complex and nonlinear nature of the factors influencing HRQoL. 9 Furthermore, they are better at identifying broad associations across a population than at making accurate predictions for individual patients. 10 As oncology care becomes more personalized, predicting HRQoL is therefore necessitate for improving patient-centered outcomes.
Recent advancements in machine learning and deep learning have revolutionized the field of health outcomes research by providing digital health tools to analyze complex datasets and predict health outcomes.11–14 Machine learning is particularly suitable for HRQoL prediction since it can handle the complex, nonlinear interactions of clinical, demographic, and patient-report data to generate individualized predictions. This approach could provide individualized predictions essential for personalized care planning. However, the development of interpretable machine learning models capable of predicting HRQoL across multiple cancer types remains limited. This gap underscores the urgent need for robust, patient-centered predictive models to support personalized cancer care.
To address this gap, our study aims to develop, assess, and interpret machine learning models to predict HRQoL in patients with cancer. We integrate a diverse set of patient-centered features and focus on the most common cancer types in China, including lung, breast, and colorectal cancer. 15 Our study not only aligns with the mission of oncology care but also offers actionable advice on improving patient outcomes. Our findings will provide an actionable, and data-driven framework for the early identification of patients at risk for impaired wellbeing.
Methods
Study design and sampling
The study was designed and reported according to the STROBE statement for observational studies (Supplemental material 1). 16 The cross-sectional design was chosen to meet our primary objective of identifying factors associated with HRQoL at a specific time point. This approach is methodologically supported by prior research employing similar data with machine learning to predict health-related outcomes.4,17,18
Our study consecutively enrolled patients aged 18 years or older who had been diagnosed with one of the three most common types of cancer in China: lung, breast, or colorectal cancer. Patients were excluded if they found it difficult to understand the survey items. All participants were required to participate in public health insurance programs, as these programs have been a cornerstone of China's healthcare system over the past decades. 19
The patients were recruited from five hospitals in Jiangsu province (i.e. Affiliated Hospital of Nantong University, the People's Hospital of Rugao, Jiangsu Provincial Hospital, Affiliated Dongtai Hospital of Nantong University, and Suqian People's Hospital) and one hospital in Hebei province (i.e. Shijiazhuang Traditional Chinese Medicine Hospital). Patients were mainly enrolled from Department of Oncology, Department of Chemotherapy, and Department of Radiotherapy in sampling hospitals.
Machine learning methods are not always immune to sample size requirements. For binary outcomes, the technique needs more than 10 times as many events for each predictor. 20 We followed the events-per-variable rule. We had 30 original features and at least 300 samples would be required. To ensure the representativeness of patients, the sample size was balanced within each sampling hospital.
Data collection
Data were collected through structured questionnaires and medical record reviews. The questionnaires mainly consisted of patient demographic information, clinical characteristics, patient-centered items, and assessment of HRQoL (Supplemental material 2). We conducted a pilot survey on 10 patients to test the feasibility and refine the methodology of our study. The pilot survey helped us to improve the clarity of the questionnaires and identify problems in the data collection process, allowing us to make necessary adjustments before the formal survey. Our formal survey was conducted from 2 March to 31 June 2023.
Questionnaires were printed for ease of reading and filling out. They were administered through one-on-one, face-to-face interviews conducted by 14 medical interns and 20 physicians. We developed survey manuals and distributed them to interviewers. We conducted either in-person or online training for interviewers to ensure consistency in data collection. Clinical characteristics were filled in by interviewers based on patients’ electronic medical records.
Patients had the right to refuse to participate in the survey at any time. In order to ensure data completeness, each questionnaire is required to be reviewed by the interviewer immediately after completion. For any patient who had trouble understanding the questionnaire, the interviewer explained each item until the patient was able to understand its meaning. All patients were fully informed of the survey and signed informed consent forms. We provided each patient with a towel as a token of appreciation after completing the survey.
Outcome variables
The HRQoL in patients was assessed using the EQ-5D-5L instrument, which is a reliable and generic tool to describe health status across diverse populations and settings including cancer.21–25 We used the official version of the EQ-5D-5L questionnaire exclusively for academic research, as approved by the EuroQol Group. The EQ-5D-5L comprises five dimensions: mobility, self-care, usual activities, pain/discomfort, and anxiety/depression. Utility scores were generated using Chinese-specific scoring algorithms based on the time tradeoff technique. 26 The health utility value was anchored between 0 (death) and 1 (full health), with higher scores meaning better HRQoL.
Statistical analysis
Baseline characteristics of the patients were analyzed according to their HRQoL, stratified into two subgroups: higher and lower. Descriptive statistics were computed for each subgroup. The normality of continuous variables was evaluated using the Shapiro–Wilk test. Variables that followed a normal distribution were reported as mean ± standard deviation and differences between subgroups were tested using independent samples t-tests. In contrast, variables that did not follow a normal distribution were presented as medians with interquartile ranges, and differences were analyzed using the Wilcoxon rank-sum test. Categorical variables were presented as numbers (percentages) and were compared using the Pearson's chi-square test.
A two-tailed p-value of less than 0.05 was considered statistically significant for conventional analyses. However, in the context of machine learning, a p-value of less than 0.1 could be considered as the threshold for statistical significance. 27 Data analysis was performed using SPSS software (version 21.0).
Model development
Data preprocessing for models was conducted by Python software (version 3.9.19). We performed basic data cleaning, including range checks, plausibility checks, and handling of missing values. Variables with low levels of missingness (<5%) were imputed using the median for continuous variables and the mode for categorical variables, while features with extensive missingness were excluded. Multicollinearity for independent variables was assessed via variance inflation factors (Supplemental material 3).
Numerical features were normalized to ensure consistency across scales, while categorical variables were encoded using one-hot encoding. Feature selection was performed by recursive feature elimination with cross-validation, with a random forest classifier serving as the base estimator. This technique is relatively robust to moderate noise and correlations among predictors. The process iteratively removed the least important features based on model-derived importance scores. Each candidate feature subset was evaluated by five-fold stratified cross-validation, with the final subset selected to maximize the mean cross-validated area under the receiver operating characteristic curve (AUC) while maintaining model simplicity. Clinical relevance was also considered to ensure applicability in cancer care.
We extended our analysis to encompass patient preferences, which, though often underestimated in HRQoL research, are hypothesized to impact patient satisfaction and treatment outcomes.28,29 Therefore, our study incorporated a comprehensive set of features, including demographic characteristics (gender, age, education, occupation, marital status, monthly household income, and type of health insurance), clinical characteristics (diagnosis and stage of cancer, time since diagnosis, types and duration of comorbid chronic diseases, and current cancer therapies), and patient preferences (willingness to receive new anticancer drugs, preferences for routes of drug administration, and satisfaction with health insurance).
We employed seven machine learning models to predict HRQoL: random forest (RF), support vector machine (SVM), K-nearest neighbor (KNN), gradient boosting (GBoost), adaptive boosting (AdaBoost), extreme gradient boosting (XGB), and multilayer perceptron (MLP). These models were selected for their diverse strengths. RF integrates multiple decision trees to enhance accuracy. SVM offers strong generalizability with minimal data requirements; KNN provides fast training and robustness to outliers. 30 GBoost and AdaBoost combine weak learners into strong predictive models, with AdaBoost iteratively refining weights. 31 XGB is renowned for its performance in classification and regression tasks. MLP utilizes deep learning for multilayer feature extraction and classification. 32
In our study, the dataset was divided into 70% for training, 10% for validation, and 20% for testing. For each of the seven machine learning models, we ran a grid search over a defined range of key hyperparameters (Supplemental material 4). The optimal hyperparameter values are shown in Supplemental material 5, and all other parameters were kept at the default settings in scikit-learn.
Given the lack of a universally acknowledged threshold for predicting HRQoL in patients with cancer, we reviewed prior research and found that the average health utilities ranged from 0.62 to 0.75.33–36 We adopted the 0.70 cutoff because it lies at the lower end of the clinically meaningful range, allowing us to separate patients with clearly impaired HRQoL from others while preserving a sufficient sample size in each group (Supplemental material 6).
Since the two HRQoL groups showed moderate imbalance, we used synthetic minority over-sampling technique to reduce the effect of unequal class sizes before model training (Supplemental material 7). Our primary aim was to investigate general HRQoL determinants in patients with cancer, as key predictors are shared across cancer types. Pooling cohorts also improved statistical power.
The predictive performance of the seven models was evaluated using accuracy, AUC, precision, recall, and F1-score, providing a comprehensive assessment of their ability to predict HRQoL. To evaluate whether insurance variables would introduce policy-driven bias, we conducted a sensitivity analysis by removing the type of insurance features and retrained the models.
To assess the impact of each feature on HRQoL predictions, we performed SHapley Additive exPlanations (SHAP) analysis using the best-performing model. The SHAP value was calculated to observe the influence of each feature on the model's prediction. Positive SHAP values represent features that enhance prediction, while negative values represent features that reduce predictive power. To further explore nonlinear effects and potential interactions, we generated SHAP dependence plots for key predictors. We also generated an SHAP waterfall plot for a representative participant to illustrate local feature contributions. Model explanations derived from SHAP analyses were reviewed by two clinical experts to ensure clinical consistency and relevance to patient care.
Results
Demographic and clinical characteristics
A total of 949 patients participated in our survey, of which 25 were excluded due to incomplete data. As a result, data from 924 patients were available for analysis. The EQ-5D-5L demonstrated good internal consistency with a Cronbach's alpha of 0.876 (Supplemental material 8). Table 1 presented characteristics of the patients, stratified by HRQoL. Participants had an average age of 64 years, ranging from 28 to 93 years. Significant differences were observed between subgroups for key variables, such as age, monthly household income, types of public health insurance, types of cancer, cancer stage, time since cancer diagnosis, and comorbid chronic diseases.
Characteristics of patients with cancer.
*In year 2023, 1000 CNY approximately equals to 141.3 US Dollars; HRQoL: health-related quality of life; NA: not applicable; UEBMI: Urban Employee Basic Medical Insurance; URRBMI: Urban and Rural Residents Basic Medical Insurance.
Model performance
Table 2 presents the evaluation results of the model performance across training, validation, and testing datasets. Accuracy measures the proportion of correct predictions made by a model. The test accuracy of the models ranged from 0.71 to 0.82, which indicated a moderate to high level of predictive performance. Notably, the RF and XGB models had the highest accuracy, demonstrating superior predictive power compared to the other models.
Summary of the model performance for predicting HRQoL.
ACC: accuracy; AdaBoost: adaptive boosting; AUC: area under the receiver operating characteristic curve; GBoost: gradient boosting; KNN: K-nearest neighbor; MLP: multilayer perceptron; RF: random forest; SVM: support vector machine; XGB: extreme gradient boosting.
ROC curves for each model, depicted in Figure 1 (panels A, B, and C), visually capture the tradeoff between sensitivity and specificity at various threshold settings. AUC score shows how well a machine learning model can separate different groups. In the validation set, the AUC values of the RF model and XGB model were close to 0.8, which indicated reliable discriminative capabilities.

Performance of the prediction model. (a) ROC curves of the train model, (b) ROC curves of the validation model, (c) ROC curves of the test model, (d) comparison of precision-recall curves on the test set.
The precision-recall curves (Figure 1(d)) further illustrate the models’ balanced performance between precision and recall. The probability distributions of models are shown in Figure 2. Given their consistent high performance across multiple metrics and ability to distinguish positive and negative classes effectively, the RF and XGB models were selected for further analysis in our study. The sensitivity analysis shows that the model performance remained stable (Supplemental material 9) and our main findings are not driven by insurance-related features.

Probability distributions of predication model.
Feature importance
Figure 3(a) displays the relationship between features and their average SHAP values of the XGB model. It indicates the mean absolute SHAP values for each feature across all predictions. Positive SHAP values, indicative of features that enhance HRQoL, were observed in patients with early-stage cancer (stage 0 or 1) and those enrolled in UEBMI. The type of health insurance had a more substantial impact on HRQoL than satisfaction with insurance reimbursement. Conversely, negative SHAP values, which predict a decrease in HRQoL, were associated with longer duration of chronic comorbidities, colorectal cancer, and ongoing chemotherapy. Chronic respiratory diseases also emerged as detrimental to HRQoL. In addition, age, time since cancer diagnosis, and preferences for new drugs exhibited bidirectional impacts on HRQoL.

SHAP analysis of the XGB model for predicting HRQoL. (a) Summary plot for SHAP values. Each point represents a single prediction. The position along the x-axis was the SHAP value, which means the feature's contribution to the prediction. The color gradient (typically from blue to red) stands for the value of the feature. (b) Bar plot of feature importance. The length of each bar corresponds to the average impact of each feature across all predictions. HRQoL: health-related quality of life; SHAP: SHapley Additive exPlanations; XGB: extreme gradient boosting
Figure 3(b) provides an in-depth analysis of feature contributions for each prediction, ranking features by their importance. The most significant predictors of HRQoL included the duration of chronic comorbidities, early-stage cancer (stages 0 and 1), age, UEBMI enrollment, and time since cancer diagnosis.
SHAP analysis of the RF model yielded results highly consistent with the XGB model (Supplemental material 10). Both models identified the similar core predictors like duration of comorbid chronic diseases, age, early-stage cancer, time since cancer diagnosis, and type of insurance. Minor variations in feature ranking are attributable to their differing algorithmic structures (bagging vs. boosting). The consistency confirms the key predictors and their impact on HRQoL are robust across tree-based ensembles.
Based on the SHAP dependence plots for main predictors, age had a curvilinear effect, with HRQoL highest in middle age (Figure 4). A longer duration of chronic diseases showed a strong negative linear association. Time since cancer diagnosis had its strongest negative effect soon after diagnosis, which diminishes over time. Preference for new drugs had a weakly positive association. The SHAP waterfall plot for an individual patient is shown in Supplemental material 11. A long duration of comorbid chronic diseases strongly pulls the prediction toward lower HRQoL, while not being in stages 0 to 1 pushes the prediction toward lower HRQoL.

SHAP dependence plot for key predictors in the model. (a) Age; (b) duration of comorbid chronic diseases; (c) time since cancer diagnosis; and (d) preference for new drugs. Notes: In each panel, the x-axis shows the original value of the predictor and the y-axis its SHAP value. The vertical color bar on the right indicates the values of the interaction feature used to color the points. SHAP: SHapley Additive exPlanations.
Discussion
Our study employs a comprehensive set of machine learning models to predict HRQoL in patients with cancer. This approach enables a robust evaluation of different algorithms and their predictive capabilities. Our findings demonstrated that the RF and XGB models achieved the highest predictive performance among seven machine learning models. The superior performance of RF and XGB models in our study coincides with their advantages in other health-related prediction tasks.37,38 Given the capability to deal with complex feature interactions and resist overfitting,31,39,40 these models are ideally suited for integration into clinical practice to dynamically predict HRQoL.
The SHAP analysis revealed the influence of various features on HRQoL.41,42 Several key predictors were identified, like the duration of chronic comorbidities, early-stage cancer (stage 0 or 1), age, enrollment in UEBMI, and time since cancer diagnosis. Specifically, longer duration of chronic comorbidities had a significant negative impact on HRQoL. This finding is consistent with studies that underline the heavy burden of multimorbidity in cancer care.43,44 In our study, we also observed that chronic respiratory diseases would deteriorate HRQoL of patients with cancer. Patients with chronic respiratory diseases often experience symptoms such as shortness of breath, coughing, and wheezing. Such symptoms can limit physical activity and contribute to feelings of fatigue and anxiety.45–47 Therefore, integrated management strategies for chronic comorbidities are essential to improve HRQoL in patients with cancer.
Our study found that patients with early-stage cancer had significantly higher HRQoL compared to those with more advanced stages. This finding emphasizes the importance of early detection and aligns with China's national cancer control priorities. The Healthy China 2030 blueprint calls on strengthening early screening programs to shift the focus from late-stage treatment to disease prevention and early intervention. Early-stage cancers are generally less aggressive. The treatments are often less invasive and more effective, leading to better HRQoL outcomes. 48 Timely interventions not only improve survival rates but also enhance the overall wellbeing of patients.
Our findings also revealed that patients enrolled in the UEBMI reported higher HRQoL compared to those in the URRBMI. This reveals the system level determinants of patient wellbeing. Variations in insurance coverage and reimbursement policies can affect patient access to cancer care and subsequently HRQoL. The disparity is likely due to the more comprehensive coverage, superior health services, and reduced financial burden provided by UEBMI.37,49 Over the past two decades, China has made significant advancements toward universal health coverage. However, there are still challenges in ensuring equitable access to healthcare service coverage and high-quality care for patients with cancer. 50 Future policy initiatives should concentrate on addressing these disparities to ensure equal access to high-quality cancer care.
We found that age exerts a bidirectional effect on HRQoL, with some older patients maintaining or even achieving higher HRQoL, while others experience a decline. Although older patients with cancer might have better HRQoL in terms of social functioning, role performance, and financial stability, they often face problems in physical and cognitive functioning. 51 Notably, older adults undergoing curative treatments could have comparable or superior HRQoL to their younger counterparts. 52 Similar variability in our study was observed regarding duration since cancer diagnosis and patient preferences for new drugs. Care plans can be adjusted according to regular HRQoL assessments. Staying responsive to patient needs can ensure that interventions remain effective and patient-centered.
The findings of our study hold implications for oncology care, emphasizing the need for personalized care plans for patients with cancer, especially those with chronic comorbidities. The care team are well-positioned to identify patients at risk of lower HRQoL based on factors such as the duration of chronic conditions, advanced cancer stages, and age-related physical or cognitive problems.
To enhance patient outcomes, healthcare providers can promote regular cancer screenings and educate patients on the benefits of early diagnosis. 53 They can also coordinate care for patients with multiple chronic conditions. These approaches operationalize the integrated service delivery model central to the Universal Health Coverage, ensuring comprehensively treatment plans that could manage cancer and comorbidities effectively.
Age-appropriate interventions, such as geriatric assessments and supportive care, are crucial for meeting the unique needs of older patients with cancer. Moreover, by providing clear information and emotional support, they can help patients make informed decisions that align with their preferences, ultimately improving their overall wellbeing. 54
Our study has several notable strengths that enhance its validity and applicability. First, we reveal the potential of machine learning, a powerful digital health tool, to play a key role in personalizing oncology care. We are one of the first to apply machine learning models to predict HRQoL in patients with cancer using a comprehensive set of patient-centered features. Multiple machine learning models (RF, XGB, etc.) allow for more precise identification of patients at risk of lower HRQoL, enabling targeted interventions.
Second, our study uniquely examines a wide range of patient-centered variables. This approach provides understanding of factors influencing HRQoL, which has not been extensively explored in previous research. By identifying these variables as key predictors of HRQoL, our study generates new evidence for developing personalized care plans that address the multifaceted needs of patients with cancer. Recent study also showed that patients’ views could affect how they accept digital decision-support tools. 55 Therefore, patient-centered insights should be integrated in future model development and implementation strategies.
Third, we found the differential impact of different health insurance programs on HRQoL. This novel result informs policymakers of the critical role of equitable insurance reimbursement policies in mitigating the financial strain on patients with cancer, thereby enhancing their HRQoL.
Fourth, our study demonstrates the bidirectional effect of age on HRQoL. By highlighting age-related challenges in physical and cognitive functioning, our study supports the development of age-appropriate interventions and geriatric assessments, thus enhancing the quality of care for older patients with cancer.
Finally, we utilized the EQ-5D-5L instrument, a validated and well-known tool, to assess HRQoL. The generic EQ-5D questionnaire was the most widely used instrument in cancer research,56–58 thus ensuring the credibility and applicability of our findings.
Despite notable strengths, our study has several limitations. First, the cross-sectional design, while common and widely accepted in health prediction research, restricts the ability to establish causality between predictors and HRQoL. To address this, future research should use longitudinal designs to explore the causal relationships over time. Second, the model was developed and validated on data from specific regions. The lack of external validation on independent, multicenter datasets or via a temporal split limits the assessment of model generalizability. Third, the recruitment of participants from two provinces may introduce potential selection bias due to differences in socioeconomic characteristics or patient populations. Future research should enroll participants from broader geographic regions to mitigate this limitation. Fourth, the dichotomized HRQoL outcome was used in our study. Although dichotomization facilitates decision making, it represents a methodological compromise. Continuous regression would increase statistical power and yield more precise estimates. Finally, like other survey-based HRQoL assessments, our findings may be susceptible to self-report bias.
Conclusions
In conclusion, our study demonstrates the potential of machine learning models, particularly RF and XGB, to predict HRQoL in patients with cancer using patient-centered features. Key factors were identified for improving HRQoL, such as chronic comorbidity management, early-stage cancer diagnosis, and equitable health insurance coverage. Our findings offer valuable suggestions for digital health and oncology care. Healthcare providers can refer to these insights to develop personalized care plans, promote early cancer detection, advocate for equitable health insurance coverage, manage chronic comorbidities, alleviate age-related functional declines, and continuously monitor HRQoL.
To translate our findings into practice, we propose developing a clinical decision support tool such as a web-based risk calculator to provide real-time HRQoL estimates. By integrating electronic health record data, the tool would enable routine HRQoL collection and provide clear risk profiles for a patient's HRQoL. Together, these efforts will improve patient outcomes and contribute to a more patient-centered healthcare system.
Supplemental Material
sj-pdf-1-dhj-10.1177_20552076261430462 - Supplemental material for Predicting health-related quality of life in patients with cancer using machine learning: A step toward personalized oncology care
Supplemental material, sj-pdf-1-dhj-10.1177_20552076261430462 for Predicting health-related quality of life in patients with cancer using machine learning: A step toward personalized oncology care by Jingyu Chen, Jiaran Chen, Ruiting Shen, Shuchen Ji, Guohua Wang, Xingyun Geng and Jinsong Geng in DIGITAL HEALTH
Footnotes
Acknowledgements
We acknowledge the contributions made by our interviewers who did one-on-one, face-to-face interviews with the patients. We are grateful to the patients for their efforts and time. When designing this study, Jinsong Geng was a research fellow at the Fellowship in Health Policy and Insurance Research, Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Healthcare Institute.
Ethics considerations
Our study, including the patient consent process, has been approved by the Medical Ethics Committee at Nantong University (Ethical Approval-2021069, Ethical Approval-2021070) and conforms to the ethical guidelines of the Declaration of Helsinki.
Consent to participate
Informed, written consent was obtained from all individual participants in the study.
Consent for publication
Not applicable.
Author contributions
GJ and GX led the study design. CJR contributed to the literature search and qualitative analysis. GJ and CJY took part in the design of the questionnaire and the interpretation of the data. GJ, JS, and WG contributed to implementing the survey. CJY, SR, and GX performed the statistical analysis. CJY wrote the manuscript, and prepared tables and figures. GJ and GX provided comments on the manuscript. All authors reviewed the manuscript.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by the National Natural Science Foundation of China (grant no.72374113), and Postgraduate Research and Practice Innovation Program of Jiangsu Province (KYCX25_3783, KYCX24-3568). The funders provided financial support for the conduct of the study. The funders had no role in the design, implementation, data collection and statistical analysis, data interpretation, or writing of the manuscript.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data availability
Original datasets will be available upon reasonable request to the corresponding author.
Guarantor
Jinsong Geng.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
