Abstract
We compare multiple machine learning algorithms and develop models to predict future hospitalization among Home- and Community-Based Services (HCBS) Users. Furthermore, we calculate feature importance, the score of input variables based on their importance to predict the outcome, to identify the most relevant variables to predict hospitalization. We use the 2012 national Medicaid Analytic eXtract data and Medicare Provider Analysis and Review data. Predicting any hospitalization, Random Forest appears to be the most robust approach, though XGBoost achieved similar predictive performance. While the importance of features varies by algorithm, chronic conditions, previous hospitalizations, as well as use of services for ambulance, personal care, and durable medical equipment were generally found to be important predictors of hospitalization. Utilizing prediction models to identify those who are prone to hospitalization could be useful in developing early interventions to improve outcomes among HCBS users.
Get full access to this article
View all access options for this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
