Abstract
Epidemiological studies frequently utilize hierarchical data structures, such as regional variations. Central obesity, commonly measured by waist circumference, is a key predictor of cardiovascular disease, one of the leading global causes of mortality. Intra-cluster correlations arising from lifestyle disparities and unequal access to healthy resources necessitate statistical models that account for both between- and within-cluster variability. Linear Mixed-Effects Models are often used for this purpose, but they may fall short in capturing nonlinearities and complex interactions among predictors. To address these limitations, tree-based extensions such as the Mixed-Effects Regression Tree and Mixed-Effects Random Forest have been introduced. MERF integrates Random Forests into the mixed-effects framework to enhance flexibility and predictive power. This study evaluates and compares the performance of LMM, MERT, and MERF in modeling waist circumference as a proxy for central obesity, using Indonesia's 2018 Basic Health Survey for West Java Province. Two modeling strategies were applied: one using full province-wide data, and another stratifying the regions into three risk categories (high, moderate, and low) based on proximity to the Special Capital Region of Jakarta. Moreover, separate models were developed for each gender to examine any difference in the contributing or influential variables for waist circumference between males and females. All models included regency/city as a random effect. Results show that MERF outperforms LMM in predictive accuracy and performs comparably to MERT, highlighting its potential for improving the modeling of clustered health survey data. Linear-based approaches, such as LMM and simple tree-based models, specifically exhibited better predictive performance than MERF for the male model. The findings support the use of mixed-effects machine learning approaches in enhancing the quality of official health statistics and informing targeted public health interventions.
Keywords
Get full access to this article
View all access options for this article.
