Abstract
In this article, relationship between respondents’ height and occurrence of diabetes has been investigated. This study uses Bangladesh Demographic and Health Survey (BDHS) 2011 data collected from an observational study. Considering height (tall/normal/short) based on percentiles separately for men and women, logistic regression model was fitted to the propensity score (PS)-adjusted weighted data. No significant relationship between respondents’ height and diabetes was observed. We also found that the occurrence of diabetes significantly varies with respect to sex, education level, wealth index, body mass index (BMI), and region/division. As, in general, women are shorter than men by nature, we strongly argue that height categories should be defined separately whenever estimation of the effect of height on some response is of interest.
Keywords
Introduction
Controlling non-communicable diseases (NCDs) is one of the major health and development challenges of the 21st century, particularly in the low- and middle-income countries. 1 The burden of NCDs on social, economic and public health is becoming devastating. Among all deaths due to NCDs, 4% of those are attributable to diabetes. 1 Although this death rate is low as compared with the deaths due to other NCDs, the prevalence of diabetes is increasing globally, particularly in the low- and middle-income countries. 1 In total, 18.8% of the total 382 million patients with diabetes are from South-East Asia (SEA) region, and International Diabetic Federation (IDF) projects about 70% increase in the prevalence in SEA region by 2035. 2 The prevalence of adult diabetes in Bangladesh is 9.4% in 2014. 1
Although diabetes is not as fatal as other NCDs, it can go unnoticed and undiagnosed for years and increase the risk of developing cardiovascular disease, kidney failure, blindness, and lower-limb amputation. It is alarming that out of 382 million diabetes cases, 46% people are undiagnosed and progressing toward complications unaware. 2 Patients with type 1 diabetes and gestational diabetes (appears during pregnancy) need insulin therapy to survive. Therefore, this 46% undiagnosed patients with diabetes are mainly type 2 diabetic. If the risk factors of type 2 diabetes can be identified and controlled, burden of diabetes as well as other life-threatening diseases caused by diabetes can be reduced.
Recent studies have investigated whether height can be thought of as a potential determinant for the occurrence of diabetes.3-9 These studies, however, concluded with different contradictory evidences. An inverse association between height and diabetes was found in the literature,5,7-9 whereas a positive association was reported in Wang et al. 6 However, sex-wise, an inverse association, ie, among women 5 and among men, 10 was also claimed. In contrast, the study in Lorenzo et al 11 could not establish any significant relationship between height and the occurrence of diabetes. These varying conclusions are mainly due to 2 reasons: different data from different geographical areas and use of different statistical models. Because of varying conclusion, the effect of height on diabetes is still a mystery.
Note that most of the above-mentioned findings are based on different observational studies. Drawing conclusion based on any observational study requires careful attention. This is because, unlike experimental study, observational study lacks randomization, and hence, bias naturally arises due to strong association among treatment (such as height) and other covariates while investigating treatment-outcome (such as occurrence of diabetes) association. For example, when association is examined between individuals’ height (short/normal/tall) and occurrence of diabetes, a strong association between height and sex puts a question mark about the association observed between height and diabetes. Under observational study design, propensity score (PS) analysis is a useful and widely used tool for finding association between treatment and outcome when association exists among treatment and other background characteristics. This article aims to use PS analysis to investigate the association between height and diabetes and to determine potential determinants of type 2 diabetes.
Methods
Randomized controlled trials (RCTs) are considered to be the ideal setup for estimating treatment effects. However, because of the complexity involved with such a design, observational studies are common in practice, where the experimenter has no control either on the background characteristics or on the random allocation of treatments. Under this setup, subjects choose their own treatments, and the background characteristics of subjects may be different among the treated and untreated groups. Therefore, use of observational studies to estimate causal treatment effect is not feasible unless confounding factors are controlled, at least to some desired level.
A very practical and widely used tool, known as PS, for the adjustment of differences in background characteristics was first proposed in Rosenbaum and Rubin.12–14 PS is a balancing score, which is in fact the probability of treatment assignment conditional on observed background characteristics. Conditional on the PS, the distribution of observed baseline covariates is similar between treated and untreated subjects. The PS allows one to design and analyze an observational study so that it mimics some of the particular characteristics of an RCT.15,16
Four different PS methods are available in the literature: (1) Matching, (2) Stratification, (3) Weighting, and (4) Covariate adjustment.12,13,15 In this article, we use PS-based weighting to determine the effect of individuals’ height on the occurrence of diabetes. This will also allow us to estimate the effects of other factors on the occurrence of diabetes. Although we will be using 3 treatments, ie, 3 height categories, for ease of understanding, we provide an algorithm on PS-based weighting method for 1 treatment case.
Let
Fit a binary logit model with response variable,
Estimate the PS, ie,
For ith subject, calculate the weight,
Weight the data using
When balance is achieved, that is, treatment groups
Note that for multiple treatment setting, instead of binary logit model, a multinomial logit model 19 may be used, and weights may be similarly calculated as in Sicree et al. 3 For details on weighting for multiple treatment case, see the literature.20,21
Data and variables
This study considers a secondary data set, namely, Bangladesh Demographic and Health Survey (BDHS) 2011. It is an observational and nationally representative cross-sectional survey conducted by the collaborative effort of the National Institute of Population Research and Training (Bangladesh), ICF international (USA), and Mitra and Associates (Bangladesh). By taking a 2-stage stratified cluster sampling of households, the BDHS 2011 survey got a random sample of total 83 731 household members from 17 141 households. Note that although data from BDHS 2014 are now available, the information on biomarker indices such as systolic blood pressure (SBP), fasting plasma glucose (FPG), and height and weight measurements are only available in BDHS 2011 survey data. In 2011 survey, biomarker information was collected from a subsample of 8835 individuals. There were 3831 women and 3734 men of age 35 years or older those who were participated for FPG measurement.
To determine the true association between diabetes status and stature of individual, we classify our main response variable into 2 categories based on FPG value. If a participant having FPG value greater than 6.1 mmol/L, then he or she will be considered as a diabetic patient, 22 otherwise he or she will be considered as non-diabetic patient. We consider respondents’ height as the treatment variable and sex (men and women), education level (no education, primary, secondary, and above secondary), wealth index (poor, middle, and rich) based on percentiles of wealth scores, place of residence (urban and rural), division (Barisal, Chittagong, Dhaka, Khulna, Rajshahi, and Rangpurand Sylhet), body mass index (BMI) category (thin [BMI < 18.5], normal [BMI 18.5-24.9], and overweight [BMI > 24.9]) 8 as potential confounders.
Unlike Hoque et al,
8
we consider height with 3 categories. Let
Results
Among the 7565 participants, men and women are almost equal in numbers. It is interesting to see that as education level increases, the percentages of individual decreases. We also found that most of the individuals (67.1%) are from rural area. Dhaka division (17.4%) contains most of the participants, whereas Barisal division (11.4%) contains least participants. More than half of the participants (57.45%) belong to the normal category of BMI. Around 33.3% of participants belong to the each wealth index category because this variable is categorized based on terciles. Moreover, the average height of the participants is noticed to be 1.57 m along with 0.091 m of standard deviation. Respondent’s minimum and maximum heights are reported 1.09 and 1.87 m, respectively. Prevalence of diabetes is 33.3%.
We first fit a binary logit model with diabetes status (diabetic/non-diabetic) as response variable and height, education level, wealth index, place of residence, division, and BMI as covariates. The results are displayed in Table 1. It was found that “Tall” individuals have significantly smaller odds of being diabetic as compared with those “Normal”
Adjusted effects of the explanatory variables on the occurrence of diabetes obtained from logistic regression model.
Abbreviation: CI, confidence interval; ref., reference group; BMI, body mass index.
Next, we try to balance the data using PS-based weights. To do so, we fit a multinomial logit model with height as response and sex, education level, wealth index, place of residence, division and BMI as covariates and calculate weights,
We feel that if a relationship between height and diabetes status is of interest, then height categories must be defined separately for men and women. Accordingly, we define height variable in the similar fashion based on percentiles, but separately for men and women. As a result, height is not affected by sex; however, height is still associated with other background characteristic. See Supplementary Table 3 for bivariate results based on separate cut-offs.
We next weight the data using PSs based on separate cut-offs and check whether balance is achieved. The bivariate results under separate cut-offs for the weighted data are shown in Table 2. It is clear that the data are now well balanced, all the selected background characteristics are apparently independent with the new height based on separate cut-offs. As balance is properly achieved, an association between height and occurrence of diabetes may now be investigated.
Assessment of covariate balance for separate cut-off height based weighted data.
Abbreviation: BMI, body mass index.
Result of a logistic regression model for PS-based weighted (separate cut-offs) data is displayed in Table 3. It is observed that height is not significantly associated with the occurrence of diabetes
Propensity score–adjusted effects of covariates on diabetes status obtained from logistic regression model.
Abbreviation: CI, confidence interval; ref., reference group; BMI, body mass index.
Discussion and Conclusions
The aim of this study is to inspect the association between individuals’ height and occurrence of diabetes. To do so, we use BDHS 2011 data, which is a nationally representative observational study. We are motivated by the fact that there is no clear evidence in the debate of whether height can be thought of as one of the determinants for occurrence of diabetes. We have noticed that earlier researches considered “height” categories based on percentiles of the full sample. We have shown that with such definition of “height” categories, the data become too much imbalanced (Among women, 70% are short and 0.3% are tall), because women are naturally shorter than men, in general. We, therefore, defined “height” in terms of percentiles, but within men and women separately, so that approximately middle 50% are “normal,” lower 25% are “short,” and upper 25% are “tall,” within both men and women. We then conducted a bivariate analysis that showed that although “sex” has no influence on “height” now, height categories still significantly vary with other confounders such as education level or wealth index. To compensate the confounding effect and determine true status of relationship between height and diabetes, we then analyze weighted data, where weights are calculated based on PSs. We did not find any significant relationship between respondents’ height and diabetes. However, we found that among women, in high education group, among wealthy people, among those with
Apart from the ongoing campaigns about the consequences and its prevention of diabetes, special attention should be addressed to the women and to the wealthy people. We have evidence that educated people are wealthier (
Supplemental Material
Supplemental_Material – Supplemental material for Exploring Association Between Individuals’ Stature and Type 2 Diabetes Status: Propensity Score Analysis
Supplemental material, Supplemental_Material for Exploring Association Between Individuals’ Stature and Type 2 Diabetes Status: Propensity Score Analysis by Ashis Talukder, Najiba Akter and Taslim Sazzad Mallick in Environmental Health Insights
Footnotes
Acknowledgements
The authors thank the National Institute of Population Research and Training (NIPORT) (or USAID), Bangladesh, for allowing to use BDHS 2011 data for analysis. The authors thank Professor Wasimul Bari, University of Dhaka, for his advice at different stages of this study.
Author Contributions
AT had the original idea for the study. AT and NA participated in the statistical analysis and helped to draft the manuscript. TSM handled the supervision and prepared the manuscript. All authors read and approved the submitted manuscript.
Declaration of conflicting interests:
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding:
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Availability of Data and Materials
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
