Abstract
Health care no longer focuses solely on patients and increasingly emphasizes regions and their populations. Strategies, such as population management (PM) initiatives, aim to improve population health and well-being by redesigning health care and community services. Hence, insight into population health is needed to tailor interventions and evaluate their effects. This study aims to assess whether population health differs between initiatives and to what extent demographic, personal, and lifestyle factors affect these differences. A population health survey that included the Short Form 12 version 2 (SF12, physical and mental health status), Patient Activation Measure 13 (PAM13), and demographic, personal, and lifestyle factors was administered in 9 Dutch PM initiatives. Potential confounders were determined by comparing these factors between PM initiatives using analyses of variance and chi-square tests. The influence of these potential confounders on the health outcomes was studied using multivariate linear regression. Age, education, origin, employment, body mass index, and smoking were identified as potential confounders for differences found between the 9 PM initiatives. Each had a noteworthy influence on all of the instruments' scores. Not all health differences between PM initiatives were explained, as the SF12 outcomes still differed between PM initiatives once corrected. For the PAM13, the differences were no longer significant. Demographic and lifestyle factors should be included in the evaluation of PM initiatives and population health differences found can be used to tailor initiatives. Other factors beyond health care (eg, air quality) should be considered to further refine the tailoring and evaluation of PM initiatives.
Background
T
Population health as a concept has been widely discussed 6,7 but, despite the definition by Kindig and Stoddart, 8 no consensus exists regarding the meaning of the term. For the purpose of this study, population health is considered to be the health of an entire population in 1 geographic area. Within this understanding of population health, many factors are thought to affect (population) health. 9 –11 A population's education level, for example, has an association with health, but also affects health indirectly through its impact on health behaviors and use of preventive services. 12 Other characteristics that can influence health include sex, age, and ethnicity. 13 Those who wish to assess health in PM initiatives or evaluate the effectiveness of their interventions should consider such variables. 4,14 Health assessments help design interventions for PM initiatives by identifying priority areas and, from an evaluation point of view, controlling for these variables will provide baseline measurements for population health that are more comparable between different initiatives (overt bias). 15
In the Netherlands, the National Monitor Population Management (NMP) was created to gather knowledge regarding the experiences of stakeholders within 9 PM initiatives as well as their performance.
16
These so-called pioneer sites each focus on a defined regional population and aim to achieve the Triple Aim. Even though their goals are similar, they are locally introduced initiatives that operate independent of each other. In most, local municipalities, hospitals, general practitioners, and insurance companies are involved on the board, supplemented by more intervention-specific stakeholders (eg, pharmacies, research institutes). Most of these interventions have the goal of improving prevention services and reducing secondary care use, but each of the pioneer sites has its own distinct set of interventions. More detailed information can be found in online Supplementary Data S1 (Supplementary Data are available online at
In order to gain a better understanding of how to evaluate and tailor PM initiatives, this study aims to assess whether population health differs between initiatives and to what extent this relationship is affected by demographic and lifestyle factors.
Methods
Ethics approval and consent to participate
The Psychological Ethics Committee of the Tilburg University (Psychological Ethical Testing Committee number: EC-2014.39) approved this study.
Study population
The study population consisted of citizens living in 9 Dutch pioneer sites. The NMP provided the baseline data used for analysis, which were gathered between December of 2014 and January of 2015. A survey was sent out in each site to a random sample of 600 insured adults (≥18 years old) who, within the previous year, had not received any surveys from insurance companies and remained registered with the same general physician. Participants were invited by mail to complete the survey, which could be done online or on paper. Participants who did not respond were sent 2 reminders. 17
Survey instruments
Demographic, personal, and lifestyle factors
The survey included demographic characteristics (sex, age, educational level [high/low]), employment, and country of origin [native/non-native]), as well as personal and lifestyle characteristics. These included levels of disability (100% disabled), alcohol use (glasses per week), and smoking status (yes/no). Furthermore, BMI was calculated using the reported height and weight, and health literacy was assessed using Chew et al's Set of Brief Screening Questions. 18
SF-12
The SF-12, a globally used instrument, assesses generic health status using 12 questions that produce a physical component score (PCS) and a mental component score (MCS). 19 The Dutch version was used for this study. 20 Scoring was done using the proprietary software provided by Qualimetrics Inc. (Sacramento, CA), which associates set weights with questions' scores that are converted to a 0–100 scale for both the PCS and MCS. In this scale a higher score means better (physical or mental) health. 19
PAM13
The PAM13 is a scale that is based on a developmental model of activation 21 and can be used to assess self-reported knowledge, motivation, and health management skills. 22 The questions' scores are summed and converted to a 0–100 scale; a higher score is positively associated with related health behaviors. 23 The Dutch version was used. 23
Analysis
All analyses were performed using SPSS 22 (IBM Corporation, Armonk, NY) and R Studio Version 0.99.441 (RStudio Inc., Boston, MA) for Windows.
First, responses to the SF12 and PAM13 were analyzed. Participants who provided a complete data set per instrument were compared regarding demographic, personal, and lifestyle factors with participants having 1 or more missing values for that instrument. If these groups differed, then it could be assumed that missing values were not Missing-Completely-At-Random and therefore would need to be imputed. MICE (multiple imputation by chained equation) would be used to impute the data 24 and further analyses would be performed on both complete cases and imputed data sets. The methodology is discussed in a previous and related article. 25
Second, an overview of demographic, personal, and lifestyle factors (independent variables; eg, age, education, sex, origin, employment, alcohol use) was created for all 9 pioneer sites. This overview also included the Pearson chi-square test for dichotomous variables and analysis of variance (ANOVA) for continuous variables to determine whether there were significant differences between the 9 pioneer sites. Factors with a P value ≤0.10 were considered potential confounders. 26
Third, separate univariate analyses were conducted for each outcome (SF12-PCS, SF12-MCS and PAM13) using regression analyses. The PM initiatives were coded as dummy variables, with one of the pioneer sites (Blauwe Zorg) as the reference group. Model fit was assessed using the F-ratio and a P value of ≤0.05 was considered significant in all regression analyses. 27 Then, the influence of each individual potential confounder was assessed using multivariate linear regression analyses. A variable was defined as a confounder when the average change in the dummy variables compared to the univariate model was more than 10% after adding that particular variable. 26 At each step, starting with the univariate model, the variable that produced the largest average change in the beta of dummy variables (and was more than 10%) was then added to the model. The remaining variables were then added step by step to this model. This process was repeated until there were no variables left. Finally, for comparison, a multiple regression analysis was executed with all potential confounders.
Results
The number of total citizens within the pioneer sites ranged from 106,270 to 646,910 (Table 1). A total of 5400 surveys were sent out, 2491 of which were completed and returned. The response analyses showed that groups with no missing values differed significantly from participants with 1 or more missing values on at least 1 demographic variable for all instruments (see online Supplementary Data S2). This indicates that missing values were not Missing-Completely-At-Random, warranting the use of multiple imputation. Subsequent analyses were therefore performed on both complete cases and imputed data sets.
ANOVA, analysis of variance; BMI, body mass index; GZGR, Gezonde Zorg, Gezonde Regio; SF12-MCS, Short Form 12 version 2 Mental Component Score; SF12-PCS, Short Form 12 version 2 Physical Component Score; PAM13, Patient Activation Measure 13; SmZ, Slimmer met Zorg; SSiZ, Samen Sterk in Zorg.
The study sample was 46.1% male, consisted of mostly Dutch natives, and was on average 55.7 years old. Between PM initiatives, there were significant differences in age, education, origin, employment, BMI, and smoking, identifying these factors as potential confounders. Overall, Mijn Zorg was the only initiative that showed consistently below-average scores on all variables. Most PM initiatives scored better on one and worse on other variables. For example, Gezonde Zorg, Gezonde Regio had the highest score on education level, but also had a higher than average unemployment rate. As expected, descriptive results from the imputed data were comparable (see online Supplementary Data S3).
An overall ANOVA showed that outcomes of the SF12-PCS, SF12-MCS, and PAM13 differed significantly between PM initiatives before adding confounders to the model. The more detailed univariate regression model showed that for each instrument, there were 1 or 2 PM initiatives that differed significantly from the reference group (Table 2). For the SF12-PCS, Mijn Zorg was the only significantly lower deviation, while Friesland Voorop and Vitaal Vechtdal scored significantly higher for the SF12-MCS. Vitaal Vechtdal scored significantly lower on the PAM13 as well, which also was the case with the imputed data (see online Supplementary Data S3).
GZGR, Gezonde Zorg, Gezonde Regio; SF12-PCS, Short Form 12 – Physical Component Score; SF12-MCS, Short Form 12 – Mental Component Score; PAM13, Patient Activation Measure 13; SmZ, Slimmer met Zorg; SSiZ, Samen Sterk in Zorg.
P ≤ 0.05
Per instrument, the aforementioned potential confounders were entered in the regression model individually. Extended results of these regression analyses can be seen in an additional file (Supplementary Data S4). Each characteristic had a noteworthy (>10%) influence on each of the instruments and was included in the final analyses. For the SF12-PCS, the largest confounders were age, BMI, and employment. Age also was an important influencer in the SF12-MCS, in addition to origin and smoking status. PAM13 was influenced mainly by education, employment, and smoking status. Somewhat different results were found in the imputed data for PAM13 (Supplementary Data S4). Here, education and employment still had the largest effect on PAM13 scores, but smoking status was no longer a confounder. Origin also was not a confounder, although age and BMI still influenced outcomes.
Table 2 shows the results of the (multivariate) linear regression models including all confounders, each showing a significant F-ratio indicating sufficient model fit. For the SF12-PCS, the Table shows that all PM initiatives had noteworthy changes (≥10%), but no differences changed from significant to not significant or vice versa. Mijn Zorg remained the only significant deviation. In mental health (SF12-MCS), Vitaal Vechtdal initially scored better. However, this difference was nuanced when results were controlled for specific population factors. Friesland Voorop, like Vitaal Vechtdal, also scored better on the SF12-MCS, but the difference remained roughly the same after controlling for confounders. For the PAM13, Vitaal Vechtdal was the only significantly different score before controlling for confounders. After controlling for confounders, the difference was no longer significant, resulting in no significant differences between PM initiatives on PAM13. This result also was seen in the imputed data (see online Supplementary Data S3).
Discussion
This study compared health within 9 Dutch PM initiatives using the SF12-PCS, SF12-MCS, and PAM13 instruments as outcomes and examined to what extent these are affected by demographic, personal, and lifestyle factors. The intention was to provide the needed insight to improve evaluation and tailor the interventions of these initiatives. The included physical (SF12-PCS), mental (SF12-MCS), and self-management (PAM13) constructs showed differences between initiatives before controlling for any population factors. This was mainly because of 1 or 2 outlying PM initiatives. After controlling for confounding factors, differences between PM initiatives were nuanced, but the SF12-PCS and the SF12-MCS still showed significant differences. For PAM13, the only significant difference from the reference group became insignificant. Age and origin as well as education, employment, and smoking had a large influence on differences between initiatives.
The effects of demographic, personal, and lifestyle factors were in line with expectations, as this effect has been seen for various instruments at the individual level, 23,29,30 as well as the population level. 31,32 Nonetheless, although adjusted results provide a clearer image for evaluation purposes, as improving health should be the end goal, unadjusted differences should not be disregarded. The impact variables have on differences in health can be used to tailor interventions for specific populations. 33 The characteristics with the largest impact on outcomes were age, origin, education, employment, and smoking. A number of these factors, including health literacy, BMI, smoking, and alcohol use, can be addressed by (health) interventions. Smoking behavior, for example, can be addressed by implementing smoking bans in schools. 34 In this manner, PM initiatives can focus interventions on variables that are shown to affect the health of their population.
Some reservations must be considered when interpreting these results. Data were collected at a single point in time and it would be of interest to compare changes over time to see the impact of interventions in a particular region. Furthermore, the provided recommended proprietary software was used to calculate the SF12-MCS and SF12-PCS, ensuring the correct calculation of both SF12 component scores. Unfortunately, even though the use of this software is recommended, it prevented the use of imputed data to calculate the sum scores. Results from PAM13 imputed data showed that differences, albeit small, could occur. Response rates in most pioneer sites were ∼45%, which sufficed and is comparable to other surveys, 35 but because of limited information regarding the sample population, it was not possible to assess selection bias in depth.
Finally, even though the confounders studied did explain some differences, not all of the variation in population health between PM initiatives was explained. For example, Friesland Voorop and Vitaal Vechtdal still showed significantly higher scores for the SF12-MCS than the reference group after controlling for confounders. Whether these differences are clinically relevant is difficult to establish. For example, for the SF12, studies consider a range of relevant differences, ranging from 2.5 points and higher. 36,37 This would mean that the differences in this study are not clinically relevant, but it is debatable whether such a hard value can be set for populations. From an evaluation perspective, a baseline measurement should be as equal as possible, as differences after controlling for confounders might indicate the presence of hidden bias. 38 Efforts, such as the research into the “Limburg-factor,” 39 should be made to seek out hidden bias. This can be done, for instance, by examining the literature more deeply, integrating qualitative analyses, or by looking for influencers elsewhere (eg, environmental factors such as air quality and the availability of green space) 40,41 However, many of these factors are currently seen as beyond the scope of responsibility of health care, while having a considerable impact on population health. This emphasizes the importance of wider integration of care domains by PM initiatives to ensure complete coverage of health.
Conclusion
At first glance, the 9 populations compared in this study differed in physical health (SF12-PCS), mental health (SF12-MCS), and self-management capacity (PAM13). However, once the identified confounders were included in the model, these differences became smaller and were no longer significant for self-management capacity. The impact the current confounders have on results can be used to guide future evaluations and tailoring of PM initiatives. Unexplained differences in health between PM initiatives require further investigation.
Footnotes
Author Disclosure Statement
The authors declare that there are no conflicts of interest. The authors received the following financial support: This study was funded under SPR project S/133002 of The National Institute of Public Health and the Environment in the Netherlands. The funder had no role in the design of the study, collection, analysis and interpretation of the data, and writing of the manuscript.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
