Abstract
The study objective was to (1) develop a statistical model that creates a novel patient engagement score (PES) from electronic medical records (EMR) and health claim data, and (2) validate this developed score using health-related outcomes and charges of patients with multiple chronic conditions (MCCs). This study used 2014-16 EMR and health claim data of patients with MCCs from Sanford Health. Patient engagement score was created based on selected patients’ engagement behaviors using Gaussian finite mixture model. The PES was validated using multiple logistic and linear regression analyses to examine the associations between the PES and health-related outcomes, and hospital charges, respectively. Patient engagement score was generated from 5095 patient records and included low, medium, and high levels of patient engagement. The PES was a significant predictor for low-density lipoprotein, emergency department visit, hemoglobin A1c, estimated glomerular filtration rate, hospitalization, and hospital charge. The PES derived from patient behaviors recorded in EMR and health claim data can potentially serve as a patient engagement measure. Further study is needed to refine and validate the newly developed score.
Keywords
Introduction
Controlling health care costs and preventing adverse health outcomes for patients with multiple (2 or more) chronic conditions (MCCs) continue to be a challenge since the management of MCCs are highly influenced by patient behaviors. Recently, patient engagement has become a major focus of health care reform as it has been shown to improve health outcomes and lower health care cost (1 –7). Patient activation measure (PAM) is a validated instrument that has been widely used to measure the level of patient engagement in real-world practice. Higher PAM score indicates greater activation and PAM segments patients into 4 activation levels, including (1) the level that patients are passive and lack confidence, (2) the level that patients become aware, but they are still struggling, (3) the level that patients take action, and (4) the level that patients can maintain behaviors (8). However, routinely assessing PAM is challenging since it requires patients to spend time and respond to the questions (9). In addition, while the level of patient engagement could change over time, especially for chronic conditions (9).
Electronic medical record (EMR) and health claim data have been widely used for several purposes in the US health care. Some EMR and claim data reflect patient engagement behaviors in various health care activities, for example, immunizations, medication refills, and wellness exams. This study intended to explore the use of these behaviors available in the EMR and health claims from a large regional health care provider to determine the level of patient engagement. This might enable health care providers to routinely capture the level of patient engagement without burden to patients. Thus, the objectives of this study were to develop a statistical model that generated a novel patient engagement score (PES) from EMR and health claims of patients with MCCs, and to validate this developed score using health-related outcomes and charges.
Methods
We conducted a cross-sectional study since only the most recent records of the majority of data elements were available. Figure 1 shows the conceptual framework of this study. We built the framework from Andersen’s behavioral model of health service utilization (10 –13). The Andersen’s model suggests that predisposing, enabling, and need factors could influence health-related outcomes (11). Based on various studies (1 –7,14 –16) indicating that patient engagement improved health-related outcomes and lowered health care cost, patient engagement was included in the study framework. Also, previous studies reported that behavioral measures, for example, adherence to medication and self-care behavior, were linked to a high level of patient activation (17 –19). This study proposed to construct a patient engagement measure from these behaviors.

Conceptual framework adapted from Andersen’s model.
Data Collection
This study obtained data through the Sanford Data Collaborative. The data comprised EMR data from Sanford Health, including 44 hospitals, as well as claim data over a 3-year (calendar years 2014-2016) period. All data were de-identified. The data elements included International Classification of Diseases, Tenth Revision, age, gender, race, ethnicity, marital status, primary insurance, primary care provider, tobacco use, smokeless cigarette use, alcohol use, body mass index (BMI), PAM score, number of immunizations, number of wellness exams, number of chronic conditions, systolic blood pressure (SBP), diastolic blood pressure (DBP), high-density lipoprotein (HDL), low-density lipoprotein (LDL), hemoglobin A1c, number of emergency department (ED) visits, number of hospitalizations, estimated glomerular filtration rate (eGFR), number of missing scheduled visits, clinic charges, and hospital charges. Only cumulative values over the 3-year period were available for the number of immunizations, wellness exams, ED visits, hospitalizations, and missing scheduled visits. Late medication refills were determined from 3 variables, including medication name, the number of day supply, and prescription date, using Sanford Health claim data. We assumed that the late medication refill occurred when the number of days supply was shorter than the number of days between 2 consecutive prescription refill dates. Then, the proportion of late refills was calculated for each patient. This study included patients who were aged at least 18 years old and had MCCs with one of the following primary diagnoses: hypertension, hyperlipidemia, ischemic heart disease, heart failure, diabetes, and chronic kidney disease; since these diseases were among highest chronic condition rates with highest rates of MCCs. This resulted in a total of 147 687 individual patient records.
Data Analysis
We conducted descriptive analyses of all data elements for both patients with PAM and PES scores. To explore the potential use of the EMR and health claim data to determine the level of patient engagement, 3 behavioral measures, including the proportion of late medication refills, the number of missing scheduled visits, and the number of immunizations over the 3-year period were purposely selected, since they likely reflected how much patients engaged in their health care. We then developed the PES score by using data from 5095 patients with complete records of these 3 behavioral measures. A finite mixture of Gaussian model was used to group patients based on the behavioral measures into a collection of nonhomogeneous distributions (clusters) (20,21). The model we assumed a G-component Gaussian mixture given by
We used multiple logistic regression analyses to examine the relationship between all outcomes (dependent variable)—SBP, DBP, HDL, LDL, A1c, eGFR (normal vs not normal range), and ED visit and hospitalization (yes vs no) and independent variables—age, gender, race, marital status, primary health insurance, primary care provider, BMI, tobacco use, alcohol use, number of chronic conditions, PAM, or PES score. We used multiple linear regression analyses to examine the relationship between the same independent variables and hospital charges. However, since the hospital charge had a right screwed distribution, we log-transformed the charge before we conducted the analyses. A 2-sided P value of <.05 was considered statistically significant.
Results
Table 1 shows the descriptive statistics of all study variables for patients with PAM scores and patients with PES scores. Only 1442 patient records had PAM scores. The average ages of patients with PAM and PES scores were about 67 and 56 years old, respectively. Approximately 64% and 49% of the patients with PAM and PES scores were female, respectively. More than 90% of the patients were white and non-Hispanic or Latino for both groups. The majority of the patients with PAM scores and the patients with PES scores were married (55.3% and 64.1%, respectively), had primary care providers (94.2% and 94.1%, respectively), and were former or nonsmokers (89.7% and 82.0%, respectively). Although 65.9% of the patients with PAM scores had government-sponsored health insurance, 16.1% of the patients with PES scores had this type of insurance. Less than 40% of the patients with PAM scores, but 52.0% of the patients with PES scores, consumed alcoholic beverages. The average BMI (25.3) of the patients with PES scores were lower than those of the patients with PAM scores (32.3). Over 58% of the patients with PAM scores had the scores at either 3 or 4. Both groups had similar health-related outcomes. The average hospital charges for the patients with PAM scores, and the patients with PES scores were $111 268 and $66 986, respectively.
Patients’ Characteristics, Health Behavior, and Health-Related Outcomes for All Patients, Patients With PAM Scores, and Patients With PES Scores.
Abbreviations: BMI, body mass index; DBP, diastolic blood pressure; ED, emergency department; eGFR, estimated glomerular filtration rate; HDL, high-density lipoprotein; LDL, low-density lipoprotein; PAM, Patient Activation Measure; PES, patient engagement score; SBP, systolic blood pressure; SD, standard deviation.
From the Gaussian model, a 3-component mixture was chosen. The best model based on BIC assumes the covariance matrices associated with the groups are symmetric and equal variance for all 3 variables within a group. By inspection of the associated behavioral characteristics, the groups were labeled as low (1), medium (2), and high levels of engagement (22). The mean number of immunizations was 4.632, 4.980, and 5.245, whereas the mean number of missing scheduled visits was 2.627, 2.548, and 1.100 when the patient engagement was at low, medium, and high levels, respectively. Similarly, the mean proportion of late medication refills was 3.568 when the patient engagement was at low level and 0 when the patient engagement was at medium and high levels.
Tables 2 and 3 show the associations between PAM/PES scores and health-related outcomes and hospital charges, respectively. After controlling for all variables, the PAM score was not a significant factor associated with any of the health-related outcomes, except hospital charges. Patients with the lowest level of PAM score had significantly higher hospital charges (β^[SE] = 0.288 [0.145], P = .046), compared to patients with the highest level of PAM score. The patients with medium PES score had significantly higher odds of having abnormal A1c (odds ratio [OR] [95% CI] = 1.340 [1.110-1.617], P = .002), at least one ED visit over the 3-year period (OR [95% CI] = 1.264 [1.093-1.462], P = .002), at least one hospitalization over the 3-year period (OR [95% CI] = 1.323 [1.134-1.542], P < .001), and had higher hospital charges (β^[SE] = 0.291 [0.060], P < .001) than those patients with the highest PES score. For eGFR, patients with the lowest PES scores had significantly higher odds of having abnormal kidney function (OR [95% CI] = 1.278 [1.064-1.534], P = .009), compared to the patients with the highest PES score. For LDL, while the patients with the lowest PES scores had significantly higher odds of having abnormal LDL (OR [95% CI] = 1.182 [1.005-1.390], P = .043), the patients with medium PES scores had significantly lower odds (OR [95% CI] = 0.818 [0.702-0.955], P = .011), compared to the patients with the highest PES scores.
Multiple Logistic Regression Results for the Association Between Either PAM or PES, and Health-Related Outcomes.
Abbreviations: BMI, body mass index; ED, emergency department; eGFR, estimated glomerular filtration rate; HDL, high-density lipoprotein; LDL, low-density lipoprotein; PAM, patient activation measure; PES, patient engagement score.
a Coded as binary variable (normal vs not normal)
b Reference group.
Linear Regression Results for the Association Between PAM/PES and Hospital Charges.
Abbreviations: BMI, body mass index; PAM, patient activation measure; PES, patient engagement score.
a Logarithmic transformation.
b Reference group.
c Dummy code (PES score 3 is reference group).
Discussions
In general, the patient characteristics, health behavior, and health-related outcomes of the patients with PAM scores, and the patients with PES scores were similar. However, it is noteworthy that the patients with PAM scores were likely younger, female, had nongovernment sponsored health insurance, consumed alcoholic beverages, and had slightly lower BMI. PAM score was a significant predictor for only hospital charges, whereas PES score was a significant predictor for several outcomes, for example, LDL, ED visit, A1c, eGFR, hospitalization, and hospital charges.
Interestingly, the PAM score was a significant predictor for only hospital charges in this study. Several previous studies showed that patient activation, as a proxy of patient engagement, was a significant predictor of health-related outcomes (2,23 –25). Besides the demographic differences, one of the reasons could be that the PAM score was not fully implemented at Sanford Health during the study period. The records showed that only 1% of all patients with MCCs had PAM scores. These results supported the rationale of this study, which was to develop another patient engagement measurement from existing data in order to reduce any patient burnout or PAM survey administration. In addition, the PAM score was criticized for its inability to fully capture the broad concept of patient engagement (26).
Based on 3 real behavioral measures (average percentage of late medication refills, average number of missing scheduled visits, and average number of immunizations) of patient engagement with their health care, PES was developed in this study. We used a χ2 test to examine the association between PAM and PES scores from the same individuals and found no significant relationship (P = .957). However, the test was based on a small sample size (only 90 individual patients) who had both scores. Despite the lack of statistical association, the PES score was a significant predictor for several outcomes, such as LDL, ED visit, A1c, eGFR, hospitalization, and hospital charges. These results showed evidence that the PES score captured the engagement level of the patients with MCCs and their health-related outcomes. To our knowledge, this was the first study that used existing EMR and health claim data to define the level of patient engagement. These results provided an example for other health care settings to generate PES scores for their patients with MCCs and use them to make decisions or recommendations without the need to administer any surveys, which likely cause patient burnout. Furthermore, the PES score can be used to motivate patients to improve their engagement (eg, their medication adherence), to improve patient experience of care, to make decisions, and to make a recommendation that creates an environment promoting and encourage engagement. However, some relationships between the PES scores and those outcomes in this study indicated room for improvement in the current PES score. For instance, only the patients with medium PES scores had significantly higher odds of having abnormal A1c, ED visit, hospitalization, and had higher hospital charge, relative to those with the highest PES scores, while those patients with the lowest PES score did not; which was a counterintuitive finding. Similarly, the findings related to LDL were counterintuitive.
This project suffered from various limitations. First, this study used secondary data from EMR and health claims. Therefore, all study variables were limited. Second, the PES score was based on only 3 behavior variables due to data limitation. For instance, data measuring other dimensions of patient engagement were not available. Having access to other potential behaviors, including patient access to EMR, rescheduling a visit, and patient-led communication (eg, emails via EMR, reminders, and phone calls), would improve the performance of the PES score. Third, although all 3 variables were captured across the study period, most of the health-related outcomes in this study were based on cross-sectional data. Fourth, an inconsistency between the prescribing dates and the number of day supplies was observed. The late medication refill was assumed to occur when the number of day supply was shorter than the number of days between 2 consecutive prescription dates. This assumption could be too restrictive because patients might still possess some medications from their last refills to cover those late few days. In other words, the late medication refills in this study might be overestimating the actual late refills. Fifth, since the PES was based on EMR and health claims, it could not be used to assess patient engagement in real time for new patients. Last, to assess patient engagement without engaging directly with the patient might result in an incomplete picture and a lack of understanding of other factors that could be driving health outcomes.
Conclusions
In our study, the PES score derived from patient behaviors in EMR and health claim data drastically outperformed the PAM score, which was based on patient survey. It was a significant predictor for various health-related outcomes, such as LDL, ED visit, A1c, eGFR, hospitalization, and hospital charge. Further studies are needed to improve the PES score, and to incorporate its use into clinical settings to promote patient engagement and subsequently improve the care management of patients with MCCs.
Footnotes
Author Biographies
Authors’ Note
S.N. contributed to conceptualization; S.N., S.M., G.D., P.D.R., and E.G. contributed to methodology; S.N. and S.M. contributed to Formal analysis and investigation; N.P. and S.N. contributed to writing and original draft preparation; N.P., S.N., S.M., A.S., G.D., E.G., and P.D.R. contributed to writing and review and editing; S.N., E.G., and A.S. contributed to funding acquisition, resources, supervision. The data that support the findings of this study are available from Sanford Health, but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of Sanford Health. Ethical approval to report this case was obtained from institutional review boards at South Dakota State University and Sanford Health approved the study protocol before the study started. This article does not contain any studies with human or animal subjects. Informed consent for patient information to be published in this article was not obtained because secondary data was used for the analysis. Arielle Selya is also affiliated with Pinney Associates, Pittsburgh, PA, USA
Declaration of Conflicting Interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Emily Griese and Ariella Selya received funding from the National Institutes of General Medical Sciences (NIGMS) of the NIH, grant number 1P20GM121341.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The work was supported by Sanford Health (Grant# SA1700421).
