Abstract
Background:
Patients with diabetes rely on blood glucose (BG) monitoring devices to manage their condition. As some self-monitoring devices are becoming more and more accurate, it becomes critical to understand the relationship between system accuracy and clinical outcomes, and the potential benefits of analytical accuracy.
Methods:
We conducted a 30-day in-silico study in type 1 diabetes mellitus (T1DM) patients using continuous subcutaneous insulin infusion (CSII) therapy and a variety of BG meters, using the FDA-approved University of Virginia (UVA)/Padova Type 1 Simulator. We used simulated meter models derived from the published characteristics of 43 commercial meters. By controlling random events in each parallel run, we isolated the differences in clinical performance that are directly associated with the meter characteristics.
Results:
A meter’s systematic bias has a significant and inverse effect on HbA1c (
Conclusions:
Two components of meter accuracy, bias and error, clearly affect clinical outcomes. While error has little effect on HbA1c, it tends to increase episodes of severe hypoglycemia. Meter bias has significant effects on all considered metrics: a positive systemic bias will reduce HbA1c, but increase the number of severe hypoglycemia attacks, TDI use, and number of fingersticks per day.
Glycemic control in patients with type 1 and type 2 diabetes is a life-long effort driven by information. Self-monitoring of blood glucose (SMBG) systems are fundamental in optimizing glycemic control for these patients and as such, blood glucose (BG) meter technology has experienced significant progress and is part of most clinical guidelines for glycemic control. While the system accuracy of BG meters has been widely studied, 1 little is known about the impact of accuracy on clinical outcomes. The maturity of mathematical models representing glucose metabolism in health and diabetes 2 and the increasing acceptance of computer simulation to predict the impact of therapy modification in type 1 diabetes mellitus (T1DM)3,4 have led to regulatory acceptance of modeling and simulation, “for approximation of human glucose/insulin utilization, interstitial sensor performance, and subcutaneous insulin delivery,” which included replacement for animal trials in the preclinical testing of artificial pancreas systems. 3 The University of Virginia (UVA)/Padova simulation platform is now commonly used for testing medical device performance as well as novel treatment strategies in T1DM;5-12 the platform is frequently augmented by new physiological processes.13-17
In this study, we aim to quantify the clinical impact of BG monitoring system accuracy using the latest UVA/Padova simulation platform by (1) constructing models that replicate the characteristics of commercially available BG monitoring systems, (2) modeling type 1 subject behavior related to meals and self-treatment, and (3) constructing a simulation to track T1DM patients over 30 days. Each simulated patient uses each of the available BG monitoring systems in turn, and simulation results are used to estimate clinical outcomes such as HbA1c, severe hypoglycemia, and other clinically relevant parameters. Finally, we create regression models that relate the characteristics of BG monitoring systems to each of the clinical outcomes.
Materials and Methods
Building on the work relating accuracy to clinical outcomes,18-30 we assess the effect of SMBG systems on clinical outcomes. An overview of the process followed in this work is described in Figure 1. (1) Our approach relied on existing literature and publicly available data to model commercially available BG meters as well as self-treatment behavior observed in continuous subcutaneous insulin infusion (CSII) therapy; (2) the resulting models, described in the following sections were incorporated in the UVA/Padova Type 1 Diabetes Simulator,13,31 and used to create a thirty day long simulation where each in-silico subject in the population used in turn each of the available commercial meters; and (3) quality of glucose control observed by each of the subject/meter combination was estimated using standard metrics such as HbA1c, severe hypoglycemia, number of fingersticks used, and an estimate of total daily insulin (TDI). The following sections describe in detail each of the steps.

Project approach to estimating clinical impacts of meter accuracy.
Modeling BG Monitoring Systems
A comprehensive study 1 reported the system accuracy of 43 commercial BG monitoring systems. Accuracy was defined in terms of percentages of measurements within certain tolerances of the reference measurement, split between low and high glucose ranges (<100 mg/dl, ≥100 mg/dl, respectively). The BG monitoring systems are modeled using a Johnson transform of a standard normal distribution where parameters are selected to optimally match the reported measurement percentage errors reported in literature. Specifically, in the low glucose range (<100 mg/dl), the parameters of a Johnson distribution
are estimated so that
Use in Simulation
Simulated measurements given a reference glucose value G can be obtained from a standard normal variate
where

Example simulated glucose readings using our modeling approach.
Modeling Behavior
The behavioral model adopted in this work consisted of the components described in Figure 3, namely (1) a meal behavior component that describes eating amount, times, and correlations; (2) a bolusing behavior component that describes the conditions under which a bolus is self-administered; (3) a fingerstick behavior component describing how frequently subjects fingerstick and under what conditions; and (4) a hypoglycemia self-treatment behavior. Before describing in detail each of these components, we briefly describe the data sources used to design and parameterize our behavioral model.

Description of the elements of the basic behavioral model.
Data Sources
We used archived deidentified data collected during the project funded by National Institutes of Health/National Institute of Diabetes and Digestive and Kidney Diseases grant RO1 DK 085623 (see clinicaltrial.gov for clinical trial registration number NCT01434030). Sixty insulin pump users with type 1 diabetes were recruited and asked to wear a CGM device for a month, simultaneously recording SMBG, CGM, and insulin pump data, as well as information about meals and physical activity. Fifty-six participants completed the data collection and contributed to the database. The demographic characteristics of these subjects were as follows: 21-65 years of age, with a mean (SD) age of 41 (12.2) years; duration of type 1 diabetes of at least 2 years, with a mean duration of 24.1 (11.0) years; use of an insulin pump for at least 6 months, with a mean interval of 10 (5.8) years; and active use of a bolus calculator function. Mean (SD) hemoglobin A1c level was 7.7% (1.2%), 59% were female, the majority (95%) were white, and 50% were employed in professional occupations. The database building protocol was approved by the local Institutional Review Board; study details were previously published. 32 Data collection was designed to interfere minimally and reduce as much as possible burden, distractions or alterations to participants’ typical daily routines and allow them to maintain typical daily behaviors.
Meal Behavior
To account for interrelationships between the timing and amount of consecutive meals that is observed in real life, we adopt bootstrap sampling in the following sense: consider a database of meals where for meal
that is, meals that took place at approximately time

Relationship between consecutive meal times and sizes, using values of
Bolus and Fingerstick Behavior
Fingerstick and bolus behavior is modeled according to Figure 5. The subject responds probabilistically to a triggering event (eg, meal or hyperglycemia), and applies a bolus if necessary according to functional therapy parameters. For example, in the top-left of Figure 5, we illustrate how the in-silico subject responds to persistent hyperglycemia (two consecutive hours of high glucose >220 mg/dl): a fingerstick is applied with a fixed probability, followed by the appropriate correction. A slightly more detailed model was used to model meal bolus behavior, where patients may (randomly) decide to ignore or delay a meal bolus.

Behavioral model of bolus and fingerstick behavior.
Hypoglycemia Treatment
Hypoglycemia is treated according to the following rule. This response approximates behavioral data from the data sets reported above. As observed, the size of hypo treatments will increase as treatments are repeated. Moreover, a lower glycose level will result in an increase in rescue size. More specifically, a low glucose concentration level (<80 mg/dl) will trigger a fingerstick with probability
Rescue Carb Schedule in Grams/kg.
In-Silico Evaluation
A 30-day scenario was designed based on behavioral models described in the previous sections, and used to generate the glycemic response for each 100 in-silico subject of the T1D adult population in the UVA/Padova simulator, using each BG monitoring system modeled in our database, a total of 43 meters, together with an ideal meter (neither bias nor error). Given the natural variability of the process and to improve estimates of our metrics, 30 replicates of each combination of subject-BG monitoring system were simulated, resulting in more than 10 000 simulated patient years. Random seeds were controlled across subjects and meters to minimize variance of the estimates and accurately isolate the effect of meter accuracy on glycemic control. In other words, each subject will experience the same meal sequence, bolus decisions, and other behavioral factors. Thus, differences across scenarios are only due to meter characteristics.
Clinical Outcome Estimates
For each simulation, we computed an estimated HbA1c (using a commonly used linear regression model relating average glucose to HbA1c 33 ) and severe hypoglycemia events (via a model relating LBGI and severe hypoglycemia events; see Table 2 in Kovatchev et al 34 ). We note that severe hypoglycemia, defined as a glucose level such that the patient will require assistance, is not explicitly simulated, but later estimated. In addition, TDI and daily fingerstick use were computed directly from the simulation output.
Metric Estimates Obtained by Averaging Simulation Outputs Across Subjects.
For each combination of subject
Error
Fraction of measurements whose absolute relative difference is >5%. Formally, if we have a collection of
where
Bias
The average difference between measurement and reference, that is,
In Table 2, we report population means for each of these metrics and meters. For example, given a fixed meter
Correspondingly, we denote by
Results
In Table 2 we report all simulation results in terms of HbA1c, severe hypoglycemia, TDI, and fingerstick count. Entries in Table 2 represent the average value of each metric across 30 replicates of 100 subjects in the adult population all using each of the 43 glucose meter models. The table also reports the average bias and errors obtained for the corresponding meter, as observed in the simulation.
Response of HbA1c to Error and Systematic Bias
Meter error has a marginal effect on HbA1c. In fact, as we will show in our next section, a linear regression model shows that the coefficient corresponding to meter error is not significant. On the other hand, the data shows a clear inverse relationship between meter systematic bias and HbA1c (as illustrated in Figure 6). Meters with positive bias (represented as orange-yellow dots), will tend to reduce HbA1c while at the same time increasing the expected number of severe hypoglycemia events.

Relationship between error, bias, HbA1c, and severe hypoglycemia. Each meter is represented by a colored dot. The
Response of Severe Hypoglycemia to Error and Systematic Bias
The effect of error, which was negligible on HbA1c, is significant on severe hypoglycemia: higher error rates are associated with a higher number of severe hypoglycemia. These relationships will be further formalized in the following sections where we show that a simple linear regression model can explain most of the results shown here.
Other Relationships
TDI, as well as fingerstick count, shows similar relationships to severe hypoglycemia, that is, from a reference of 41.8U of TDI for an ideal meter, high error rates as well as positive bias will result in higher daily insulin use. For example, meter 19 with a positive bias of 15.1 mg/dl and an error of 0.7 (on a scale of 0 to 1), increases insulin use to 47.3U per day. A similar relationship is observed for fingerstick counts.
A Model Relating Accuracy and Clinical Outcomes
We propose a linear model
where
Regression Model Parameters Relating Error, Bias, Fingerstick Counts, and per Subject Baseline to Clinical Outcomes.
and the prediction for severe hypoglycemia takes the form
Discussion
Based on our simulations, we could show a clear relationship between the system accuracy of a BG monitoring system and the resulting quality of glucose control. More specifically, a meter with a large error rate will tend to increase glucose variability and therefore episodes of severe hypoglycemia. The BG monitoring systems with large systematic bias, on the other hand, will have a dual and symmetric effect: the effect on HbA1c will be inversely proportional while incidence of severe hypoglycemia will be proportional to systematic bias. A previous retrospective study has similar conclusions, 35 showing for instance that meters with readings consistently higher than reference can significantly increase hypoglycemic coma episodes.
Generalization of these results is somehow limited by the use of models mimicking patient’s behavior during a relatively small scale (n = 55, 1 month) clinical study; while the protocol was designed to minimize contact with participants and perturbations of their treatment behaviors, this group may still have exhibited treatment behaviors that were more compliant than ones observed for the population at large. Measurements frequency in particular may impact our results, particularly the balance between bias and variance of the error. It is important to note that while the observed BGM frequency was high it was equivalent in all simulation, leading to changes in hypoglycemia and HbA1c that were purely driven by BGM error characteristics.
In summary, our results demonstrate that BG monitoring systems compliant with most conditions of the ISO 15197 (2013) standard have only limited impact on HbA1c, SHE, insulin utilization and SMBG frequency, whereas systems not meeting the standards can have significant clinical influence on one or several of these outcomes. For the HbA1c values, the increases could reach approximately 0.4% while the number of annual SHE could increase by up to 1.7 cases per year. In addition, the insulin consumption could increase by up to 5.5 units/day and the number of fingersticks by up to 1.0 tests/day.
Importantly, these are not limited to just one specific SMBG system; rather, they apply to several systems not meeting the most recent version of ISO standard 15197. These findings not only offer important guidance to both clinicians and individuals with diabetes when selecting an appropriate SMBG system but they provide a basis for estimation of the economic impact of SMBG system inaccuracy, which will be presented in a subsequent publication. The relationship between CGM-MARD and the ISO 15197 (2013) standard in nonadjunctive use is also critical. As recent studies have shown, it is difficult to directly relate MARD levels to the ISO 15197:2013 standard. A recent study 36 showed that MARDs of 3.25% and 5.25% are required to achieve ISO standard with probability one.
Conclusion
In this study, we present a new approach to estimate the impact of BG monitoring system accuracy on clinical outcomes. This approach allows us to leverage recent advances in the simulation of the glucose-insulin metabolism as well as new behavioral models of type 1 diabetes patients to assess the clinical impact of inaccurate glucose meters in everyday use. Patients under CSII therapy will receive benefits from increases in accuracy in both CGM and BGM technologies. These effects will be observed independent of the mode of BGM use, for example, insulin-dosing or calibration only.
Although this in-silico study simulated a CSII based population, we can assume that, at least qualitatively, these results can be extended to patients using functional insulin therapy and multiple daily injections (MDI); it is still unclear how much of these results are applicable to older forms of therapy, for example, sliding scale. Our simulation study does not consider temporary basal rate adjustments, so as long as the considered basal insulins display long (~24-hour) time constants (eg, glargine), results are not likely to change. Behavioral models were trained on a combination of MDI and CSII patients, and are representative of this mixture. Further simulation studies would allow to quantify the sensitivity of our results to these behavioral parameters, in particular the response to changes in the number of treatment decisions.
An important limitation in our study is the lack of long-term behavioral adjustments. For instance, it has been observed that patients with T1DM who experience frequent episodes of hypoglycemia will adjust their therapy on their own or with assistance by their physician. Although the results presented here accurately predict short-term clinical outcomes, the long-term effects of such behavioral adaptations need to be better understood. While such behavioral adaptations are commonly observed in clinical practice, the authors are not aware of data sets with sufficient detail to properly model and simulate this behavior.
Footnotes
Abbreviations
BG, blood glucose; CSII, continuous subcutaneous insulin infusion; MDI, multiple daily injections; SMBG, self-monitoring of blood glucose; T1DM, type 1 diabetes mellitus; UVA, University of Virginia.
Declaration of Conflicting Interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: KF is a full-time employee of Roche Diabetes Care.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Roche Diabetes Care.
