Abstract
Background:
Despite the risk associated with nocturnal hypoglycemia (NH) there are only a few methods aiming at the prediction of such events based on intermittent blood glucose monitoring data. One of the first methods that potentially can be used for NH prediction is based on the low blood glucose index (LBGI) and suggested, for example, in Accu-Chek® Connect as a hypoglycemia risk indicator. On the other hand, nowadays there are other glucose control indices (GCI), which could be used for NH prediction in the same spirit as LBGI. In the present study we propose a general approach of combining NH predictors constructed from different GCI.
Methods:
The approach is based on a recently developed strategy for aggregating ranking algorithms in machine learning. NH predictors have been calibrated and tested on data extracted from clinical trials, performed in EU FP7-funded project DIAdvisor. Then, to show a portability of the method we have tested it on another dataset that was received from EU Horizon 2020-funded project AMMODIT.
Results:
We exemplify the proposed approach by aggregating NH predictors that have been constructed based on 4 GCI associated with hypoglycemia. Even though these predictors have been preliminary optimized to exhibit better performance on the considered dataset, our aggregation approach allows a further performance improvement. On the dataset, where a portability of the proposed approach has been demonstrated, the aggregating predictor has exhibited the following performance: sensitivity 77%, specificity 83.4%, positive predictive value 80.2%, negative predictive value 80.6%, which is higher than conventionally considered as acceptable.
Conclusion:
The proposed approach shows potential to be used in telemedicine systems for NH prediction.
Keywords
Nocturnal hypoglycemia (NH) is the most feared type of hypoglycemia in patients with type 1 diabetes treated by insulin. Despite the danger of NH, there is still a lack of methods aiming at the prevention of such cases. NH problem is less worrisome for the patients equipped with continuous glucose monitors (CGM), but, according to GBI Research, Diabetes Landscape, Market, Technology and Intellectual Property, May 2012, only about 2-3% of insulin-treated patients use such systems.
On the other hand, intermittent monitoring performed from finger sticks remains the most widely used blood glucose monitoring method (BGM). This type of BGM is marketed at very low prices compared to CGM and provides fairly accurate results of BG concentration. Therefore, it is attractive to develop a method for predicting NH which uses only limited discrete information on blood glucose level during daytime hours.
One of the first methods that aims at the prediction of severe hypoglycemia, and also can potentially be used for prediction of NH, is based on the low blood glucose index (LBGI).1,2 The value of LBGI index cumulates all daily measurements of blood glucose and may provide a risk indicator for NH. For example, in Blood Glucose Index 3 the LBGI-values above 2.5 are interpreted as to be at risk of hypoglycemia.
On the other hand, nowadays there are other glucose control indices (GCI), which could be used for NH-prediction in the same spirit as LBGI. Namely, a particular index, say I, needs to be accompanied with a suitable threshold value
At the same time, as mentioned in that survey, there has been no consensus as to which of the discussed indices is best. Therefore, in the present study we propose a general approach of combining NH predictors constructed from different GCI. The approach is based on a recently developed strategy for aggregating ranking algorithms.5,6 The approach has been tested on 2 different clinical datasets and exhibited a secure level of predictive accuracy outperforming previously known results.
Performance Metrics
In this paper we use the standard metrics for measuring the performance of classifiers, that is, predictors predicting the “yes” or “no” answer. These metrics count the numbers or percentage of true positive (TP), true negative (TN), false positive (FP), and false negative (FN) predictions. In the present context TP and TN mean respectively the cases when NH appearance or absence was correctly predicted. FP means that NH was predicted, but did not occur, and FN means the opposite scenario.
Sensitivity defined as SE = TP/(TP+FN) and Specificity as SP = TN/(TN+FP) measure respectively the proportion of positives and negatives that were correctly identified as such.
The positive and negative predictive values, defined as PPV = TP/(TP+FP) and NPV = TN/(TN+FN), respectively, are the proportions of positive and negative predictions that were true.
For measuring the accuracy of classifiers the so-called f-scores are also used, since they represent a weighted average of the precision (PPV) and recall (SE). In the present study we use the traditional f-measure or balanced f-score (f1 score = 2TP/(2TP+FN+FP)) that is the harmonic mean of precision and recall. In addition, we measure the f2 score = 5TP/(5TP+4FN+FP), which weights recall higher than precision. The importance of the latter one can be explained by the patient’s desire to be sure that the predictions of no hypoglycemia will be correct (SE is more important).
Method Description
NH Prediction From GCI
Let
Glucose control indices
4
considered in this paper are the functions
where
For example, as we mentioned, in Blood Glucose Index
3
a predictor
is suggested, where
Moreover, combining the idea of the quadratic risk function with the observation
7
that LBB measurement
At the same time, it is clear that the construction (1) admits the use of other GCI. Below we recall the definition of GCI that will be used as examples in (1).
Hypoglycemic Index (HI)
where LLTR denotes lower limit of target range BG value, a default value of LLTR is 95 mg/dL.
GRADE Hypoglycemic Index (GHI)
Aggregation of NH Predictors Based on GCI
Let
Assume that we can access clinical records of diabetic patients, which contain historical data such as daily BG measurements
Having an ensemble of NH predictors
Note that NH predictors
where
Following Kriukova et al,
5
we choose the vector
with the matrix
where
From Kriukova et al
5
(see Theorem 10 there) it follows that under rather general assumptions and for sufficiently large number
Thus, to construct NH predictor
Datasets
In the current study we used datasets DIAdvisor and ChildrenData. Both datasets contain BG measurements of patients with type 1 diabetes. Further details are given below.
DIAdvisor
DIAdvisor dataset containing the data of 34 patients with diabetes was collected within the framework of the European FP7-funded project DIAdvisor. The considered subjects have been treated with insulin for at least 12 months before data collection; their ages were between 18 and 65 years, with a BMI< 35 kg/m2. During the study, the CGM-values were sampled every 5-10 min using continuous glucose monitoring (CGM) sensor. These data were not used for a prediction, but allow a detection of NH during the nights. Only confirmed hypoglycemia by finger stick measurements were used to qualify for NH. At the same time, 4 true BG measurements were performed daily in parallel with CGM estimations, and these BG data were used for training and testing the considered NH predictors.
A total number of n = 150 days has been chosen to test the proposed approach. NH has been confirmed in 40 cases.
ChildrenData
This dataset was collected during 3 and a half months in children hospitals of the Kyiv city, Ukraine, according to a protocol that is similar to Whincup and Milner. 7 The access to this dataset has been provided within the framework of the European Horizon 2020-funded MSC-project AMMODIT.
The dataset contains information about 179 children. Each of n = 476 records of this dataset contains 9 BG measurements, which were performed at the following time points of a 24-hour cycle: 08:00, 11:30, 13:30, 16:00, 18:00, 21:00, 00:00, 03:00 and 06:00. The measurements at 00:00, 03:00 and 06:00 were used to identify the occurrence of NH, while the measurements of other time points of the 24-hour cycle form the input for NH prediction. In this dataset the number of records with NH is 222.
Method Testing
To illustrate our aggregation approach, we consider NH predictors of the form (1), which are based on the indices
NH predictors

Scheme of prediction by means of an aggregation of the considered GCI-based NH predictors.
Performance of NH Predictors Based on GCI and Their Aggregator on DIAdvisor Dataset (Values Are Percentages).
In view of possible application in Telemedicine, a desirable feature of a NH predictor is its portability from individual to individual without readjustment. This means that an algorithm, which was constructed with the use of clinical data of one group of patients, can be used by other patients without recalibration and essential loss of prediction performance.
To illustrate that the proposed aggregation of NH predictors potentially allows the above mentioned portability, the predictor
Comparative Performance of Aggregations of Different Ensembles of NH Predictors (Values Are Percentages).
Note that, as has been pointed in Skladnev et al, 9 a NH prevention method may achieve market acceptance at SE of 70% with SP of 65%. Then the performance reported in Table 2 is higher than conventionally considered as acceptable.
Conclusions
We have discussed an extension of the idea1,2 of using GCI for NH prediction. Our results also demonstrate the advantages of aggregating several GCI-based NH predictors instead of choosing a single one. Although the construction of an aggregator requires training, the obtained predictor can further be used without modifications.
It is also worth to note that all our tests have been performed with data acquired from patients with type 1 diabetes, but the predictors considered in the present study are dealing with GCI that can be used for both patients with type 1 and type 2 diabetes patients. Therefore, a similar performance could potentially be expected for patients with type 2 diabetes. At the same time, a challenge is that patients with type 2 diabetes usually perform a lower number of daily BG measurements.
Note that in contrast to DIAdvisor the ChildrenData dataset contains BG measurements at discrete time moments only. Therefore, a validation of hypoglycemia cases on these datasets has been performed similar to Whincup and Milner 7 and Davies 8 by examining BG measurements collected during the night period. Of course, in this way some asymptomatic nocturnal hypos may be missed. Therefore, the results reported above should be considered as a proof of concept only.
Another limitation is that the considered predictors take into account only daily glucose variability. However, it is clear that there are other important factors that need to be considered in understanding development of hypoglycemia during the night, including intensity and duration of daily physical activity, meals, basal and bolus insulin doses. Therefore, further investigations are necessary for more accurate NH prediction.
As a summary, we would like to mention, that, in contrast to majority of publications on blood glucose prediction, the current study uses 2 independently collected clinical datasets for validating prediction performances. The main message of the presented study is that the incorporation of more and more new predictors into the proposed aggregation procedure leads to the improvement of prediction reliability. This conclusion follows from present study (see Table 2) and previous publications.5,6 At the same time, we understand that to bring the proposed tool closer to patients a new clinical trial entirely devoted to its testing would be desirable, and we hope that our research provides a motivation for such a trial. The design of this future trial should shed light on the following issues.
The use of CGM as “gold standard” for NH detection. This is the issue because it is known (see Baysal et al 10 and references therein) that CGM is not very reliable especially during the night and may produce false hypoglycemia scenarios. Hence only confirmed hypoglycemia by finger stick measurement is expected to be considered.
Dependence of prediction reliability on the number and the time of SMBG measurements. Note that here we may check whether this issue can be resolved by aggregating NH-predictors that use different input data.
Actions toward NH-prevention, when NH has been predicted. Of course, an obvious solution would be to take an additional snack followed by one more SMBG measurement reversing the prediction. But the time, at which the above mentioned SMBG measurement should be performed, is an open issue (see the previous issue). Moreover, the amount of additional carbohydrate intake should not drastically increase the mean overnight glucose. Nevertheless the development of decision algorithms following a predicted NH will definitely add value of predictor use to the patient.
Footnotes
Acknowledgements
This work was done while the first author was visiting Johann Radon Institute and he gratefully acknowledges the hospitality at Linz. The visit was supported by EU-Horizon 2020 MSC-RISE project AMMODIT.
Abbreviations
BG, blood glucose; BGM, blood glucose monitoring method; CGM, continuous glucose monitors; FN, false negative; FP, false positive; GCI, glycemic control indices; GRADE, Glycemic Risk Assessment Diabetes Equation; LBB, last before-bed; LBGI, low blood glucose index; LLTR, lower limit of target range BG value; NH, nocturnal hypoglycemia; NPV, negative predictive values; PPV, positive predictive values; ROC, receiver operating characteristic; SE, sensitivity; SP, specificity; TN, true negative; TP, true positive.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The authors affiliated with Johann Radon Institute gratefully acknowledge the support of the Austrian Science Fund (FWF): projects P25424 and I1669.
