Abstract
Background:
This study demonstrated the novel application of a “machine-intelligent” mathematical structure, combining differential game theory and Lyapunov-based control theory, to the artificial pancreas to handle dynamic uncertainties.
Methods:
Realistic type 1 diabetes (T1D) models from the literature were combined into a composite system. Using a mixture of “black box” simulations and actual data from diabetic medical histories, realistic sets of diabetic time series were constructed for blood glucose (BG), interstitial fluid glucose, infused insulin, meal estimates, and sometimes plasma insulin assays. The problem of underdetermined parameters was side stepped by applying a variant of a genetic algorithm to partial information, whereby multiple candidate-personalized models were constructed and then rigorously tested using further data. These formed a “dynamic envelope” of trajectories in state space, where each trajectory was generated by a hypothesis on the hidden T1D system dynamics. This dynamic envelope was then culled to a reduced form to cover observed dynamic behavior. A machine-intelligent autonomous algorithm then implemented game theory to construct real-time insulin infusion strategies, based on the flow of these trajectories through state space and their interactions with hypoglycemic or near-hyperglycemic states.
Results:
This technique was tested on 2 simulated participants over a total of fifty-five 24-hour days, with no hypoglycemic or hyperglycemic events, despite significant uncertainties from using actual diabetic meal histories with 10-minute warnings. In the main case studies, BG was steered within the desired target set for 99.8% of a 16-hour daily assessment period. Tests confirmed algorithm robustness for ±25% carbohydrate error. For over 99% of the overall 55-day simulation period, either formal controller stability was achieved to the desired target or else the trajectory was within the desired target.
Conclusions:
These results suggest that this is a stable, high-confidence way to generate closed-loop insulin infusion strategies.
In the treatment of type 1 diabetes (T1D), perhaps the most useful innovation short of a cure would be an effective “artificial pancreas” (AP), enabling stable, closed-loop infused insulin in response to both the ongoing dynamics of T1D glucose-insulin homeostasis and the ongoing external perturbations of meals, exercise, sleep, and menstruation, among others. In this article, it is argued that existing medical devices can provide a significantly better quality of blood glucose (BG) control through the exploitation of alternative mathematical methods coupled with the availability of computing resources in cloud and mobile platforms.
The mathematical approach demonstrated here, a combination of differential game theory and Lyapunov stability theory, obtained much of its background theory in robotics. Industrial robots typically have highly nonlinear, coupled dynamics of high dimension and significant dynamic uncertainties, while only a small number of variables are observable in (noise-polluted) time series. Furthermore, robotic systems typically degrade over time due to selective wear in mechanisms, generating idiosyncratic dynamic behavior. From the late 1970s to early 1990s, a group of researchers in robotic controls rejected “classic” identification and controller techniques (model linearization, dynamic optimization) partly because these were incompatible with robust handling of the nonlinear dynamic uncertainties for ongoing control of the system using noise-polluted partial information.
Instead, Lyapunov-based methods were developed for nonlinear identification, nonlinear controllers, and nonlinear Model Reference Adaptive Control (MRAC)—the implementation of which was described compendiously as the Product State Space Technique (PSST)1-8—in a body of robotics research that has more recently enjoyed a revival.9 -13 This provided the genesis of a medical platform technology14,15 demonstrated here in an AP context.
In 2002, Greenwood 14 outlined a “control-to-range” mathematical method for steering the solution trajectories of a biological dynamic system to a specified target range or more complicated set, using incomplete information, despite the existence of significant uncertainties or uncontrolled elements in the system dynamics. This was done using differential game theory, inspired by earlier nonbiological works.16,17 One or more Lyapunov functions were to be used to describe controller objectives in terms of steering solution trajectories to a target set sandwiched between nested contours of a Lyapunov function. Uncertainties or uncontrolled elements were to be ascribed to a hostile player (“Nature”), who would endeavor to prevent those trajectories from colliding with or being captured by the target. A minimax method was described, applied to distinct variables controlled by these mutually hostile players, to generate a control program robust to real-world uncertain dynamic elements.
Important differences exist between this approach and other published applications of robust control methods to the AP problem, which also typically use a minimax approach to uncertainties. Two examples are those of Parker et al 18 and Kovács et al, 19 who have employed modified forms of Sorenson’s 20 model (rather than the usual T1D benchmark model of Dalla Man et al 21 and Magni et al 22 ; Sorenson, 20 Dalla Man et al, 21 and Magni et al 22 represent nonlinear models of T1D glucose-insulin homeostasis); Parker et al 18 applied an implementation of the H∞ control, while Kovács et al 19 applied a Linear Parameter Varying (LPV) model–based control algorithm, itself an extension of linear time-invariant systems.
An important criticism of both examples’ approaches is that they have necessarily involved significant linearization of the model: for instance, Parker et al 18 linearized the model and reduced it to a third-order linear form for controller synthesis, while the method of Kovács et al 19 involved the construction of a polytopic region with the model built up by a linear combination of the linearized models derived in each polytopic point.
A problem with significant linearization of nonlinear glucose-insulin dynamics is that it introduces two layers of potential instability in the control algorithm:
The first layer is due to the discrepancy between the nonlinearized and linearized models, which can be compensated within a specified domain using rigorous simulation testing.
The second layer is fundamental: all mathematical models of glucose-insulin homeostasis and the insulin-based control are themselves only approximations of biological reality. Even in sophisticated models such as that of Magni et al, 22 some processes typically assumed to have a fixed linear structure can be reasonably expected to be, in reality, significantly nonlinear and/or time varying. Examples include the individualized, time-dependent nature of insulin sensitivity in T1D 23 and the nonlinearity of insulin infusion pharmacokinetics (PK) (see Kraegen and Chisholm, 24 who attempted to fit linear parameters and found these values to be dependent on the infusion profile, suggesting “that the linear model is an oversimplification”). Given that specific forms of such unmodeled nonlinearities are typically unknown a priori, it is difficult to be confident in the stability of control laws reliant on linearized models.
This criticism, and associated problems with nonlinear identification and nonlinear state observers, was overcome in the present study through the use of Lyapunov functions and differential game theory, preventing any need for linearizing the original system dynamics. To avoid artificial assumptions, this study also used an actual medical history of a “brittle” volunteer with diabetes code-named WM3, awaiting pancreatic islet transplantation at Westmead Hospital in Sydney, when generating simulated histories for data mining.
Methods
An AP built on the principles of Greenwood 14 has key differences from the status quo.
1. Use of Information
Sophisticated models of T1D insulin-glucose dynamics20,22,25-27 are typically of high dimension, are nonlinear, and have many parameters. In practical AP implementations, only two state variables and two control variables are typically available for time series measurements: BG levels via fingerstick (or similar) and interstitial fluid glucose (ISFG) levels via continuous glucose monitoring (CGM), and infused insulin doses and meal carbohydrate content. Under clinical conditions, additional variables may also be measured, including plasma insulin (PI), glucagon, and C-peptide levels.
Two modeling methods strongly represented in the AP literature are as follows:
Statistical analysis of large data sets to estimate “nominal” T1D parameter values, followed by further refinement (eg, neural networks, Bayesian analysis), and/or
Model reduction, simplifying (by linearizing and/or removing components of the dynamics) until the available information is sufficient to enable the identification and control of a unique estimate.
In contrast, this study used a different approach:
The use of Lyapunov functions enabled solution trajectories of the nonlinear system to be manipulated without requiring an explicit closed form of these solutions to be computed, avoiding the need for simplification.
Differential game theory enabled the system dynamics to be controlled without requiring uniqueness: the problem of controlling an underdetermined dynamic model was translated into the task of manipulating the behavior of a multitude of nonunique candidate solution trajectories. As demonstrated here, using Lyapunov functions, this process is viable and generates information that can be further exploited in decision making.
Discarding the requirements for simplification and uniqueness meant that the information loss associated with model reduction could be avoided. Statistical analysis, followed by refinement, was similarly rejected in favor of the construction of partial dynamic models from available information, interrogating a hypercube constructed from intervals of possible values.
Hence, the biological T1D system has its noiseless dynamics written as a generalized differential equation (the “contingent equation”):
where
Here, the actual biological state
The estimated state
Here,
the hypercube of plausible parametric vectors within the parameter space, where the interval
Parametric Intervals for GA Search Hypercube
Also showing nominal type 1 diabetes (T1D) values from the literature. GA, genetic algorithm; N/A, not applicable.
Identification employed a technique outlined by Greenwood.
15
The PSST typically employs Lyapunov functions to achieve nonlinear MRAC, whereby the dynamics of a complex system (equation 1) are made to converge with the noiseless dynamics (
The previous objections to linearization in the control law also posed an obstacle to formulating an appropriate state observer that would estimate values of unseen variables over time.
The insulin-glucose system was to be assumed significantly nonlinear (including anticipated bounded nonlinearities of an unknown form); hence, linear observers such as the Luenberger observer or Kalman filter were rejected, as were the linearizing assumptions of extended Kalman filters.
The high dimensionality of the model and the individually idiosyncratic nature of T1D also contraindicated the use of Bayesian methods.
Furthermore, the model to be used was not formally observable, given the limited number of available output variables.
Using the technique outlined above, derived from Greenwood, 15 enabled the direct construction of multiple underdetermined nonlinear candidate models using all available information. Each candidate generated an associated solution trajectory; the compact set comprising the volume enclosed by all such trajectories (the “dynamic envelope”) bounded the actual system trajectory by exploiting the convergent dynamics of the PSST. Combined with differential game theory, this dynamic envelope replaced the requirement for a single nonlinear observer.
This alternative technique used a variant of a genetic algorithm (GA). This is a method28-30 of achieving pattern recognition through simulating the processes of evolution at a chromosomal level. Parameter values from across a compact hypercube were inserted into one or more specified models and evolved across generations through a process of selective breeding and mutation until a pattern of observed data was heuristically matched.
Candidate vectors
This variant of a GA expressed chromosomes using the Gray code
31
instead of conventional binary encoding to avoid the formation of Hamming walls.
30
Consequently, the operation of mutation and crossover on the genes of Gray-encoded chromosomes expanded the search domain from an initial compact hypercube
In the AP context, this enabled the exploitation of existing T1D models and direct data mining of individual patients’ medical histories to generate multiple, personalized candidates for T1D dynamics consistent with known analytical models and observable data from that individual.
In contrast with the application of LPV methods to the AP,
19
here, information about stability of the insulin-based control of the full nonlinear model can be obtained directly using the minimax of the Lyapunov derivatives, where the controller is playing against the known dynamic uncertainties remaining in all plausible candidates (
Information about the level of confidence in imposing stable control (in differential game theory, the “strong controllability”) of the underdetermined system is provided by this method by generating a dynamic envelope of possible solution trajectories and striving to pour this envelope down the slope of the Lyapunov function to the target set (here including a desired interval of BG levels), despite intelligent counterstrategies exploiting known residual uncertainties to attempt to prevent this. Perturbations or uncertainties in the dynamics, if contained within this envelope over time, are then also controlled to target.
This method encompasses not only control of the actual uncertainties across
2. Model Analysis and Manipulation
The key problem was extracting an insulin control law from multiple underdetermined T1D models built from a common high-dimensional set of nonlinear differential equations, which fulfilled desired control objectives using incomplete information.
The solution involved Lyapunov functions. These are typically defined to be positive-definite
Model equations (equation 2) operating under a control program or strategy
Equation 4 represents a sufficient condition for stable control of the system. The preferred form of control corresponds with the strongest such form of stability, namely, asymptotic control to target
where
Lyapunov-based control is typically formally suboptimal under conditions of perfect information. However, when controlling a complex nonlinear system under conditions of incomplete information and unknown dynamic aspects, the Lyapunov approach has the following advantages: A controller response sufficient to overcome the effects of bounded uncertain dynamics
for asymptotically stable controls and
for strong controllability, ensuring asymptotically stable controls.

Plot of Lyapunov function showing the descent of solution trajectories towards the target.
This approach was applied to poorly modeled components of the T1D system’s nonlinear dynamics to ensure high-confidence controller stability.
Estimated regions of controllability are readily computable; such estimates are based on sufficient conditions6,7 so they are conservative and high confidence.
Designing a controller law using Lyapunov conditions imposes asymptotic stability on the resulting dynamics within the region of controllability; hence, the strongest available form of stability has been imposed as part of the design process.6,7
Real-time computation is fast (an algebraic descent condition); hence, Lyapunov methods have been studied for possible real-time military use.17,32 This also means that an additional layer of machine-intelligent computation can be introduced to operate within a feasible time, as is done here.
3. Response to Constraints
Significant dynamic constraints affect AP control:
Changes in ISFG values dynamically lag behind their BG counterparts by a time period that varies based on local BG conditions 33 and in ways that may not be entirely described by current models.34 -36
A significant fixed-value CGM sensor lag also exists as an artifact of the ISFG sensors’ filter algorithms.
Elsewhere, synthetic insulin, once infused subcutaneously, continues to act on BG levels for a considerable duration. The value of this duration of action depends on local dynamic conditions as well as the form of insulin being used.
Consequently, this demonstration did not use CGM data for direct prediction of system dynamics, nor did it assume a specific horizon for a past insulin dose. Both issues were instead resolved through the predictive use of a large number of candidate solution trajectories, generated from the model equations using underdetermined evolutionary fitting to the patients’ available medical histories:
Partial system identification via data mining medical histories using a variant of a GA gave multiple candidate values for parameters in self-consistent combinations, including those relating ISFG to BG (and similarly for insulin, assuming consistent use of a single insulin type).
These various values generated multiple trajectories, forming a forward-moving divergent dynamic envelope of the actual system dynamics.
Ongoing CGM data were used to establish recent past values for ISFG and reassess expected recent values for BG, updating the starting point for the dynamic envelope in the recent past every 5 minutes. Thus, CGM data still formed part of a closed-loop AP control but were not used for direct prediction (performed instead by the dynamic envelope). To emphasize the robustness of this approach, in these simulations, the fixed-value CGM sensor lag was taken to be relatively large:
4. Modeling of Meals
Meals represent a major source of uncertainty in BG dynamics, with carbohydrate content difficult to predict accurately for most patients with diabetes. Consequently, the AP literature typically imposes strict constraints on meal carbohydrate content, timing, or both38,39 and views meals as a “disturbance that must be rejected by the control law implemented within the AP algorithm.” 39
This study took a different view: meals are an inherent, if highly uncertain, part of the T1D system dynamics. Their effects are not a disturbance to be rejected but are handled by the control law applying game theory on the dynamic envelope (below).
To demonstrate this, WM3’s medical history provided actual meal data from several months in 2008, which were used in this study:
Meals were irregularly timed and had irregular carbohydrate content.
The algorithm received only 10 minutes’ warning before each meal (this constraint was deliberately harsh to emphasize the algorithm’s robustness to short-warning perturbations; in real-world applications, a longer-duration warning would be feasible).
Estimates of meals’ carbohydrate content were initially assumed accurate, and then robustness studies were performed using ±25% error.
5. Machine-Intelligent Handling of Uncertainty: Game Against Nature
The model dynamics had 2 forms of significant uncertainty:
Uncertainties in the values of parameters and state variables due to underdetermined equations, manifested by the dynamic envelope, and
Uncertainty associated with an inherently uncertain variable (meal carbohydrates), affecting this dynamic envelope as it flowed through state space.
The solution to handling these dynamic uncertainties was to overlay the so-called Game against Nature as an additional layer of machine-intelligent decision making on top of the Lyapunov formulation.11,14,16,17,40
Equation 4 shows the essential criterion for steering a trajectory to a target set despite uncertainty. The Lyapunov-based Game against Nature (L-GaN) goes further, assuming all such (bounded) uncertainty
In the L-GaN implementation demonstrated, the dominant uncertainty was the underdetermined model equations partially identified from available data, leading to a finite set of possible parameter vectors
with the preferred form (from equation 7)
When no single strategy satisfied equations 8 or 9 for all
for some
If possible, all candidate parameter vectors associated with trajectories projecting hypoglycemia from
If not all
Subordinate to the above,
Subordinate to the above,
The T1D system was primarily derived from Magni et al 22 with four modifications:
The unit step function
A dimensional anomaly was found in the BG-ISFG dynamics of Magni et al.
22
This was remedied by inserting an additional term corresponding with
Meal equations were taken from Magni et al
22
and Dalla Man et al.
41
The medical histories of “brittle” patients with diabetes at Westmead Hospital demonstrated significantly higher glycemic sensitivity than predicted by Dalla Man et al,
41
so a correction factor
Clinical T1D typically has vestigial pancreatic insulin secretion, complicating insulin control. To simulate the presence of this tiny insulin signal,
Then, the system equations were as follows (with variables as defined in Table 2):
Variables in the State Vector
The components of
For each participant,
Partial identification was to be achieved through generation of vectors
Apart from
A related question in this study was the impact on stable insulin control of PI assay (non)availability during identification:
PK parameters for subcutaneous insulin infusion were assumed to be well estimated by the insulin manufacturer and available to the clinician. Reassessing existing manufacturers’ estimates of insulin infusion PK using these algorithms is possible but was regarded as a less interesting question than exploring the deeper capabilities of these algorithms in tandem with existing estimates, with and/or without PI time series being available, so accurate infusion PK estimates were assumed.
Consequently, equation 15 was replaced by equation 24:
Were this assumption to be abandoned, the same identification techniques demonstrated here would be repeated for
The literature21,22,24,25,33,41
-44 gave possible
When such intervals existed, an associated search interval
The parametric plausible candidate hypercube
Hence, using the standard “black box” method for simulation testing, vectors of system parameters
S1 used representative T1D values from the above literature, as a “typical” patient with diabetes (basal BG = 179.5 mg/dL).
S2 was modified within the plausible range of diabetic values to reflect more extreme behavior observed in a “brittle” patient with diabetes (in particular, basal BG = 231.7 mg/dL), chosen to be challenging to reduce the desired BG interval [80, 140] mg/dL without either inducing hypoglycemia or experiencing interim hyperglycemia.
Using 61 days’ actual meal and insulin pump data from WM3’s medical history,
The observable time series from these medical histories were given to the identifier to analyze. Identification involved hypothesis formulation and predictive testing on independent data. Up to 11 days’ actual meal data were also used for the controller phase. Throughout identification, the “tracking criterion” was that trajectories must track fasting BG and ISFG measurements to within ±1 mg/dL and PI data, where measured, within ±5 pM.
Results
Computational processes were divided between time-intensive identification, performed offline prior to the controller regime, and the controller, which was demonstrated to operate effectively in real time (computations required in 10 minutes’ simulated time were performed within 10 minutes in real time).
For each participant, the process was as follows.
Stage 1: Initial Identification
This was performed on a single day’s fasting data (September 23) 7.5 hours from midnight to breakfast.
For each participant, two scenarios were studied:
(I): PI assays accompanied each fasting BG measurement in stage 1, and
(No I): no such PI measurements were available.
Running in a parallel-computing architecture using an AMAX Servmax MNL-1185 (AMAX, Fremont, California) high-performance server, equipped with 2 quad-core Intel Xeon E5520 (Intel, Santa Clara, California) Nehalem processors and 3 Nvidia Tesla C1060 (Nvidia, Santa Clara, California) cards, each time, the identifier used 100 chromosomes, initially generated randomly across
This was done using 10 to 15 runs per participant per scenario, generating an analysis of 10 million to 22.5 million (typically 15 million) candidate vectors each. Table 3 lists the numbers of compliant vectors/candidate trajectories derived from fasting data, each corresponding with an initial hypothesis of the system dynamics based on incomplete information. Note that the severely underdetermined nature of the identification process and the nonlinearity and complexity of system and model equations meant that the possibility of large-scale divergent dynamics remained: models constructed on
Number of Hypotheses per Participant and Scenario.
(I), scenario where plasma insulin (PI) assays are available in Stage 1 identification; (No I), scenario where PI assays are unavailable.
Stage 2: First-Pass Fasting Open-Loop Predictive Testing
Using 8 distinct days’ fasting insulin infusion data (6-8 hours daily; July 25 to August 1), hypotheses predicted BG and ISFG trajectories each day, which were compared with “black box” measurements. All compliant stage 1 hypotheses tracked the measurements each day within the tracking criterion.
This was then successfully repeated using another 10 days’ fasting data (6-8 hours daily; August 2-11).
Stage 3: First-Pass Open-Loop Prandial Filter
Insulin infusion and meal time series across September 23 were then supplied to the hypotheses, which predicted BG and ISFG time series for 24 hours.
Given the uncertainty associated with meal dynamics, the key question was whether the dynamic envelope encompassed all known system behavior.
The dynamic envelope was indeed observed to surround all actual “black box” postprandial BG and ISFG measurements (Table 4, left-hand column: BG data shown as squares).
As the hypotheses were generated under fasting conditions, some trajectories diverged significantly from observed postprandial BG behavior while still imposing an envelope upon it. Those diverging too far from observed BG levels were culled as extraneous to reflect the observed meal dynamics reality and reduce the computational burden of the L-GaN. Table 3 shows the number of remaining hypotheses in the “reduced” dynamic envelope, illustrated in Table 4 (left-hand column) for both participants and both scenarios, including the associated tracking/prediction of PI time series (right-hand column). Note that the actual PI time series (dashed blue line) had its waveform successfully predicted as a candidate, when PI data were unavailable.
Construction of the Reduced Dynamic Envelope.
(Left) The reduced dynamic envelope generated from fasting data, with extraneous trajectories removed, shown for blood glucose (BG) (mg/dL) for September 23 with actual measurements superimposed (squares). Note that it enclosed the actual BG dynamics. (Right) Projected plasma insulin (PI) (pM), with actual PI shown (dashed blue line): (I) scenarios achieved prediction of blood insulin from fasting partial information, while (No I) scenarios predicted PI waveform as a function of underdetermined amplitude. (I), scenario where PI assays are available in stage 1 identification; (No I), scenario where PI assays are unavailable.
Stage 4: Second-Pass Open-Loop Prandial Predictive Testing
Remaining hypotheses were given whole-of-day meal and insulin infusion time series for 10 distinct days (August 2-11) and generated predictions for BG and ISFG, which were then compared with “black box” measurements for consistency.
The dynamic envelopes passed all the stages of testing and thus were deemed suitably robust as a predictive tool.
The next part of the study was to demonstrate the L-GaN closed-loop controller applying the triage logic of equations 8 and 10 to the participants under both scenarios. On each of 5 to 11 days (September 23 repeated as a common benchmark; additional days from August 2-11), the closed-loop controller was tasked with steering BG levels to the desired target
The baseline controller sampled BG every 2 hours and ISFG using the ongoing flow of CGM data every 5 minutes and assumed accurate carbohydrate estimates. Control began at 0001 hours each day, with the controller recomputing strategies on a time scale of minutes. To emphasize algorithm stability, each day was treated in isolation, beginning at basal conditions (instead of successor days benefitting from previous successful BG control).
To distinguish between initial high BG conditions and subsequent hyperglycemia, a grace period of 6 hours (midnight to 0600 hours) was permitted when logging nonhypoglycemic BG levels to handle the daily descent from basal conditions (Figure 2). Hypoglycemic events (if any) were to be recorded over the full 24 hours.

Plot of successful Lyapunov-based Game against Nature control of blood glucose, showing controlled descent during the “grace period” (checkered) and controlled meal perturbations.
As shown in Tables 5 and 6, the baseline L-GaN insulin control was successfully established, with no high BG occurring after 0600 hours, or hypoglycemia at any time, over 38 simulated days. In the case of S2, BG fingerstick samples were then relaxed to every 4 hours to assess this effect on performance over 5 additional simulated days.
Results of Baseline L-GaN Control for S1.
WM3’s and S1’s values under WM3’s insulin regime (shaded) are included for comparison. BG, blood glucose; CGM, continuous glucose monitoring); I, scenario where plasma insulin assays are available in Stage 1 identification; L-GaN, Lyapunov-based Game against Nature; (No I), scenario where plasma insulin assays are unavailable. Bold text and numbers pertain to the desired target interval for blood glucose values.
Results of L-GaN Control for S2: BG Sampling Period of 2 Hours or 4 Hours.
S2’s values under WM3’s insulin regime (shaded) are included for comparison. BG, blood glucose; (I), scenario where plasma insulin assays are available in stage 1 identification; L-GaN, Lyapunov-based Game against Nature; (No I), scenario where plasma insulin assays are unavailable. Bold text and numbers pertain to the desired target interval for blood glucose values.
Meal robustness tests were then performed over 6 days per participant under No I conditions:
Fingerstick tests were relaxed to every 4 hours.
In accordance with Patek et al, 39 carbohydrate data communicated to the controller were falsified by ±25% to test robustness to meal uncertainty. To avoid random errors partially canceling, 2 distinct scenarios of daylong systematic errors were tested: systematic +25% and −25% meal errors were applied to all meals across simulation days.
The +25% test (increasing actual carbohydrate content by 25% over the values given the AP; all meals) used a single day (September 23). The −25% test (reducing actual carbohydrate content by 25% below the values given the AP; all meals) was regarded as more important due to the risk of hypoglycemia from excessive insulin, so it used 5 days’ meal data (September 23, August 2-5) to assess.
Successful, stable closed-loop control was achieved on all days (Table 7), despite this additional uncertainty.
Meal Robustness Tests for L-GaN Control: BG Sampling Every 4 Hours.
BG, blood glucose; L-GaN, Lyapunov-based Game against Nature; (No I), scenario where plasma insulin assays are unavailable. Bold text and numbers pertain to the desired target interval for blood glucose values.
Hence, stable closed-loop insulin control was successfully demonstrated under realistic meal conditions for both participants under both scenarios, without hypoglycemia or postprandial hyperglycemia, for a total of 55 days.
Discussion
The simulations showed the following:
Available fasting PI data during identification significantly improved BG control for S1 (Table 5), although it had negligible effects in the analogous S2 baseline study, where the reduced dynamic envelope appeared to be sufficient (Table 6).
Relaxing fingerstick intervals in the S2 (No I) study to 4 hours had a significant effect on BG levels, shifting BG activity upwards (Table 6).
As shown in Table 7, the carbohydrate +25% study for S1 pushed BG out of target 57.3% of the time, although no postprandial high BG levels or hyperglycemia ensued. The uncertainty had a negligible effect in the equivalent study for S2, suggesting that S2’s dynamic envelope (profuse
The reduced dynamic envelopes for both participants were robust against hypoglycemia under the carbohydrate −25% studies in BG levels dropping low gave the antihypoglycemia strategies priority within S1’s dynamic envelope, pushing the subsequent profiles up.
Table 8 illustrates examples of the stable control for both participants, first under the I scenario with accurate meal data and then under the No I carbohydrate −25% scenario, all using WM3’s actual meal history of August 2 (left-hand column) and August 3 (right-hand column).
The absence of any hypoglycemic or postprandial high BG levels is attributed to the controller’s application of triage logic to the reduced dynamic envelope generated by the evolutionary identifier.
Examples of BG Plots (mg/dL): August 2 and 3.
Showing the Lyapunov-based Game against Nature controller achieving stable blood glucose (BG) control despite realistic uncertainties. (I), scenario where plasma insulin assays are available in stage 1 identification; (No I), scenario where plasma insulin assays are unavailable.
A key question for any AP controller for real-world medical deployment is its stability under dynamic perturbations. An advantage of the present technique was that the stability of its Lyapunov controller could be directly analyzed.
Although the controller function
was adequate to test BG stability with respect to the objective of steering the trajectory to the target and retaining it there. Segments of the trajectory within the target T BG were ignored as the objective was already met; however, meal- and insulin-based perturbations enabled the ongoing possibility of target escape or excursions.
Analysis over the 55 days revealed that the controller established strong controllability for asymptotically stable control, the strongest possible form of stable control, as stipulated in equation 7 by 0100 hours every morning, retaining it while steering the trajectory to the target under fasting conditions in preparation for meal-based perturbations that might break stability. Including the daily descent phase from midnight, trajectories were external to the target 21.67% of the overall simulation time. They were under stable control for 99.43% of this period and so suffered formal instability for less than 1% of their overall time external to the target. As shows, the scenario that suffered the most meal-induced breaks in formal Lyapunov stability was the S1 (No I) carbohydrate +25% scenario, where deliberately inputting false meal data caused formal stability to lapse for 19.01% of the 24-hour day, but under this scenario, the algorithm still imposed effective control, preventing all BG excursions from leaving the interval [80, 160] mg/dL.
Conclusions
The key problems confronting stable, high-confidence AP control of BG via insulin infusion are the following:
Realistic T1D models in the literature are significantly underdetermined when using available time series data.
Three dynamic uncertainties dominate closed-loop control: namely, BG-ISFG lag, duration of insulin effect, and meal carbohydrate uncertainty.
Stability of the BG controller algorithm is essential in an uncertain environment.
This study demonstrated that these problems could be solved by resolving them simultaneously: evolving multiple candidate models for a realistic T1D model from partial information and then combining Lyapunov stability theory and differential game theory in the construction of insulin control strategies to handle dynamic uncertainties robustly using machine-intelligent “triage logic.” Rigorous simulation testing suggests that this is a stable, high-confidence way to generate closed-loop insulin infusion strategies, although this needs to be confirmed in a clinical study.
Provided the model represented a good approximation of the T1D system, such that the dynamic discrepancies could be represented by the combination of finitely bounded parametric uncertainties within the
Abbreviations
AP, artificial pancreas; BG, blood glucose; CGM, continuous glucose monitoring; GA, genetic algorithm; I, scenario where plasma insulin assays are available in stage 1 identification; ISFG, interstitial fluid glucose; L-GaN, Lyapunov-based Game against Nature; LPV, Linear Parameter Varying; MRAC, Model Reference Adaptive Control; No I, scenario where plasma insulin assays are unavailable; PI, plasma insulin; PK, pharmacokinetics; PSST, Product State Space Technique; S1, (simulated) participant 1; S2, (simulated) participant 2; T1D, type 1 diabetes; WM3, Westmead 3 (designation of a volunteer with “brittle” T1D at Westmead Hospital, Sydney, who consented to make medical data available).
Footnotes
Acknowledgements
The authors acknowledge the kind assistance of Dr D. Jane Holmes-Walker at Westmead Hospital, Sydney.
Declaration of Conflicting Interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Both Dr Greenwood and Professor Gunton hold shares in Diabetes Neuromathix Pty Ltd, a company established in 2013 as the collaborative vehicle for developing and trialing the Neuromathix artificial pancreas algorithms.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The authors acknowledge support from a JDRF Innovative Grant (5-2011-477) and a Queensland Government Proof-of-Concept Grant during this project. This work was also partly funded by NeuroTech Research Pty Ltd. Human data was sourced from volunteers at Westmead Hospital, Sydney, through earlier work by the authors funded by the Queensland Government (Medical Devices Financial Incentive Program funding and a Proof-of-Concept Grant) and NeuroTech Research Pty Ltd.
