Sage Journals: Discover world-class research

Abstract

Keywords

Introduction

Systems for continuous glucose monitoring (CGM) and automated insulin delivery (AID) are widely used by patients with diabetes (PwD) these days, with impressive improvements in glucose outcomes and increases in the number of users year by year. The accuracy of CGM systems (which are a critical component of AID systems) has improved a lot in the last 20 years. However, even with the most recent generations of such systems, it is not clear if the accuracy is good in all glucose ranges, especially the low glucose range (<70 mg/dL). Comparison of the accuracy of different CGM systems is hampered by the fact that the study procedures for clinical evaluation of CGM performance are not standardized yet. In this regard, a working group of the International Federation of Clinical Chemistry and Laboratory Medicine (IFCC) has recently recommended a respective procedure.^1,2

In relatively short time intervals, the manufacturers of CGM systems bring new generations of their devices onto the market. These generations differ from each other in several aspects, like easier handling, smaller housings, longer usage time, etc.; however, usually, the quality of glucose measurement is also improved. At least this is what is stated in the marketing announcements for the new products, documented by lower mean absolute relative difference (MARD) numbers in the clinical studies performed for regulatory approval by the manufacturer with manufacturer specific procedures. The analytical performance of (new) CGM systems is evaluated in clinical studies that are required for market approval by the regulatory agencies in the United States and Europe (and other parts of the world). The manufacturer evaluates the performance of their CGM systems during the clinical development process by comparing the measurement results to those measured in parallel with conventional glucose analyzers in capillary or venous blood samples. The study procedures differ between manufacturers and are most probably optimized for the device and algorithm tested.¹ The differences in the measurement results are given as MARD values. An MARD value <10% is regarded as sufficient for reliable diabetes therapy performance and usage in AID systems; however, the MARD is a parameter that has several limitations, in particular the dependency on the study protocol.^3-5

In the past, reports were published in which the performance of a new product (= generation of a CGM system) was compared with the previous generation of a given manufacturer or has tested the same sensor in combination with different algorithms; however, most often, no such “internal” comparisons are published. Reports, in which the performance of a given product from one manufacturer is compared with the product made by another manufacturer, are more frequently performed; however, there are a limited number of respective publications (Table 1). Such “external” comparisons are of high interest for PwD, healthcare professionals, and also for payers, especially as CGM/AID-derived parameters like time in range appear in therapy guidelines. Ideally, such evaluations are performed with the devices attached to the same PwD, to reduce the impact of all factors influencing the observed performance. In addition, the study design should not be optimized to get a good MARD for a specific device, but optimized to the patient need and challenge the CGM to get reliable information for daily use over the whole measurement range including higher rates of change as requested by the IFCC working group._² Another very important topic is the choice of the comparison measurement method and evaluation procedure.⁶ In such studies, the same “generation” of systems should be tested.

Table 1.

Studies Reporting Results From Head-to-Head Comparisons of Different CGM Systems Published in the Last Five Years With Otherwise Healthy PwD.

First author/Title of publication	CGM system	Study funding	Study subjects (N)	MARD (%)
Freckmann et al.⁷ Performance and Usability of Three Systems for Continuous Glucose Monitoring in Direct Comparison	Dexcom G5 Medtronic Guardian Connect Roche CGM	Roche Diabetes Care GmbH	54	10.6 11.6 10.7
Kumagai et al.⁸ Comparison of glucose monitoring between Freestyle Libre Pro and iPro2 in patients with diabetes mellitus	Freestyle Libre Pro Medtronic iPro2	None	10	n.r
Moser et al.⁹ A head-to-head comparison of personal and professional continuous glucose monitoring systems in people with type 1 diabetes: Hypoglycaemia remains the weak spot	Medtronic Enlite 2 + iPro2 Medtronic Enlite 2 + 640G	None	10	19.1 19.0
Denham et al.¹⁰ A Head-to-Head Comparison Study of the First-Day Performance of Two Factory-Calibrated CGM Systems	FreeStyle Libre 2 Dexcom G6	Abbott Diabetes Care	25	13.2 18.5
Fokkert et al.¹¹ Performance of the Eversense versus the Free Style Libre Flash glucose monitor during exercise and normal daily activities in subjects with type 1 diabetes mellitus	Eversense Freestyle Libre	Bas van de Goor foundation	23	17/13 20/12 Exercise/normal daily activity
Jafri et al.¹² A Three-Way Accuracy Comparison of the Dexcom G5, Abbott Freestyle Libre Pro, and Senseonics Eversense Continuous Glucose Monitoring Devices in a Home-Use Study of Subjects with Type 1 Diabetes	Dexcom G5 Freestyle Libre Pro Eversense	None	23	16.3 18.0 14.8
Tsoukas et al.¹³ Accuracy of FreeStyle Libre in Adults with Type 1 Diabetes: The Effect of Sensor Age	Freestyle Libre Dexcom G5	None	14	12.8 12.5
Boscari et al.¹⁴ Comparing the accuracy of transcutaneous sensor and 90-day implantable glucose sensor	Dexcom G5 Eversense	None	11	n.r.
Ji et al.¹⁵ Multicenter Evaluation Study Comparing a New Factory-Calibrated Real-Time Continuous Glucose Monitoring System to Existing Flash Glucose Monitoring System	AiDEX Freestyle Libre	Microtech Medical (Hangzhou) Co. Ltd.	120	9.1 17.1
Link et al.¹⁶ Comparative Accuracy Analysis of a Real-time and an Intermittent-Scanning Continuous Glucose Monitoring System	Dexcom G5 Freestyle Libre	Dexcom Inc.	27	9.5 13.6
Nagl et al.¹⁷ Performance of three different continuous glucose monitoring systems in children with type 1 diabetes during a diabetes summer camp	Freestyle Libre Dexcom G6 Medtronic Enlite 2 + 640G	None	38	n.r.
Pleus et al.¹⁸ Variation of Mean Absolute Relative Differences of Continuous Glucose Monitoring Systems Throughout the Day	Dexcom G5 Freestyle Libre	Ascensia Diabetes Care Holdings AG	24	13.2 12.5
Boscari et al.¹⁹ Implantable and transcutaneous continuous glucose monitoring system: a randomized cross over trial comparing accuracy, efficacy and acceptance	Eversense Dexcom G5	None	16	12.3 13.1
Yeoh et al.²⁰ A head-to-head comparison between Guardian Connect and Freestyle Libre systems and an evaluation of user acceptability of sensors in patients with type 1 diabetes	Medtronic Guardian Connect Freestyle Libre	Alexandra Health Enabling Grant	10	9.7 17.5
Kölle et al.²¹ Performance Assessment of Three Continuous Glucose Monitoring Systems in Adults With Type 1 Diabetes	GlucoRx AiDEX FreeStyle Libre 2 FiberSense System	FIND	30	21.9 9.2 14.7
Lundemose et al.²² Factory-Calibrated Continuous Glucose Monitoring Systems in Type 1 Diabetes: Accuracy during In-Clinic Exercise and Home Use	FreeStyle Libre 2 Dexcom G6 Guardian 4	None	13	17.2 12.6 10.7

CGM: continuous glucose monitoring; PwD: patients with diabetes.

Studies conducted in the hospital or with special patient groups, such as pregnant women, dialysis patients, etc. were excluded.

Such so-called “head-to-head” studies provide helpful insights into the performance of a given CGM system and allow a reasonable interpretation of the results, like “Is the analytical performance of the tested CGM systems truly comparable/different?” In addition, if the study is well-designed, finer distinctions like “Is the analytical performance better across the whole range of glucose values/all daily life situations/over duration of usage/patient groups?” can be made. “Head-to-head” studies can thus diminish the main weakness of MARD values and other outcome metrics in points of comparability and allow for a more objective judgement of CGM system accuracy.

Practical Aspects of Performing Head-to-Head Studies

Clinical studies during which the glucose sensors of 2, 3, or more CGM systems are fixed to the abdomen, arm, or thigh of volunteers are, in principle, conducted in the same way as a “regular” CGM/AID study. However, the study protocol needs to be adjusted to account for the differences in sensor lifetime of different CGM systems. As the accuracy of a given CGM system might change over time, it is important to schedule frequent sampling periods with dynamic glucose excursions at the same point regarding sensor lifetime to ensure objective comparability. Wearing several CGM systems at the same time poses an additional burden on the study subjects (most often PwD), and there is a limitation of how many CGM systems can be attached at the same time. This issue has been more relevant in the past though, since nowadays most CGM systems are approved for several application sites and not only limited to the abdomen and the size of the sensors has also significantly decreased over the years (Figure 1).

Figure 1.

Example of six CGM sensors attached to one subject during a clinical study conducted in 2013.

A clear advantage of this approach is, that all systems measure identical glucose levels and changes in a given subject. It is known that glucose sensors located at different body sites (arm, abdomen, and thigh) provide some differences in measurement results (also differences between left and right arm were reported); however, the application sites labelled by the manufacturer should be adhered to in studies. What will always remain is a certain sensor-to-sensor variability that cannot be excluded and has rather technical than physiological causes.

The term “head-to-head studies” sometimes also refers to studies during which a group of volunteers uses a given CGM system for some time (weeks to months) and subsequently switches to a different system. Alternatively, different devices can be randomized to multiple groups of volunteers. However, in a strict sense, such studies are not head-to-head studies as the measurements are not performed on the same subject at the same point in time.

“Head-to-head” studies can also be performed under outpatient conditions, reflecting more real-world experience. However, the parallel measurement of blood glucose values in quality and quantity in parallel can be hampered under such conditions.

Publications of the Results of Head-to-Head Studies Performed in the Last Five Years

Looking at the past five years, there were a total of 16 studies that compared the performance of more than one CGM device in otherwise healthy PwD (eg, studies conducted with pregnant women or in the hospital were excluded) (Table 1). Until recently, there have been no publications using the most recent generations of CGM devices, eg, FreeStyle Libre 3 and Dexcom G7, though.

One might ask “Why do we not see the publication of such studies more regularly with new CGM/AID systems?” The products are freely available on the market and therefore in principle, everybody who is interested and experienced can perform such studies. However, the conduction of clinical studies is associated with considerable costs. Manufacturers of CGM/AID systems are reluctant to sponsor such studies as they are carrying the risk that the study outcome might not be beneficial for their product, ie, the manufacturer of a given competitor product might benefit more from the study results. In case the manufacturer of a given CGM/AID system does not support study performance, who is willing to sponsor/support such studies?

One option for support would be payers. Healthcare insurances pay a lot of money every year for CGM/AID systems (with massive increases year by year), why do they not invest in a thorough investigation of system performance before paying for them? They rely on the regulatory approval process etc., while knowing that their needs—and that of their customers—(effectiveness, ease of use) differ from those that are in the focus of regulatory agencies (safety). One wonders why such studies are not required by regulatory agencies. In principle, such studies should be performed by independent research institutions to avoid bias; however, this would also mean independent support for such studies.

Looking at the funding sources of the published “head-to-head” studies comparing devices from different manufacturers conducted during the last five years, there was one study funded each by Abbott and Dexcom. Without surprise, the results of both studies were positive in favor of the product of the sponsor of the study. It can be assumed that more studies were performed but results were not published as they were not beneficial for the funder, emphasizing the need for independent funding.

“Head-to-Head” Comparison of Current CGM Systems

In this issue of JDST, a study is published to present the results of an evaluation of the point accuracy between two different current CGM systems (Dexcom G7 vs FreeStyle Libre 3).²³ The results of this multicenter, single-arm, prospective, nonsignificant risk evaluation with 55 PwD with type 1 diabetes (T1D) or type 2 diabetes (T2D) showed lower MARD with the Libre 3 CGM system compared with the G7 system (8.9% vs 13.6%). The authors conclude that the Abbott CGM system is more accurate than the Dexcom system in all metrics evaluated. They also ask for additional head-to-head accuracy studies of competitive CGM systems, using standardized metrics and methodologies. This study was sponsored by Abbott.

“Head-to-Head Study” With AID Systems

Looking at AID systems, all issues raised and discussed above remain, but there is the additional limitation that only one system can really “do the job,” ie, infuse insulin and modify glycemia. This limitation makes it impossible to conduct true “head-to-head studies” with AID systems even if they are so-called.

At the EASD 2023, the results of the first large-scale head-to-head study with three different AID systems were presented.²⁴ In a center in Barcelona, 132 adults with type 1 diabetes prospectively used an AID system from Diabeloop (DBLG-1), Medtronic (MiniMed 780G), or Tandem (Control-IQ) over 12 months, although this study was not randomized. Excitingly, the results—overall—were not significantly different: HbA1c levels fell by 0.9% after 12 months, from a mean baseline of 7.5% to 6.6%. Time in target range (TiR) increased from 60% to 74%, ie, by 3.4 hours per day. The participants achieved a reduction in time above the range of 2.9 hours per day, from a baseline value of 36% to 24% after 12 months. The time below target also decreased from 3.9% at baseline to 1.9% at 12 months, which corresponds to a decrease in time in hypoglycemia of 13 minutes per day. A breakdown of the results by AID system showed that users of the 780G system achieved the highest TiR (79%), followed by Control-IQ users (76%) and Diabeloop users (69%). All three systems achieved a low time below the TiR: 2.6%, 2%, and 1.4%. The largest reduction in HbA1c was achieved by the Control-IQ users (1.1%), the Diabeloop users by 0.9%, and the 780G users by 0.6%. The question is to what degree the results of this nonrandomized study would have been different if the allocation of patients to the respective AID system had been randomized. The initial HbA1c value of the users of the 780G AID system was already lower at the beginning.

Summary

In summary, up to now “head-to-head” studies with a thoroughly selected study design remain to be the only way to truly compare the accuracy of different CGM systems making them especially valuable. It is encouraging to see the first “head-to-head study” of the most recent generations of CGM systems published in this issue. However, the need for more independent funding for further “head-to-head” studies and for more standardization of study design which would also enhance comparability between different CGM and AID systems remains.

Footnotes

Acknowledgements

The helpful comments of Dr Stephanie Wehrstedt and several other clinical colleagues are fully acknowledged.

Correction (April 2024):

Editorial updated; for further details, please see the Article Note at the end of the article.

Abbreviations

AID, automated insulin delivery; CGM, continuous glucose monitoring; PwD, patients with diabetes

Declaration of Conflicting Interests

The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: G.F. is the general manager and medical director of the Institute for Diabetes Technology (Institut für Diabetes-Technologie Forschungs- und Entwicklungsgesellschaft mbH an der Universität Ulm, Ulm, Germany), which carries out clinical studies, eg, with medical devices for diabetes therapy on its own initiative and on behalf of various companies. G.F./IfDT has received research support, speakers’ honoraria, or consulting fees in the last three years from Abbott, Ascensia, Berlin Chemie, Boydsense, Dexcom, Lilly, Metronom, Medtronic, Menarini, MySugr, Novo Nordisk, PharmaSens, Roche, Sanofi, and Terumo. D.W. is an employee of IfDT. L.H. is a consultant for several companies that are developing novel diagnostic and therapeutic options for diabetes treatment. He is a shareholder of the Profil Institut für Stoffwechselforschung GmbH, Neuss, Germany.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iDs

Guido Freckmann

Delia Waldenmaier

Lutz Heinemann

Article Note

The following updates were made to this article:

In Table 1, the values in the column “MARD (%)” have been updated for the following rows corresponding to the “First Author/Title of Publication” column: Kumagai et al.⁸; Comparison of glucose monitoring between Freestyle Libre Pro and iPro2 in patients with diabetes mellitus; Performance of the Eversense versus the Free Style Libre Flash glucose monitor during exercise and normal daily activities in subjects with type 1 diabetes mellitus; Kölle et al.²¹; Continuous Glucose Monitoring Systems in Adults With Type 1 Diabetes

Reference ‘Hanson K, Kipnes M, Tran H’ earlier mentioned in the text has been numbered as 23 in text and has been added to the reference list.

Earlier reference 23 ‘Gregori ARF, Andújar DS, Flores IC’ has been renumbered as 24 in both text and reference list.

References

Freckmann

Eichenlaub

Waldenmaier

, et al. Clinical performance evaluation of continuous glucose monitoring systems: a scoping review and recommendations for reporting. J Diabetes Sci Technol. 2023;17(6):1506-1526. doi:10.1177/19322968231190941.

Eichenlaub

Pleus

Rothenbühler

, et al. Comparator data characteristics and testing procedures for the clinical performance evaluation of continuous glucose monitoring systems. Diabetes Technol Ther. Epub ahead of print 9 January 2024. doi:10.1089/dia.2023.0465.

Kirchsteiger

Heinemann

Freckmann

, et al. Performance comparison of CGM systems: MARD values are not always a reliable indicator of CGM system accuracy. J Diabetes Sci Technol. 2015;9(5):1030-1040. doi:10.1177/1932296815586013.

Reiterer

Polterauer

Schoemaker

, et al. Significance and reliability of MARD for the accuracy of CGM systems. J Diabetes Sci Technol. 2017;11(1):59-67. doi:10.1177/1932296816662047.

Heinemann

Schoemaker

Schmelzeisen-Redecker

, et al. Benefits and limitations of MARD as a performance parameter for continuous glucose monitoring in the interstitial space. J Diabetes Sci Technol. 2019;21:1932296819855670. doi:10.1177/1932296819855670.

Pleus

Eichenlaub

Gerber

, et al. Improving the bias of comparator methods in analytical performanceassessments through recalibration. J Diabetes Sci Technol. Epub ahead of print 22 October 2022. doi:10.1177/19322968221133107.

Freckmann

Link

Kamecke

Haug

Baumgartner

Weitgasser

Performance and usability of three systems for continuous glucose monitoring in direct comparison. J Diabetes Sci Technol. 2019;13(5):890-898. doi:10.1177/1932296819826965.

Kumagai

Muramatsu

Fujii

, et al. Comparison of glucose monitoring between Freestyle Libre Pro and iPro2 in patients with diabetes mellitus. J Diabetes Investig. 2019;10(3):851-856. doi:10.1111/jdi.12970.

Moser

Pandis

Aberer

, et al. A head-to-head comparison of personal and professional continuous glucose monitoring systems in people with type 1 diabetes: hypoglycaemia remains the weak spot. Diabetes Obes Metab. 2019;21(4):1043-1048. doi:10.1111/dom.13598.

10.

Denham

. A head-to-head comparison study of the first-day performance of two factory-calibrated CGM systems. J Diabetes Sci Technol. 2020;14(2):493-495. doi:10.1177/1932296819895505.

11.

Fokkert

van Dijk

Edens

, et al. Performance of the Eversense versus the Free Style Libre Flash glucose monitor during exercise and normal daily activities in subjects with type 1 diabetes mellitus. BMJ Open Diabetes Res Care. 2020;8(1):93. doi:10.1136/bmjdrc-2020-001193.

12.

Jafri

Balliro

El-Khatib

, et al. A three-way accuracy comparison of the Dexcom g5, Abbott Freestyle Libre Pro, and Senseonics Eversense continuous glucose monitoring devices in a home-use study of subjects with type 1 diabetes. Diabetes Technol Ther. 2020;22(11):846-852. doi:10.1089/dia.2019.0449.

13.

Tsoukas

Rutkowski

El-Fathi

, et al. Accuracy of Freestyle Libre in adults with type 1 diabetes: the effect of sensor age. Diabetes Technol Ther. 2020;22(3):203-207. doi:10.1089/dia.2019.0262.

14.

Boscari

Vettoretti

Amato

AML

, et al. Comparing the accuracy of transcutaneous sensor and 90-day implantable glucose sensor. Nutr Metab Cardiovasc Dis. 2021;31(2):650-657. doi:10.1016/j.numecd.2020.09.006.

15.

Guo

Zhang

Chen

. Multicenter evaluation study comparing a new factory-calibrated real-time continuous glucose monitoring system to existing flash glucose monitoring system. J Diabetes Sci Technol. 2023;17(1):208-213. doi:10.1177/19322968211037991.

16.

Link

Kamecke

Waldenmaier

, et al. Comparative accuracy analysis of a real-time and an intermittent-scanning continuous glucose monitoring system. J Diabetes Sci Technol. 2021;15(2):287-293. doi:10.1177/1932296819895022.

17.

Nagl

Berger

Aberer

, et al. Performance of three different continuous glucose monitoring systems in children with type 1 diabetes during a diabetes summer camp. Pediatr Diabetes. 2021;22(2):271-278. doi:10.1111/pedi.13160.

18.

Pleus

Stuhr

Link

Haug

Freckmann

. Variation of mean absolute relative differences of continuous glucose monitoring systems throughout the day. J Diabetes Sci Technol. 2022;16(3):649-658. doi:10.1177/1932296821992373.

19.

Boscari

Vettoretti

Cavallin

, et al. Implantable and transcutaneous continuous glucose monitoring system: a randomized cross over trial comparing accuracy, efficacy and acceptance. J Endocrinol Invest. 2022;45(1):115-124. doi:10.1007/s40618-021-01624-2.

20.

Yeoh

Png

Khoo

, et al. A head-to-head comparison between Guardian Connect and FreeStyle Libre systems and an evaluation of user acceptability of sensors in patients with type 1 diabetes. Diabetes Metab Res Rev. 2022;38(7):e3560. doi:10.1002/dmrr.3560.

21.

Kolle

Eichenlaub

Mende

, et al. Performance assessment of three continuous glucose monitoring systems in adults with type 1 diabetes. J Diabetes Sci Technol. 2023;12:19322968231159657. doi:10.1177/19322968231159657.

22.

Lundemose

Laugesen

Ranjan

Norgaard

. Factory-calibrated continuous glucose monitoring systems in type 1 diabetes: accuracy during in-clinic exercise and home use. Sensors (Basel). 2023;23:22. doi:10.3390/s23229256.

23.

Hanson

Kipnes

Tran

. Comparison of Point Accuracy Between Two Widely Used Continuous Glucose Monitoring Systems. J Diabetes Sci Technol. 2024. Published online January 8, 2024. doi: 10.1177/19322968231225676.

24.

Gregori

ARF

Andújar

Flores

. The effectiveness of closed-loop systems is maintained after one year of use. Diabetologia. 2023. https://www.easd.org/media-centre/home.html#!resourcegroups/order=primary_event_starts_at&page=1&group=&event_ids=&resourcetype_ids=5&tag_ids=

Head-to-Head Evaluation of Continuous Glucose Monitoring and Automated Insulin Delivery Systems: Why are They not Used More Systematically?

Abstract

Keywords

Introduction

Practical Aspects of Performing Head-to-Head Studies

Publications of the Results of Head-to-Head Studies Performed in the Last Five Years

“Head-to-Head” Comparison of Current CGM Systems

“Head-to-Head Study” With AID Systems

Summary

Footnotes

Acknowledgements

Correction (April 2024):

Abbreviations

Declaration of Conflicting Interests

Funding

ORCID iDs

Article Note

References