Sage Journals: Discover world-class research

Abstract

The article by Macleod and coauthors outlines an accuracy study of two blood glucose monitoring systems (BGMSs) against glucose-oxidase- and hexokinase-based assays showing how the reference/comparison method and inappropriate sample comparisons can affect accuracy conclusions. The dangers of independent institutions producing inappropriate conclusions when the methodology used for product regulatory or registration accuracy requirements is not according to best practice are stressed. The authors highlight several important aspects of a multistep accuracy evaluation protocol. However, it is essential that anyone undertaking or reporting BGMS accuracy studies should have a clear understanding of each and every protocol point, best practice, and how each can influence accuracy conclusions. Claims against regulatory accuracy criteria should be made only if the detailed specified protocol design and analysis is followed.

Keywords

BGMS accuracy blood glucose monitoring system FDA guidance ISO 15197 protocol design reference/comparison glucose method

The article by Macleod and coauthors¹ highlights problems that can be encountered in evaluations of blood glucose meter system (BGMS) accuracy when inappropriate protocols and different reference/comparison methods are used. An ISO 15197:2013/2015 user trial was undertaken with a comparison of blood sample results generated on two BGMSs against a glucose-oxidase-based YSI 2300 reference analyzer and hexokinase-based methods on laboratory Siemens ADVIA 2400 and Abbott Architect C16000 analyzers. Illustrations are provided showing how the reference/comparison method used, and inappropriate sample comparisons can both significantly affect accuracy claims. The authors indicate anecdotally that accuracy evaluations, producing inappropriate accuracy conclusions when methodology used is not according to best practice, are being carried out and that as part of addressing in-vitro diagnostic regulation, notified bodies will conduct their own accuracy assessments of product performance where no requirements on test methodology are provided. This could lead to independent testing institutions developing their own test methods different from best practice resulting in increased variation in blood glucose accuracy measurement. Device assessments of accuracy review as per country specific requirements for product registration could be similarly affected.

A range of articles outline how to assess and present accuracy^2-4 and best practice quality guidelines or recommendations for undertaking and reporting appropriate evaluations of BGMS accuracy are available from several sources.^5,6 Determining accuracy is however a complex process dependent on correct procedural design. Several standards/guidelines^7,8 outline requirements for BGMSs used by lay users performing self-monitoring of blood glucose. Post launch accuracy investigations include the Diabetes Technology Society (DTS) Surveillance program.⁹ Each describes detailed protocols for the assessment of accuracy as well as accuracy criteria. Although each has its own, separate opinion regarding the requirements,¹⁰ valid compliance statements with minimum accuracy criteria should be made only if the specified protocol is used and best practice understood.

Macleod et al indicate that a thorough understanding of reference instrument performance, including imprecision, calibration, and traceability at the time of the evaluation is critical to understanding any potential impact on BGMS bias. This is essential as the ISO 15197:2013/2015 standard⁷ (although system accuracy and user performance sections differ) and FDA guidance on OTC BGMs⁸ allow the use of reference/comparison methods different from the manufacturer’s stated reference instrument that was used to calibrate BGMSs against. It is however important to understand that many other aspects of best practice protocols can also influence accuracy evaluation data.

Although the correct evaluation of BGMS accuracy is complex, many publications have outlined guidelines to ensure this is done correctly. It is essential that deviation from protocol guidelines, inappropriate performance, or systematic bias in the reference/comparison method is not mistaken for inaccuracy in the investigated BGMSs.¹¹ When interpreting system accuracy data, one should be aware that the use of different methods/analyzers and deviations from best practice specific protocols have an influence on the results obtained and hence compliance with accuracy criteria. This becomes increasingly important as minimum accuracy acceptance criteria become tighter.

In addition to highlighting potential issues with the reference/comparison method and correct sample comparison the Macleod et al article incorporates many protocol details important in ensuring valid conclusions. Irrespective of if ISO 15197:2013/2015 or FDA guidance is used, it is essential that all aspects of the specified protocol are followed. Over 20 individual protocol points that must be addressed have been outlined previously and best practice identified.⁶ These are grouped below into categories based on (1) the general protocol, (2) samples and subjects, (3) reference/comparison method, and (4) data analysis, assessment, and presentation. It is essential any accuracy evaluation study or publication addresses and clearly documents each issue.

General Protocol

The type of evaluation and the specific standard/guidelines followed determines the protocol used. This will include for example details of if it is an analytical accuracy or user study, the scope, test conditions such as temperature, humidity, and location, procedural details, inclusion/exclusion based on the system’s instructions for use (IFU) or standard/guideline criteria, and who performs the tests and how. Details of the meters and strips used should also be provided including how they were obtained and stored, the number of different strip lots used together with details of their batch numbers, expiry dates and when the evaluation was performed, and the specific enzyme the strip chemistry is based on.

Accuracy evaluation studies or publications should ensure the following in relation to the general protocol:

- Only appropriate, approved, and full protocols are used to assess BGMS accuracy against specific standard/regulatory criteria.

- There is a clear unambiguous statement that all aspects of the protocol have been followed, supplemented with sufficient individual protocol point details to enable a full understanding and valid interpretation of results and conclusions. Accuracy publications^12-14 in relation to ISO 15197:2013/2015 illustrate appropriate inclusion.

- All aspects of the BGMS IFU are understood and considered in relation to the standard’s protocol guidelines. These include claims, appropriate samples, operation, and inclusion/exclusion criteria.

- Best practice aspects of all standards/guidelines are considered.

Samples and Subjects

Details of the samples and subjects to be used in specific standards/guidelines vary. The number of subjects and samples, handling of samples, the spread and distribution of sample glucose concentrations needed, and the use of specimens in which glucose concentration is altered all vary and should be appropriate when accuracy claims are made against specific criteria. A BGMS’s IFU will state the intended use of fresh capillary samples with any alternative sample type mentioned if appropriate. Any suitable anticoagulant/preservative will also be stated. As shown by Macleod et al, a sample type, such as venous blood, not claimed as suitable for use in the IFU should not be used for accuracy determination.

Accuracy evaluation studies or publications should ensure the following in relation to samples and subjects:

- Fresh capillary blood samples are compared on both the BGMS and the reference/comparison method in line with standard/guidelines, that is, comparing like with like. Comparison of capillary against venous samples is not appropriate and introduces both potential physiological and analytical differences.

- An appropriate number of samples, providing the specified number of data points, are taken from the number and type of individuals identified in the standard/guidelines. Samples satisfy the glucose concentration spread requirements outlined.

- Use of altered specimens is as detailed in the specific standard/guidelines.

- Possible effects due to glycolysis or sample handling are appropriately eliminated or minimized.

Reference/Comparison Method

As demonstrated by Macleod et al the reference/comparison method used in accuracy studies can affect a system’s apparent compliance with standard/guideline criteria. This warrants discussion and understanding of which reference/comparison methods should be used, their performance, acceptance or exclusion of results, and differences between methods.

The use of an appropriate well-managed reference/comparison methods remains of paramount importance in performing accuracy evaluations correctly.¹⁵ Its role becomes even more important with increasingly strict performance requirements for BGMSs. Reference/comparison analyzer errors and systematic measurement differences (biases) between methods that previously may have been considered acceptable may now be consequential and particularly important for comparison against analytical accuracy criteria which are more stringent than clinical accuracy requirements.

Which Reference/Comparison Method

Previous best practice⁶ and Macleod et al¹ required BGMS results to be compared against results generated by the reference/comparison method specified by the manufacturers. ISO 15197:2013/2015 (system accuracy section 6.3.2) however states “A reference measurement procedure that conforms to the traceability requirements of ISO 17511 shall be used to assign the glucose reference values.” FDA guidance quotes “comparator method refers to a laboratory-based glucose measurement method that has been well-validated for precision and accuracy and that is traceable to a higher order reference material and/or method.”

Ideally, a “gold standard” analytical method such as isotope dilution mass spectrometry should be used in accuracy studies or traceability to it shown. However the method is lengthy, is expensive, and has a lower sample throughput than glucose-oxidase- or hexokinase-based laboratory analyzers¹⁶ against which currently available BGMSs are calibrated.^10,17 In recent accuracy evaluations against ISO 15197:2013/2015 criteria 68% of publications used a glucose-oxidase-based comparison method with 61% using a YSI, 42% of studies used a hexokinase-based method with 30% performed on a Cobas instrument. Both methods were used for comparison measurements in 11% of studies. Due to the impact on accuracy evaluations, harmonization of comparison methods and definition of specific requirements should be established in calibration of BGMSs and performance analyses.¹⁷

Performance of the Reference/Comparison Method

Macleod et al correctly indicate it is critical to understand performance of the reference analyzer relating to imprecision, calibration, and traceability such as to National Institute of Standards and Technology (NIST) standards. Performance and traceability of the reference/comparison method should be determined and reported when subject samples were analyzed. Standards/guidelines however differ on statements regarding what is appropriate performance. ISO 15197:2013/2015 and the CLSI guideline¹⁸ describe procedures to verify trueness and precision of the reference method, for example, by using certified reference material.⁵

Reference/comparison method analyzers should be calibrated according to manufacturer’s instructions, and best practice would suggest to ensure traceability a minimum of four NIST glucose standard reference samples spanning the measurement range of the BGMS should be tested and bias estimated with each batch of study samples.⁹ Theoretical mathematical recalibration and adjustment of the comparison method results according to certified NIST targets have been reported.^19,20 While recalibration cannot eliminate imprecision, it could harmonize different laboratory methods with respect to bias.²⁰

ISO 15197:2013/2015 states that any reference method chosen should have adequate performance criteria, and information obtained from the manufacturer in relation to compliance with traceability requirements of ISO 17511 may be used to make this determination. It also states that the trueness and precision of the reference IVD medical device shall be verified during the performance evaluation, although concrete requirements such as allowed deviations are not included. FDA requirements only state that the reference method has to be “well-validated for precision and accuracy.” The CLSI¹⁸ and DTS⁹ guidelines require the reference/comparison method to fulfil specific performance criteria in that bias should not exceed 2.2%, analytical imprecision 2.9%, and total error 6.9%. This should be considered best practice and incorporated and reported in accuracy evaluations and publications. Quite correctly it has been suggested that in consideration of the tight accuracy criteria for BGMSs, the standards should pay much more attention to the definition of clear criteria for the reference method and that they should be stricter than those proposed by national directives for routine laboratory systems.⁵

As highlighted in a previous analysis article,²⁰ full details and data on the trueness and imprecision of the reference/comparison method should be included in accuracy studies and publications. The amount of quantitative information provided in recently published ISO 15197:2013/2015 accuracy studies vary from none, to a simple statement of imprecision CV% at a single glucose concentration, to detailed information on bias and imprecision at each of a range of glucose concentrations spanning the clinical analytical range. Jendrike et al¹⁴ indicate that traceability to NIST SRM 965b was confirmed during the study and that bias and imprecision were ⩽1.0% and ⩽1.3%, respectively, for YSI 2300 STAT Plus and ⩽0.6% and ⩽1.0%, respectively, for Cobas Integra 400 Plus when using manufacturer-provided control materials.

The Macleod et al article states calibration, maintenance and use of the hospital comparison analyzers were per individual hospital protocol and traceability and performance requirements were maintained throughout the study. In addition, trueness and precision of the YSI 2300 glucose analyzer were verified during the test process by daily quality control measurements which followed LifeScan internal standard operating procedures. Although mean bias is shown for different methods further details and values relating to reference/comparison method performance, traceability, calibration, and quality control would have been useful. This is particularly relevant to the YSI comparison method, which although in widespread use, is complex and variable in terms of reported maintenance²¹ calibration and checks,^22,23 traceability,²⁴ target and acceptance values, imprecision, and the exact procedure used for sample analysis.^15,21,25

Acceptance or Exclusion of Reference/Comparison Method Results

Specific requirements on acceptance or exclusion of results vary with individual standards/guidelines. FDA guidelines indicate duplicate measurements with the reference method are optional. ISO 15197:2013/2015 however states at least duplicate reference measurements are mandatory and measurements before and after the BGMS measurements are required to ensure glucose concentration stability. Data exclusion criteria and drift analysis dictate that if samples measurements differ from each other by more than 4 mg/dl for values ⩽100 mg/dl, or by more than 4% for values >100 mg/dl, then the data cannot be used. CLSI guidelines also exclude data/results if there is >4% (4 mg/dl) deviation from certified material. An indication or definition of how close NIST results should be when using specific standards/guidelines would be useful. ISO 15197:2013/2015 excludes samples with interferences such as hematocrit that fall outside the BGMS’s stated range in the IFU. FDA guidance however specifies inclusion of all hematocrit levels with an ideal range stated.

Differences Between Reference/Comparison Methods

In line with previous concerns^15,17,20 Macleod et al highlight recognized differences between reference/comparison methods and demonstrate that results obtained when comparing YSI to other hospital analyzers depend on the glucose level. Systematic bias differences between glucose oxidase and hexokinase reference/comparison methods have been previously described^17,26 with deviations of up to 8% observed, even though traceability of both methods was established.^20,26 Differences in the YSI measured accuracy versus the isotope dilution mass spectrometry reference method have also been reported, as has influence by both hematocrit and plasma water concentration in a glucose concentration dependent manner relative to a hexokinase method. Low hematocrit levels can introduce up to a 15% positive bias and high hematocrit levels up to a –18% bias.²⁷ Differences of approximately 4% between YSI instruments in different locations and selection of YSI analyzers that exhibited the least bias relative to serum target values have been reported.¹⁵

Accuracy evaluation studies or publications should ensure the following in relation to reference/comparison methods:

- An appropriate reference/comparison method in line with standards/guidelines is employed and details of use, calibration, traceability, quality control, and maintenance provided.

- The imprecision and bias of the method at the time of analysis is reported and within guidelines.

- NIST standards are run with each batch of specimens to ensure traceability and bias reported.

- The enzymes used in the BGMS strips and reference/comparison method, and the manufacturer’s stated comparison method stated.

Data Analysis, Assessment, and Presentation

Different requirements for accuracy data analysis, assessment, minimum acceptance criteria, and presentation are outlined in specific standards/guidelines. These include analytical accuracy criteria, graphical and tabular presentation of accuracy and bias, scatter plot and regression analysis, and clinical accuracy error grid analysis.

Accuracy evaluation studies or publications should ensure the following in relation to data analysis, assessment and presentation of results:

- Data are recorded, analyzed, and presented as outlined in the specific standards/ guidelines used.

- Statements on compliance against criteria outlined in standards/guidelines are made only if all aspects of their specific protocol are complied with and reference/comparison performance at the time of analysis was appropriate and acknowledged and all aspects of bias against NIST standards considered.

Possible Differences in Compliance When Using Different Reference/Comparison Methods

Due to differences in bias in glucose oxidase and hexokinase reference/comparison methods, it is acknowledged that although showing a high level of accuracy against ISO 15197:2013/2015, BGMSs calibrated against glucose oxidase or hexokinase would likely show systematically different results.¹⁶ Many commercially available BGMSs have been evaluated using both glucose oxidase and hexokinase reference/comparison methods, one of which would have been the manufacturer’s designated comparison method, and many shown to either meet or not meet analytical accuracy criteria by both methods.^14,17,28 There are however a few examples^13,14,16 where BGMSs satisfied accuracy criteria when using the manufacturers designated comparison method but with use of another reference/comparison method, 2 out of 3 strip lots failed to meet analytical accuracy criteria and were therefore classified as noncompliant despite meeting clinical error grid clinical accuracy criteria.

As stressed by Macleod et al, this reinforces the importance of when interpreting accuracy data, not only should it be ensured that an appropriate study design has been used, but one should also be aware that the use of different reference/comparison methods can influence results.

In conclusion, when BGMS accuracy evaluations are performed and interpreted they should:

1 Ensure and state an appropriate protocol has been used and fully, not partially, complied with and best practice considered

2 Have details of the reference/comparison method used and its performance at the time in terms of imprecision, trueness, bias, and traceability

3 Be aware that the use of different reference/comparison methods can have an influence on results and hence compliance with accuracy criteria

4 Only make claims relating to compliance with minimum accuracy regulatory criteria if the appropriate detailed specified protocol and analysis is followed

Footnotes

Abbreviations

BGMS, blood glucose monitoring system; DTS, Diabetes Technology Society; FDA, Food and Drug Administration; IFU, instructions for use; ISO, International Organization for Standardization; NIST, National Institute of Standards and Technology; YSI, Yellow Springs Instrument.

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

References

Macleod

Katz

Cameron

Capillary and venous blood glucose accuracy in blood glucose meters versus reference standards: the impact of study design on accuracy evaluations. J Diabetes Sci Technol. 2018. Advance Online Publication. doi: 10.1177/1932296818790228.

Schnell

Hinzmann

Kulzer

et al . Assessing the analytical performance of systems for self-monitoring of blood glucose: concepts of performance evaluation and definition of metrological key terms. J Diabetes Sci Technol. 2013;7(6):1585-1594.

Simmons

DA.

How should blood glucose meter system analytical performance be assessed?

J Diabetes Sci Technol. 2016;10(1):178-184.

Pleus

Flacke

Sieber

et al . Strengths and limitations of new approaches for graphical presentation of blood glucose monitoring system accuracy data. J Diabetes Sci Technol. 2017;11(6):1226-1230.

Freckmann

Baumstark

Pleus

Do the new FDA guidance documents help improving performance of blood glucose monitoring systems compared with ISO 15197?

J Diabetes Sci Technol. 2017;11(6):1240-1246.

Thorpe

GH.

Assessing the quality of publications evaluating the accuracy of blood glucose monitoring systems. Diabetes Technol Ther. 2013;15(3):253-259.

International Organization for Standardization. In Vitro Diagnostic Test Systems—Requirements for Blood-Glucose Monitoring Systems for Self-Testing in Managing Diabetes Mellitus (ISO 15197:2013). EN ISO 15197. Geneva, Switzerland: International Organization for Standardization; 2015.

Food and Drug Administration. Self-monitoring blood glucose test systems for over-the-counter use— Guidance for industry and food and drug administration staff. Food and Drug Administration; 2016. http://www.fda.gov/downloads/MedicalDevices/DeviceRegulationandGuidance/GuidanceDocuments/UCM380327.pdf.

Klonoff

Lias

Beck

et al . Development of the diabetes technology society blood glucose monitor system surveillance protocol. J Diabetes Sci Technol. 2016;10(3):697-707.

10.

Freckmann

Schmid

Baumstark

et al . Analytical performance requirements for systems for self-monitoring of blood glucose with focus on system accuracy: relevant differences among ISO 15197:2003, ISO 15197:2013, and current FDA recommendations. J Diabetes Sci Technol. 2015;9(4):885-894.

11.

Teodorczyk

Nandagopalan

Maguire

et al . System accuracy of blood glucose monitoring devices according to the current and proposed ISO 15197 standards. J Diabetes Sci Technol. 2013;7(3):795-797.

12.

Pleus

Baumstark

Rittmeyer

et al . Performance of two updated blood glucose monitoring systems: an evaluation following ISO 15197:2013. Curr Med Res Opin. 2016;32(5):847-855.

13.

Link

Schmid

Pleus

et al . System accuracy evaluation of four systems for self-monitoring of blood glucose following ISO 15197 using a glucose oxidase and a hexokinase-based comparison method. J Diabetes Sci Technol. 2015;9(5):1041-1050.

14.

Jendrike

Baumstark

Pleus

et al . Evaluation of 4 blood glucose monitoring systems for self-testing with built-in insulin dose advisor based on ISO15197: 2013: system accuracy and hematocrit influence. Diabetes Technol Ther. 2018;20(4):303-313.

15.

Bailey

Klaff

Wallace

et al . Fundamental importance of reference glucose analyzer accuracy for evaluating the performance of blood glucose monitoring systems (BGMSs). J Diabetes Sci Technol. 2016;10(4):872-875.

16.

Baumstark

Jendrike

Pleus

et al . Evaluation of accuracy of six blood glucose monitoring systems and modeling of possibly related insulin dosing errors. Diabetes Technol Ther. 2017;19(10):580-588.

17.

Freckmann

Link

Schmid

et al . System accuracy evaluation of different blood glucose monitoring systems following ISO 15197:2013 by using two different comparison methods. Diabetes Technol Ther. 2015;17:635-648.

18.

CLSI. Point-of-Care Blood Glucose testing in Acute and Chronic Care Facilities; Approved Guideline—Third Edition. CLSI document POCT12-A3. Wayne, PA: Clinical and Laboratory Standards Institute; 2013.

19.

Littmann

Petersen

ERB

Pussinen

et al . Evaluation of OneTouch Verio®, a new blood glucose self-monitoring system for patients with diabetes. Scand J Clin Lab Invest. 2013;73:286-292.

20.

Rutschmann

. Analysis of “Accuracy and user performance evaluation of a new, wireless-enabled blood glucose monitoring system that links to a smart mobile device.” J Diabetes Sci Technol. 2017;11(4):744-745.

21.

Salacinski

Alford

Drevets

et al . Validity and reliability of a glucometer against industry reference standards. J Diabetes Sci Technol. 2014;8(1):95-99.

22.

Klonoff

Parkes

Kovatchev

et al . Investigation of the accuracy of 18 marketed blood glucose monitors. Diabetes Care. 2018;41(8):1681-1688.

23.

Zijlstra

Heinemann

Fischer

et al . A comprehensive performance evaluation of five blood glucose systems in the hypo-, eu-, and hyperglycemic range. J Diabetes Sci Technol. 2016;10(6):1316-1323.

24.

Dunne

Viggiani

Pardo

et al . Accuracy evaluation of CONTOUR PLUS compared with four blood glucose monitoring systems. Diabetes Therapy. 2015;6:377-388.

25.

Brazg

Hughes

Martin

et al . Clinical evaluation of the FreeStyle Precision Pro system. Clin Chim Acta. 2013;421:243-250.

26.

Freckmann

Schmid

Baumstark

et al . In response to Teodorczyk and coauthors: system accuracy of blood glucose monitoring devices according to the current and proposed ISO 15197 standards. J Diabetes Sci Technol. 2013;7(6):1659-1660.

27.

Lyon

DuBois

Slingerland

et al . Evaluation of the accuracy of the YSI glucose analyzer relative to an IDMS aligned perchloric acid hexokinase reference glucose method. Presentation at: International Hospital Diabetes Meeting; 2017; Atlanta, GA.

28.

Bernstein

Parkes

Goldy

et al . A new test strip technology platform for self-monitoring of blood glucose. J Diabetes Sci Technol. 2013;7(5):1386-1399.

Analysis of “Capillary and Venous Blood Glucose Accuracy in Blood Glucose Meters Versus Reference Standards: The Impact of Study Design on Accuracy Evaluations”

Abstract

Keywords

General Protocol

Samples and Subjects

Reference/Comparison Method

Which Reference/Comparison Method

Performance of the Reference/Comparison Method

Acceptance or Exclusion of Reference/Comparison Method Results

Differences Between Reference/Comparison Methods

Data Analysis, Assessment, and Presentation

Possible Differences in Compliance When Using Different Reference/Comparison Methods

Footnotes

Abbreviations

Declaration of Conflicting Interests

Funding

References