Abstract
In an article in
Arnold et al have presented a thorough study of imprecision components for a point-of-care (POC) hemoglobin A1c (HbA1c) assay. 1 This study provides valuable information. The following comments may help future evaluation studies.
There are 3 purposes for an assay—screening, diagnosis, and monitoring. HbA1c is intended for diagnosis and monitoring but the authors in this study mention only diagnosis in their article. This is confusing—would not this assay also be suitable for monitoring?
It is mentioned that precision and total error were investigated. This is also somewhat confusing since if one has evaluated total error and found it to be acceptable, then the precision must also be acceptable because precision is part of total error.
The problem with estimating total error to determine performance is that total error as calculated by the authors (which is the usual way it is calculated) is a probability metric because it involves multiples of the standard deviation (or coefficient of variation) that correspond to percentages of results. Thus, the authors’ formula provides the total error for 95% of the results. This leaves 5% of the results without being characterized for performance. To be fair, the venerable NGSP (National Glycohemoglobin Standardization Program) standard also requires only a percentage of results (90%) to be acceptable.
2
Why should 5% (this study) or 10% (NGSP) be left out of performance specifications? Since total implies everything, how can
One potential remedy is to create an error grid for HbA1c. If in fact, there are results beyond the currently required total error of 6%, an error grid would have zones that provide the severity of false-positive and false-negative results. And these zones are not constrained to one HbA1c concentration—the zone values can change as a function of the HbA1c concentration. Since an error grid is so widely accepted for glucose meters, it would seem likely to be accepted for HbA1c. One might need different error grids for HbA1c diagnosis vs monitoring. An error grid for HbA1c was provided recently. 3
As mentioned previously, most evaluations use relatively small sample sizes and rarely produce outliers. 4 This does not invalidate the study—rather one can consider the estimated performance to be for typical results. That is, this is the performance one can expect most of the time (eg 95%) with errors caused by imprecision and average bias. Rare errors or failure to obtain a result often have other causes (user error, manufacturing problems, and hardware issues). A source to examine these errors is the FDA MAUDE database (Food and Drug Administration Manufacturer and User Facility Device Experience database). 5 The good news is that only 3 events (erratic values) were found for the authors’ product in the last 2 years. Since it is implied that this POC assay may achieve Clinical Laboratory Improvement Amendments (CLIA) waived status, the number of user errors may increase especially if the number of finger-stick samples increases. The authors have shown that finger-stick samples are less precise than venous samples. As the authors imply, personnel in CLIA waived labs might have more user error. This is another reason to monitor the MAUDE database.
Another aspect of the authors’ study is the combination data from different studies to arrive at a total error estimate. This is an interesting and innovative approach.
Finally, it is also informative to have one or more graphs of the data (none were provided) in spite of the fact that total error was calculated from different studies.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
