Abstract
Intra- and inter-laboratory variability of cerebrospinal fluid (CSF) biomarker analyses remains an important issue. We investigated the clinical-diagnostic impact of CSF biomarker concentration shifts in mild cognitive impairment (MCI) and autopsy-confirmed Alzheimer’s disease (AD) dementia patients. MCI patients (
Keywords
INTRODUCTION
Amyloid-β of 42 amino acids (Aβ1-42), total tau protein (T-tau), and tau phosphorylated at threonine 181 (P-tau181P) are qualified cerebrospinal fluid (CSF) biomarkers to identify subjects with (prodromal) Alzheimer’s disease (AD) [1 –5].
Nowadays, CSF Aβ and tau proteins can reliably be analyzed with several analytical techniques, including but not limited to the classic single-analyte ELISAs or multi-analyte tests based on xMAP technology [6 –11]. The overall clinical accuracy for using above mentioned biomarkers is, besides the population of the selected subjects, affected by the method of measurement, which is in part a combination of assay precision and bias (assay accuracy) (Supplementary Figure 1). Lot-to-lot variation remains an important issue for worldwide integration of cutoff values. Lot-to-lot variability for Aβ1-42 accounted for 22% (ELISA) and 10% (xMAP) of the overall variability [12]. The delivery to the market of assays with low lot-to-lot variability is the accountability of the vendor of the assays [13, 14], while the reliability of the generated concentrations, which is linked to the qualification of the lab and the operators who perform the assays, is the accountability of the end-user [15]. Intra-laboratory variability is also most pronounced for Aβ1-42 as demonstrated by coefficients of variation (CV) in a recent study using ELISA (Aβ1-42: 14.4% , T-tau: 8.5% , P-tau181P: 12.6% ) [14]. The same study reported largest inter-laboratory variability values for P-tau181P (Aβ1-42: 7.3% , T-tau: 6.7% , P-tau181P: 27.6% ) [14], whereas a former study also demonstrated largest inter-laboratory variability values for Aβ1-42 [12].
Intra- and inter-laboratory variability of these CSF biomarker analyses, together with an absolute value assignment for the markers, remains an important issue for their further integration into clinical routine [12 , 15–18]. In order to reduce CSF biomarker variability, several international initiatives to standardize (pre-)analytical steps of CSF biomarker analyses are running [19 –21]. As the clinical relevance of CSF biomarker variability has not yet been investigated in extenso, we have set up an exploratory study to determine the impact of differences in CSF biomarker levels on the diagnostic performance in mild cognitive impairment (MCI) subjects and autopsy-confirmed AD dementia patients. The size of the shifts were selected to reflect anticipated differences in biomarker levels due to (pre-)analytical conditions. A CSF biomarker profile was considered to be pathological if the CSF Aβ1-42 concentration was below the cutoff, in combination with T-tau and/or P-tau181P values above the cutoff (IWG-2 criteria). The present exploratory study should be considered as technology-independent statements.
MATERIALS AND METHODS
Study population
Seventy-two dementia patients with autopsy-confirmed AD, 85 patients with MCI and clinical follow-up, and 100 cognitively healthy controls were included. Patients were recruited through the Memory Clinic of Hospital Network Antwerp, Middelheim, and Hoge Beuken, and through centers referring to the Biobank of the Institute Born-Bunge as described before [1]. Data including gender, age at time of CSF sampling, age at conversion to AD dementia (in case of the MCI group), and age at death (if applicable) were available for the majority of the patients.
The inclusion criteria for the control group were: (1) no neurological or psychiatric antecedents and (2) no organic disease involving the central nervous system following extensive clinical examination. The control group consisted of patients with mechanical low back pain requiring a selective lumbar radiculography (n = 25), patients with disorders of the peripheral nervous system (polyneuropathy, peripheral facial nerve palsy) (n = 10), and patients with subjective complaints in whom disorders of the central and peripheral nervous system were ruled out by means of an extensive clinical work-up (n = 65).
MCI patients were diagnosed applying Petersen’s diagnostic criteria [22], i.e., (1) cognitive complaint, preferably corroborated by an informant; (2) objective cognitive impairment, quantified as performance of more than 1.5 SD below the appropriate mean on the neuropsychological subtests; (3) largely normal general cognitive functioning; (4) essentially intact activities of daily living (basic and instrumental activities of daily living were determined by a clinical interview with the patient and an informant); and (5) not demented. Based on clinical follow-up the MCI group consisted of patients who were still diagnosed as MCI at follow-up (stable MCI group;
AD dementia was diagnosed according to the NINCDS/ADRDA criteria [23]. All neuropathological diagnoses of AD were established according to neuropathological criteria of Montine [24] by the same neuropathologist (JJM) who was blinded for biomarker results. The neuropathological examination was performed on the right hemisphere of the brain as previously described [1, 8].
Part of the study results have been published previously [1]. This study was approved by the ethics committee of UAntwerp, Antwerpen, Belgium. Informed consent was obtained from all patients.
CSF sampling and storage
The samples were collected at Middelheim General Hospital and centers referring to the Biobank of the Institute Born-Bunge according to standard collection protocols as described previously [1]. CSF was obtained by lumbar puncture (LP) at the L3/L4 or L4/L5 interspace. MCI patients underwent LP at baseline as part of their diagnostic work-up. CSF samples were collected in polypropylene vials (Nalgene cat.no.5000–1020 (1.5 mL) and 5000–0050 (4.5 mL)), immediately frozen in liquid nitrogen, and subsequently stored at –75°C until analysis.
CSF analyses and interpretation of CSF biomarker results
CSF levels of Aβ1-42, T-tau, and P-tau181P were determined with commercially available single-analyte ELISA kits (INNOTEST® β-AMYLOID (1-42) , INNOTEST® hTAU-Ag, and INNOTEST® PHOSPHO-TAU (181P) ; Fujirebio Europe). Results of autopsy-confirmed AD patients and controls have been published previously [1], and have been combined with a series of MCI subjects. In principle, the outcome of this study is independent on the method of measurement and can be applied to other immunoassay methods for the same variables. For each assay run, tests were performed as described earlier [1]. The only difference with the formerly published protocol was that the threshold of acceptance for intra-assay variation was decreased to 20% (calculated as (max-min) x 100/average). The laboratory technician was blinded for the clinical and definitive pathological diagnoses when performing the tests. The concentration ranges of the test kits are described in the package inserts (Aβ1-42: 125-2000 pg/mL, T-tau: 75-1200 pg/mL, P-tau181P: 15.6–500 pg/mL). The three CSF biomarker levels were interpreted based on cutoff values, selected as the maximal sum of sensitivity and specificity [25], to discriminate independent sets of autopsy-confirmed AD patients and age-matched cognitively healthy elderly: Aβ1-42 <638.5 pg/mL, T-tau >296.5 pg/mL, P-tau181P >56.5 pg/mL [26]. Consistent with the IWG-2 criteria for AD [27], a CSF biomarker profile was considered to be pathological if the CSF Aβ1-42 value was <638.5 pg/mL, in combination with T-tau values >296.5 pg/mL and/or P-tau181P values >56.5 pg/mL.
Induced biomarker concentration shift exercise
Both increments and decrements (from ±5% to ±40%) were added to the true (=observed) CSF biomarker values of the definite AD, MCI, and control subjects to estimate the impact of an induced concentration shift on the clinical classification. The added concentration shifts of only one of the three CSF biomarkers was investigated. For example, if Aβ1-42 values were increased or decreased, the other two markers were analyzed with their true CSF biomarker values.
The effect of concentration shifts was investigated by recalculating the sensitivity, specificity, and clinical diagnostic accuracy (Supplementary Figure 2) values to discriminate autopsy-confirmed AD from controls, and to discriminate between progressive MCI and stable MCI, using the above mentioned CSF biomarker cutoffs and the CSF biomarker IWG-2 criteria for AD [26, 27]. Results were reported as relative difference with respect to the reference diagnostic sensitivity, and specificity, obtained with the true biomarker concentrations.
Statistical analyses
Statistical analyses were performed using IBM SPSS Statistics 22 and GraphPad Prism 6. To describe and analyze our entire cohort, categorical variables were analyzed with a chi-square test, and percentages were reported. Demographic comparisons were based on one-way Anova tests with Bonferroni Posthoc tests, or in case of no normal distribution, Mann Whitney U tests. Mean values with standard deviations were reported. For all analyses,
RESULTS
Population (Table 1)
There was no significant difference in the proportion of gender in the AD, MCI, and control groups (
CSF analyses and diagnostic performance of CSF biomarkers
AD versus healthy control subjects (Figs. 1 and 2, and Supplementary Tables 1–3)
For the applied context of use, namely the identification of an AD biomarker profile, no major difference in clinical diagnostic performance of the CSF biomarker IWG-2 criteria for AD was observed for a range of induced differences in concentration to one of the three true biomarker values. Indeed, shifts in true analyte concentrations resulted in a deviation of clinical diagnostic accuracy of less than 5% (Fig. 1). Only if Aβ1-42 concentrations were decreased with 40% and 30% shifts, the clinical diagnostic accuracy deviated 6.4% and 5.3% of the reference, respectively (Fig. 1). The smallest effects on clinical diagnostic performance were observed for an induced shift of P-tau181P compared to the other two biomarkers (Fig. 1). The effects of induced concentration shifts are higher when one considers sensitivity or specificity (Fig. 2, Supplementary Tables 1–3).
Progressive MCI versus stable MCI (Figs. 3 and 4, and Supplementary Tables 4–6)
When considering the true values of the three CSF biomarkers, the sensitivity, specificity, and clinical diagnostic accuracy of conversion from MCI to AD dementia within the MCI groups were 68.1% , 79.0% , and 72.9% , respectively. A shift in true analyte concentrations of one of the three biomarkers between –20% and +20% resulted in a deviation of clinical diagnostic accuracy for discriminating progressive from stable MCI of less than 8.2% (Fig. 3). Moreover, the clinical diagnostic performance of induced concentration shifts of T-tau or P-tau181P deviated not more than 2.4% from the reference if ±40% shifts were added to the true CSF biomarker data of T-tau or P-tau181P, whereas the clinical diagnostic accuracy of a shift in Aβ1-42 was lower and deviated up to 15.2% from the reference in the same range of induced shifts. The effects were higher when one considers sensitivity or specificity (Fig. 4, Supplementary Tables 4–6).
Effect of concentration shifts at the individual patient level (Supplementary Table 7)
A change of diagnostic category, based on an AD CSF biomarker profile (abnormal/normal), was investigated for each individual separately. The CSF biomarkers were considered abnormal if suggestive for AD. If a shift in Aβ1-42 level was introduced, we detected the highest number of changes in diagnostic classification based on the CSF biomarker profiles (AD
DISCUSSION
CSF Aβ1-42, T-tau, and P-tau181P are qualified biomarkers for AD that have been integrated in the IWG-2 diagnostic criteria for (prodromal) AD [27]. The U.S. Food and Drug Administration encourages their use for enrichment in clinical trials as exemplified by the recently published letter of support [28]. Intra- and inter-laboratory variability of these CSF biomarker analyses remains an important issue, as are differences in absolute concentrations when results are generated with different immunoassay formats [12]. Several international initiatives were established to standardize [12 , 20] and harmonize (pre-)analytical steps of CSF biomarkers analyses [12 , 31]. The overall difference in concentration is a combination of pre-analytical factors (e.g., sample collection and storage, freeze-thaw cycles), the uncertainties at the level of the assay (e.g., precision of the method) and the qualification of the laboratory (Supplementary Figure 1). The analytical accuracy of the clinical classification is further complicated by the absence of an international reference method [32], although major progress was made recently by using a mass spectrometry method for Aβ1-42 [33, 34]. In order to study the clinical-diagnostic impact of a shift in CSF biomarker concentrations, we evaluated the effect for the individual biomarkers in a group of MCI and autopsy-confirmed AD patients.
One multicenter study using single parameter ELISA kits demonstrated that lot-to-lot variability was a major cause of intra-laboratory variability which was most pronounced for Aβ1-42. Inter-laboratory variability mainly affected P-tau levels [14]. Although the impact of variability on the interpretation of individual CSF biomarkers (normal/abnormal) was pronounced (ranging from 10% to 29% ), a change of diagnostic category based on an AD CSF biomarker profile was limited to 12–16% of the cases [14].
In the present study, we demonstrated that the clinical diagnostic accuracy when using AD CSF biomarker concentration values for the three biomarkers [27] remained largely unchanged and above the 80% threshold when ±5% to ±40% shifts in concentration levels were added to one of the three true biomarker values in autopsy-confirmed AD cases. Due to the fact that several hallmarks are linked to the pathophysiology of AD (e.g., plaques, tangles), it is logical to use combinations of different biomarkers for the neurochemical diagnosis of AD. We therefore opted to use the IWG-2 criteria although this meanwhile implies that alterations in one biomarker might be compensated by unaltered levels of the other biomarkers. The approach to integrate a shift in only one analyte at once is linked to the fact that (1) the influence of the well-known pre-analytical variables for CSF biomarker testing is not identical for all analytes, (2) each immunoassay has been designed with a specific set of components and buffers to perform the assay, (3) the characteristics of the analytes with respect to short term stability, freeze-thawing, or absorption, differ between analytes, while (4) many clinical laboratories still use single-analyte immunoassays for the qualification of the CSF biomarkers. However, the impact of concentration shifts in one biomarker on sensitivity and specificity, for example the use of Aβ1-42 to define a cutoff value to identify the presence of amyloidopathy in the brain, was much higher (Figs. 2 and 4). This means that when a laboratory uses a cutoff established in another laboratory, they could be operating with similar clinical diagnostic accuracy but with markedly different sensitivity and specificity. Hence, a proportion of patients (in our study, for Aβ1-42
Although the running international standardization and harmonization efforts are very valuable, certainly from an analytical point of view, their influence on the clinical diagnostic performance of these CSF biomarkers might be smaller than initially hypothesized. Indeed, applying a concentration shift of ±20% of Aβ1-42 changed the diagnostic biomarker classification (AD patients were no longer identified as such) in 6% of subjects. For T-tau and P-tau181P, respectively 13% and 4% of the AD subjects displayed changed biomarker profile when a shift of ±20% was applied.
The induced variation exercise was performed in very well-characterized (autopsy-confirmed) AD patients, whereas in daily clinical practice, the impact of CSF biomarker shifts might be more important, especially in case of subjects showing biomarker values around the cutoff levels. Therefore, a clinical MCI group was included in our study as well. In this study group, shifts in true biomarker data deviated the clinical diagnostic performance more from the reference than when the comparison was done for the AD dementia patients. In the MCI group, more patients have CSF biomarker values at or around the cutoff values (‘grey zone’ or transition phase) as compared to the AD and control groups. Therefore, in the MCI group as compared to AD and control groups, even a small shift in biomarker levels will lead to a larger number of patients who change from normal to abnormal and vice versa and thus might change from diagnostic category based on the IWG-2 criteria. As such, the impact will be smaller in subjects with a broader difference in values between groups, such as the control subjects and the AD dementia patients. In MCI patients, Aβ1-42 is the most determing factor for clinical diagnostic performance variability. The large effects of induced shifts to CSF Aβ1-42 levels may also be explained by the fact that this biomarker is the most sensitive to the different factors that contribute to the (pre-)analytical variability (Supplementary Figure 1). CSF Aβ1-42 is the most important marker to identify subjects with progressive MCI [5, 36]. Moreover, changes of Aβ1-42 levels can be observed earlier in the time course of the AD continuum than changes of CSF T-tau or P-tau181P levels [37]. One possibility to explore for the future is whether or not the use of the ratios of Aβ isoforms, such as Aβ1-42/Aβ1-40 ratio, might be a more robust biomarker for this type of analysis. By neglecting this ratio the interpretation of the AD CSF biomarkers might lead to false negatives (AD subjects with high Aβ1-42 CSF concentrations) or false positives (non-AD subjects with low Aβ1-42 CSF concentrations). Introducing this ratio will probably overcome this problem by the normalization of Aβ concentrations, and thus improve the diagnostic performance [38, 39]. Another improvement for the neurochemical diagnosis of AD, is the forthcoming automation of the analytical assays [40]. This will further reduce variability when compared with single-parameter ELISA assays described in the Alzheimer’s Association Quality Control (AAQC) program [41].
A potential limitation of this study is the relative young age of the healthy controls, which were not age-matched to the AD subjects. However, the proportion of preclinical AD individuals will be smaller in younger controls than when we would have used an age-matched control group. The CSF samples from the three diagnostic groups have not been analyzed simultaneously, and thus inter-assay variability might be an issue, which is another limitation of this study. A last limitation of this study is that we did not use age-adjusted cutoffs for the three CSF biomarkers.
In conclusion, the clinical diagnostic performance of AD CSF biomarkers remains largely unchanged and clinical diagnostic accuracy deviated less than 8.2% from the reference when shifts ranging between –20% and +20% were added to one of the three CSF biomarkers in MCI and autopsy-confirmed AD patients. Notwithstanding the fact that pre-analytical and analytical parameters can affect the clinical classification, the present exploratory study provides evidence that for a specific context of use, the impact on the clinical accuracy of concentration differences might be lower than originally expected, pointing to the robustness of using CSF biomarkers in a specific clinical context, especially when using a combination of different analytes. Our suggested approach can be used to set boundary conditions for acceptance in shifts of concentrations for biomarkers and reduce ongoing discussions on the difficulty to integrate CSF biomarker analysis in routine clinical practice due to the complexity of CSF collection and/or storage and analytical method (single versus multiple analyte technique).
Footnotes
ACKNOWLEDGMENTS
This research was in part supported by the University Research Fund of the University of Antwerp; the Institute Born-Bunge; the Foundation for Alzheimer Research (SAO-FRA); Neurosearch Antwerp; the Research Foundation - Flanders (FWO); the Agency for Innovation by Science and Technology (IWT); the Interuniversity Attraction Poles (IAP) programme of the Belgian Science Policy Office; the Flemish Government Methusalem excellence program, Belgium; the Flanders Impulse Program on Networks for Dementia Research (VIND). This work is part of the BIOMARKAPD project within the EU Joint Program for Neurodegenerative Disease Research (JPND). This work has received support from the EU/EFPIA Innovative Medicines Initiative Joint Undertaking (EMIF grant ndeg 115372).
