Abstract
Accumulation of amyloid beta (Aβ) is one of the pathological hallmarks of Alzheimer’s disease (AD), which can be visualized using [18F]florbetapir positron emission tomography (PET). The aim of this study was to evaluate various parametric methods and to assess their test-retest (TRT) reliability. Two 90 min dynamic [18F]florbetapir PET scans, including arterial sampling, were acquired (n = 8 AD patient, n = 8 controls). The following parametric methods were used; (reference:cerebellum); Logan and spectral analysis (SA), receptor parametric mapping (RPM), simplified reference tissue model2 (SRTM2), reference Logan (rLogan) and standardized uptake value ratios (SUVr(50–70)). BPND+1, DVR, VT and SUVr were compared with corresponding estimates (VT or DVR) from the plasma input reversible two tissue compartmental (2T4k_VB) model with corresponding TRT values for 90-scan duration. RPM (r2 = 0.92; slope = 0.91), Logan (r2 = 0.95; slope = 0.84) and rLogan (r2 = 0.94; slope = 0.88), and SRTM2 (r2 = 0.91; slope = 0.83), SA (r2 = 0.91; slope = 0.88), SUVr (r2 = 0.84; slope = 1.16) correlated well with their 2T4k_VB counterparts. RPM (controls: 1%, AD: 3%), rLogan (controls: 1%, AD: 3%) and SUVr(50–70) (controls: 3%, AD: 8%) showed an excellent TRT reliability. In conclusion, most parametric methods showed excellent performance for [18F]florbetapir, but RPM and rLogan seem the methods of choice, combining the highest accuracy and best TRT reliability.
Keywords
Introduction
Alzheimer’s disease (AD) is neuropathologically characterized by cortical amyloid beta (Aβ) deposition, which starts to accumulate approximately 10–20 years before clinical symptoms.1,2 Aβ can be visualized using [18F]florbetapir positron emission tomography (PET).3,4 Accurate quantification of Aβ is important for identifying subtle amyloid accumulation, as well as for monitoring disease progression and evaluating (experimental) anti-amyloid disease-modifying therapies.5–7
So far, most studies have used semi-quantitative measures for [18F]florbetapir uptake, such as the standardized uptake value ratio (SUVr). However, SUVr may be biased and sensitive to changes in perfusion, which are common in AD, and therefore making it less suitable for longitudinal measurements, where full quantification may be required.8,9 Recently, it was demonstrated that in vivo kinetics of [18F]florbetapir can best be described by a reversible two tissue compartmental model with fitted blood volume (2T4k_VB). 8 In addition, it has been shown that the simplified reference tissue model-(SRTM) derived binding potential (BPND) provides an accurate measure of [18F]florbetapir specific binding, showing less bias and lower test–retest variability than SUVr. 8
So far, it has not been investigated which parametric imaging method is most optimal for the quantification of [18F]florbetapir. Advantages of parametric images are that these can be used in voxel-by-voxel analyses and to take advantage of the scanner resolution. By contrast, full kinetic modelling and SRTM are non-linear regression-based, and therefore more computationally demanding and more susceptible for noise. [18F]florbetapir is a widely used amyloid-beta radiotracer, and validated parametric imaging methods are important for accurate and robust amyloid-beta quantification, allowing whole brain voxel-based analyses which are important in assessing the efficacy of disease modifying drugs over time. In addition, visual assessment of [18F]florbetapir images using BPND/R1 images might be more reliable compared to SUVr images.10,11 Therefore, the aim of this study was to evaluate the performance of various parametric methods for voxel-by-voxel quantification of [18F]florbetapir kinetics and to assess their test–retest (TRT) repeatability.
Material and methods
Participants
Participants have already been described in a previous study, and existing data were used for the present study. 8 In brief, eight patients with mild to moderate probable AD (MMSE ≥ 19) from the Amsterdam Dementia Cohort were included. Screening included vital signs, physical and neurological examinations, medical history, neuropsychological assessment, laboratory measurements, and brain MRI. In addition, eight healthy controls were recruited through advertisements in newspapers. These controls were in good physical health, experienced no cognitive complaints, and met Research Diagnostic Criteria (RDC) for “never mentally ill.” Controls underwent a comparable screening as AD patients and were only eligible if results of all clinical tests, including brain MRI and neuropsychological assessment, showed no abnormalities. The study was approved by the Medical Ethics Review Committee of the VU University Medical Center and all subjects provided written informed consent, in line with the Helsinki Declaration of 1975 (and 1983 revised) guidelines.
[18F]florbetapir synthesis
[18F]florbetapir (also named Amyvid or [18F]AV45) was synthesized locally in accordance with Avid Radiopharmaceuticals Investigational quality control release criteria.
Data acquisition
Data were acquired using an Ingenuity TF PET/CT scanner (Philips Medical Systems, Best, The Netherlands). Prior to scanning, two cannulas were inserted, one for intravenous [18F]florbetapir administration, the other for arterial sampling. Each subject underwent two [18F]florbetapir PET scans (interval [mean±SD]: 4 ± 2 weeks). Following a low-dose CT for attenuation correction, a 90-min PET emission scan was acquired after a bolus injection of approximately (mean±SD) 294 ± 27 MBq [18F]florbetapir. Arterial blood was sampled continuously at a rate of 5 mL·min−1 for the first 5 min and 2.5 mL·min−1 thereafter, using an online detection system. Continuous withdrawal was interrupted briefly (approximately 10 s) for the collection of seven (at 5, 10, 20, 40, 60, 75 and 90 min post injection) manual blood samples of approximately 8 mL, which were used to estimate plasma-to-whole blood ratios and to measure plasma metabolite fractions. A detailed description of the radiometabolite analyses has been given elsewhere. 8 Satisfactory blood data were available for six controls and eight AD patients; detailed information about missing blood data can be found elsewhere. 8 Dynamic PET acquisition was performed in list mode, and images were reconstructed in 22 frames (1 × 15, 3 × 5, 3 × 10, 4 × 60, 2 × 150, 2 × 300, 7 × 600 s) with a matrix size of 128 × 128×90 voxels, and were subsequently reconstructed using 3D RAMLA (voxel size of 2 × 2×2mm3). During reconstruction, all usual corrections, e.g. for attenuation, scatter, randoms, decay and dead time were performed. For brain tissue segmentation, 3D T1-weighted structural MRI scans (MPRAGE sequence) were acquired using a 3.0 Tesla Signa HDxt MRI (General Electric, Milwaukee, WI, USA).
Image analysis
Structural 3D T1-weighted MRI images were co-registered and superimposed to the PET images. Subsequently, PVElab was used to derive time activity curves (TACs) in anatomically based regions of interest (Hammers brain atlas, n = 68 ROIs). 12 Based on earlier findings, and as reference, the 2T4k_VB model was used to obtain plasma-input-derived distribution volume ratio (DVR), and SRTM was used to derive BPND (using cerebellum grey matter as reference region). In addition, the following plasma input parametric imaging methods were evaluated: Logan, spectral analyses (SA) (both 90 and 60 min), together with the following reference input parametric imaging methods: receptor parametric mapping (RPM), SRTM2, reference Logan (rLogan), multilinear reference tissue model (MRTM) 0, MRTM1, MRTM2, MRTM3A, MRTM3B (all 90 min) and SUVr50-70.13–18 For MRTM implementations, a scan duration of 90 min was used in order to have sufficient data points for fitting the model. For Logan, rLogan, RPM, SRTM2, and spectral analyses, fitting parameters (such as starting times for linear fits and number of basis functions) were optimized with reference to 2T4k_VB and SRTM. Cerebellar grey matter was used as reference region. The following bilateral anatomical regions from the Hammers atlas were excluded from analyses because these either did not consist of (cortical) grey matter tissue or/and are devoid of amyloid pathology under normal conditions: caudate nucleus, nucleus accumbens, putamen, thalamus, pallidum, corpus callosum, ventricles and brainstem.19,20 Finally 52 ROIs remained for image analyses. In addition, [18F]florbetapir SUV50–70 images were read for Aβ pathology by an experienced nuclear medicine physician (BvB) to determine the level of amyloid burden in each participant for descriptive purposes.
Statistical analyses
Statistical analyses were performed using SPSS version 20.0.0 (IBM Corp., Armonk New York, USA). χ2-tests were used for discrete variables, and t-tests for continuous demographic and clinical data. To evaluate the suitability of frequently used reference regions for [18F]florbetapir, 21 t-tests were used to compare 2T4k_VB VT values for cerebellum (1. grey matter [GM], 2. white matter [WM], 3. grey + white matter [GMWM], 4. subcortical WM, 5. brainstem and 6. pons between AD and controls). We first investigated the most optimal parametric imaging method, correlations (explained variance, r2) and slopes (i.e. bias) between 2T4k_VB DVR values and SRTM BPND and various parametric imaging methods (RPM, SRTM2, rLogan, Logan, SA, SUVr50-70 and all MRTM methods) for controls, AD patients and across groups. For correlational analyses, scaling differences between DVR and BPND (DVR = BPND + 1) are adjusted throughout the remainder of the manuscript. To investigate the impact of scan duration, DVR values obtained with RPM, SRTM2, RLogan, Logan and SA using 60 min of data were compared with those of 90 min scan data.
Results
Clinical and demographic data are presented in Table 1. There were no differences in age (controls = 63 ± 4, AD = 67 ± 6) or sex (three males and five females in both groups) between patients with AD and controls (all p > 0.05). Visual assessment of the [18F]florbetapir SUV50–70 images showed that all AD patients showed abnormal amyloid accumulation, whereas none of the controls showed significant cortical [18F]florbetapir uptake (see example Figure 1).
Clinical and demographic data and settings of parametric methods.
Note: Data are presented as mean (SD) or as frequency (percentages).

Examples of several quantitative images of a selection of parametric methods for a typical Alzheimer’s disease subject and a healthy volunteer. If available (RPM, SRTM2), we also presented (in the center white box) the corresponding R1 images reflecting tracer delivery or relative cerebral blood flow.
There were no significant differences between AD and controls with regard to the reference regions 2T4k_VB-derived Vt values (Figure 2; cerebellum GM p = 0.96, cerebellum WM p = 0.21; cerebellum GMWM p = 0.79; brainstem p = 0.12; pons p = 0.16; subcortical WM p = 0.19). Subsequent analyses were performed using cerebellum GM as a reference region because this region showed the least differences between groups.

Boxplot and whisker plots with interquartile ranges for VT values for various reference regions in AD and controls. VT values were based on 2T4k_VB model estimations using an original input function.
Comparisons between parametric values obtained using different parametric methods and 2T4k_VB (including slopes and intercepts) are presented in Table 2. Across groups, RPM DVR values showed the highest correlations and least bias (r2 = 0.95 and slope = 0.92) compared with 2T4k_VB-derived DVR values (Figure 3). In addition, Logan (r2 = 0.95; slope = 0.84), rLogan (r2 = 0.94; slope = 0.88), SRTM2 (r2 = 0.91; slope = 0.83), SUVr50–70 (r2 = 0.92; slope = 0.79) and SA (r2 = 0.91; slope = 0.88) correlated well with 2T4k_VB values. The results remained essentially unchanged when reducing the scanning time from 90 to 60 min (Table 2 and 3; Figure 3) or when performing separately for each diagnostic group, and with adequate tracer delivery (i.e. R1 images) based on RPM and SRTM2 (Figure 1). MRTM models, particularly MRTM1, correlated well with 2T4k_VB values, but generated noisy (visually) parametric images (data not shown). In a different set of analyses, parametric methods were compared with SRTM BPND (Table 3). Across groups, based on both 60- and 90-min data, RPM (Figure 3(b) and (d)) and rLogan provided the most accurate results.
Correlations and test–retest results between 2T4k_VB-derived DVR values and those seen with the tested parametric methods.
Note: Parametric methods in comparison to plasma input-derived 2T4k_VB (VT or DVR values) using 90 min scan data. The following optimized settings were used for each parametric method (RPM= 0.01–0.1, 50 basis functions; SRTM2 = 0.01–0.1, 50 basis functions; rLogan = 30–90 min; Logan = 30–90 min; Spectral analyses = 0.000167–0.008 (start-end), 50 basis functions. Test–retest results were based upon the average variation of all regions of interest.

Correlations between RPM DVR (panel A and C [controls and AD patients respectively]), 2T4k_VB-derived DVR and, RPM BPND (panel B and D) and SRTM BPND for +1both 90- and 60-min scan durations and for both AD patients and controls. Panel E shows correlations between SRTM BPND and SUVr50–70. Different colours reflect different regional estimations of each participant.
Correlations between SRTM-derived BPND and those seen with the tested parametric methods.
Note: Parametric methods compared to SRTM using 90 min scan data. The following optimized settings were used for each parametric method (RPM= 0.01–0.1, 50 basis functions; SRTM2 = 0.01–0.1, 50 basis functions; rLogan = 30–90 min; Logan = 30–90 min; Spectral analyses = 0.000167–0.008 (start-end), 50 basis functions.
Finally, we compared DVR TRT for various parametric methods (Figure 4; Table 2). RPM, SRTM2 and rLogan provided excellent TRT performance for both 60- and 90-min data (TRT < 5%), and MRTM models for 90-min data. Larger TRT variability was found for plasma input-based methods, i.e. Logan and spectral analyses (TRT range; 7–18%).

Boxplot and whisker plots with interquartile ranges for test and retest scans (all grey matter voxels) for AD and controls. For RPM and SRTM2 outcome values were rescaled (DVR= BPnd + 1) for illustration purposes. For rLogan DVR values are shown.
Discussion
In this study, we investigated the performance and TRT of various parametric methods for quantifying [18F]florbetapir uptake in both mild to moderate AD patients and controls. In general, amongst reference tissue parametric methods, most parametric methods showed excellent performance, but RPM and rLogan showed the least bias compared with corresponding 2T4k_VB and SRTM estimates, together with excellent TRT performance. Plasma input parametric methods showed slightly more bias and lower TRT repeatability, with best results obtained for Logan.
We used a number of approaches to evaluate various parametric methods. Firstly, we validated each (plasma input) parametric method against the reversible two tissue compartmental model-(2T4k_VB) derived DVR and SRTM-derived BPND. In order to assess various levels of [18F]florbetapir binding, we performed comparisons across groups as well as for AD and controls separately. Across groups, all parametric methods (particularly RPM, MRTM0, MRTM1 and MRTM3A with r2>0.90 and slopes ∼1.00) corresponded well with relatively low bias relative to 2T4k_VB-derived DVR and SRTM-derived BPND. Parametric methods based on linearization techniques (i.e. rLogan, Logan) (slightly) underestimated [18F]florbetapir binding compared with both 2T4k_VB-derived DVR and SRTM-derived BPND, which is in line with another study using linearization techniques. 22 Of these linearization techniques, rLogan provided the highest accuracy (lowest bias) compared with both 2t4k and SRTM (non-linear). RPM showed the best performance of the basis function approaches (i.e. RPM, SRTM2, SA). In contrast to previous studies, methods fixing the reference k2ʹ parameter (i.e. MRTM2, SRTM2) did not result in better accuracy and higher precision due to lower levels of noise compared with methods in which k2ʹ was not fixed (e.g. RPM).22,23 For our reference tissue methods, we used cerebellum grey matter, because pathological studies have demonstrated that the cerebellum is usually devoid of amyloid pathology in mild to moderate AD.24–26 In agreement with literature, we did not find any significant differences between cerebellar Vt values between AD and controls, which suggests that this region can be used as a valid reference region. In general, accuracy seemed highest in the AD group for most parametric methods, which can be explained by the higher DVR values due to substantial amyloid accumulation in AD patients that are less susceptible to small changes. The present findings are in line with earlier studies on other amyloid tracers ([ 11 C]PiB and [18F]flutemetamol), particularly with respect to the performance of rLogan and RPM.27–29 Although we observed slightly lower correlations and positive bias between SUVr and corresponding 2T4k_VB and SRTM-based DVR values, there was still a good agreement. One explanation is that SUVr is susceptible to (altered) brain perfusion,6,30 which is commonly present in AD. 31 This could affect tracer delivery and kinetics, and could result in bias compared with quantitative methods.
Next, we evaluated TRT performance for [18F]florbetapir, showing larger TRT variability in AD than in controls. This is probably due to the negligible [18F]florbetapir binding in controls, which has also been confirmed by visual readings. 32 TRT variability was comparable for DVR and BPND, but semi-quantitative techniques (SUVr) as well as methods relying on plasma input function (Logan and spectral analyses) showed poorer TRT performance both for AD and controls. These findings are well in line with TRT studies on [ 11 C]PiB, which indicated more variability over time while using semi-quantitative techniques or plasma input models for [18F]florbetapir,6,27 and could be explained by AD-related hypoperfusion for SUVr or relatively noisy estimations when using plasma input-based models.8,31
Finally, we investigated the effects of reducing scanning time from 90 to 60 min. In a previous study, it was shown that reliable SRTM BPND required a minimum of 60 min of data. 8 Consequently, in the present study, no shorter scanning times were investigated. All quantitative parametric methods, except for spectral analyses, Logan and MRTM models, only showed minor changes in BPND or DVR for the shorter scan time, which implies that 60 min is sufficient to obtain reliable and valid [18F]florbetapir binding images. In particular, RPM provided comparable results for 60 and 90 min data with excellent TRT performance. Taken together, a dynamic acquisition of 60 min seems sufficient for RPM-derived R1 and BPND images, although an extension to 70 min can be considered to allow for the generation of SUVr50–70 images (FDA recommended interval for static [18F]florbetapir scans).
In summary, various parametric methods showed excellent performance for [18F]florbetapir, but RPM and rLogan are methods of choice for generating parametric images with excellent TRT performance particularly in AD patients and for reduced scan duration. These findings illustrate reliable ways to accurately quantify amyloid deposition, and are especially relevant for capturing regional changes of amyloid over time, for example for disease modifying therapies and clinical trials.
Footnotes
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Van der Flier received grant support from ZonMW, NWO, EU-FP7, Alzheimer Nederland, CardioVascular Onderzoek Nederland, Stichting Dioraphte, Gieskes-Strijbis Fonds, Boehringer Ingelheim, Piramal Neuroimaging, Roche BV, Janssen Stellar and Combinostics. All funding is paid to the institution.
Acknowledgements
This research was made possible by Avid Radiopharmeuticals Inc., a wholly owned subsidiary of Eli Lilly and Company (NYSE: LLY). Research of the VUmc Alzheimer Center is part of the neurodegeneration research program of the Amsterdam Neuroscience. We would like to acknowledge the participants of the Amsterdam Dementia Cohort and the healthy volunteers for dedicating their time and energy to this study.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Authors’ contributions
Sander CJ Verfaillie: acquiring data, analysing and interpreting data, drafting the manuscript, approving the final content of the manuscript. Sandeep SV Golla: acquiring data, analysing and interpreting data, drafting the manuscript, approving the final content of the manuscript. Chris Van der Weijden: acquiring data, analysing and interpreting data, critically revising the manuscript, approving the final content of the manuscript. Tessa Timmers: acquiring data, analysing and interpreting data, critically revising the manuscript, approving the final content of the manuscript. Hayel Tuncel: acquiring data, analysing and interpreting data, critically revising the manuscript, approving the final content of the manuscript. Robert C Schuit: acquiring data, analysing and interpreting data, critically contributing to the manuscript, approving the final content of the manuscript. Patrick Schober: acquiring data, critically revising the manuscript, approving the final content of the manuscript. Wiesje M van der Flier: contributing to conception and design, enhancing its intellectual content, approving the final content of the manuscript. Albert D Windhorst: contributing to conception and design, enhancing its intellectual content, approving the final content of the manuscript. Adriaan A Lammertsma: contributing to conception and design, analysing and interpreting data, drafting the manuscript and enhancing its intellectual content, approving the final content of the manuscript. Bart NM van Berckel: contributing to conception and design, analysing and interpreting data, drafting the manuscript and enhancing its intellectual content, approving the final content of the manuscript. Ronald Boellaard: contributing to conception and design, analysing and interpreting data, drafting the manuscript and enhancing its intellectual content, approving the final content of the manuscript. Boellaard is the principal investigator of this study.
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Verfaillie, Golla, Timmers, Tuncel, Schuit, Schober, Windhorst, Lammerstma, Boellaard and van Berckel report no conflict of interest.
