Abstract
Dynamic positron emission tomography (PET) data are usually quantified using compartment models (CMs) or derived graphical approaches. Often, however, CMs either do not properly describe the tracer kinetics, or are not identifiable, leading to nonphysiologic estimates of the tracer binding. The PET data are modeled as the convolution of the metabolite-corrected input function and the tracer impulse response function (IRF) in the tissue. Using nonparametric deconvolution methods, it is possible to obtain model-free estimates of the IRF, from which functionals related to tracer volume of distribution and binding may be computed, but this approach has rarely been applied in PET. Here, we apply nonparametric deconvolution using singular value decomposition to simulated and test–retest clinical PET data with four reversible tracers well characterized by CMs ([11C]CUMI-101, [11C]DASB, [11C]PE2I, and [11C]WAY-100635), and systematically compare reproducibility, reliability, and identifiability of various IRF-derived functionals with that of traditional CMs outcomes. Results show that nonparametric deconvolution, completely free of any model assumptions, allows for estimates of tracer volume of distribution and binding that are very close to the estimates obtained with CMs and, in some cases, show better test–retest performance than CMs outcomes.
INTRODUCTION
Positron emission tomography (PET) is a nuclear imaging technology that uses radioactively labeled molecules (tracers) that are injected into the body and bind, although not exclusively, to a specific target to quantify in vivo biological processes, such as blood flow, metabolism, and the distribution of proteins in the brain and the body. PET is an invaluable tool to establish the genetic and cellular basis of diseases.
Fully quantitative PET allows estimation of outcome measures such as, in the case of a reversible tracer, the total volume of distribution (VT) of the injected tracer and a series of binding potentials (BPs) between tracer and target, to indirectly determine the available amount of the target. 1 These BPs are in various ways related to the density of available target (Bmax) through the affinity of the tracer for the available binding sites (1/KD). 1
Quantification of these outcome measures relies on describing the PET signal associated with the tracer in the tissue (i.e., the time activity curve, TAC) as the convolution between the concentration of free (not protein bound) unmetabolized tracer in the arterial plasma (i.e., the metabolite-corrected input function) and a function specific to the tissue, the impulse response function (IRF). 2 The metabolite-corrected input function represents how much of the injected tracer is actually available to bind with the target. IRF is estimated from measures of the input and the tissue PET signal, and would not otherwise be known.
Traditionally, PET quantification methods have used parametric approaches, which require specifying a certain parametric form for the IRF (e.g., a mathematical function derived from a physiologic model of the system, or that approximates the system behavior). This approach therefore relies on fairly strong assumptions, which are not always substantiated in reality, and a priori knowledge of the system under investigation. Traditional PET parametric analysis quantifies these outcome measures by interpreting the PET signal in the brain tissue using compartment models (CMs) to describe the system IRF (the kinetics) of the tracer, in particular a reversible or irreversible two-tissue compartment model (2TCM). 1 The tracer signal in the brain tissue is modeled as the sum of two components: part of the tracer is specifically bound to the target of interest (specific binding); part is nonspecifically bound to macromolecules other than the target, or free in the tissue water (nonspecific binding).1,2 Identification of this model involves estimating four rate constants (K1, k2, k3, k4), which describe how quickly the tracer transits from ‘being in the plasma’, to ‘being free to bind in the tissue’, to ‘being specifically bound to the target’.1,2 However, these constants are not always all identifiable, and as a result directly estimating the specific binding component in the model can often be unstable. Therefore, full quantification of this model also requires identifying a reference region (RR) in the brain that is completely devoid of the target.1,2 In a proper (i.e., measurable, valid, reliable, independent of treatment effects and groups) RR devoid of the target of interest, the tracer can only be nonspecifically bound to macromolecules other than the target,1,2 and its kinetics can be described with a one-tissue compartment model 1 (1TCM) regulated by only two rate constants (K1′ and k2′). The tracer volume of distribution in such a region is assumed to represent (and used to estimate) a common nonspecific binding across the brain, and helps isolate the component that is only specifically bound to the target. Often, however, CMs or graphical approaches derived from CMs 2 either do not properly describe the tracer kinetics, or are not identifiable, leading to nonphysiologic estimates of the VT and of the binding to the target.
On the other hand, nonparametric deconvolution approaches do not impose any specific formulation or model for the IRF, and only rely on very loose requirements, often matched in the case of physiologic systems, for the IRF (e.g., degree of smoothness; monotonicity). Nonparametric deconvolution approaches, therefore, allow obtaining model-free estimates of the IRF, from which functionals related to tracer VT and BPs can be computed. Nevertheless, they have been rarely adopted in PET. O'Sullivan et al 3 proposed the use of nonparametric residue analysis and demonstrated it with simulations and data using the irreversible tracer [ 18 F]FDG. Only recently, Jiang et al 4 proposed a nonparametric approach based on functional principal component analysis and demonstrate it with simulations and data with the tracer [ 11 C]-diprenorphine.
Several approaches have been developed that fall between these two extremes.5–7 Generally, these involve expressing the IRF in terms of basis functions that, although derived from the principles of linear time-invariant systems and based on CMs, 8 do not impose any predefined compartmental structure. Similar to what we propose using nonparametric deconvolution approaches, these basis function approaches allow for calculation of functionals that can be computed from the IRF as PET outcome measures.
Here, we propose the use of nonparametric deconvolution via singular value decomposition (SVD)9,10 to compute functionals of the IRF of PET reversible tracers that are associated with tracer VT and BPs, and extensively compare accuracy, reproducibility, reliability, and identifiability of these different functionals with traditional outcome measures derived using CMs. SVD is an easy-to-implement nonparametric deconvolution approach that is widely used, for example, in the quantification of Dynamic Susceptibility Contrast-Magnetic Resonance Imaging data9,10 and has been used before to obtain regularized least-square solutions.7,11–13 We show relative advantages and disadvantages of using outcome measures derived from either nonparametric deconvolution or traditional parametric approaches, using simulated data and four test–retest clinical datasets with the tracers [ 11 C]CUMI-101 (CUMI), [ 11 C]DASB (DASB), [ 11 C]PE2I (PE2I), and [ 11 C]WAY-100635 (WAY).
MATERIALS AND METHODS
SUBJECTS
Four previously acquired test–retest datasets were considered: 12 scans with CUMI, 14 20 with DASB, 15 14 with PE2I, 16 and 10 with WAY. 17 The studies were performed in accordance with the Declaration of Helsinki and The Institutional Review Boards of Columbia University Medical Center and New York State Psychiatric Institute approved the protocol. Subjects gave written informed consent after receiving an explanation of the study.
Positron Emission Tomography Protocol
Tracers preparation, emission data acquisition and reconstruction, and determination and fit of the metabolite-corrected input function were obtained as previously described.14–17
Image Analysis
Images were analyzed using Matlab Release 2013b (The Mathworks, MA, USA) with extensions to FLIRT version 5.2 (ref. 18) and BET version 1.2 (ref. 19). Motion correction was applied. An RR and several anatomic target regions of interest (ROIs) were traced based on brain atlases and published reports:20,21 13 ROIs for CUMI (hippocampus, entorhinal cortex, insula, posterior parahippocampal gyrus, temporal lobe, amygdala, medial prefrontal cortex, cingulate, orbital, dorsolateral prefrontal cortex, parietal, raphe, occipital lobe; RR: cerebellar gray matter); 6 ROIs for DASB (midbrain, anterior cingulate, amygdala, dorsal putamen, hippocampus, thalamus; RR: cerebellar gray matter); 4 ROIs for PE2I (dorsal caudate, dorsal putamen, ventral striatum; RR: cerebellum); and 13 ROIs for WAY (raphe; anterior cingulate; amygdala; cingulate, dorsolateral prefrontal cortex, hippocampus, insula, medial prefrontal cortex, occipital lobe, orbital, parietal, posterior parahippocampal gyrus, temporal lobe; RR: cerebellar white matter). Selecting these ROIs restricted the analysis to regions for which the binding is expected to be relatively high (with the exception of the RR).
Simulated Data
In addition to the clinical test–retest data sets, we considered simulations designed to generate data that imitate characteristics of real data in the case of 1TCM and 2TCM, respectively.
For the 1TCM, we used the metabolite-corrected input function measured in one of the considered DASB scans, a tracer whose kinetics is well described by a 1TCM, 15 and set the K1 and k2 values in six selected regions (cerebellar gray matter, temporal lobe, hippocampus, dorsal caudate, amygdala, and ventral striatum) based on existing data on DASB healthy control subjects. 15 For the 2TCM, we used the metabolite-corrected input function measured in one of the considered CUMI scans, a tracer whose kinetics is well described by a 2TCM, 14 and set the K1 to k4 values in five selected regions (cerebellar gray matter, hippocampus, temporal lobe, occipital lobe, and cingulate) based on existing data on CUMI healthy control subjects. 14
In both cases, we simulated Gaussian noise with zero mean and standard deviation (SD) calculated as SD(ti) = α. σ/w(ti), where w(ti) is the root square of the duration of the time frame during which the PET data corresponding to time ti has been acquired, and σ indicates the level of noise that we derived from the weighted residual sum of square values obtained by fitting DASB and CUMI scans, respectively, with CMs. Increasing α allows us to study the various estimation techniques in situations with proportionately more noise than what we encounter in our real-data situations. We simulated five noise conditions setting α equal to 1, 2.5, 5, 7.5, and 10, respectively. In all cases and noise conditions, we simulated 1,000 TACs for each of the selected regions.
The complete list of simulated parameter values and noise levels is reported in the Supplementary Material.
Interpretation of Positron Emission Tomography Data
The PET data are commonly modeled as a convolution of the metabolite-corrected input function and the tracer IRF in the tissue:
2
Outcome Measures
Expressing (1) as in (5), and considering nonparametric inference for the residue, gives rise to a deconvolution problem, providing a novel approach to the analysis of dynamic PET data that is not reliant on specific compartmental modeling assumptions.
3
However, once the IRF has been nonparametrically estimated via deconvolution, it remains to establish which are the outcomes that can be actually computed from the IRF, and how they compare with respect to traditional outcomes used in PET. The first outcome measure usually computed in PET for a reversible tracer is VT, whose formulation in terms of the 2TCM K1 to k4 values is:
2
While the use of the first quantile of the residence was suggested for the irreversible tracer [
18
F]FDG,
3
we propose here to further generalize and optimize this nonparametric formulation of VT for reversible tracers, by considering not only the first quantile of the residence density (i.e., R−1(1/4)), but also a large number of quantiles (i.e., R−1(n), for n = N1, ⃛, NQ), and then averaging over all of them:
Singular Value Decomposition
Potentially, any of many nonparametric deconvolution approaches can be used to estimate the IRF starting from known metabolite-corrected input function and tracer TACs in the brain tissue. O'Sullivan et al
3
propose a technique based on regularized cubic B-spline approximation of the residence time distribution, but do not claim any optimality of the B-spline estimators. Jiang et al
4
use functional principal component analysis. Here, we propose the use of SVD. Specifically, if the tissue concentration CT(t) of the tracer in response to a metabolite-corrected input function CP(t) is given by Equation (5), then assuming that the metabolite-corrected input function and the tracer TACs in the brain tissue are measured at equally spaced time points t1, t2, ⃛, tN, with Δt as the space between time points of measurement, Equation (5) can be discretized, assuming that over short time intervals Δt, the residue function and metabolite-corrected input function are constant in time:
In matrix form, (12) is expressed as:
Equation (13) can be solved iteratively for the elements of b. This approach is, however, extremely sensitive to noise, potentially causing the estimated R(t) to oscillate. A technique to solve (13) is SVD, which constructs matrices V, W, and UT, so that the inverse of A in (13) can be written:
Here, we propose an automatic data-driven algorithm for selecting the threshold for SVD for each ROI that is based on cross-validation. Specifically, for each TAC in each considered ROI, the threshold is automatically selected by:
For i = 1 to i = N (with N the total number of original sampling time points)
Removing the TAC point corresponding to the sampling time point ti;
For threshold T = 2% to T = 40% (with step = 1%), applying SVD with threshold value T to the TAC containing the remaining N – 1 data points;
Storing the threshold T that gives the best prediction of the TAC data at the time point ti that was left out (i.e., calculated as the minimum of the square difference between original TAC value at the excluded time point t i and the reconvolved TAC obtained with SVD at the same ti);
Selecting the average value (across excluded time points ti) of the stored thresholds.
The suggested threshold range is a generous range that has proven optimal before for physiologic signals.9,10
Metrics of Comparison for Simulated Data
For all simulated cases and noise conditions, we ran SVD with the automatic threshold selection, and calculated VT-NP1, VT-NP2, and VT-end from the deconvolved IRF estimates. BPP-NP1, BPP-NP2, and BPP-end were derived by subtracting VT-NP1, VT-NP2, and VT-end of the RR. We compared the distribution of outcome measures estimates obtained with the true simulated values, and calculated variance (across the 1,000 estimates of the outcome measure in each ROI, the median variance across ROIs is reported) and bias (as distance between median of the 1,000 estimates of the outcome measure in each ROI and the corresponding true value, the median bias across ROIs is then considered) of such outcome measures.
Application to Test–Retest Data
For all tracers, subjects, and ROIs, we calculated VT and BPP = VT – VTref using CMs (2TCM for CUMI, 14 PE2I, 16 and WAY; 17 1TCM for DASB 15 ); the RR was the gray matter (for CUMI 14 and DASB 15 ), the white matter (for WAY 17 ), or the totality (for PE2I 16 ) of the cerebellum. We also ran SVD with the automatic threshold selection, and calculated VT-NP1, VT-NP2, and VT-end from the deconvolved IRF. BPP-NP1, BPP-NP2, and BPP-end were derived by subtracting VT-NP1, VT-NP2, and VT-end, respectively, of the RR.
We first compared VT-NP1, VT-NP2, and VT-end versus VT values using correlation (r), slope and intercept of the regression line, with VT values being the independent variables. BPP-NP1, BPP-NP2, and BPP-end were similarly compared with BPP values.
We then compared the test–retest performance of VT, VT-NP1, VT-NP2, VT-end, and BPP, BPP-NP1, BPP-NP2, BPP-end, and using five metrics: 15
Percent difference (PD), calculated as the absolute difference between the test and retest values divided by their average (the lowest the PD, the highest the reproducibility);
Within-subject mean sum of squares (WSMSS): WSMSS =
Interclass correlation coefficient (ICC): ICC =
Variance of an outcome measure across subjects (sample variance, across subjects, of all observations for each ROI); and
Identifiability based on bootstrap resampling of residuals; we computed 100 bootstrap samples 26 of the TAC data for each subject's ROI, estimated the outcome measures for each of these samples, and measured the variability of these estimates using the robust median absolute deviation criterion: median [|Zj – median(Z1, Z2, ⃛, Z100)|, j = 1, 2, ⃛, 100].
RESULTS
Test–Retest Data
Comparison with CMs estimates. Figures 1 and 2 show for all tracers, subjects, and ROIs, the scatter plots of VT-NP1, VT-NP2, and VT-end, and BPP-NP1, BPP-NP2, and BPP-end, respectively, obtained using SVD versus the VT and BPP values obtained using CMs. The identity and regression lines, and the numerical results of the regression analysis, are reported in each plot. Correlation is high (> 0.90) in all cases, with exception of BPP-NP1 for PE2I (r = 0.740) and BPP-end for CUMI (r = 0.875).

Scatter plots of VT-NP1, VT-NP2, and VT-end obtained by singular value decomposition (SVD) versus the VT values obtained using compartment models (CMs) for all considered tracers, subjects, and regions of interest (ROIs). The identity (solid) and regression (dotted) lines are reported in each plot, together with the numerical results of the regression analysis. VT, total volume of distribution.

Scatter plots of BPP-NP1, BPP-NP2, and BPP-end obtained by singular value decomposition (SVD) versus the BPP values obtained using compartment models (CMs) for all considered tracers, subjects, and regions of interest (ROIs). The identity (solid) and regression (dotted) lines are reported in each plot, together with the numerical results of the regression analysis. BPp, binding potential.
However, the slope values differ across outcome measures and tracers. As expected from its formulation in comparison with VT, VT-end underestimation increases as the value of the estimated outcome increases. VT-end shows the lowest slope values across the different outcomes for all tracers: the minimum value is 0.466 (PE2I) and the maximum is 0.619 (DASB). VT-NP1 and VT-NP2 show similar results with each other, and the slope values are closer to unity than with VT-end (VT-NP1: minimum value is 0.511, PE2I, maximum is 1.019, WAY; VT-NP2: minimum value is 0.827, PE2I, maximum is 0.944, DASB). As expected by definition, VT-NP2 shows higher correlation than VT-NP1 for all tracers, while the slope is closer to unity than with VT-NP1 for DASB and PE2I.
For all tracers, slope values for BPP decrease with respect to the comparison with VT values. In general, BPP-NP2 shows the best correlation values for all tracers (0.922 for CUMI, 0.986 for DASB, 0.973 for PE2I, and 0.942 for WAY), and the best slope values for DASB (0.928) and PE2I (0.794). BPP-NP1 shows the best slope value for CUMI (0.858) and WAY (1.044).
Test–retest metrics. Figures 3 and 4 show the boxplots of the five considered metrics for all tracers and formulations of the outcome measure (VT, VT-NP1, VT-NP2, VT-end and BPP, BPP-NP1, BPP-NP2, BPP-end, respectively). In each box, the central mark is the median value of the metric (across subjects/pairs and ROIs), the edges of the box are the 25th and 75th percentiles, the whiskers extend to the most extreme data points of the metric not considered outliers, and outliers are plotted individually. A complete list of the numerical results corresponding to Figures 3 and 4 is reported in the Supplementary Material.

Boxplots of the five considered test–retest metrics for all considered tracers and the four different estimations of the outcome measure (VT, VT-NP1, VT-NP2, VT-end). In each box, the central mark is the median value of the metric (across subjects/pairs and regions), the edges of the box are the 25th and 75th percentiles, the whiskers extend to the most extreme data points of the metric not considered outliers, and outliers are plotted individually. ICC, interclass correlation coefficient; PD, percent difference; VT, total volume of distribution; WSMSS, within-subject mean sum of squares.

Boxplots of the five considered test–retest metrics for all considered tracers and the four different estimations of the outcome measure (BPP, BPP-NP1, BPP-NP2, BPP-end). In each box, the central mark is the median value of the metric (across subjects/pairs and regions), the edges of the box are the 25th and 75th percentiles, the whiskers extend to the most extreme data points of the metric not considered outliers, and outliers are plotted individually. BP, binding potential; ICC, interclass correlation coefficient; PD, percent difference; WSMSS, within-subject mean sum of squares.
Table 1 summarizes which outcome measure provides the best performance. For the clinical data, the outcome measures with the best correlation and slope (with the corresponding outcome measures estimated with CMs) are reported separately from the test–retest metrics; the best performance in the test–retest metrics is reported both considering and not considering VT-end (or BPP-end).
Summary of which outcome measure provides the best performance in each of the simulated and clinical data sets
Abbreviations: BPp, binding potential; CM, compartment model; ICC, interclass correlation coefficient; PD, percent difference; 1TC, one-tissue compartment; 2TC, two-tissue compartment; VT, total volume of distribution; WSMSS, within-subject mean sum of squares.
For the simulated data, the outcome measures with the best variance and best bias (with respect to the true value of the outcome measure) are reported. For the clinical data, the outcome measures with the best correlation and slope (with the corresponding outcome measures estimated with CM) are reported separately from the test–retest metrics; the best performance in the test–retest metrics is reported considering and not considering VT-end (or BPP-end).
VT-end shows the lowest WSMSS and variance, and the best identifiability, for all tracers. VT-end also shows the lowest PD for CUMI, DASB, and PE2I, and the highest ICC for CUMI and DASB. Correspondingly, BPP-end shows the lowest WSMSS and variance for all tracers. BPP-end also shows the highest ICC for all tracers, besides WAY, the lowest PD for DASB and PE2I, and the best identifiability for all tracers, with exclusion of DASB.
VT and BPP show the lowest PD and the highest ICC for WAY. BPP also shows the best identifiability for DASB.
VT-NP1 and BPP-NP1 show the worst identifiability for all tracers. BPP-NP1 also shows the lowest ICC for all tracers, besides WAY.
VT-NP2 shows the highest ICC for PE2I (0.80), while BPP-NP2 shows the lowest PD for CUMI.
Simulated Data
Figures 5 and 6 show the distribution of outcome measures obtained using SVD in the 1TCM and 2TCM case, respectively (top panel: VT-end, VT-NP1, VT-NP2; bottom panel: BPP-end, BPP-NP1, BPP-NP2), with simulated noise level corresponding to α = 1. Each region is reported separately (the cerebellum gray matter is the RR for calculation of BPP-end, BPP-NP1, BPP-NP2; therefore, it is not reported for that outcome measure); the dotted vertical lines represent the median of the distribution for each formulation of the outcome measure, while the black solid vertical line represents the true simulated value for that outcome measure and region.

Distribution of outcome measures estimated using singular value decomposition (SVD) and the three nonparametric formulations (top panel: VT-end, VT-NP1, VT-NP2; bottom panel: BPP-end, BPP-NP1, BPP-NP2) across the 1,000 simulated one-tissue compartment model (1TCM) cases in each region; the six simulated regions are reported separately (the cerebellum gray matter is used as the reference region (RR) for the calculation of BPP-end, BPP-NP1, BPP-NP2; therefore, it is not reported for that outcome measure); the dotted vertical lines represent the median of the distribution for each formulation, while the black solid vertical line represents the true simulated value for that outcome measure and region. The data underlying the simulations were derived from clinical [ 11 C]DASB scans modeled with 1TCM. BPp, binding potential; VT, total volume of distribution.

Distribution of outcome measures estimated using singular value decomposition (SVD) and the three nonparametric formulations (top panel: VT-end, VT-NP1, VT-NP2; bottom panel: BPP-end, BPP-NP1, BPP-NP2) across the 1,000 simulated two-tissue compartment model (2TCM) cases in each region; the five simulated regions are reported separately (the cerebellum gray matter is used as the reference region (RR) for the calculation of BPP-end, BPP-NP1, BPP-NP2; therefore, it is not reported for that outcome measure); the dotted vertical lines represent the median of the distribution for each formulation, while the black solid vertical line represents the true simulated value for that outcome measure and region. The data underlying the simulations were derived from clinical [ 11 C]CUMI scans modeled with 2TCM. BPp, binding potential; VT, total volume of distribution.
In the 1TCM case, VT-end always underestimates VT the most across the three formulations, except for the cerebellum gray matter, although with much less variability. In the 2TCM case, VT-end performs the best for the three regions with the lowest VT (cerebellum gray matter, occipital lobe, and cingulate), but underestimates the most the region with the highest VT (hippocampus). Correspondingly, BPP-end always underestimates BPP the most in both cases.
In the 1TCM case, VT-NP1 and VT-NP2 show similar median values for VT, with VT-NP2 performing better than VT-NP1 for the three regions with highest simulated VT (dorsal caudate, amydgala, ventral striatum). In the 2TCM case, it is VT-NP1 that slightly outperforms VT-NP2 in all regions in terms of bias, but not variance of the estimates distribution. Corresponding BPP-NP1 and BPP-NP2 values are similar in the 1TCM case, while BPP-NP1 tends to overestimate the BPP in all the regions in the 2TCM case. BPP-NP2 shows the smallest bias across regions and less dispersed distributions than the ones of BPP-NP1 in the 2TCM case.
As reported in Table 1, with the noise present at the ROI level (α = 1), VT-end and BPP-end show the best variance in both simulated cases: 0.039 and 0.084 (1TCM); 0.043 and 0.090 (2TCM). VT-end shows the smallest bias (–0.298) for 2TCM, while VT-NP2 shows the smallest bias for 1TCM (–0.426), and BPP-NP2 shows the smallest bias for both 1TCM (0.129) and 2TCM (0.351).
The results relative to the noise analysis are reported in the Supplementary Material. In both 1TCM and 2TCM cases, the bias of the VT-end and BPP-end estimates is almost constant across the five noise conditions. The corresponding variance, although it increases with the noise in the data, remains significantly lower than the corresponding variance of the other two nonparametric formulations for the same outcome measure.
In both 1TCM and 2TCM cases, the pattern with which the bias of VT-NP1 and BPP-NP1, and VT-NP2 and BPP-NP2, changes with noise is similar, with both formulations that tend to overestimate the true value at elevated noise levels. In the 1TCM case, the bias of VT-NP2 and BPP-NP2 increases the most with the noise level. The corresponding variance values increase with noise, with VT-NP1 and BPP-NP1 providing the worst performance in the 2TCM case. In the 1TCM case, the variance of VT-NP1 and VT-NP2, and BPP-NP1 and BPP-NP2, is almost identical.
DISCUSSION
We applied nonparametric deconvolution to estimate the IRF of PET reversible tracers in several regions, using both simulated and clinical test–retest data sets, computed several functionals of the IRF to estimate tracer VT and BPs, and extensively compared these functionals with traditional outcome measures derived using CMs. We showed that nonparametric deconvolution, which does not require any model assumptions, and the functionals that can be computed from the recovered IRF can provide estimates of VT and BPP that are not only very close to, and highly correlated with, the estimated obtained with traditional CMs, but can also show better test–retest performance than outcome measures derived from CMs. Therefore, nonparametric deconvolution and the proposed functionals offer a promising alternative to CMs for the quantification of PET dynamic data, especially when CMs assumptions do not hold or when CMs provide poor fits and/or unphysiologic estimates of the outcome measures for a specific PET tracer.
The optimized formulation of VT we proposed in Equation (10), and the corresponding BPP-NP2 that can be derived, provide the best performance in terms of bias with respect to, and correlation with, outcome measures with traditional CMs in the majority of the simulated and clinical considered data sets (Table 1). VT-end as in Equation (8), and the corresponding BPP-end that can be derived, provide the best test–retest performance across metrics and, overall, considered tracers; outcome measures derived with traditional CMs show superior test–retest performance than VT-end and BPP-end only for some of the metrics in the case of DASB (identifiability of BPP), PE2I (ICC of VT), and WAY (PD and ICC of VT and BPP). Therefore, VT-end and BPP-end are stable (with respect to noise in the TACs) and reliable outcome measures that could be used, for example, to discriminate different clinical populations from each other and from healthy controls. However, as one would expect from their definitions, results on simulated and clinical data sets suggest that VT-end and BPP-end often provide the greatest underestimation of VT and BPP (with the exception of PE2I BPP, for which BPP-NP1 shows the greatest underestimation). This is perhaps less of an issue if the purpose of the study is to examine differences between groups (or across values of some continuous variable) but if the primary goal of the analysis is estimation of standard outcome measures, the bias of these estimates must be considered. In such a situation, VT-NP1 and/or VT-NP2 (and the corresponding BPP-NP1 and BPP-NP2) could be considered, as they show better test–retest performance than CMs outcomes according to the majority of the considered metrics for all tracers, with the exception of WAY.
An intrinsic limitation of the nonparametric approach is that the IRF can be estimated only up to the end of the PET scan, and no inference up to infinity can therefore be made without some strong assumptions. For example, it may be the case that, for tracers that slowly reach equilibrium, the nonparametric estimation of VT and BPP according to equations (9 and 10) is biased by the fact that, by the end of the scan, the IRF has not yet reached a certain fraction of its initial value, and therefore the corresponding kth quantile of the residence cannot be determined. Estimation of VT-end and BPP-end instead might be preferred in these cases, with proper interpretation of the results.
We have implemented here nonparametric deconvolution using SVD9,10 with an automatic data-driven selection of the threshold for each ROI. This is a fast, easy to implement, and widely adopted approach to deconvolution that easily lends itself to implementation at the voxel level. However, we claim no optimality in either the choice of the deconvolution approach or the selection of the thresholds.
It is known that SVD performance can be sensitive to the level of the noise present in the data.27,28 Our simulation results at different noise level (see Supplementary material) suggest that, at the voxel level, or for particularly noisy tracers, either VT-end and BPP-end should be preferred as outcome measures because they are less affected by noise, or more robust approaches to deconvolution, such as nonlinear stochastic regularization 28 or kernel-based linear system identification,29,30 should be considered for reconstructing the IRF.
The cross-validation algorithm we propose here for thresholding the singular values in the decomposition to constrain the propagation of noise from the data to the IRF estimates is data driven and has proven to be flexible enough to provide reasonably regularized solutions for the considered four tracers and simulated data. However, other strategies exist,7,31–33 which may provide improved solutions. The optimization of the thresholding algorithm, which likely depends on the tracer at hand, is however beyond the scope of this article.
Singular value decomposition, as many other nonparametric deconvolution approaches, requires the input data to be equidistantly sampled. 23 The PET data involve radioactive decay, and are usually acquired using increasing frame lengths to ensure a comparable number of counts and thus statistical quality per data point. This implies that the PET data need to be interpolated on an equally-spaced grid of temporal values before applying SVD. The possible effect of the interpolation on the estimation of VT and BPP should be further investigated, especially when using SVD at the voxel level where, due to the large number of voxels to quantify, adopting a very fine interpolation grid can lead to out-of-memory issues during the quantification process. While Shannon's sampling theorem 23 implies that any band-limited signal can be reconstructed from its regularly spaced samples, the solution of the nonuniform sampling problem poses more difficulties. 24 There exist a vast literature on nonuniform sampling theory34,35 and methods claiming to efficiently reconstruct a function from its nonuniform samples,36–38 including nonuniform SVD. 39 The investigation of the performance on PET data of nonuniform deconvolution approaches is however beyond the scope of this article.
It is important to state that comparing the performance of the proposed nonparametric outcome measures with the corresponding CMs estimates is not generally useful when the CM is not valid for the tracer in question. Here, we have considered four tracers for which we have already assessed the validity of the CMs,14–17 so the proposed comparison is meaningful, although it may be questioned in certain individual regions. Certainly, the possibility of quantifying the binding of PET tracers model-free and nonparametrically is particularly attractive for those tracers for which the assumptions of CMs do not hold or CMs do not provide a good description of the data. In these cases, however, the CMs estimates of the binding cannot be used as a reference to evaluate the performance of the nonparametric approach, and validation with in vitro results would be recommended.
Future directions of investigation also include the extension of the presented nonparametric approach to quantification in the absence of a metabolite-corrected input function.
CONCLUSIONS
Nonparametric deconvolution does not require any model assumptions and allows for estimates of VT and BPP that are very close to the estimates obtained with traditional CMs, and can also show superior test–retest performance. Nonparametric deconvolution and the proposed functionals therefore offer a viable alternative to CMs for the quantification of PET dynamic data, especially when CMs assumptions do not hold or CMs provide poor fits and/or unphysiologic estimates of the outcome measures for a specific PET tracer.
AUTHOR CONTRIBUTIONS
Dr Zanderigo was responsible for ideation of the methodologies, drafting of the manuscript, creation of the software, analysis and interpretation of the data. Dr Parsey contributed to conception and revision of the manuscript, and interpretation of the data. Dr Ogden contributed to ideation of the methodologies, interpretation of the data, critical revision of the manuscript, and final approval.
DISCLOSURE/CONFLICT OF INTEREST
The authors declare no conflict of interest.
Footnotes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
