Abstract
Tissue perfusion plays a critical role in oncology. Growth and migration of cancerous cells requires proliferation of networks of new blood vessels through the process of tumor angiogenesis. Many imaging technologies developed recently attempt to measure characteristics pertaining to the passage of fluid through blood vessels, thereby providing a noninvasive means for cancer detection, as well as treatment prognostication, prediction, and monitoring. However, because these techniques require a sequence of successive imaging scans under administration of intravenous imaging tracers, the quality of the resulting perfusion data depends on the acquisition protocol. In this paper, we explain how to infer stability for stochastic curve estimation. The topic is motivated by two recent attempts to determine stable acquisition durations for acquiring perfusion characteristics using dynamic computed tomography, where in inference used inappropriate statistical methods. Notably, when appropriate statistical techniques are used, the resulting conclusions deviate substantially from those previously reported in the literature.
Keywords
Introduction
Many imaging technologies developed recently attempt to measure characteristics pertaining to the passage of fluid through blood vessels, thereby providing a noninvasive means to quantify vascular features. 1 Perfusion is of particular interest in oncologic imaging, where tissue, and in particular tumor perfusion, plays a critical role. The growth and migration of cancerous cells requires proliferation of networks of new blood vessels through the process of tumor angiogenesis, triggering modifications to the vasculature of the surrounding host tissue. In principle, measurements obtained from perfusion imaging provide physiological correlates for neovascularization induced by tumor angiogenesis. 2
Thus, many investigators in cancer biology and oncology are attempting to use these features to better understand the pathophysiological processes at play in the tumor microenvironment. Ultimately, these efforts aim to identify biomarkers based on perfusion phenotypes that could be utilized for cancer detection, disease prognostication, as well as prediction and monitoring of therapeutic response to intervention.3–6
Perfusion computed tomography (CTp) is one such functional imaging technology that enables noninvasive observation and quantification of perfusion characteristics. Physiological models have been developed to quantify a variety of perfusion characteristics (such as tumor blood volume, capillary permeability) that derive from measuring temporal changes in contrast enhancement obtained from CT images acquired over a period of time during intravenous administration of a contrast medium. 7 Consequently, CTp provides a quantitative basis for evaluating vasculature heterogeneity. The functional imaging technology has been utilized in a number of organs and tumors, including the prostate, colorectal, head and neck, lung, liver, and normal tissue.
Because such techniques require a sequence of successive scans under intravenous administration of a contrast medium, the quality of the resulting perfusion data depends on the manner in which the data is acquired. When specifying an acquisition protocol, investigators must determine several factors that could affect the quality of the resultant perfusion measurements. For example, one important factor involves the delineation of the preenhancement setpoint, or time/image at which the arterial up-slope is considered to first occur.8,9 In order to avoid excessive radiation exposure, patients are not scanned continuously, but rather at regular intervals over the course of the acquisition. Thus, investigators must determine the interscan subsampling interval to use in the acquisition. In a recent study, 10 the length of the subsampling intervals was shown to significantly impact the resulting perfusion characteristics. In addition, investigators must determine an acquisition duration, or the extent of time for which the patient must undergo repeated scanning, that yields stable quantification of the perfusion characteristics. To limit radiation exposure, the acquisition duration should be minimized. Moreover, because the tissue type maybe unknown before the diagnosis, any proposed duration of acquisition must ensure stable quantification of CTp characteristics for both malignant and healthy tissues before CTp can be used for detection and prognostication.
In the cases of two recent attempts to determine stable acquisition duration for acquiring perfusion characteristics in body tumors using dynamic CT,11,12 recommendations were put forth that were inferred using statistical methods that are inappropriate for addressing this objective. In this context, the investigators implemented t-tests between CTp values obtained at discrete acquisition durations using a traditional hypothesis-testing framework. Information pertaining to neighboring scans was ignored in the inference, and stability was concluded in the absence of significant differences for tests between successive scans. In the case of one study, 12 conclusions were also based on measures of linear dependence between pairs of intrapatient observations at successive scans.
It is well known that the traditional formulation of the hypothesis-testing problem considers equality of effects under the null hypothesis, with the alternative hypothesis characterizing inequality. The corresponding P-value provides a measure of evidence against the null hypothesis, not for it. Because the roles assumed by the null and alternative statements are logically asymmetric, equivalence should not be inferred from the absence of a significant difference, since, intuitively, any underpowered study would inevitably reach this conclusion. Moreover, it has been well described that measures of linear dependence are misleading and inappropriate for evaluating equivalence or “agreement”. 13 A proper analysis requires an equivalence-testing framework that measures the evidence against nonequivalence in relation to a prespecified equivalence region.
In addition, the pairwise approaches to inference utilized in both studies ignore temporal trends in the data, masking stabilization as a function of acquisition time. Notably, when appropriate statistical techniques are applied, conclusions deviate substantially from those provided by the aforementioned authors. 14 Heretofore, an appropriate method for inferring stable acquisition durations for acquiring imaging biomarkers from dynamic imaging modalities has yet to be explained. Nor has the concept of “equivalence testing” 15 been appreciated by the oncologic imaging community. In this paper, we explain how statistical modeling can be used to infer stable domains for stochastic curve estimation.
The ideas in this paper are presented in the following sequence. First we present the general method. Thereafter, we demonstrate the method by evaluating acquisition durations for a perfusion biomarker acquired in metastatic sites to the liver as well as healthy liver tissue using semiparametric model inference. We provide concluding remarks in the last section.
Inferring Stable Acquisition Durations from Stochastic Curves
This section presents a formal definition for stability as well as a general approach to inference based on equivalence testing.
Stability Criterion
Let t > 0 denote the acquisition duration, and let f(t) characterize the nonstochastic mapping of a perfusion-based biomarker as a function of t. Let
The function has attained stability at time t if its velocity is bounded within a neighborhood of zero for all subsequent time points. Thus, stability condition (1) is satisfied for all δ 0 if f'(t) reaches a steady state (or is time invariant) beyond t0: f”(t*) = 0, for all t* > t0. Therefore, we can evaluate acquisition durations for time invariance by fitting smooth curves to the observed data and conducting inference on the corresponding derivatives to assess their relative proximity to zero as a function of time.
Stability Inference
Let y
t
denote a stochastic response variable associated with a perfusion biomarker acquired for one patient region. A general nonparametric additive model applies local regression to a low-dimensional projection of the data. For example, we may assume that a one-to-one transformation of y, g(y), varies symmetrically about mean f(t) with random error ∊ and constant error variance
Mapping f(t) represents an arbitrary function of time, which can be estimated using smoothing splines or lowess. 16
The traditional approach to statistical analysis through hypothesis testing is valid when the aim of an experiment is to evaluate the evidence for differences among experimental conditions. However, the condition of “stability” is actually a statement of equivalence. A proper analysis requires an equivalence-testing framework that measures the evidence against nonequivalence in relation to a prespecified “equivalence” region.
Let τ > 0 denote the maximum observation period, 0 < t < τ, and let Lα(t) and Uα(t) denote the lower and upper bounds of the 100(1 − α)% simultaneous confidence band (CB1−α) for f'(t) over the interval (t, τ). Statistically, one should infer that f(t) is stable at acquisition duration t0 at significance level α if the corresponding CB1−α encompassing all subsequent acquisition durations are contained within a sufficiently small neighborhood of zero (−λ, λ), that is
The approach is analogous to testing null hypotheses of nonequivalence with equivalence region (–λ, λ). The boundary parameter λ represents the minimal magnitude of deviation that is meaningful in the context of the analysis. This may be specified as a scaled multiple of the estimated residual error standard deviation.
Case Study in CT Perfusion
In this section, we demonstrate the method for stability inference presented in the previous section using semiparameteric regression with implementation to the perfusion characteristic most commonly utilized in oncology, namely blood flow (BF). Specifically, spline regression is used to avoid prespecification of a parametric form for the underlying functional relationships, which are often unknown. As demonstrated in Ref. 14, deconvolution modeling of dynamic CT requires acquisition durations of sufficient length in order to achieve accurate quantification of a patient's perfusion characteristics. Before attaining steady states, these models yield biomarkers that are characterized by periods of noisy fluctuation. The dynamic periods are explained in part by the initial absorption of contrast. Ensuring stable quantification for the various perfusion scanning applications in oncology requires the implementation of acquisition protocols that use acquisition durations that yield relative time-invariant mappings. We will use the statistical model to flexibly estimate the mean velocity in the presence of stochastic curves. The stability criterion will be used to infer a minimum stabilization time for blood flow when acquired in metastatic sites in liver as well as healthy liver.
CT Perfusion Data
The study collected data on 16 patients with neuroendocrine liver metastases in whom CTp had been undertaken on a target lesion in the liver. CT perfusion images were obtained from a dual-phase protocol spanning a duration of 590 seconds. BF was acquired using a deconvolution analysis with the distributed parameter physiological model.7,17,18 BF is the rate measured as milliliters per minute per 100 grams of liver tissue (mL/min per 100 g). The dataset analyzed here consisted of 59 eight-slice cine images temporally sampled at 0.5 seconds from the phase 1 acquisition, together with 8 anatomically matched images from the phase 2 acquisition. A final BF value was obtained for each region of interest (ROI) by averaging across each of the eight CT slice images. There were 25 separate ROIs where BF was obtained in liver metastases and 27 separate ROIs where BF was obtained in normal liver tissue. The observed BF values were transformed to the log scale for the purpose of adjusting for conditionally asymmetric residual error at a given acquisition time and to mitigate heteroskedasticity as a function of acquisition time. Figure 1 provides the scatterplots of the observed log BF as a function of acquisition time for both types of tissue. Solid lines connect observations acquired from the same ROI, while dots characterize the observed scan times. The figure suggests that BF tends to be both elevated and more heterogenous in tumor sites when compared to normal liver.

Scatterplots of log blood flow measurements from the liver perfusion study in tumor (left) and normal liver (right) as functions of acquisition time. Solid lines connect repeated observations obtained from the same region of interest; dots characterize scan times.
Semiparametric Model
We model the CTp curves using penalized splines due to their smoothness properties and the fact that a unified framework for computing simultaneous confidence bands has recently been established. 19 Our case study analysis uses a truncated polynomial following the mixed model framework established by Ruppert et al. 20 , thereby enabling direct estimation and inference on the derivatives of the CTp curves as a function of acquisition duration.
Penalized Spline Regression
A spline basis is in essence a linear combination of piecewise polynomials. Let s denote a K × 1 vector of knot locations. At time t, a truncated polynomial spline basis of degree D is defined as
Penalized estimation uses constrained optimization to attempt to strike a balance between smoothness and a close fit to the data.
20
In the mixed model representation, the spline coefficients u1,…,u
k
, are modeled as independent and identically distributed (i.i.d.) random effects with variance
Let
After estimating the variance components with REML, we can obtain the estimated best linear unbiased predictor (BLUP) of f(t), which follows as
In addition, the residual sum-of-squared (RSS) errors can be used for cross validation,
Derivative Inference
Point and interval estimators can be derived for the rate of change as a function of acquisition time. Let
The estimated BLUP for the derivative
Using the mixed model formulation, Ruppert et al.
20
demonstrated that the corresponding large sample covariance
Thus, approximate 100(1 − α)% interval estimators can be computed by selecting an appropriate asymptotically justified critical value
A 100(1 − α)% pointwise confidence interval results from fixing
Results
Acquisition durations for BF from the liver perfusion study were inferred for CTp curves obtained in metastatic sites as well as regions of healthy liver (from the left or right lobes). For each tissue type, penalized spline regression analysis was implemented using truncated polynomial spline bases of D = 1, 2, 3. Our analysis used the package AdaptFitOS in statistical software R to implement REML and to select
Knots were placed at evenly spaced quantiles of the observed acquisition time points. While, in principle penalized spline estimators are robust to knot selection, because the sample sizes are rather small, the total number of knots were selected using the “corrected” version of AIC provided by Hurvich and Simonoff,
21
AIC c was also used to compare goodness of fit among splines of varying degree. Table 1 provides the AIC “optimal” numbers of knots for each spline degree and analysis in tumor and normal sites. In addition, the resulting AIC c and RSS are provided for each model. Figure 2 plots the point and interval estimates for the BF maps obtained in tumor (top) and normal liver (bottom), as functions of acquisition time. The third-degree truncated polynomial basis clearly resulted in a smooth fit when compared to the piecewise linear model (first degree). An intermediate degree of smoothness is evident for the fit corresponding to the second-degree spline. As evident in Table 1, the first-degree spline yielded the best tradeoff between goodness of fit and model complexity as defined by AIC c in tumor. For sites in normal liver, the extent of enhanced smoothness provided by the third-degree polynomial yielded the best tradeoff.

Estimated best linear unbiased predictors of log blood flow as functions of acquisition time in tumor (top) and normal liver (bottom) using penalized spline regression with truncated polynomial bases of specified degree. Estimated curves are represented by solid black lines. Shaded regions characterize interval estimates.
Statistical summaries obtained from semiparametric regression analysis of log blood flow from the liver perfusion study using penalized splines with truncated polynomial bases of specified degree. Boldfaced values mark the spline degrees that achieved minimum A/Cc.
Figure 3 provides the resultant estimated BLUP for the derivatives and the corresponding CB0.95, with an equivalence region (red) defined by ±0.5. The residual error standard deviation was estimated to be approximately 0.55 in tumor and 0.42 in normal tissue. Therefore, λ = 0.5 was chosen so that stability was measured in relation to the evidence that the mean curve varied less than approximately 1 standard deviation of random error. Using the approach described in the previous section, we concluded that CTp provides sufficiently stable characterization of BF when acquired for at least 220 seconds. Stabilization was evident sooner in normal liver, where a duration of 131 seconds yielded stable acquisition. This is not surprising, since, as noted in Figure 1, tissue perfusion tends to be more heterogeneous in regions undergoing tumor angiogenesis.

Estimated best linear unbiased predictors of the derivatives as functions of acquisition time in tumor (top) and normal liver (bottom) using penalized spline regression with truncated polynomial bases of specified degree. Point estimates are represented by solid black lines. Shaded regions characterize 95% simultaneous confidence bands over the entire acquisition duration. Red lines are used to depict an equivalence region defined by the neighborhood contained within ±0.5.
Discussion
In this paper, we described a statistically justified model-based approach for inferring stability for estimation of stochastic curves that eventually attain steady states. The effort was motivated in the oncologic imaging setting in the context of evaluating acquisition protocols for functional modalities that depend on a sequence of scans to acquire biomarkers that characterize biological processes associated with tumor angiogenesis. The approach was used to select acquisition durations that yield stable characterizations of a perfusion biomarker when acquired in metastatic sites in liver as well as normal liver tissue. It is important that the oncologic community recognize and use appropriate methods of inference when evaluating acquisition protocols for functional imaging modalities so that these promising technologies realize their full potential as tools for constructing biomarkers for guiding cancer detection, prognostication, and treatment selection.
Author Contributions
Conceived and designed the experiments: BH, CN. Analyzed the data: BH. Wrote the first draft of the manuscript: BH. Contributed to the writing of the manuscript: CN. Agree with manuscript results and conclusions: BH, CN. Jointly developed the structure and arguments for the paper: BH, CN. Made critical revisions and approved final version: BH, CN. Both authors reviewed and approved of the final manuscript.
