Abstract
Purpose:
To improve the contouring of clinical target volume for the radiotherapy of neck Hodgkin/non-Hodgkin lymphoma by localizing the prechemotherapy gross target volume onto the simulation computed tomography using [18F]-fluorodeoxyglucose positron emission tomography/computed tomography.
Material and Methods:
The gross target volume delineated on prechemotherapy [18F]-fluorodeoxyglucose positron emission tomography/computed tomography images was warped onto simulation computed tomography using deformable image registration. Fifteen patients with neck Hodgkin/non-Hodgkin lymphoma were analyzed. Quality of image registration was measured by computing the Dice similarity coefficient on warped organs at risk. Five radiation oncologists visually scored the localization of automatic gross target volume, ranking it from 1 (wrong) to 5 (excellent). Deformable registration was compared to rigid registration by computing the overlap index between the automatic gross target volume and the planned clinical target volume and quantifying the V95 coverage.
Results:
The Dice similarity coefficient was 0.80 ± 0.07 (median ± quartiles). The physicians’ survey had a median score equal to 4 (good). By comparing the rigid versus deformable registration, the overlap index increased from a factor of about 4 and the V95 (percentage of volume receiving the 95% of the prescribed dose) went from 0.84 ± 0.38 to 0.99 ± 0.10 (median ± quartiles).
Conclusion:
This study demonstrates the impact of using deformable registration between prechemotherapy [18F]-fluorodeoxyglucose positron emission tomography/computed tomography and simulation computed tomography, in order to automatically localize the gross target volume for radiotherapy treatment of patients with Hodgkin/non-Hodgkin lymphoma.
Introduction
The common clinical practice to treat limited stage (Ann Arbor I and II stage) lymphoma disease consists of a variable number of chemotherapy cycles followed by radiation therapy (RT). 1 –5 For the early-stage Hodgkin lymphoma (HL) and Non-HL (NHL), the combined modality treatment (CMT) has shown higher patient survival and better local tumor control. 6,7
Staging is performed by physical examination, blood test, and tomographic imaging (usually computed tomography [CT] preferably coupled with [18F]-fluorodeoxyglucose positron emission tomography [[18F]FDG-PET]), which provides anatomical and metabolic information about the lymph nodes involved. After chemotherapy is selected and delivered, lymph nodes usually shrink and postchemotherapy imaging might show just residual active regions or might be totally negative. However, current guidelines for CMT indicate that the affected primitive regions need to be treated with RT. 8 So the correct identification of the volume to be irradiated is fundamental.
Both chemotherapy and radiotherapy can lead to severe late side effects, including second malignancy, heart disease, thyroid dysfunction, sterility, gastritis, enteritis, pneumonitis, and so on. 9,10 Therefore, over the last few years, the trend is to reduce the aggressiveness of the treatments. More specifically, the tendency in RT is to reduce the dose and the volume of tissue to be treated, 11 –14 thus being very precise and specific. Ranging from the largest to the smallest size of the irradiated region, RT treatments can be classified as total lymphoid/nodal irradiation (TLI/TNI), extended field RT (EFRT), extensive mantle field, involved field RT (IFRT), and involved node RT (INRT). Nowadays, TNI and the EFRT are mostly abandoned, and common treatments are IFRT, if the RT field encompasses all of the clinically involved regions and INRT, which envisages delivering the dose on the initially involved nodes, rather than on the whole nodal chain area. 12,15,16 For both IFRT and INRT, the prechemotherapy gross tumor volume (GTV) provides the reference for determining the clinical target volume (CTV). The smaller is the target to be irradiated, the more important is to delineate the CTV precisely. 14 Recent studies proved high observer variability in tumor delineation for HL, 17 –19 thus highlighting the need of a robust and operator-independent methodology for target definition. A considerable improvement in treatment volume definition on simulation CT has been obtained by integrating the information provided by the [18F]FDG-PET/CT, acquired before chemotherapy for diagnosis and staging purposes. 4,5,13,14,16,20 –23
In order to combine the [18F]FDG-PET/CT outcome with the CT-based CTV delineation, the common routine is visual assessment: The physician compares the two imaging modalities, displayed on 2 different screens, and grounds the matching on anatomical landmarks. However, this approach is time consuming and operator dependent. Some authors proposed methods based on image rigid coregistration and overlay (image fusion) and highlighted more favorable results if the [18F]FDG-PET/CT scan is acquired on patients in treatment position. 13,16,24
However, in most cases, patients are scanned in different setup (position of the arms and/or neck) and/or using different scanners in different hospitals. In addition, weight loss and lymph node shrinkage occurring between the 2 imaging stages represent particularly challenging issues for CT-[18F]FDG-PET/CT matching based on rigid registration.
In this work, we propose a new methodology to overcome these limitations by using nonrigid image registration between diagnostic [18F]FDG-PET/CT and simulation CT images. This approach can be useful in most clinical situations, regardless of the specific imaging setup/acquisition protocol and the chemotherapy outcome. To our knowledge, this is the first study demonstrating the use of deformable image registration as a technique to map the initial involved lymph node regions onto the radiotherapy images.
We tested our method on a cohort of patients treated for neck HL or NHL disease. Data analysis also included the comparison between rigid and nonrigid registration. The ultimate aim is to provide clinicians with robust guidance during the contouring phase, for speeding up the planning phase and improving the treatment efficacy by identifying accurately the primary lesion on the simulation CT scans.
Materials and Methods
Clinical Protocol and Patients
We selected retrospectively 15 patients treated at the European Institute of Oncology in Milan (Italy) for neck lymphoma. In particular, 5 of them were diagnosed with HL and 10 with NHL. In all, 13 patients were treated with CMT and 2 patients refused chemotherapy and underwent RT only. Details about each patient, regarding diagnosis, staging, imaging setup, and radiotherapy treatment are included in Table 1.
Patients’ CT Information.a
Abbreviations: CT, computed tomography; [18F]FDG-PET, [18F]-fluorodeoxyglucose positron emission tomography; HL, Hodgkin lymphoma; NHL, non-Hodgkin lymphoma; PTV, planning target volume; IA, stage I, subclassification A; IEA, stage IE, subclassification A; IIIs, stage III included spleen.
aCTDIAGN and CTSIM are, respectively, the first diagnostic [18F]FDG-PET/CT and the radiotherapy simulation CT. Patients received radiotherapy only.
The institutional clinical protocol for CMT of lymphoma is as follows:
Prechemotherapy [18F]FDG-PET/CT (Discovery, GE Medical Systems, Milwaukee, Wisconsin) is acquired for diagnosis and staging purposes. Image readout is performed on a Xeleris Workstation (GE Medical Systems, Milwaukee Wisconsin), which allows visualizing [18F]FDG-PET, CT and fused [18F]FDG-PET/CT sections in transverse, coronal, and sagittal planes. [18F]FDG-PET/CT images are interpreted by an experienced nuclear medicine physician (informed about the clinical and the radiological findings of the case), in collaboration with an experienced radiation oncologist. The presence of pathological FDG uptake to define the GTV is indicated when tracer uptake increases with respect to surrounding tissues and normal structures. The nuclear medicine physician always checks the anatomical correspondence of the FDG target on CT images, since patient’s random or physiological movements can cause image mismatch. Depending on diagnosis and stage, 2 to 4 cycles of chemotherapy are delivered to the patient. Postchemotherapy [18F]FDG-PET/CT is acquired to assess the response to treatment. A simulation CT (HiSpeed CT/I, GE Medical Systems) is acquired in a standardized position for the planning of the RT treatment, after the manufacturing of a thermoplastic mask for patient immobilization with both arms along the body and hyperextended neck position. The radiation oncologist draws the GTV on the simulation CT by visually examining the first [18F]FDG-PET/CT; in case of partial response after chemotherapy (presence of pathological FDG uptake on the postchemotherapy [18F]FDG-PET/CT the residual mass is also contoured to receive a boost dose. Starting from GTV, the CTV is defined according to IFRT guidelines to include tissues that could host cancerous cells, although they do not show morphological or functional variations. Finally, a geometric isotropic margin of 1 cm is added to compensate setup errors and organ motion, for planning target volume definition. Usually, 3-dimensional (3D) conformal radiotherapy is applied for patient treatment; if the patient is a recurrence case, intensity-modulated RT is preferred.
Methodology
Figure 1 depicts the workflow of the methodology proposed in this study. The CT volume obtained by the first diagnostic [18F]FDG-PET/CT acquisition and the simulation CT were used as input data. We will refer to them as CTDIAGN and CTSIM, respectively. An open source registration software (Plastimatch; www.plastimatch.org) 25,26 was employed to register the CTDIAGN onto CTSIM. Before image registration, image histograms were stretched in order to enhance the soft tissues contrast. Histogram modification was performed using a piecewise linear function remapping intensity landmarks from (−1000, −800, −100, 0, 50, 100, 1000, 3071) to (−800, −800, −400, 0, 600, 800, 1000, 1000). Since the stretched image highlights soft tissues at the expense of the dense ones, the processed CT was added to the original CT, in order to increase the contrast of both soft tissues and bones. This process was applied to each couple CTDIAGN-CTSIM. A multistage iterative B-spline-based registration procedure, minimizing the sum of squared voxel differences, was run for image registration. The registration process consisted of a rigid alignment followed by 4 deformable stages, going from coarse to fine registration by means of a progressive reduction in B-spline grid tile. The Plastimatch configuration file is provided in Appendix A. The registration output was a vector field describing the voxel displacements from CTDIAGN to CTSIM.

Flowchart of the proposed methodology. Dotted paths represent activities carried out for testing purposes.
For validation purposes, we asked the radiation oncologist to contour on CTDIAGN organs at risk (OARs) typically included in the radiotherapy plan, such as parotid glands, spinal cord, trachea, larynx, and mandible (OARDIAGN). The CTV and OARs were already available on the CTSIM (OARSIM). The OARDIAGN and GTV (identified as stated in the clinical protocol, ‘Clinical Protocol and Patients’ section) were warped according to the output vector field, obtaining OARDIAGN_WARPED and GTVDIAGN_WARPED. In order to compare the impact of rigid versus deformable registration, the GTV volume after rigid alignment was also computed (GTVDIAGN_RIGID). The algorithm was executed on a GNU/Linux workstation equipped with an Intel Xeon CPU 2.5 GHz and 8 GB of random access memory, requiring approximately a computation time of up to 8 minutes.
Data Analysis
Image registration
Image registration outcome was assessed by quantitatively comparing each OARDIAGN_WARPED with the corresponding OARSIM (representing the ground truth). The Dice similarity coefficient (DSC) 27 was calculated for each couple OARDIAGN_WARPED/OARSIM according to the following formula:
DCS value was interpreted as a function of the structure volume, as described in Isambert
GTVDIAGN_WARPED Localization
Five radiation oncologists were asked to visually score the quality of GTVDIAGN_WARPED location on the CTSIM. By using 2 different screens, each rater was presented simultaneously with the original GTV onto CTDIAGN and with the GTVDIAGN_WARPED onto CTSIM. Their task was to evaluate whether or not the GTVDIAGN_WARPED was correctly localized on the CTSIM based on its relative spatial localization with respect to the neighbor soft tissues and landmark anatomical structures (eg, bones and organs). Scores ranged from 1 to 5, being 1 = wrong, 2 = poor, 3 = fair, 4 = good, and 5 = excellent. The median score for each patient was computed as final grade for GTVDIAGN_WARPED.
Finally, deformable and rigid registration outcomes were compared from both geometric and dosimetric point of view by computing the overlap index (OI), that is, the percentage of GTVDIAGN_WARPED (or GTVDIAGN_RIGID) included in the planned CTV (Equation 2)
with x = DIAGN_WARPED or x = DIAGN_RIGID, and the volume of GTVDIAGN_WARPED and GTVDIAGN_RIGID receiving the 95% of nominal dose (V95). Dosimetric analysis was performed by SlicerRT (http://slicerrt.github.io), 29 a plugin of 3D Slicer (www.slicer.org). 30 The Wilcoxon test was chosen to verify whether statistical difference existed between paired samples coming from a data distribution not strictly Gaussian.
Results
The time interval occurring between diagnostic and treatment planning scans ranged from 8 months to 1 day, with a median value of 4 months.
Figure 2 shows 2 representative cases of a good (patient 8) and a bad (patient 7) image registration result.

An example of good (A) and bad (B) image registration. In (A) and (B) the CTDIAGN (red) is overlaid to CTSIM (green) to show the initial situation while, later, the CTSIM, the CTDIAGN_WARPED and the OARSIM/OARDIAGN_WARPED are depicted in checkerboard mode with a different histogram window to improve the visualization of the result. In (A), anatomical landmarks between CTSIM and CTDIAGN_WARPED, at the edge of each checker box, match; the warped OARs are consistently overlaid on the original ones. In (B), the CTDIAGN_WARPED, depicted in the upper left and in the lower right checker boxes, present unnatural swirls due to excessive distortion of the b-spline grid; as a consequence, the warped OARs are broken into small islands, such as mandible (in light blue), right and left parotid (in black and dark blue respectively).
Figure 3 shows the median ± quartiles of DSC as a function of structure’s median volume. Best results were found for mandible (0.86 ± 0.02) and trachea (0.89 ± 0.02), and worst outcomes were obtained for right (0.69 ± 0.05) and left (0.75 ± 0.04) parotid and for larynx (0.76 ± 0.05). Spinal cord scored 0.78 ± 0.03. The DSC plotted versus volume showed an increasing trend, fitting a logarithmic curve at 95% prediction bounds (

Organs at risk (OARs): logarithmic correlation between Dice similarity coefficient (DSC) and volume.
In Table 2, the results of the performed survey are reported. Besides a general good performance of the algorithm reported by physicians (median ± quartiles: 4 ± 0.5), 2 of 15 cases were indicated as just sufficient (see patients 7 and 11), with median score equal to 3. In 2 circumstances, 1 rater largely disagreed with the others by underrating the outcome of the algorithm (see patients 9 and 13).
Physicians’ Scores of GTV Localization on CTSIM.
Abbreviation: GTV, gross tumor volume.
The comparison between rigid and deformable registration is reported in Figure 4. Median ± quartiles for OI were 0.21 ± 0.16 (rigid) and 0.74 ± 0.13 (deformable), and for V95, were 0.84 ± 0.38 (rigid) and 0.99 ± 0.10 (deformable). Wilcoxon matched-pair test, performed at 5% confidence level, revealed statistical difference both for OIDIAGN_RIGID versus OIDIAGN_WARPED (

Patient 8’s CTSIM, dose and contours. The left panel shows the result of the rigid alignment (clinical target volume [CTV] green, GTVDIAGN_RIGID yellow); in the right panel is depicted the result of the deformable registration (CTV green, GTVDIAGN_WARPED blue). At the bottom is the comparison between rigid and deformable registration in terms of overlap index (OI) and V95.
In Figure 4, a comparison between rigid and deformable registration in terms of GTV localization, dosimetric coverage and OI, and V95 is depicted.
Discussion
In this work, we described an image-based methodology to improve the delineation of the CTV for RT treatment of neck HL and NHL. Our approach was based on mapping the GTV, extracted from the diagnostic [18F]FDG-PET/CT, onto the simulation CT volume, by means of nonrigid image registration.
Other investigators proposed to use rigid image registration to integrate the [18F]FDG-PET/CT information with the simulation CT data set. 13,16,24 However, in those studies they recommend to acquire the diagnosis and the treatment planning image data sets with similar patient positioning and, possibly, with the same devices for patient setup and immobilization. Conversely, we enrolled patients who were scanned in different position (arms and neck particularly), in order to test the method in the most challenging circumstances.
Image registration was performed only between CT volumes that, by offering better anatomical detail compared to [18F]FDG-PET imaging, guarantee a more reliable result. The [18F]FDG-PET, coregistered by hardware to CTDIAGN, was used to contour the GTV, according to the clinical protocol. The image registration algorithm was executed by setting the same parameters (metric, number of stages, B-spline grid, number of iterations, optimizer, etc) for all patients. This choice was motivated by the fact that in a clinical environment, the system must be stable and robust. The parameters were tested and chosen on the basis of previous studies. 31,32 The choice to gradually reduce the B-spline grid tile was made in order to level first the macroscopic differences and then the finer ones. This strategy allows at the same time to avoid local minima during the searching of the optimal transformation. For the same reasons, and to speed up the process, the rigid transformation was computed by subsampling the images.
The quality of image registration was assessed by computing the DSC between reference OARSIM and warped OARDIAGN_WARPED. The median ± quartiles values of DSC ranged from 0.69 ± 0.05 to 0.89 ± 0.02. Mandible, spinal cord, and trachea exhibited the highest values, while parotids the lowest. This is not surprising since parotid glands are very difficult to contour and have large intra and interrater variability.
33
Moreover, we found a logarithmic correlation between volume size and DSC, in agreement with Isambert
When evaluating the GTVDIAGN_WARPED localization, the lowest median score was 3 (“fair”), and it occurred in 2 of 15 cases. In both cases, deformable registration did not perform well in the neck region as highlighted by the median DSC for each patient (0.78 and 0.63, respectively), being the lowest among the entire patient cohort. This caused an irregular warping of the GTVDIAGN_WARPED that was not judged fully acceptable by most of the raters. Compared to rigid alignment, deformable image registration is more affected by metal artifact effects. This is due to the fact that the displacement field is computed locally, unlike the rigid transformation where it is computed on the whole volume. However, we found that mild misregistration due to metal artifacts (5 cases in our data set) did not affect the final warping and localization of the GTV. When metal artifacts are a concern, algorithms for their reduction should be used in order to mitigate the problem.
As far as the comparison between rigid and deformable registration is concerned, both OI and V95 showed statistical difference. Although on a limited data set, the overall results showed that deformable registration performs better than rigid registration. This is due to the fact that we enrolled patients acquired in very different arms/neck position during the diagnosis and simulation CT. This large difference cannot be recovered by using only a rigid alignment but requires a deformable image registration approach. The gap between rigid and deformable registration was particularly evident for the OI. The V95 contrast was mitigated by the fact that IFRT was planned for these patients. We would expect higher differences, in favor of deformable registration, in case of INRT.
Conclusion
In conclusion, the proposed methodology turned out to improve image-based guidance for the radiation oncologists during the CTV contouring of neck lymph nodes for patients with HL/NHL. In clinical practice, the described method is put forward as a valuable tool for reducing the CTV delineation variability and improve the primary disease localization.
Footnotes
Appendix A
The complete list of Plastimatch registration parameters stages:
[STAGE] xform=align_center [STAGE] xform=rigid optim=versor impl=itk metric=mse max_its=100 convergence_tol=5 grad_tol=1.5 res=4 4 2 [STAGE] xform=bspline optim=lbfgsb impl=plastimatch metric=mse max_its=150 res=1 1 1 grid_spac=40 40 50 [STAGE] xform=bspline optim=lbfgsb impl=plastimatch metric=mse max_its=100 res=1 1 1 grid_spac=20 20 30 [STAGE] xform=bspline optim=lbfgsb impl=plastimatch metric=mse max_its=50 res=1 1 1 grid_spac=10 10 10 [STAGE] xform=bspline optim=lbfgsb impl=plastimatch metric=mse max_its=50 res=1 1 1 grid_spac=5 5 5
Acknowledgments
The authors thank the Doctors Mariaquila Santoro (Azienda ospedaliera “Pugliese-Ciaccio”, Catanzaro, Italy) and Giulia Marvaso (Medical Oncology Unit; Magna Graecia University of Catanzaro and T. Campanella Cancer Center, Catanzaro, Italy) for their contribution to visual scoring the automatic contours.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
