Abstract
Routine follow-up visits and radiographic imaging are required for outcome evaluation and tumor recurrence monitoring. Yet more personalized surveillance is required in order to sufficiently address the nature of heterogeneity in nonsmall cell lung cancer and possible recurrences upon completion of treatment. Radiomics, an emerging noninvasive technology using medical imaging analysis and data mining methodology, has been adopted to the area of cancer diagnostics in recent years. Its potential application in response assessment for cancer treatment has also drawn considerable attention. Radiomics seeks to extract a large amount of valuable information from patients’ medical images (both pretreatment and follow-up images) and quantitatively correlate image features with diagnostic and therapeutic outcomes. Radiomics relies on computers to identify and analyze vast amounts of quantitative image features that were previously overlooked, unmanageable, or failed to be identified (and recorded) by human eyes. The research area has been focusing on the predictive accuracy of pretreatment features for outcome and response and the early discovery of signs of tumor response, recurrence, distant metastasis, radiation-induced lung injury, death, and other outcomes, respectively. This review summarized the application of radiomics in response assessments in radiotherapy and chemotherapy for non-small cell lung cancer, including image acquisition/reconstruction, region of interest definition/segmentation, feature extraction, and feature selection and classification. The literature search for references of this article includes PubMed peer-reviewed publications over the last 10 years on the topics of radiomics, textural features, radiotherapy, chemotherapy, lung cancer, and response assessment. Summary tables of radiomics in response assessment and treatment outcome prediction in radiation oncology have been developed based on the comprehensive review of the literature.
Introduction
Lung cancer is one of the most common cancer in the world, accounting for the first place in men (16.7%) and third in women (8.7%). 1 When cancer is suspected, further confirmation and follow-up are required in order to assist clinicians with accurate diagnosis and develop properly indicated treatment regimens. Tumor biopsy and physiology analysis are the most accurate way for diagnosis. Yet biopsy has limitations, that is, the location of tumor biopsied may strongly affect physiology results. 2 Medical imaging systems are noninvasive approaches for tumor assessment and they provide substantial benefits, that is, tumor’s biological and functional information and its surrounding microenvironment, 3 beyond simple tumor visualization with newly developed contrast and tracer agents. Diagnostic interpretation of medical images has become more sophisticated and focused on disease types. For instance, most tumors exhibit heterogeneity, both spatially and temporally, which can be visualized and tracked with new radiomics applications on chest radiographs and computed tomography (CT) images. Lambin et al 4 was the first to introduce the concept of “radiomics” in 2012 and presented its key technique and application prospects. Kumar et al 5 further expanded the definition and analyzed the challenges of each step of its workflow, especially for lung cancer in the same year. Radiomics is essentially derived from the concept of computer-aided diagnosis, which uses computer assistance in processing a large number of imaging features extracted from various imaging modalities, including CT, positron emission tomography (PET), magnetic resonance imaging, and/or other medical imaging modalities. 4,5 This strategy transforms the regions of interest (ROIs) into high-resolution, minable data by introducing the big data technology and using automatic data feature algorithms. 5 Adopted from radiology, this technique has been recently explored in cancer treatments, including tumor targeted drug therapy, 6 preoperative assessment of tumor surgery, 7,8 and response and outcome assessments after radiotherapy and systemic therapy. 9 -13
The applications of radiomics for non-small cell lung cancer (NSCLC) include (1) treatment efficacy and response evaluation, (2) to assist noninvasive early diagnosis, and (3) prediction of treatment outcomes. It becomes possible to detect tumor occurrence at an early stage, as well as to predict treatment efficacy and detect recurrence or metastasis after a given treatment regime of radiation and/or systemic therapy. 9,14 With the established database, care providers can now more practically predict treatment outcomes and possible radiation-induced injuries and seek out personalized and precise treatment options for patients. 15
Previous review articles have provided summaries of radiomics analysis approaches, mathematical algorithms, and clinical applications in diagnosis, prognosis, or outcome predictions. 5,16 -18 In 2014, Mattonen et al 10 reviewed novel techniques focusing on quantitative imaging feature analysis for response assessment after radiotherapy for lung cancer. In 2016, Parekh and Jacobs 17 summarized approaches and mathematical algorithms applied to lung, breast, liver, and so on. The development of radiomics and its application have also been discussed in various aspects. 19 -21 More recently, several papers reviewed related texture analysis applied to PET/CT images for tumor response assessment 16,22,23 in the past 2 years. Scrivener et al 2 and Lee et al 18 focused on the application to lung cancer, while Sollini et al 24 proposed harmonizing PET radiomics methods for their applications in NSCLC.
Radiomics currently plays an important role in providing a fundamental methodology for future personalized treatment and follow-up in the era of precision medicine. Our focus is to elaborate the specific applications of radiomics in the context of radiotherapy with or without systemic regimen for NSCLC and its current research developments using CT and PET images. A comprehensive literature search was conducted using the PubMed database to include papers published from the year of 2007 to present, with the keywords of radiomics, textural features, radiotherapy, systemic therapy, chemotherapy, lung cancer, and response assessment. Summary tables in response assessment and treatment outcome prediction in radiation oncology have been developed based on the comprehensive review of the literature.
The Workflow of Radiomics
The workflow of radiomics, as shown in Figure 1, includes (1) imaging acquisition/reconstruction, (2) ROI definition/segmentation, (3) image feature extraction, and (4) image feature selection and classification. Research interests have been widely focused on metrologies and challenges, as well as clinical applications of these aspects. Herein we describe in details various stages in the radiomics workflow.

The workflow of radiomics.
Imaging Acquisition and Reconstruction
Acquiring high-quality and standardized images is a prerequisite for the accuracy and consistency of radiomics data. A large amount of the standardized image data can help establish a response assessment model to predict the probability of recurrence and metastasis after treatment. 6,25 Within those 2 commonly available types of image acquisition modalities for treating NSCLC, CT and PET, choosing a consistent image modality for the diagnosis and follow-up scans is essential to avoid divergence and discordance in the extracted feature data. In addition, even for the same imaging modality, the variations in acquisition parameters (kV, mAs, slice thickness, breathing control method, configurations, field strength, and contrast media) and reconstruction parameters (reconstruction kernels or filters) are likely to affect image values for scanning the exact same subject, which might complicate the process of ROI-based image segmentation and feature extraction. 5,26 -30 Thus, it is essential to maintain homogeneity in image scan protocols for a radiomics study.
Region of Interest Segmentations
Image segmentation for ROI creations is a crucial step, which directly affects the quality of subsequent feature extraction, thus affects the correctness and accuracy of research results. 20,31 Segmentation includes manual, semiautomatic, and automatic segmentations. Manual segmentation means that an experienced physician manually delineates the ROI on images, which is currently the most widely used method. 28,32 Manual segmentation has minimal needs in specialized algorithms but demands user specialty and experience. It is also time-consuming and can have significant intra- and interobserver variability, which may cause obstacles in big data analysis. 33 The automatic segmentation does not require manual intervention and functions through computer-aided automation with preset parameters. However, it relies on the accuracy of algorithms and their ability to differentiate ROIs from surrounding tissues. Semiautomatic segmentation takes advantages of both manual intervention and software automation, which makes it a preferable method for radiomics data analysis. 5,14,19,34 Semiautomatic segmentation that uses computer segmentation method cannot be reliably used alone. Further manual intervention is needed to ensure the accuracy of segmentation. 35
Reproducibility of quantitative PET and CT image features in NSCLC concerning variations caused by segmentation methods and patient factors had been studied and proved stable. 36 -41 Several studies used intraclass correlation coefficient (ICC) 42 to compare the repeatability in manual, automatic, and semiautomatic segmentation methods. 36,38,39 Among them, Parmar et al 39 concluded that semiautomatic features provide higher reproducibility than manual segmentations (ICC: 0.85 ± 0.15 vs 0.77 ± 0.17). Semiautomatic multiseed point segmentation can generate stable segmented volumes with similarity index above 0.93, compared to 0.73 in manual segmentation. 41 Overall, ROI identification and segmentation are critical factors in the process of imaging analysis and feature extraction. 43,44
Image Feature Extraction
Exploring possible correlations between image feature information with tumors’ phenotypes and prognostics is currently the main interest in radiomics research. Publications have shown potential diagnostic and prognostic powers from radiomics signatures for lung patients. 45 -50 Present literatures have reported clinical factors, conventional features, and texture features for radiomics or delta-radiomics studies. 19,45,51 -56 Different from clinical factors and conventional features that are acquired from hospital records or radiology readings, texture features are calculated mathematically by software platforms, such as MATLAB or Python. Texture feature is the spatial distribution of gray-level intensity in images and analyzed by 2 main mathematical techniques in NSCLC: statistical methods and transform-based methods, of which the first one is most used for lung cancer. 6,16,32,45,51,57 More specific description of radiomics features are summarized in Table 1. Currently, one of the predictive models is established based on size, concavity, contour, and spiculation. 48 In fact, the size-based features can be good predictors of cancerous nodules by itself and has significant correlation with overall survival (OS). 46 Additionally, the diversity of image features, such as tumor’s growth rate and volume change, and multivariate analysis can improve prediction accuracy. 46,51 It is even possible to determine benign or malignant lymph nodes status by using CT texture features. 49,50 More significant features are to be discovered and validated.
Features Extracted From PET and/or CT Images.
Abbreviations: AUC-CSH, area under the curve of the cumulative SUV-volume histogram; COV, coefficient of variation; CT, computed tomography; GLCM, gray-level co-occurrence matrix; GLRLM, gray-level run-length matrix; GLSZM, gray-level size zone matrix; HU, Hounsfield unit; IVH, intensity–volume histogram; Law’s, Law’s texture measures; LoG, Laplacian transform of Gaussian filter; MTV, metabolic tumor volume; NGTDM, neighborhood gray-tone difference matrix; PET, positron emission tomography; SD, standard deviation; SUV, standardized uptake value; SUVmax, maximum SUV; SUVmean, average SUV; SUVpeak, peak SUV; TLG, total lesion glycolysiss.
Feature Selection and Classification
Feature selection and classification are also crucial after extracting a large number of features, most of which may be noise or intercorrelated features. The procedure of selecting and classifying features is to reduce the dimension. 28,58 There are a few publications on comparing the abilities of different feature selection methods and machine learning classifying methods to predict outcomes for lung. 58,59
Feature selection preserves a subset of useful and unique features, reducing the computational costs and increasing the accuracy of predation, which can be applied to supervised and unsupervised learning. 60,61 Previous research had demonstrated that Wilcoxon selection method had highest performance in supervised learning, 62 while principal component analysis 58 had higher prediction performance than Wilcoxon (area under the receiver operating characteristic curve [AUC] = 0.70. vs 0.67) in unsupervised learning. 58 Feature selection algorithms, that is, filtering, wrapper, and embedded methods, have their merits and demerits. 60,63 The filtering methods are independent of the chosen predictors, with efficient numeracy and statistical scalability, which can reduce dimensionality and overfitting compared to the wrapper and embedded methods. 63 Comparison of 14 filter feature selection methods indicated that Wilcoxon selection has the highest performance in predicting OS (AUC = 0.65 ± 0.02) with maximum stability (0.84 ± 0.05). 62
Among the two feature classifications, that is, supervised and unsupervised classifiers, the former requires user to provide patients’ radiomics features and outcomes as training data (ie, support vector machines, random forest, decision tree, neural networks, boosting, discriminant analysis, k-nearest neighbors, etc), 64 -68 while the latter doesn’t require outcome data (ie, consensus clustering, k-means clustering, hierarchical clustering, etc). 17 Parmar et al 62 and Zhang et al 58 proved that random forest classifier has the highest prediction performance in OS (AUC = 0.66 and 0.71).
Moreover, a combination of different feature selection methods and classifiers can achieve different prediction results. It was found that a combination of Wilcoxon selection model with random forest classifier has the best performance for predicting OS, while near-zero variance selection model combined with random forest classifier is appropriate for predicting recurrence (AUC = 0.76), and zero variance selection model with naive Bayes classifier is the best to predict death (AUC = 0.77), and mixture discriminant analysis classifier alone (AUC = 0.73) was best for predicting recurrence-free survival . 58 Identification of feature selection methods, classification methods, and analyzing tools is a crucial step for improving accuracy, stability, and performance of features for assessing response and predicting clinical outcomes. Radiomics is an emerging area, so does the development of feature selection and classification algorithms. The optimal method is still being developed based on clinical needs.
Radiomics in Response and Outcome Evaluation
Literature on applying radiomics to assess response and predict treatment outcome for lung is based on pretreatment images (Table 2) and follow-up images (Table 3). Pretreatment imaging radiomics is used to discover the association of quantitative features extracted from images taken prior to treatment with response and outcomes monitored upon treatment completion. The focus of these studies is on the ability of predicting prognosis from the pretreatment features. 8,52,74,75 Delta-radiomics is a method that compares the changes within those features extracted from pretreatment images and follow-up images during and/or after the treatment, in order to identify signs of recurrence, metastases, or other morality. 15,56,77,81,83 Positron emission tomography and CT images have been widely used for lung patients and thus are readily available for radiomic analysis. Positron emission tomography images provide molecular metabolic information and thus detect disease early, while CT provides anatomical characteristics. The end points mostly studied in PET images are Response Evaluation Criteria in Solid Tumors (RECIST)-based responses 12,13,79 and outcomes, such as recurrence, 52,53,69,78,81 distant metastases (DM), 57,69 and survival. 54,55,71,72 The end points in CT images focus on pathological response, mutation status, and distinguishing radiation-induced lung injury (RILI). The image features with high correlation to those end points are listed in Tables 2 and 3. Most of these study trials are retrospective, thus patients’ images along with clinical outcome data are achieved from records of medical institutions or hospitals.
Pretreatment Imaging Radiomics in Response Assessment and Treatment Outcome Prediction.
Abbreviations: AIP, average intensity projection; AUC, area under the receiver operating characteristic curve; AUC-CSH, area under the curve of the cumulative SUV-volume histogram; c-index, concordance index; 95% CI, 95% confidence interval; COV, coefficient of variation; CRT, chemoradiotherapy; CT, computed tomography; DFS, disease-free survival; DM, distant metastases; DMFS, distant metastasis–free survival; DSS, disease-specific survival; ECOG, Eastern Cooperative Oncology Group; FB, free breathing; FDR, false discovery rate; GRD, gross residual disease; HR, hazard ratio; IVH, intensity–volume histogram; LoG, Laplacian transform of Gaussisn filter; LR, local recurrence; LRC, local-regional control; LRFS, locoregional recurrence-free survival; LRR, loco-regional recurrence; OS, overall survival; pCR, pathologic complete remission; PET, positron emission tomography; PFS, progression-free survival; R 2, coefficient of determination; RECIST, Response Evaluation Criteria in Solid Tumors; RFS, recurrence-free survival; rs, Spearman correlation; RT, radiotherapy; SBRT, stereotactic body radiotherapy; SUV, standardized uptake value; SUVmax, maximum standardized uptake value; TKI, tyrosine kinase inhibitors.
Delta-Radiomics in Response Assessment and Treatment Outcome Prediction.
Abbreviations: AUC, areas under the receiver operating characteristic curve; c-index, concordance index; 95% CI, 95% confidence interval; COV, coefficient of variation; CRT, chemoradiotherapy; CT, computed tomography; DM, distant metastases; DSS, disease-specific survival; EGFR, epidermal growth factor receptor; GLCM, gray-level cooccurrence matrix; GTV, gross tumor volume; HR, hazard ratio; HU, Hounsfield unit; Law’s, Law’s texture measures; LC, local control; LR, local recurrence; MTV, metabolic tumor volume; OS, overall survival; PET, positron emission tomography; PFS, progression-free survival; R 2, coefficient of determination; RECIST, Response Evaluation Criteria in Solid Tumors; RI, retention index; RILI, radiation-induced lung injury; RT, radiotherapy; SBRT, stereotactic body radiotherapy; SD, standard deviation; SHR, adjusted subhazard ratio; SUVmax, maximum standardized uptake value; SUVmean, average standardized uptake value; TKI, tyrosine kinase inhibitors; Δ, delta.
Tumor Response Assessments
Positron emission tomography images
Tumor response studied in PET images including stable disease and progress disease is assessed by the RECIST and divided into 3 types: complete response, partial response, and nonresponse. 12,13,79 The extracted features are pretreatment standardized uptake value (SUV) metrics, metabolic tumor volume (MTV), total lesion glycolysis, quantitative texture features, such as entropy, correlation, contrast, and uniformity, and “delta-radiomics.” Among them, texture features outperformed others in predicting tumor response to treatment. Cook et al 12,13 showed that texture features measured by coarseness, contrast, and busyness presented strong correlations with RECIST response to chemoradiotherapy (CRT; P = .004, .002, .027), while SUV parameters showed none. Texture features reflecting reduced heterogeneity measured by first-order standard deviation (SD), entropy, and uniformity (P < .01, = .001, = .001) also presented a stronger correlation with RECIST response to erlotinib than SUV metrics. Dong et al 79 found that even though pretreatment features, such as coefficient of variation, MTV, and contrast (AUC = 0.781, 0.686, 0.804) had a predictive capability for response to CRT, early changes in texture features can better predict response with higher specificity (80%-83.6%) and sensitivity (73.2%-92.1%).
Computed tomography images
Different from RECIST responses studied in PET images, tumor response studied in CT images mainly focuses on pathological response, and treatment responses such as tumor response to radiation and epidermal growth factor receptor (EGFR) mutation status reflected gefitinib response. 6 -8,86,87 In terms of predicting pathological response, pretreatment primary tumor sphericity, texture features, lymph node homogeneity, and changes in primary tumor volume, mass, histogram features, and texture features are potential predictors for patients with NSCLC after CRT, while primary tumor intensity variability and size zone variability are predictive for patients after tyrosine kinase inhibitor therapy. 7,86 It is also found that lymph node phenotype has a better performance in classifying pathologic complete response and gross residual disease than primary tumor. 7 A study tracked tumor response to radiation in daily CT images during radiotherapy course found that reductions in mean Hounsfield unit (HU) in gross tumor volume (GTV) had a strong correlation with accumulated GTV dose (R 2 > 0.99) and were significantly associated with survival rate. 87 In predicting EGFR mutation status and reflected gefitinib response, a pretreatment Law’s texture feature and changes in volume, maximum diameter, and a filter-based feature were proved significant.
The RECIST response has its limitation in diversified clinical applications. 88 Radiomic features such as texture features and volume changes have the potential to better predict tumor responses and thus may be considered as new tumor response phenotypes that provide diversified information in the future.
Survival Analysis
Studies focusing on survival analysis include OS, progression-free survival (PFS), locoregional recurrence-free survival (LRFS), distant metastasis–free survival (DMFS), disease-free survival (DFS), disease-specific survival (DSS), and so on.
Positron emission tomography images
For predicting survivals based on PET images, the best features ever found are: the high-order contrast for OS (P = .002), 12 AUC of the cumulative SUV-volume histogram (AUC-CSH) for PFS, LRFS, and DMFS (P < .001, = .002, = .003), 54 and mean SUV (SUVmean) greater than 3.45 for DSS (P = .007). 78 Kang et al 54 revealed that maximum SUV (SUVmax) and AUC-CSH reflecting tumor heterogeneity were significant prognostic factors for PFS, while AUC-CSH were for LRFS and DMFS. 54 Carvalho et al 55 showed that tumor’s relative volume containing >80% SUV significantly correlates with OS, and a larger tumor’s relative volume above a higher SUV can lead to better prognosis. Furthermore, the texture feature SumMean for OS by Ohri et al 71 and textural feature dissimilarity for DSS and DFS by Lovinfosse et al 70 are shown to be more powerful independent predictors compared to metabolic metrics. Aside from these features, conventional clinical factors, such as gender, age, histology, stage, and so on, have also been discussed by Lovinfosse et al 70 and Fried et al 72 The latter showed that the combination of quantitative features with conventional clinical features can improve OS risk stratification compared with conventional clinical features alone. 72 In addition, delta-radiomics features of fludeoxyglucose-PET are correlated with OS in patients with NSCLC. Carvalho et al 80 validated the predictive capacity in delta-radiomics features (volume, texture features, and intensity–volume histogram [IVH]) and demonstrated their correlations with OS.
Computed tomography images
Radiomics features in CT images can predict OS better than PET images. A radiomics model based on pretreatment CT and recalibrated cone beam CT images in van Timmeren et al study (concordance index = 0.69, P = 4.0 × 10−10) and a combination model of pretreatment features and delta radiomics features in Fave et al’s study (c-index = 0.675; P = 1.3 × 10−5) are significant predictors for OS. 74,85,89 Overall, in survival studies based on CT images, those radiomics features quantifying shapes, intensity, and texture, conventional features (such as Eastern Cooperative Oncology Group performance status, pleural retraction, and diameter), and delta-radiomics features are predictive, and when they are combined with clinical factors, the predictive capacity can be significantly improved. 32,45,74,85
Recurrence Prediction After Treatment
Positron emission tomography images
For recurrence prediction after treatment based on PET images, SUV metrics and texture features were both discussed in recent literature. Takeda et al, 81 Essler et al 78 and Zhang et al 53 assessed local recurrence (LR) in patients with NSCLC after stereotactic body radiotherapy (SBRT) by PET images and showed SUV metrics as strong LR predictors, that is, dual-time-point SUVmax or SUVmaxs in the study by Takeda et al, 81 SUVmean >3.44, SUVmax > 5.48, and their reduction in the study by Essler et al, 78 and the cutoff SUVmax of 5 with 100% sensitivity and 91% specificity in the study by Zhang et al 53 However, SUV metrics are reported less correlated when compared with other features, such as IVH, texture features, and so on. Pyka et al 69 assessed the relationship of texture features with LR in PET images for patients with NSCLC after radiotherapy and reported that several texture features such as entropy (AUC = 0.872) and correlation (AUC = 0.816) had higher AUC values than SUV metrics in receiver operating characteristic analysis.
Computed tomography images
Computed tomography–based radiomics features exhibited lack of prognostic power for locoregional recurrence (LRR) in many studies, 28,32 while 5 pretreatment statistic and texture features in the study by Huynh et al and end-treatment texture-strength feature in the study by Fave et al are prognostic for LR, and 3 pretreatment statistic and texture features in Huynh et al are prognostic for lobar recurrence. 28,32,81,85 Although CT-based radiomic features are limited in their ability to predict recurrence compared to other outcomes, they have the ability to distinguish tumor recurrences on follow-up CT images earlier than human eyes. Mattonen et al 83 compared the predictive ability between doctors and radiomic features. Although physicians’ average predictive accuracy rate was 83% for the average prediction period of 15.5 months, they misdiagnosed at an average error rate of 35%, a false-positive rate (FPR) of 1%, and a false-negative rate (FNR) of 99% when the follow-up period was shortened to 6 months after radiotherapy. In contrast, the studied five radiomics features can accurately determine the recurrence rate with AUC value of 0.85, classification error rate of 24%, FPR of 24%, FNR of 23% at 6 months.
These recurrence studies show that PET features own higher predictive abilities and accuracy in tumor recurrence than CT features. 81 Vaidya et al 52 also studied the capacity of combined PET and CT image features, such as SUV or HU, IVH, and texture features, for LRR prediction in patients with NSCLC after radiotherapy. It was found that a 2-parameter model of PET and CT features had higher prediction accuracy for LRR than PET or CT features alone. Thus, multimodality radiomics features are superior in predicting tumor recurrence.
Distinguishing RILI
Computed tomography images
Radiomic features from CT images also demonstrated high potentials in distinguishing tumor RILI from recurrence 15 and RILI severity levels. 84 Mattonen et al 15 showed that compared with RILI, tumor recurrence showed higher HU,and higher SD in ground-glass opacity (GGO) texture measure. When comparing conventional features (RECIST and volume) and quantitative changes in CT number (HU) with GGO textural feature, results showed the predictive time points to distinguish RILI and recurrence in advance is 9 months post-SBRT and 15 months post-SBRT, respectively. Another publication by the same group 82 described that GGO textural analysis has potential to predict recurrence within 5 months post-SBRT. Similar texture analysis methods were adopted by Moran et al, 84 where first-order and gray-level co-occurrence matrix texture features were extracted to distinguish RILI severity levels: none/mild, moderate, and severe. Gray-level co-occurrence matrix texture features (P = .012-.262; AUC = 0.643-0.750) have been reported to provide a better performance than first-order features (P = .100-.990; AUC = 0.543-0.661). Cunliffe et al 90 also combined radiomic features with radiation dose and demonstrated that radiomics can provide a quantitative, personalized measurement of radiation dose tolerance for individual patients, which can be used to determine the possibility of radiation-induced pneumonitis and monitor its progression.
Distant Metastases Evaluation
Positron emission tomography images
For DM evaluation in PET images radiomics studies, the optimal prognostic model including two quantitative image features of intratumoral heterogeneity and average of the voxel with SUVmax of the tumor region (SUVpeak; c-index = 0.71) were shown to be able to predict and categorize patients into low- and high-risk groups. Furthermore, when tumors’ histologic types were combined, the prognostic power of the model was significantly improved. 57 Interestingly, it was presented that neither PET metrics nor texture features were related to DM in the study by Pyka et al. 69
Computed tomography images
For DM evaluation in CT images radiomics studies, one pretreatment feature describing the range of voxel intensities (Wavelet LLH stats range; c-index = 0.67) and seven pretreatment average intensity projection CT radiomics features describing shape and heterogeneity (c-index = 0.638-0.676) in the study by Huynh et al perform well. 28,32 Also, Coroller et al found strong prognostic powers in radiomic features, among which Laplacian transform of Gaussian (LoG) filter features showed the best performance (c-index = 0.61, P < .001). 75 Besides, combining pretreatment texture features with clinical prognostic factors can significantly improve their predictive abilities. 74,85
It is worth noting that only three studies 57,72,80 provided feature validation. Therefore, even though results were shown promising, lack of feature validation may lead to false positive and might shadow doubts in the subsequent correlation studies.
Current Research Challenges and Prospects
Published studies show promising results in NSCLC treatments, yet there are still major challenges and limitations to resolve before they can be translated into reliable clinical application. A number of experimental clinical studies have found that current restrictions of radiomics in its therapeutic application are mainly subjected to (1) image standardization, 29,91 (2) image registration, 92 -94 and (3) data sharing. 5 The on-going research activities focusing on tackling these limitations are elaborated subsequently.
Standardization of Images
Variations in scanning devices, acquisition modes, reconstruction parameters, and scanning protocols may impact subsequent feature analysis as mentioned in “The Workflow of radiomics”. A study on feature stability in CT perfusion maps showed that none of radiomics parameters were stable without standardization, especially for voxel size, temporal resolution, HU threshold, image discretization, and so on. 91 There are two ways to solve the problem. One is to develop software to calibrate existing image data for retrospective analysis. But the excepted standardization is still difficult to achieve due to the vast variations in calibration algorithms. 95 The other method is to design prospective trials where all enrolled patients receive standardized image scans. In addition, there are national image centers that are currently being planned or constructed, where images can be obtained within one entity in order to fundamentally resolve the image nonstandardizing issue. 19,95
Image Registration
Image registrations, including both rigid and deformable, are commonly used in the course of radiotherapy treatment of NSCLC. 92 The concept of delta-radiomics demands high accuracy in image registration when comparing pretreatment images with posttreatment images and images during treatment. Image registration can be achieved by manual, automatic, or the combination of both. 96,97 At present, the commonly used automatic registration algorithms include image intensity-based method, and structure-based method. 94,97 Although the impact of image registration on texture features and change assessment of serial thoracic CT scans have been studied, 93,98,99 it still remains unclear how image registration accuracy can affect radiomics results and which method would work the best for delta-radiomics for NSCLC. Future study areas can be large-scale prospective clinical trials with the application of delta-radiomics.
Data Storage and Data Sharing
Radiomics study relies heavily on the statistical analysis. Sample size is a critical defect in current research. As shown in Tables 2 and 3, most studies cover limited number of patients, with one exception that contains a large patient sample of 647. 45 Small sample size poses an obstacle for obtaining high correlations between radiomics features and treatment outcomes with high confidence interval. Big data are the premise of using technology to mine the radiomics features and ensure statistical significance in data analysis. The establishment of patient database can strengthen standardized management and improve utilization efficiency. The high-number and high-quality database is the basis for radiomics study, in order to effectively predict outcome. Image data storage and standardization require joint efforts by companies and/or institutions. Institutions such as Cancer Learning Intelligence Network for Quality and Flatiron Health are working on data aggregating. 100
Additionally, the robustness and stability of the discovered features need improvement before they are applied to clinical treatment assistance in NSCLC, so validation and further clinical practice are essential. Future research can also be prospectively designing clinical trials with the focus on implementing discovered experimental features with high statistical significance.
Future Developments in Radiomics for Managing NSCLC
With artificial intelligence being adopted to medical field and the optimization of machine learning algorithm, limitations in image preprocessing, that is, ROI segmentation, feature extraction, and feature selection/classification, may be greatly improved or even eliminated in a foreseeable future.
The applications are still not adequate to provide satisfactory optimized summary information to direct clinicians in medical and radiation oncology on how to further manage their patients, and more “standardized” data are needed from institutions to create these programs. Standardizing is required not only for image acquisition but also for all the steps in the radiomics workflow as mentioned in “The Workflow of radiomics”, as well as all personnel involved, including those in medical and radiation oncology professions. There is a cultural change underway to capture these big data in pursuit of personalized medicine for patients with NSCLC, and many efforts are underway by American Society for Radiation Oncology, the American Association of Physicists in Medicine, and American Society of Clinical Oncology to name a few. 91
The Cancer Imaging Archive, a National Cancer Institute–funded information repository that aggregates images (radiology, pathology), radiation therapy information objects, annotations, clinical trial data, and information derived from quantitative image analysis to support big data analytics, is an example of a comprehensive approach to acquiring, archiving, and extracting data that will be useful for the predictive models that are needed for radiomics to be fully utilized and a great value to clinicians and their patients.
Conclusion
Radiomics process to assess tumor response and predict recurrence, survival, DM, and RILI in NSCLC mainly includes imaging acquisition/reconstruction, ROI definition/segmentation, image feature extraction, and image feature selection and classification. Features quantifying target shape, size, volume, intensity, texture, and so on, are able to predict outcome and assess response. Among them, texture features outperform the others for predicting tumor response, survival, recurrence, DM, and RILI in PET/CT images and CT images. Delta-radiomics, which compares the changes within those features extracted from pretreatment images and follow-up images during and/or after the treatment, are able to identify signs of recurrence, metastases, or death early. Image standardization, image registration, and data sharing are main challenges and limitations for the clinical applications of radiomics tools. Additionally, validation and clinical practice are also essential before radiomics applied to clinical treatment guidance for NSCLC.
Footnotes
Authors’ Note
Liting Shi and Yaoyao He contributed equally to the work.
Acknowledgments
Authors would like to thank Dr. T Li for her contribution in proofreading a couple paragraphs.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study received fundings from the China National Key Research and Development Program (2016YFC0103400) and Shandong Province Key Research and Development Program (2017GSF218075). Jianfeng Qiu is supported by the Taishan Scholars Program of Shandong Province.
