Abstract
Background
The national lung screening trial (NLST) demonstrated a reduction in lung cancer mortality with lowdose CT (LDCT) compared to chest x-ray (CXR) screening. Overdiagnosis was high (79%) among bronchoalveolar carcinoma (BAC) currently replaced by adenocarcinoma in situ (AIS), minimally invasive adenocarcinoma (MIA) and adenocarcinoma of low malignant potential (LMP) exhibiting 100% disease specific survival (DSS).
Objective
Compare the outcomes and proportions of BAC, AIS, MIA, and LMP among NLST screendetected stage IA NSCLC with overdiagnosis rate.
Methods
Whole slide images were reviewed by a thoracic pathologist from 174 of 409 NLST screen-detected stage IA LUAD. Overdiagnosis rates were calculated from follow-up cancer incidence rates.
Results
Most BAC were reclassified as AIS/MIA/LMP (20/35 = 57%). The 7-year DSS was 100% for AIS/MIA/LMP and 94% for BAC. Excluding AIS/MIA/LMP, BAC behaved similarly to NSCLC (7-year DSS: 86% vs. 83%, P = 0.85) The overdiagnosis rate of LDCT stage IA NSCLC was 16.6% at 11.3-years, matching the proportion of AIS/MIA/LMP (16.2%) but not AIS/MIA (3.5%) or BAC (22.8%).
Conclusions
AIS/MIA/LMP proportionally matches the overdiagnosis rate among stage IA NSCLC in the NLST, exhibiting 100% 7-year DSS. Biomarkers designed to recognize AIS/MIA/LMP preoperatively, would be useful to prevent overtreatment of indolent screen-detected cancers.
Keywords
Introduction
The national lung screening trial (NLST) demonstrated an overall mortality reduction of 20% at a median of 6.5-years after three annual lowdose CT-screenings (LDCT) compared to the control arm screened by chest X-ray (CXR) among high-risk smokers [1]. Mortality reduction was associated with a stage shift towards the detection of early-stage disease whereby the majority of NSCLC are curable by surgery alone. Two follow-up studies have reported the frequency of overdiagnosis, defined as the excess cancers in the CXR-arm presenting clinically after screening cessation compared to the LDCT-arm at median time points of 6.4-years and 11.3-years [2,3]. The overdiagnosis rate for all cancers was reported as 18.5% at 6.4-years and 3.1% at 11.3-years; NSCLC as 22.5% at 6.4-years (not specifically reported at 11.3-years); and BAC as 78.9% at both the 6.4-year and 11.3-year time period.
Since the study period of the NLST, BAC was abandoned as a pathologic entity and replaced by the more specifically defined entities of adenocarcinoma in situ (AIS), minimally invasive adenocarcinoma (MIA), lepidic predominant adenocarcinomas, and invasive mucinous adenocarcinoma (IMA) [4]. AIS and MIA together exhibit long-term 100% DSS whereas lepidic predominant and IMA are regarded as low and intermediate grade cancers respectively [5,6]. While BAC comprised 27% of all screen-detected adenocarcinoma in the CTLS-arm, AIS and MIA together comprise < 5% of lung adenocarcinoma in most large series. We have proposed histologic criteria for adenocarcinoma of low malignant potential (LMP) which demonstrate long-term 100% DSS identical to AIS/MIA and together comprised 23% of a stage I adenocarcinoma cohort [7]. These criteria have been independently validated in an international cohort [8] and more recently we have shown substantial reproducibility (Fleiss kappa = 0.74) when applied by general surgical pathologists [9].
While no central pathologic review was performed in the NLST, H&E-stained slides were subsequently digitized from a subcomponent of the NLST. Using this digitized subset, the aims of this study are to determine the frequency of AIS/MIA/LMP among screen-detected stage IA NSCLC in the CTLS and CXR-arms of the NLST and determine their relationship to the historically classified entity BAC. We further seek to determine the prognostic significance of BAC as it relates to more specifically defined pathologic entities and to better understand its association with overdiagnosis among NSCLC.
Methods
Patients and samples
Details of the NLST have been published previously [1,2,3]. Briefly adults aged 55–74 years of age with a minimum of 30 pack-years of cigarette smoking and who were either current or former smokers who had quit within the past 15-years were enrolled between 2002–2004 at 33 United States based medical institutions and randomized to receive three annual protocol screens of either LDCT (26,722 participants) or single-view CXR (26,730 participants). CXR does not reduce lung cancer mortality compared to routine community care and was therefore deemed an appropriate control arm [10]. Participants were excluded prior to randomization if they had unexplained weight loss or hemoptysis in the prior year, a CT-scan 18-months prior to enrollment, or a history of lung cancer. Screendetected lung cancers were distinguished from non-screen detected cancers if they were diagnosed within 1-year of a positive screen with no intervening negative screens or > 1-year based on diagnostic procedures initiated because of the positive screen. All participants were actively followed up for lung cancer incidence and mortality until the end of 2009. Participants screened at states with cancer registries (22 of 33 screening centers, comprising 87.6% of trial participants) were followed passively for lung cancer incidence until the end of 2014. At the end of the active follow-up period there were 1089 total tumors of which 926 were NSCLC (111 BAC & 815 non-BAC NSCLC) in the LDCT-arm and 969 total tumors of which 793 were NSCLC (36 BAC & 757 non-BAC NSCLC) in the CXR-arm [2]. At the end of the passive follow-up period, there were 1701 total tumors of which 1397 were NSCLC (121 BAC & 1276 non-BAC NSCLC) in the LDCT-arm and 1681 total tumors of which 1343 were NSCLC (46 BAC & 1297 non-BAC NSCLC) in the CXR-arm [3].
Screen-detected cancers were staged using the 6th edition of the American Joint Committee on Cancer (AJCC). There were 649 screen-detected cancers in the LDCT-arm of which 591 were NSCLC (95 BAC & 496 non-BAC NSCLC) and 279 screen-detected cancers in the CXR-arm of which 247 were NSCLC (13 BAC & 234 non-BAC NSCLC). Cases of SCLC (LDCT 49 & CXR 28) and carcinoid (LDCT 5 & CXR 1) were excluded. We additionally excluded all stage IB-IV, cases of NSCLC of unknown stage (LDCT 5 & CXR 2), stage IA carcinomas of unknown histologic type designated by ICD-0-3 code 8000 (LDCT 2 & CXR 1) and cases where neither histologic type nor stage were known (LDCT 1 & CXR 0); yielding 325 and 84 stage IA NSCLC for analysis in the LDCT and CXR-arms respectively. Whole slide images (WSI) were previously generated from archived FFPE tumor blocks from 463 patients derived from one of the NLST screening networks (Lung Screening Study Network, LSS) which included 10 screening centers enrolling 34,612 participants that detected lung cancer in 1,284 participants. The WSI and linked but anonymized clinical and pathologic annotations were made available for download from the National Cancer Institute Cancer Data Access System after an approved application (Project ID NLST-867). All participants enrolling in the NLST signed an informed consent developed and approved by the institutional review board (IRB) at each screening site. Among screen-detected stage IA NSCLC, there were 182 with WSI of which 174 had tumor sufficient for classification. There were a median of 2 digitized tumor slides per patient (range 1–5).
Histopathological analysis
All WSI were reviewed by a single experienced thoracic pathologist (EJB). Adenocarcinoma in situ (AIS) was rendered for purely lepidic tumors
Survival, overdiagnosis and statistical analysis
Disease-specific survival (DSS), defined as time from surgery to death from lung cancer or time of last follow-up (unrelated deaths censored at time of event) was estimated using the Kaplan-Meier method comparing groups with the log-rank test. The probability of a screen-detected cancer being an overdiagnosis was calculated as previously described [2] where the excess number of total cancers (screen & non-screen detected) in the LDCT compared to CXR-arms previously reported at the end of two time periods (2009 and 2014) are divided by the number of screen-detected cancers from the original report. Statistical analyses were performed using SPSS version 28 (IBM). Chi Square Test for Homogeneity or the Fisher’s Exact test were used for categorical variables, as appropriate. Post hoc analysis involved pairwise comparisons using the z-test of two proportions with a Bonferroni correction. Continuous variables were compared between groups using Welch’s t-test. All tests were two-tailed.
Results
Cohort comparison
Clinicopathologic characteristics of NLST stage IA screen-detected NSCLC.
Clinicopathologic characteristics of NLST stage IA screen-detected NSCLC.
Note: The data are shown as the number and (%) unless otherwise indicated. Abbreviations: BAC, bronchoalveolar carcinoma; DSS, disease specific survival; LC, large cell carcinoma; LUAD, lung adenocarcinoma, LUSC, lung squamous cell carcinoma, NSCLC & Other, non-small cell lung carcinoma and cases coded as other rare subtypes of NSCLC. Small cell lung carcinoma, carcinoid tumors, and those with unknown histologic type (ICD-O-3 8000) or stage excluded. Stage IA as per AJCC 6th edition.
Clinicopathologic comparison of stage IA screen-detected NSCLC stratified by histology.
Note: The data are shown as the number and (%) unless otherwise indicated. Abbreviations: BAC, bronchoalveolar carcinoma; DSS, disease specific survival; LC, large cell carcinoma; LUAD, lung adenocarcinoma, LUSC, lung squamous cell carcinoma, NSCLC & Other, non-small cell lung carcinoma and cases coded as other rare subtypes of NSCLC. Small cell lung carcinoma, carcinoid tumors, and those with unknown histologic type (ICD-O-3 8000) or stage excluded. Stage IA as per AJCC 6th edition.
NLST screen detected stage IA NSCLC with histology stratified by classification.
Note: The data are shown as the number and (%) and unless otherwise indicated. Abbreviations: AIS, adenocarcinoma in situ; BAC, bronchoalveolar carcinoma; LC, large cell carcinoma; LCNEC, large cell neuroendocrine carcinoma; LMP, low malignant potential; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; MIA, minimally invasive adenocarcinoma; NSCLC & Other, non-small cell lung carcinoma and cases coded as other rare subtypes of NSCLC. Small cell lung carcinoma, carcinoid tumors, and those with unknown histologic type (ICD-O-3 8000) or stage excluded. Stage IA as per AJCC 6th edition.
Overdiagnosis calculations of NSCLC by stage and historic classification.
Abbreviations: BAC, bronchoalveolar carcinoma; CXR, chest X-ray arm; Diff, difference of LDCT – CXR arms at specified time points; LDCT, low-dose CT-arm; NSCLC, non-small cell lung cancer; Overdx, overdiagnosis. Stage IA as per AJCC 6th edition. All stage screen-detected NSCLC and total cancers at median of 6.4-years and 11.3-years from published reports as described in the methods section.
The proportion of screen-detected stage IA NSCLC among total screen-detected NSCLC was significantly greater in the LDCT-arm than the CXR-arm (325/591 = 55% vs. 84/247 = 34%, P < 0.001). The ratio of LDCT to CXR stage IA screen-detected NSCLC was 3.9 (325/84) compared to 1.6 (266/163) for all other stage subgroups. Clinicopathologic features of screendetected stage IA NSCLC in the LDCT-arm and CXR-arm are shown in Table 1. Compared to the CXR-arm, the LDCT-arm had a higher proportion of tumors historically classified as BAC (23% vs. 8%) and a correspondingly better outcome (7-year DSS: 87% vs. 79%, P = 0.046). WSI were available for 43% (174/409) of the NSCLC including 44% (142/325) of the LDCT-arm and 38% (32/84) of the CXR-arm. There were minimal clinicopathologic differences between the subset with WSI compared to those without (Table 2) and no significant difference in outcomes.
Table 3 shows the proportions of tumors histologically classified as indolent by the historic compared to the newer pathologic classification. AIS/MIA comprised a minority of stage IA cancers in the LDCT-arm (3.5%) and were not observed in the CXR-arm. LMP comprised 12.7% of the LDCT-arm but only 3.1% of the CXR-arm. The majority of BAC were AIS/MIA/LMP (20/35 = 57%) whereas a minority were AIS/MIA (5/35 = 14%) or IMA (3/35 = 9%). The majority (
Overdiagnosis pathologic correlation among stage IA screen-detected NSCLC.
Overdiagnosis pathologic correlation among stage IA screen-detected NSCLC.
Note: The data are shown as the number and (%). Abbreviations: AIS, adenocarcinoma in situ, BAC, bronchoalveolar carcinoma, LMP, low malignant potential, MIA, minimally invasive adenocarcinoma. Stage IA as per AJCC 6th edition.

Kaplan Meier Curves showing disease specific survival of screen-detected Stage IA NSCLC. (A) Total non-BAC NSCLC stratified by historic classification, (B) Total BAC vs. non-BAC NSCLC, (C) WSI BAC vs. non-BAC NSCLC, (D) WSI excluding AIS/MIA/LMP and remaining BAC vs. non-BAC NSCLC.
Figure 1 shows DSS for AJCC 6th ed. Stage IA NSCLC. Historic classifications of LUAD, LUSC, LC, and NSCLC & Other showed similar 7-year DSS (79–85%, P = 0.95 Fig. 1A) as did reclassified LUAD and LUSC (7-year DSS: 83% vs. 84%, P = 0.90, KM curves not shown). As such, all non-BAC NSCLC were grouped together for comparison with BAC (Fig. 1B–D). As expected, BAC showed better 7-year DSS than non-BAC NSCLC in the entire cohort (94% vs. 83%, P = 0.03 Fig. 1B). The same analysis performed with the smaller histologic cohort revealed a similar magnitude of difference (94% vs. 84%) but lacked power to reach statistical significance (P = 0.15 Fig. 1C). All cases of AIS/MIA/LMP showed 100% 7-year DSS (Fig. 1D) and when these were removed, BAC behaved similarly to the remaining non-BAC NSCLC (86% vs. 83%, P = 0.85 Fig. 1D).
Overdiagnosis rate and pathologic associations
The overdiagnosis rates of NSCLC and the data from which they are derived from are shown in Table 4. The calculated overdiagnosis rate for all stage NSCLC (22.5%) and non-BAC NSCLC (11.7%) at 6.4-years and BAC (78.9%) at both 6.4- and 11.3-years are identical to previously published results [2,3]. Using the same formula, we calculate overdiagnosis rate for AJCC 6th ed. stage IA tumors as these were the group most overrepresented in the LDCT compared to CXR-arms given the size of these tumors limits their detection by CXR alone. The overdiagnosis rate for stage IA NSCLC dropped from 40.9% at 6.4-years to 16.6% at 11.3-years. No excess BAC were detected from 6.4- to 11.3-years, remaining at 75 in spite of the fact that this number exceeded the total excess NSCLC of 54. A similar erroneous observation of 21 fewer non-BAC NSCLC in the LDCT compared to CXR-arm raise concerns about the comparability of these diagnostic categories over the follow-up period of this study. As such, the calculated overdiagnosis rates of BAC vs. non-BAC NSCLC (stage IA: 101.4% vs. −8.4% & all stage: 78.9% vs. −4.2%) may be an artifact of classification (see discussion).
Table 5 shows the proportion of pathologically predicted indolent screen-detected NSCLC in the LDCT-arm compared to the observed overdiagnosis rate. The combined proportion of tumors classified as AIS/MIA/LMP closely approximates the calculated overdiagnosis rate of stage IA NSCLC (16.2% vs. 16.6%) and importantly exhibited 100% 7-years DSS. In contrast, the combined proportions of AIS/MIA fell short of the observed overdiagnosis rate (3.5% vs. 16.6%, P < 0.001) despite also exhibiting 100% 7-year DSS. The proportion of historic BAC exceeded the observed overdiagnosis rate of stage IA NSCLC (22.8% vs. 16.6%, P = 0.049) and did not attain the 100% 7-year DSS necessary to predict overdiagnosis.
Discussion
Overdiagnosis is a major problem in cancer screening programs as it leads to patient anxiety associated with a malignant diagnosis, unnecessary procedures with associated risks, and costs to the health care system [11,12]. It has been proposed that indolent screen-detected cancers be reclassified as indolent lesions of epithelial origin (IDLE) [13]; however such terminology is challenging for pathologists to incorporate given that precise classification generally requires complete excision of a tumor which alters the natural history of the disease, i.e. 100% DSS after resection is not equivalent to 100% DSS without treatment. Aiming at the intent of the IDLE proposal, we proposed to expand the histologic spectrum of indolent adenocarcinoma beyond AIS/MIA to include a group of tumors termed LMP; a subset with identical behavior as determined by 100% long-term DSS after surgery shown in two previously published cohorts [7,14]. In the present study, we sought to advance this concept by determining the rate of AIS/MIA/LMP among a subset of screen-detected stage IA NSCLC from the LDCT-arm of the NLST with digitized images, confirm their indolent behavior after surgery, and compare this to the calculated overdiagnosis rate derived epidemiologically with extended follow-up. AIS/MIA/LMP tumors in the NLST exhibited 100% 7-year DSS and were nearly identical in proportion to the epidemiologically calculated overdiagnosis rate of stage IA NSCLC at median follow-up of 11.3-years (16.2% vs. 16.6%).
Overdiagnosis rates for all lung cancer at median of 6.4-years has been reported as 18.5% and 3.1% at 11.3-years [2,3]. Given the rapid growth rate of SCLC, the goal of annual lung cancer CT-screening is early detection and treatment of clinically significant NSCLC. As such, the overdiagnosis rates of NSCLC are most relevant for cancer screening and has been reported at 22.5% at 6.4-years, dropping to 9.1% at 11.3-years for all stage disease. Given that overdiagnosis must correlate with early stage (non-metastatic) lesions visible by CT-scans but not routine CXR, the AJCC 6th ed. Stage IA (
In the screening period and initial follow-up of the NLST cohorts, BAC was the category of NSCLC most associated with indolent behavior. As such, overdiagnosis rates for all stage BAC have been reported as 78.9% at both 6.4 and 11.3-years of median follow-up [2,3]. There are at least three problems with this conclusion. First, in 2011 – during the period of extended (passive) follow-up of the NLST cohorts – the IASLC/ATS/ERS proposed a new histologic classification for lung adenocarcinoma [15]in which BAC was abandoned and replaced with AIS, MIA, IMA, and lepidic predominant adenocarcinoma. As such, the low frequency of additional BAC in both the LDCT and CXR-arms (10 cases each) during the extended follow-up between 2009 and 2014 may reflect adoption of a new classification rather than a true change in BAC incidence. Second, not all BAC behaved indolently, with 6% of stage IA BAC dying of lung cancer at 7-years after surgery and thus precluding such an entity defining indolent/non-progressive cancer. Third, the most consistently applied histologic feature separating BAC from LUAD was the presence of a lepidic component, confirmed in this cohort (
AIS/MIA/LMP exhibited 100% 7-year DSS and occurred in a proportion nearly identical to the overdiagnosis rate (16.2% vs. 16.6%). In contrast, the proportion of AIS/MIA was significantly less than (3.5%) and the proportion of BAC was significantly more than (22.8%) than the observed overdiagnosis rate (16.6%). Moreover, we show that once AIS/MIA/LMP are removed, BAC behaves similarly to all other NSCLC (7-year DSS: 86% vs. 83%, P = 0.852). The proportion of AIS/MIA/LMP in the LDCT-arm was 22–24% compared to 5% in the CXR-arm. The former frequency is similar to the frequency observed in our original cohort (23%) [7] and in a follow-up study of stage I adenocarcinoma in a single-center LDCT cohort (18%) and a non-screen detected cohort of similarly matched high-risk smokers (20%) [14]. Conversely, non-screen detected tumors in low-risk and never smokers exhibited a higher proportion of AIS/MIA/LMP (33%) which may have implications for screening programs targeting light or never smokers [14].
Herein, we document an additional 19 cases of LMP which when combined with our previous reports totals 110 cases of LMP with long-term 100% DSS. More recently, LMP exclusive of AIS/MIA was found to comprise 12.4% of an Italian cohort of 274 stage IA LUAD [8]. Of the 34 LMP described, 5 recurred (14.7%) with 2 dying of lung cancer. Molecular confirmation of matched driver mutation was confirmed in only 1 case while 3 cases were late recurrences (6–9 years after surgery) raising the possibility of second primary lung cancers which are known to occur in 15–18% of patients within 7-years of treatment for a primary lung cancer [9,16,17]. Alternatively, some of these may be slow-growing local recurrences. We have observed only a single recurrence of an LMP, occurring at the staple line 1-year after wedge resection with a close surgical margin. The patient is alive and without metastatic spread > 10-years after local treatment with stereotactic radiation as was described in our original cohort [7]. While it is impossible to prove that all AIS/MIA/LMP would not progress without surgical intervention, the indolent behavior and proportions matching the overdiagnosis rate at 11.3-years are highly suggestive. Anecdotally, one of the authors (EJB) has observed a 2.5 cm LMP which was followed by CT-scan as a subsolid nodule for 4-years prior to excision with minimal ( < 5 mm) growth. The patient agreed to surgery after this monitoring interval and is now 4-years status post lobectomy with no recurrence or metastasis.
One goal of pathologically defining IDLE’s is for the development of biomarkers which would allow their identification prior to treatment [18]. To this aim, prior investigators have shown by radiomic analysis of CT-images that 18% of screen-detected stage I adenocarcinoma in the LDCT-arm can be predicted as “good-risk”, showing 100% 7-year DSS, using Computer-Aided Nodule Assessment and Risk Yield (CANARY) [19]. This would imply that many AIS/MIA/LMP might be predicted by radiomic features preoperatively (see Steiner et al. in this edition). Additionally, we have previously demonstrated by gene expression profiling the ability to predict aggressive histologic features, and thus exclude LMP, which we hope to apply as a tissue based biomarker [14]. A combined radiomic and tissue-based biomarker suitable for presurgical biopsies, might together provide even greater sensitivity and specificity for preoperative prediction of cancers likely to represent overdiagnosis. This knowledge could predict patients who might benefit from tissue sparing surgical approaches (wedge or segmentectomy) or those who might be better treated by non-invasive approaches such as stereotactic body radiation therapy (SBRT), cryoablation, or radiofrequency ablation. Alternatively, some of these patients might be better managed by active surveillance protocols similar to the management of low-grade prostate cancer [20].
This study has several limitations. First, the histologic assessment was only possible on the subset of screen-detected stage IA NSCLC with WSI (44% in the LDCT-arm) which were derived from a minority (10/33) of screening sites, but which enrolled the majority (34,612/53,452) of participants. Given the similar proportions of historically classified BAC and LUAD and lack of outcome differences between those with and without WSI, we believe our histologic findings are generalizable to the overall group of screen-detected NSCLC. Histologic classification was limited to representative tumor blocks (median of 2) rather than slides from the entire tumor which generally includes 3–4 tissue blocks for tumors of this size (
Conclusion
The combined pathologic subgroup of AIS/MIA/LMP in NLST correlates well with the epidemiologically observed rate of overdiagnosis among stage IA NSCLC (∼ 16%). Tumors thus classified exhibited 100% long-term DSS in this and most prior retrospective studies [7,14]. The development of radiomic and/or tissue-based biomarkers to predict this subgroup of adenocarcinoma may one day allow non-surgical management strategies, such as active surveillance, to prevent the overtreatment of this fraction of LDCT screendetected neoplasia.
Footnotes
Acknowledgments
The authors thank the National Cancer Institute for access to NCI’s data collected by the NLST (Project ID NLST-867).
This work was supported in part by 1R01CA275015-01A1.
Author contributions
Conception: EJB, TBS, KMRC.
Interpretation or analysis of data: EJB, TBS, KMRC.
Preparation of the manuscript: EJB, TBS.
Revision for important intellectual content: EJB, TBS, KMRC.
Supervision: EJB, KRC.
