Abstract
Background
Effective means for early diagnosis are imperative to reduce death rate of non-small cell lung cancer (NSCLC) patients. We aimed to find out high-performance serologic markers to distinguish early-stage NSCLC patients from benign pulmonary nodule patients and healthy controls (HC). Cystatin-SN (CST1) is an active cysteine protease inhibitor of the CST superfamily, involving in the processes of inflammation and tumorigenesis. This is the first exploration of the diagnostic and prognostic values of serum CST1 in NSCLC.
Methods
We analyzed the transcriptome data from The Cancer Genome Atlas and the Gene Expression Omnibus database, screened biomarkers for NSCLC, and verified the candidate markers via the ONCOMINE database. Then, we performed ELISA, western blotting, and immunohistochemistry analysis to detect the expression levels of CST1 in NSCLC cell lines, tumor tissues, and serum samples of clinical cohorts.
Results
We identified 3 up-regulated secreted protein-encoding genes, validated the expression levels of CST1 in NSCLC tumor tissues and cell lines, and found that serum CST1 levels of NSCLC (4289 ± 2405 pg/mL) were significantly higher than those of PBN patients (1558 ± 441 pg/mL,
Conclusions
Serum CST1 may serve as a novel diagnostic marker for differentiating early-stage NSCLC from PBN and HC, and could be used as a prognosis predictor in NSCLC patients.
Introduction
Non-small cell lung cancer (NSCLC) is the most common subtype of lung cancer, accounting for nearly 85% of lung cancer cases.1,2 Most NSCLC patients were initially diagnosed at advanced stage, of which the 5-year survival rate remains extremely poor; while the 5-year survival rate for the patients diagnosed at an early stage was as high as 71%-88%.3,4 Therefore, effective means for early diagnosis are imperative to reduce death rate of NSCLC.
As the gold standard for the diagnosis of NSCLC, the combination of bronchoscopy and pathological examination is an invasive and traumatic examination, leading to the poor compliance. Hematology testing is a convenient and safe diagnosis method, which plays an important role in the diagnosis and prognosis prediction of various tumors. Currently, serological markers including carcinoembryonic antigen (CEA) and cytokeratin 19 fragment (CYFRA21-1) are being employed extensively to diagnose NSCLC, as recommended by the National Academy of Clinical Biochemistry. 5 Although these serum markers exhibit high specificity (90%), their sensitivity is still quite low (50-60%), 6 which cannot meet the actual clinical needs. It is urgently necessary to develop innovative and high-performance diagnostic tumor markers.
Cystatin-SN (CST1), a secreted protein encoded by the CST1 gene, plays an important role in regulating the proteolytic activity of cysteine proteases and involving inflammation and tumorigenesis. 7 Numerous studies confirmed that high expression of CST1 was an independent indicator of poor prognosis in various cancers by multivariate analysis.8-12 In NSCLC, a study revealed that high level of CST1 in tumor tissue is associated with poor survival of patients through immunohistochemistry (IHC) analysis. 13 It has also been reported that the secreted protein CST1 can be detected in serum. 14 However, whether serum CST1 can serve as an independent predictor in NSCLC and aid the clinical diagnosis remains to be addressed.
In the present study, we identified CST1 as a candidate biomarker from the transcriptome data of TCGA and GEO database, and validated its expression in cell lines, tumor tissues and serum samples from NSCLC patients. We further evaluated the feasibility of detecting serum CST1 for differential diagnosis of clinical patients with NSCLC or benign nodules, and assessed its diagnostic efficacy and prognostic value.
Material and Methods
Patients and Specimens
In this retrospective clinical study, the NSCLC group included 201 NSCLC patients pathologically diagnosed at SYSUCC between December 2015 and February 2017. All patients met the following inclusion criteria: A clear pathological diagnosis of primary NSCLC was confirmed by two pathologists, and no surgery, chemotherapy or any other treatment was received before serum collection. A total of 94 patients with pulmonary benign nodules (PBN) were enrolled from SYSUCC between March 2015 and February 2016, which met the following criteria: Pulmonary benign nodules were identified by computed tomography (CT) scans, and at least 2-year follow-up showed no progression in the pulmonary nodules. Control samples were collected from 119 healthy volunteers without any malignant tumor or benign nodules confirmed by chest CT between March 2015 and February 2016. Each subject was asymptomatic without known disease, and underwent routine blood tests, serum biochemistry tests, liver function and kidney function tests in the PBN group and the healthy control (HC) group. A total of 19 NSCLC tumor tissues and adjacent non-tumor tissues were collected for IHC analysis. Besides, serum samples were retrospectively collected from 137 NSCLC patients who were treated at SYSUCC from May 2002 until April 2004 for survival analysis. The pathological stage was based on the American Joint Committee on Cancer (AJCC) 7th edition staging criteria. 15 Written informed consent was obtained from the patient(s) for their anonymized information to be published in this article. All the procedures performed in this study were in accordance with the Declaration of Helsinki (as revised in 2013).
Validation of Candidate Genes via the ONCOMINE Database
To validate the expression of candidate genes, we consulted the ONCOMINE database (http://www.ONCOMINE.org/), selected “cancer type” as “Non-small cell lung cancer”, chose all the transcriptome data of NSCLC, and we entered the gene symbols to be verified to obtain the expression levels in each study (the results of 32 studies were listed in supplement files).
Development of Enzyme-Linked Immunosorbent Assay (ELISA) for CST1
After screening multiple antibody pairs, a double antibody sandwich ELISA for CST1 antigen was developed using rabbit polyclonal antibody (1:2000, 16025-1-AP, Proteintech) as capture antibody and mouse monoclonal antibody (1: 6250, MAB1285-SP, R&D Systems) labeled with biotin as detection antibody. CST1 fusion protein (Ag9068, Proteintech) was used as standard/positive control. Other materials included 96-well plates (Corning), BSA (MPBio), PBS (Zhongshan Jinqiao), and TMB color reagent & stop solution (Kangwei Century). Low or high CST1 expression was defined by the median CST1 level (5463 pg/mL) determined in serum from NSCLC patients.
IHC Staining
The protocols are listed in the supplementary appendix.
Cell Culture
Human epithelial cells (16HBE and Beas2-B), and different NSCLC cell lines (A549, 95D, H1975, HCC827, PC-9, H358 and H460) were cultured in RPMI 1640 medium (Invitrogen) supplemented with 5-10% fetal bovine serum (Thermo Fisher Scientific) in a thermostatic incubator at 37°C with 5% CO2.
Western Blotting
Cells were lysed with sample buffer with proteinase/phosphatase inhibitors. The protein samples were separated by 6-15% SDS-PAGE, transferred to polyvinylidene difluoride membranes and then blocked with 5% skim milk for 45 min. The membranes were incubated with antibodies against CST1 (1: 400, 16025-1-AP, Proteintech) or β-actin (1:3000, A5441, Sigma-Aldrich) overnight at 4°C, followed by a horseradish peroxidase (HRP)-conjugated secondary antibody.
Statistical Analysis
Data was analyzed using Statistic Package for the Social Science 20.0 (SPSS, IBM). Nonparametric Mann-Whitney U-test or Kruskal-Wallis test were used to compare the differences in CST1 expression between two or more than two groups. The receiver operating characteristic curve (ROC) was established using GraphPad Prism 5 (San Diego, USA); the area under the curve (AUC), sensitivity, and specificity were utilized to evaluate the diagnostic efficacy. ELISA Calc was used to plot ELISA standard curve. Survival analysis was performed using the Kaplan-Meier method, and survival differences between the groups were determined by log-rank test. Multivariate analysis was performed using a Cox regression model. All statistical tests were two-sided and
Results
Identification of DEGs in NSCLC from the TCGA and GEO Database
To obtain potential diagnostic indicators for lung cancer, we firstly screened and analyzed The Cancer Genome Atlas (TCGA) database and the Gene Expression Omnibus (GEO) database according to certain filter conditions by bioinformatics methods (Supplematal Table S1). A total of 64 overlapped differentially expressed genes (DEGs) were identified from early-stage (stage I and stage II) lung adenocarcinoma patients in both TCGA and GEO datasets (Figure 1A), of which 10 genes were up-regulated (Figure 1B) and 54 genes were down-regulated; all the DEGs are shown in Supplemenatl Figure S1. Considering that secreted proteins are much more likely to be detected in serum than nuclear proteins and membrane proteins especially under tumor progression, we picked out three up-regulated secreted protein-encoding genes as the candidate genes including Cystatin-SN (CST1), secreted phosphoprotein 1-osteopontin (SPP1) and a disintegrin and metalloproteinase with thrombospondin motifs 1 (ADAMTS1) from 64 DEGs. Selection and verification of potential serum diagnostic indicators for NSCLC from the TCGA, NCBI, and Oncomine databases. A, in the TCGA and NCBI databases, 64 common genes were differentially expressed in early-stage lung adenocarcinoma patients, of which 10 genes were up-regulated and 54 genes were down-regulated; B, expression levels of 10 up-regulated genes in early-stage and advanced NSCLC compared with healthy controls; C, validation of the 3 candidate genes in 32 NSCLC studies from the ONCOMINE database, with ranking by the median expression levels of the genes in each study. s1, stage I; s2, stage II.
Verification of the Candidate Genes via the ONCOMINE Database
We validated the expression levels of three candidate genes in NSCLC-related studies using the ONCOMINE database. The results showed that CST1 (
Validation of CST1 Expression in NSCLC Cell Lines and Tumor Tissues
We further verified the expression of CST1 at the transcriptome level and the proteomic level in NSCLC samples. The expression of CST1 mRNA and protein in NSCLC tumor tissues were both significantly higher than that in adjacent tissues (Figure 2F-2G). Besides, IHC staining was performed to detect the expression levels of CST1 in 19 pairs of NSCLC tumor tissues and matched adjacent tissues. The results showed that the expression of CST1 protein was higher in 94.74% (18/19) of NSCLC tissues including lung adenocarcinoma and lung squamous cell carcinoma, compared with adjacent tissues (Figure 2A-2D), and the scores of CST1 levels were significantly higher in NSCLC tissues ( The expression of CST1 in NSCLC tumor tissues and adjacent tissues. Immunohistochemical (IHC) staining of CST1 in tumor tissues (A, left: 40 × and right: 200 ×) and adjacent normal tissues (B, left: 40 × and right: 200 ×) from lung adenocarcinoma patients; IHC staining of CST1 in tumor tissues (C, left: 40 × and right: 200 ×) and adjacent tissues (D, left: 40 × and right: 200 ×) from lung squamous cell carcinoma patients; Brown or tan particles are positively stained; E, CST1 protein expression levels were scored in 19 pairs of NSCLC tumor tissues and adjacent tissues; F, Western blotting of CST1 protein expression levels in 4 pairs of NSCLC tumor tissues and adjacent tissues and relative quantification analysis; G, the mRNA relative expression of CST1 in 4 pairs of NSCLC tumor tissues and adjacent tissues. T, the NSCLC tumor tissue; N, the adjacent normal tissue.
Serum CST1 Levels in NSCLC Patients
We recruited a total of 414 participants (201 NSCLC patients, 94 PBN patients and 119 healthy controls), of whom the main characteristics were presented in Supplemenatl Table S2.
The level of serum CST1 in NSCLC patients (4289 ± 2405 pg/mL) was significantly higher than in PBN patients (1558 ± 441 pg/mL, Diagnostic efficiency of serum CST1 level for NSCLC. A, serum levels of CST1 in HC (n = 119), PBN patients (n = 94), and NSCLC patients (n = 201); B-E, serum CST1 levels in patients with stage T0–2 (n = 150) versus T3–4 (n = 45), N0 (n = 88) versus N1–3 (n = 102), M0 (n = 143) versus M1 (n = 58), or stage I–II (n = 94) versus III–IV (n = 107); F, serum CST1 levels in the PBN patients with the longest diameter of the nodule (Dmax) less than or equal to 4 mm (n = 49), PBN patients with Dmax more than 4 mm (n = 45), and NSCLC patients with Dmax less than or equal to 10 mm (n = 9); The ROC analysis of serum CST1, Cyfra21-1, CEA, and three combined indicators in distinguishing between NSCLC and PBN (G), NSCLC and HC (H), early-stage NSCLC and HC (I), early-stage NSCLC and PBN (J).
Clinical characteristics and serum CST1 levels of 201 NSCLC patients.
The CST1 level was expressed as the median with the range. Mann-Whitney U-test or Kruskal-Wallis test were used to compare the differences in CST1 levels between two or more than two groups. AD, adenocarcinoma; SCC, squamous carcinoma.
Besides, we further analyzed the correlation between serum CST1 levels and the PBN size. The 94 cases of PBN patients were divided into two groups by the size of PBN: the longest diameter of the nodule ≤4 mm (
We also detected CST1 level in the serum of patients with pulmonary chronic inflammation, including chronic pneumonia, chronic obstructive pulmonary disease, pulmonary chronic granulomatous inflammation and so on. 40 patients with chronic inflammation, age and sex-matched 32 NSCLC patients and 24 healthy controls were included. We found that CST1 level in chronic inflammation patient group was higher than in healthy control group (
Diagnostic Efficiency of Serum CST1 for NSCLC Patients
The AUC of CST1 for differential diagnosis of NSCLC and PBN/HC was .809/0.808, the sensitivity was 90.80%/94.70%, and the specificity was 72.30%/66.30%, while the AUC of Cyfra21-1 and CEA in diagnosing NSCLC and PBN/HC was .799/0.788 and .758/0.783, respectively. The AUC of CST1 + Cyfra21-1 + CEA was as high as .910/0.919, the sensitivity was 83.20%/82.20%, and the specificity was 93.60%/93.30% (Supplemenatl Table S4, Figure 3G-3H).
CST1 identified early-stage NSCLC and PBN/HC patients with an AUC of .734/0.733, a sensitivity of 52.60%/63.20%, and a specificity of 94.70%/89.90%. Cyfra21-1 and CEA identified early NSCLC and PBN/HC patients with an AUC of .830/0.819 and .794/0.814, respectively. The AUC of CST1 + Cyfra21-1 + CEA for distinguishing early-stage NSCLC from PBN/HC was as high as .914/0.925, the sensitivity was 81.10%/88.40%, and the specificity was 90.40%/89.10% (Supplemenatl Table S5, Figure 3I-3J).
Serum CST1 is Associated with the Prognosis of NSCLC
Clinical characteristics and serum CST1 levels of 137 NSCLC patients for survival analysis.
The CST1 level was expressed as the median with the range.
1Mann-Whitney U-test or Kruskal-Wallis test were used to compare the differences in CST1 levels between two or more than two groups. Low or high CST1 expression was defined by the median CST1 level (5463 pg/mL).
2Statistical significance was analyzed by chi-square test. AD, adenocarcinoma; SCC, squamous carcinoma.
Univariate and multivariate analysis for OS prediction of NSCLC patients.
1Statistical significance was analyzed by Cox regression model; - means the data was not given; HR, hazard ratio; CI, confidence interval.
In this study, the Kaplan-Meier method was used to analyze the relationship between serum CST1 levels and the prognosis in 137 preoperative patients with NSCLC, and the log-rank test was used to compare the difference of survival curves of patients with high CST1 levels and low CST1 levels. All participants were followed up to 2017, of which the median survival time and mean survival time were 19 months and 45 months, respectively. The 1-, 3-, 5-, and 10-year OS rates were 72.99%, 35.04%, 23.36%, and 12.41%, respectively.
Survival analysis showed that 1-, 3-, 5-, and 10-year OS rates of NSCLC patients with high serum CST1 levels were 59.52%, 21.43%, 9.52%, and 7.14%, which were significantly lower than those with low serum CST1 levels (75.79%, 38.95%, 28.42%, and 14.74%, respectively) (Figure 4A-4D). These results suggested that serum level of CST1 was associated with poor prognosis of patients with NSCLC. Survival analysis of NSCLC patients stratified by median serum CST1 protein expression level. Kaplan-Meier curves show that patients with low CST1 expression had higher survival rates, including the total (A), 3-year (B), 5-year (C), and 10-year (D) OS rates. Red line, NSCLC patients with low CST1 expression; green line, NSCLC patients with high CST1 expression.
Discussion
Although the strategy for early diagnosis of lung cancer has been clinically recognized, the differential diagnosis of PBN and NSCLC is still an important issue to be resolved. 18 In the present study, we selected and verified a candidate gene CST1 from bioinformatics databases. We then validated the distinctions of serum CST1 levels in NSCLC patients (4289 ± 2405 pg/mL) compared with PBN patients (1558 ± 441 pg/mL) and healthy controls (1529 ± 416 pg/mL). Further analysis showed that serum CST1 levels of NSCLC patients in the early stage (I + II) were significantly lower than those in the terminal stage (III + IV), and meanwhile closely associated with the TNM stage.
CST1 is an active cysteine protease inhibitor of the CST superfamily, involving in the processes of inflammation and tumorigenesis. Numerous studies have reported that CST1 was a novel biomarker for several types of human cancer. Yoneda
This is the first exploration of the relationship between serum CST1 and NSCLC, and whether serum CST1 can serve as an independent diagnostic indicator for distinguishing between early-stage NSCLC and PBN. We performed the ROC analysis to clarify the role of serum CST1 in the diagnosis of NSCLC. The serum CST1 levels could be used to distinguish early-stage NSCLC patients from healthy subjects and PBN patients with the AUC of .733 and .734, respectively. Considering that the single indicator CST1 may not meet the needs of clinical diagnosis, we combined Cyfra21-1, CEA and CST1 to improve the diagnostic efficacy for early-stage NSCLC, with the AUC of combined indicators for differential diagnosis between early-stage NSCLC and PBN/HC patients reaching .914 and .925, respectively. Therefore, CST1-based combined indicators provided a great potential to recognize early-stage NSCLC and PBN cases.
We also explored the mechanisms involved in the relationship between CST1 and the prognosis of NSCLC. It has been previously reported that Cathepsin B (CTSB) concentration and activity were positively correlated with poor prognosis of tumors.19,20 CST1 can counteract the inhibitory effect of Cystatin C (CST3) on CTSB activity and indirectly lead to enhanced CTSB activity. 21 Moreover, CST1 and CTSB could delay cell senescence by inhibiting abnormal accumulation of glycogen. Overall, CST1 may be involved in regulating of CST3 and CTSB activity, thus indirectly affecting the prognosis.
In this study, we found that high serum CST1 levels were positively correlated with the poor prognosis of NSCLC patients, and the correlation was likely to persist throughout the course of disease progression. Our results also showed that 1-, 3-, 5-, and 10-year OS rates for NSCLC patients with high serum CST1 levels were 59.52%, 21.43%, 9.524%, and 7.143%, which were significantly lower than those with low serum CST1 levels (75.79%, 38.95%, 28.42%, and 14.74%). Moreover, we provided evidence that serum CST1 (HR = 1.605;
We also acknowledged the limitations of the present study. All participants in this study were enrolled from the single center, which may cover a narrower range. Multicenter cohort should be conducted in future to further strengthen these results. The expression levels of upstream and downstream effectors of CST1 should be examined, which could provide further evidence of the functions of CST1. Additional investigations of the detailed mechanisms of CST1 expression acting on NSCLC should be performed.
Conclusions
Serum CST1 was a prognostic predictor for NSCLC, and could be combined with Cyfra21-1 and CEA for distinguishing early-stage NSCLC patients from PBN cases and healthy controls. However, whether serum CST1 will become a novel clinical biomarker for NSCLC still needs to be validated in a larger sample size of cohorts in future.
Supplemental Material
Supplemental Material - Identification and Validation of Serum CST1 as a Diagnostic Marker for Differentiating Early-Stage Non-Small Cell Lung Cancer from Pulmonary Benign Nodules
Supplemental Material for Identification and Validation of Serum CST1 as a Diagnostic Marker for Differentiating Early-Stage Non-Small Cell Lung Cancer from Pulmonary Benign Nodules by Yanzhen Lai, Yu Wang, Yaxian Wu, Meng Wu, Shan Xing, Ying Xie, Shulin Chen, Xiaohui Li, Ao Zhang, Yi He, Huilan Li, Shuqin Dai, Junye Wang, Shudai Lin, Yunmeng Bai, Hongli Du, and Wanli Liu in Cancer Control
Footnotes
Acknowledgments
We thank the Cancer Genome Atlas (TCGA), the National Center of Biotechnology Information (NCBI), and the ONCOMINE team for using their data.
Author Contributions
Wanli Liu and Hongli Du designed the study; Yanzhen Lai, Yu Wang, and Yaxian Wu performed the experiments and wrote the paper; Meng Wu, Shan Xing, and Ying Xie analyzed the data; Shulin Chen, Xiaohui Li, Ao Zhang, Yi He, Huilan Li, Shuqin Dai, Junye Wang, Shudai Lin, and Yunmeng Bai collected the data. All authors discussed the results and revised the manuscript.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was supported by the National Natural Science Foundation of China (No. 81271902, No. 82002240), the National Key R&D Program of China (No. 2018YFC0910202), the Natural Science Foundation of Guangdong Province (No. 2019A1515010798), and the Science and Technology Planning Project of Guangzhou (No. 201704020176).
Ethical Approval
The study was approved by the Institutional Review Board of Sun Yat-sen University Cancer Center, Guangzhou, Guangdong Province, China (B2022-059-01).
Informed Consent
Written informed consent was obtained from the patient(s) for their anonymized information to be published in this article.
Supplemental Material
Supplemental material for this article is available online.
Appendix
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
