Abstract
BACKGROUND:
Ovarian cancer is the common tumor in female, the prognostic of which is influenced by a series of factors. In this study, 4 genes relevant to pathological grade in ovarian cancer were screened out by the construction of weighted gene co-expression network analysis.
METHODS:
GSE9891 with 298 ovarian cancer cases had been used to construct co-expression networks. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses was used to analyze the possible mechanism of genes involved in the malignant process of ovarian cancer. Hub genes were validated in other independent datasets, such as GSE63885, GSE26193 and GSE30161. Survival analysis based on the hub genes was performed by website of Kaplan Meier-plotter.
RESULTS:
The result based on weighted gene co-expression network analysis indicated that turquoise module has the highest association with pathological grade. Gene Ontology enrichment analysis revealed that the genes in turquoise module main enrichment in inflammatory response and immune response. Kyoto Encyclopedia of Genes and Genomes enrichment analysis revealed that the genes in turquoise module main enrichment in cytokine-cytokine receptor interaction and chemokine signaling pathway. In turquoise module, a total of 4 hub genes (MS4A4A, CD163, CPR65, MS4A6A) were identified. Then, 4 hub genes were effectively verified in the test datasets (GSE63885, GSE26193 and GSE30161) and tissue samples from Shengjing Hospital of China Medical University. Survival analysis indicated that the 4 hub genes were associated with poor progression-free survival of ovarian cancer.
CONCLUSIONS:
In conclusion, 4 hub genes (MS4A4A, CD163, CPR65, MS4A6A) were verified associated with pathological grade of ovarian cancer. Moreover, MS4A4A, CD163, MS4A6A may serve as a surface marker for M2 macrophages. Targeting the 4 hub genes may can improve the prognosis of ovarian cancer.
Introduction
Epithelial ovarian cancer (EOC) is one of the most common and lethal malignant tumors, and the 5-year survival rate is less than 30% [1]. Tumor grade is closely related to the prognosis of patients. High grade serous ovarian carcinoma (HGSOC) as the most common histological subtype of EOC has a worse prognosis than low-grade ovarian cancer [2]. Currently, the standard treatment for HGSOC is tumor cell destruction combined with postoperative taxol/platinum chemotherapy. However, the vast majority of patients eventually relapse for a variety of reasons [3]. Therefore, finding predictive markers related to ovarian cancer grade is very significant, which can not only predict the prognosis of ovarian cancer, but also serve as a therapeutic target for OC.
In recent years, bioinformatic analysis is commonly used in various diseases, especially tumors. Compared with other methods, bioinformatic analysis is more accurate, comprehensive and economical. Weighted gene co-expression network analysis (WGCNA) is a widely used method in systems biology, especially for studying biological networks based on paired correlations between variables. Based on the correlation between gene expression data and clinical phenotypes, the WGCNA network can be constructed to analyze the change of biological process integrally and identify hundreds of pathogenic genes and therapeutic targets simultaneously. WGCNA network replaced the hard threshold with soft threshold, and introduced the concept of scale-free distribution network. Through using the correlation coefficient of network weighted expression matrix, gene pairs in each network meet a certain biological significance. WGCNA technology, based on the concept of scale-free distribution, has been successfully applied to solve many biological problems and make great contribution to the finding of many important discoveries [4]. In cancer research, WGCNA had been used to find tumor markers for screening, diagnosis, and treatment [5]. Zhu et al. [6] screened out genes related to the prognosis of soft tissue sarcoma by constructing WGCNA. Several miRNAs were identified as prognostic biomarkers through co-expression network analysis [7]. Similarly, biomarkers serve as clear cell Renal cell Carcinoma were identified in the same way [8].
In our research, WGCNA was established based on gene expression data and clinical information download from GSE9891 dataset and the hub gene module closely related to the pathological grade of OC were screened. GO and KEGG pathways analysis related to genes in hub module were performed. Subsequently, hub genes, which were closely related to ovarian cancer pathological grade, were screened out and verified in other data sets. Through a series of analyses, we speculated that the hub genes may be served as surface markers of M2 macrophages.
Materials and methods
Obtain the specimen of OC
From April 2018 to July 2019, 70 OC tissue samples were collected from Shengjing Hospital of China Medical University, including 25 in the low-grade group and 45 high-grade group. There was no chemotherapy or any other treatment before operation, and the clinicopathological data was complete. This study was approved by the medical research ethics committee of the hospital.
GEO data acquisition
Ovarian cancer experimental data set GSE9891 [9], validation data sets GSE63885 [10], GSE26193 [11] and GSE30161 [12] and their clinical data were download from GEO (
Clustering dendrogram of 
The “WGCNA” package was used to establish co-expression network as the same way used in our previous article [13]. First, the Pearson correlation matrix is constructed by using the correlation coefficient of genes. Then, the pearson correlation matrix was transformed into a weighted adjacency matrix by using the formula of amn
Determination of soft threshold and inspection of scale-free network. (A) The correlation coefficients of log(K) and log(p(K)) corresponding to different soft-thresholding power (
Two methods were used to screen a hub module. First, gene significance (GS) was defined as the log10 transformation based on the
Hub gene screening and validation
In our research, hub genes were considered to be genes with high module connectivity (cor.GeneModule Membership
Real time fluorescence quantitative PCR experiment
The RNA of ovarian cancer tissues were extracted by Trizol according to the manufacturer’s instructions. Then, we used the ThermoScript reverse transcriptase (RT)-PCR System to synthesize single-stranded cDNA. The 20
Statistical analyses
SPSS 23.0 and R 3.5.3 were used for analysis. All statistical tests were bilateral, and a P value less than 0.05 was studied statistically significant. Continuous variables having to be in conformity with the customary distribution were compared by independent t test, while continuous variables with skewed distribution were compared by Mann-Whitney U test. The Kaplan-Meier curve was utilized to analyze the relationship between gene and overall survival. log-rank test is employed for evaluation.
Results
Construction of WGCNA and screening hub module
We used the “WGCNA” to assign genes with similar expression patterns into one module, and 7 gene modules (black, yellow, brown, blue, red, green and turquoise) were obtained (Fig. 3A). Then, we explored the correlation between the 7 gene modules and clinical traits, such as age, FIGO stage and grade. The result showed that the correlation between turquoise module and pathological grade of ovarian cancer was the highest (Fig. 3B,
Identification of hub module associated with the pathological grade of OC. (A) Dendrogram of all expressed genes clustered based on a dissimilarity measure (1-TOM), Dynamic Tree Cut corresponds to the original module and Merged Dynamic corresponds to the final module. Since no modules need to be merged, the results are exactly the same. (B) Heatmap of the correlation between Modular significance (MS) and clinical traits of ovarian cancer (OC), each row corresponds to a trait module and each column corresponds to a gene module, the numbers in the square brackets represent P values, the numbers in the square represent correlation coefficients, the red represents positive correlation, the blue represents negative correlation. (C) The heat map of interconnection degree after power conversion of correlation coefficient between genes in each module based on an adjacency matrix. In this heatmap progressively more saturated yellow colors indicate the high co-expression interconnectedness. Modules correspond to highly interconnected genes blocks. Genes of high intramuscular connectivity are located at the tip of the module branches because they show the highest interconnectedness with the rest of the genes in the module. (D) Heatmap plot of the adjacencies in the hub gene network includes the trait weight. Each column and row correspond to one module hub gene (labeled by color) or weight. In the heatmap, red represents high adjacency (positive correlation), while blue color represents low adjacency (negative correlation). Squares of red color along the diagonal are the meta-module.
GO and KEGG enrichment analysis was used to analyze the possible mechanism of genes in turquoise module involved in the malignant process of ovarian cancer. GO analysis revealed that the genes in turquoise module main enrichment in inflammatory response, immune response and lipoprotein metabolic process (Fig. 4A). KEGG analysis revealed that the genes in turquoise module main enrichment in cytokine-cytokine receptor interaction and chemokine signaling pathway (Fig. 4B). All of the results suggested that the genes in turquoise module may be involved in the malignant process of ovarian cancer by regulating the tumor microenvironment.
GO functional enrichment analysis and KEGG analysis. The size of the circle represents the number of genes. The y-axis shows the GO or KEGG pathway terms. The -log 10(pvalue) of each term is colored according to the legend. The redder the color the higher the value of -log 10 (pvalue) (A) GO functional enrichment analysis, the results revealed that the genes in turquoise module main enrichment in inflammatory response, immune response and lipoprotein metabolic process. (B) KEGG analysis, the results revealed that the genes in turquoise module were mainly enriched in cytokine-cytokine receptor interaction and chemokine signaling pathway. 
Hub gene screening. (A) Scatter plot for correlation between gene module membership in the turquoise module and gene significance. (B) PPI network analysis of top 25 genes in the turquoise module, the circle is the gene name, and the line represents the degree of connection between genes. The higher the connectivity, the stronger the interaction between genes. 
Hub gene validation. Differential expression of MS4A4A, CD163, CPR65, MS4A6A between Grade1/2 and Grade3/4 of OC tissues based on datasets GSE63885, GSE26193 and GSE30161. (A) Differential expression of MS4A4A in GSE63885 (
Survival analysis. Kaplan-Meier plotter database prognostic analysis of MS4A4A, CD163, CPR65, MS4A6A [progression-free survival (PFS)]. (A) High expression of MS4A4A indicated shorter PFS of OC (
In this study, the hub genes refers to the turquoise module were the genes most related to the pathological grade of ovarian cancer. WGCNA screened 4 genes (MS4A4A, CD163, CPR65, MS4A6A) as hub genes based on hub gene screening method (cor.ModuleMembership in turquoise module
The prognostic value of the 4 genes
To investigate the influence of screened hub genes on the prognosis of ovarian cancer patients, survival analysis based on the 4 genes (MS4A4A, CD163, CPR65, MS4A6A) were performed by website of Kaplan Meier-plotter. The results showed that high expression of MS4A4A, CD163, CPR65, MS4A6A indicated shorter progression-free survival (PFS) of OC (Fig. 7,
Real time fluorescence quantitative PCR verification of the expression of the 4 genes
The results of Real time fluorescence quantitative PCR revealed that MS4A4A, CD163, CPR65, MS4A6A were overexpressed in high-grade group than the low-grade group. The difference was statistically significant (Fig. 8).
Discussion
Tumor pathological grade is considered to be closely related to tumor prognosis. High-grade serous ovarian cancer (HGSOC) has a poorer prognosis than low-grade ovarian cancer because it is more likely to metastasize into the peritoneal cavity [15]. Therefore, identification of biomarkers related to the pathological grade of ovarian cancer and intervening in patients at an early stage is crucial to improve the prognosis of patients.
In this study, in order to screen the biomarkers most related to the pathological grade of OC, we used WGCNA package to establish the relationship matrix between genes and phenotypes. Finally, 4 genes (MS4A4A, CD163, CPR65, MS4A6A) most associated with pathological grade of OC in turquoise module were screened for further study. To improve the reliability of the experiment, the 4 genes were verified in three other independent datasets (GSE63885, GSE26193 and GSE30161) and tissue samples from Shengjing Hospital of China Medical University. The results further proved that all of the 4 genes were related to the pathological grade of OC. Survival analysis showed that high expression of the 4 genes predicted a shorter PFS.
MRNA levels verified by Real time fluorescence quantitative PCR experiment. (A) MS4A4A; (B) CD163; (C) CPR65; (D) MS4A6A.
MS4A4A and MS4A6A are a member of the tetracycline membrane, 4 domain families, subfamily A (MS4A), the expression of which was mainly confined to hematopoietic tissues [16]. At present, there are few related studies on MS4A4A and MS4A6A. The major research areas were limited to Alzheimer’s disease and blood diseases. Sanyal et al [17] reported that MS4A4A can be used as a surface marker for M2 macrophages, which suggest that targeting MS4A4A can inhibit the polarization of M2 macrophages. Wang et al. [18] revealed that MS4A4A correlates with stromal immune score and prognosis of gastric cancer patients. High expression of MS4A4A was associated with poor prognosis in gastric cancer, which was consistent with our findings. In addition, the role of MS4A4A in carcinogenesis and tumor progression has not been clearly found. Buddingh et al. [19] found that MS4A6A was specifically expressed on the surface of M2 macrophages in the process of screening differential genes related to osteosarcoma metastasis. Ma et al. [20] indicated that the hypermethylated and silenced of MS4A6A was associated with short-term survival in GBM patients, which was contrary to our research results. In addition, there is no more research on MS4A6A in tumor. CD163 is an important member of the scavenger receptor superfamily and mainly expressed on the surface of monocytes and macrophages. It is also served as a highly specific type marker of M2 tumor-associated macrophage [21]. CD163 not only acts as an immunomodulator against inflammation, but also plays an important role as a tumor-associated macrophage family member on tumor proliferation and metastasis. The expression of CD163 in malignant tumors has drawn more and more attention. Some studies have shown that CD163 is closely related to malignant cancers such as breast cancer [23], bladder cancer [24], lung cancer [25] and colorectal cancer [26]. A research also showed that CD163 is high-expressed in tumor-associated macrophages of OC and relevant to shorter PFS in OC, which was consistent with our findings. Besides, the research also showed that the number of CD163+ cells in OC was clearly associated with early relapse after first-line therapy [27]. However, there have not been any research associated with CPR65.
Through the review of the above literature, we found that MS4A4A, CD163 and MS4A6A are mainly related to immunity. Our functional analysis results also revealed that genes in turquoise module play a role in the malignant progression of ovarian cancer mainly through immune regulation. Tumor immune infiltration microenvironment is an important part of tumor genesis and treatment [28]. Many theoretical and experimental evidences had illustrated that tumor-associated macrophages (TAMs) as a component of immune infiltrate have a crucial effect on tumor immune-infiltrating microenvironment [29]. TAMs have two opposite phenotypes: M1 subtype macrophages have anti-tumor effect, while M2 type macrophages have tumorigenic activity. Numerous studies had shown that M2 TAMs promote tumor metastasis by communicating with tumor cells [32]. Chen et al. [33] reported that the communication between cancer cells and M2 TAMs can promote migration of EOC cell. In our research, MS4A4A, CD163, CPR65, MS4A6A were associated with the pathological grade and PFS of OC. In further research, we can extend the PFS of OC by inhibiting the 4 genes.
In conclusion, our research used a series of bioinformatic analysis methods to identify and validate the relationship between the 4 genes and ovarian cancer pathological grade. The MS4A4A, CD163, MS4A6A may serve as a surface marker for M2 macrophages. Targeting the 4 hub genes may be can extend the PFS of OC.
Footnotes
Acknowledgments
We thank the authors who provided the GEO public datasets.
Conflict of interest
The authors declare no competing interests.
