Abstract
Objective
Stroke is a severe complication of atrial fibrillation (AF). We aimed to discover key genes and microRNAs related to stroke risk in patients with AF using bioinformatics analysis.
Methods
GSE66724 microarray data, including peripheral blood samples from eight patients with AF and stroke and eight patients with AF without stroke, were downloaded from the Gene Expression Omnibus (GEO) database. Differentially expressed genes (DEGs) between AF patients with and without stroke were identified using the GEO2R online tool. Functional enrichment analysis was performed using the DAVID database. A protein–protein interaction (PPI) network was obtained using the STRING database. MicroRNAs (miRs) targeting these DEGs were obtained from the miRNet database. A miR–DEG network was constructed using Cytoscape software.
Results
We identified 165 DEGs (141 upregulated and 24 downregulated). Enrichment analysis showed enrichment of certain inflammatory processes. The miR–DEG network revealed key genes, including
Conclusion
Dysregulation of certain genes and microRNAs involved in inflammation may be associated with a higher risk of stroke in patients with AF. Evaluating these biomarkers could improve prediction, prevention, and treatment of stroke in patients with AF.
Introduction
Persistent or permanent atrial fibrillation (AF) is the most common sustained arrhythmia worldwide. AF is the second most common risk factor for ischemic stroke, the prevention of which is the main goal of treating patients with AF.1–3 In AF patients suffering stroke, a cardiac embolus originating from the left atrial appendage is a common cause. 4 , 5 Patients with AF-related stroke seem to have a worse prognosis, such as more severe disability and greater mortality, than those with stroke in the absence of AF. 6 , 7 Today, an increasing number of studies are focusing on the prevention and intervention of stroke in AF patients. One approach is to find suitable biomarkers by which to identify patients at greatest risk of stroke.
A number of traditional cardiovascular-related biomarkers have been explored to predict or clarify the risk of stroke in patients with AF. Troponin is one of these risk factors. A substudy of the RE-LY trial showed that troponin level is associated with thromboembolic events such as stroke. 8 Other research has suggested that cardiac troponin high-sensitivity (cTn-hs) can be a biomarker independently associated with risk of stroke in AF. 9 In contrast, a substudy of the ARISTOTLE trial suggested that plasma D-dimer, a marker of fibrin turnover, is a potential predictor of stroke, mortality, and major bleeding. 10 The RE-LY substudy showed that the level of N-terminal pro b-type natriuretic peptide (NT-proBNP) is related to the risk of thromboembolic events and cardiovascular mortality. 8
Recently, new biomarkers have been reported for prediction and treatment of stroke in AF. For example, phosphodiesterase 4D may be a risk factor for AF and stroke. 11 Asymmetrical dimethylarginine may be used to predict a pro-thrombotic state in AF and correlate with the risk of stroke based on the CHADS2/CHA2DS2-VASc score. 12 Variations in the gene encoding alanine-glyoxylate aminotransferase 2 may be involved in age-related thromboembolic complications in AF. 13 New therapeutic targets have also been studied. Heat-shock protein 70-kDa induction seems to delay thrombus formation with minimal bleeding risk, which is promising for reducing the risk of stroke in AF patients. 14
In this study, we downloaded microarray data from the Gene Expression Omnibus (GEO) database and screened differentially expressed genes (DEGs) between AF patients with stroke and those without stroke. MicroRNAs, which are small noncoding RNA molecules (containing about 22 nucleotides), targeting these DEGs were also included in our analysis. We aimed to explore new mechanisms, molecular biomarkers, and potential therapeutic targets for stroke in AF patients.
Methods
Microarray data
All of the data we used were obtained from open access databases on the Internet and did not require ethical permission or patient consent to use.
We downloaded the microarray data of GSE66724 from the GEO database (http://www.ncbi.nlm.nih.gov/geo/) with its microarray platform as GPL570 ([HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array). The gene expression files were deposited by Allende et al. 14 Peripheral blood was used in the microarray analysis. According to the original study, 14 16 nonvalvular AF patients were recruited, 8 of whom had suffered a cardioembolic stroke. Stroke and non-stroke patients were matched by CHAD index and sex. All patients were on anticoagulant treatment with acenocoumarol, with an international normalized ratio (INR) between 2 and 3 at the time of blood withdrawal. AF was diagnosed by electrocardiography and lasted more than 3 months. Cardioembolic stroke was diagnosed clinically by imaging techniques (magnetic resonance imaging or X-ray computed tomography). Patients were excluded from the study if they met any of the following criteria: carotid artery lesion occluding more than 50% of the lumen vessel diameter in the side of the infarction, cancer in progress, leukocytosis (>7,000 cells/mL), leukopenia (<3,500 cells/mL), history of venous thromboembolism in the last 3 months, acute coronary syndrome, infection, autoimmune disease, or surgery. Renal failure (creatinine value more than double the normal value), oral contraceptive use, hormonal therapy, and corticoid consumption were also exclusion criteria.
Screening DEGs
DEGs were screened using the online tool GEO2R in the GEO database. GEO2R is an R-programming-based language for analysis of gene expression datasets by
Functional enrichment analysis
We uploaded the selected DEGs to the Database for Annotation, Visualization, and Integrated Discovery (DAVID) v6.8 Beta (https://david-d.ncifcrf.gov/) for further analysis. The DAVID database provides useful functional annotation tools to help researchers understand the biological meaning of genes.
16
,
17
In our study, DAVID was used to investigate Gene Ontology (GO) annotations and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways related to the selected DEGs. These biological processes and pathways might be associated with stroke in AF patients.
Protein–protein interaction analysis
Protein–protein interaction (PPI) pairs of DEGs were obtained from Search Tool for the Retrieval of Interacting Genes (STRING, http://string-db.org/). 18 The STRING database is an online database resource containing a collection of comprehensive information of predicted and experimental interactions of proteins. In our study, a PPI confidence score >0.6 was considered the threshold of significance to construct the PPI network. The generated list of PPI pairs was downloaded for further analysis.
MicroRNAs targeting the DEGs
The DEGs were uploaded to the database miRNet (http://www.mirnet.ca/faces/home.xhtml) 19 to obtain microRNAs (miRs) targeting these DEGs. miRNet is an online tool with comprehensive support for statistical analysis and functional interpretation of data generated in microRNA studies. It allows researchers to build miR–target interaction networks. The generated list of miR–DEG pairs was downloaded for further analysis.
Construction of miR–DEG network
A miR–DEG network composed of both miR–DEG pairs and PPI pairs was constructed and visualized using Cytoscape software 3.4.0. 20 To determine the key functional modules, module clustering analysis for the network was then performed by the Molecular Complex Detection (MCODE) 21 plugin. We set the degree cutoff value to 2 and the node score cutoff to 0.2 for the MCODE process.
Results
DEG screening
Data normalization and cross-comparability were assessed (Figure 1), and then DEGs were analyzed. A total of 165 DEGs were selected using the cutoff point of

Cross-comparability assessment of microarray data. Dataset GSE66724 included eight peripheral blood samples from AF patients with stroke (Cases) and eight from AF patients without stroke (Controls). The box plot shows the distribution of value data for the samples selected. The box extends from the first to the third quartile; the line inside the box displays the median; and the whiskers extend from the ends of the box to the smallest and largest data values. AF, atrial fibrillation.
Functional enrichment analysis
Twenty-one GO terms (
Construction of miR–DEG network
Taking the selected 165 DEGs into account, we identified 38 PPI pairs by STRING, and 4,739 miR–DEG pairs by miRNet. These two pairing pictures were merged and a complex miR–DEG network was generated in Cytoscape. To assess the key functional modules of this network, module clustering was then performed using the MCODE plugin of Cytoscape. Three modules were identified (Figure 2). Two DEGs,

The three functional modules. Module A involves
Discussion
One of the main goals when treating AF is to prevent stroke. The CHA2DS2-VASc score is a commonly used clinical prediction rule for estimating the risk of stroke in patients with non-rheumatic AF. Risk factors in CHA2DS2-VASc score include congestive heart failure, hypertension, age ≥75 years, diabetes mellitus, prior stroke, transient ischemic attack or thromboembolism, vascular disease, age 65 to 74 years, and sex. The major treatment based on the CHA2DS2-VASc score is the use of anticoagulants to decrease the risk of stroke. 22 , 23 However, despite careful management, some patients still suffer a stroke. In addition, anticoagulants may increase the risk of bleeding in some contraindicated patients. Therefore, new risk factors and targets of therapeutic intervention need to be identified.
Some previous studies proposed that inflammation is an independent risk factor for and contributes to the occurrence of thromboembolism and stroke in AF patients.24–27 Associations between miRNAs and certain cardiovascular diseases have also been uncovered in recent years. Zhang et al. 28 suggested a role for miRNAs in the mechanism of inflammation in AF. McManus and Freedman suggested that platelet-derived miRNAs have important roles as biomarkers of susceptibility, prognosis, or treatment of certain cardiovascular diseases, including stroke and AF. 29 Llombart et al. suggested that circulating miRNAs may be used in the diagnosis of cardioembolic stroke. 30
In the present study, based on the GSE66724 dataset from the GEO database, 165 DEGs were found between AF patients with stroke and those without stroke. Key biological processes were enriched based on these DEGs, many of which were associated with immune or inflammatory process. We constructed three miR–DEG modules, which contained DEGs and miRNAs that were also associated with inflammation or cardiovascular diseases. This implied a mechanism of inflammation precipitating stroke in patients with AF.
Myocyte enhancer factor 2A (MEF2A) is a transcription factor in the Mef2 family. We found that MEF2A was upregulated (
Cullin associated and neddylation dissociated 1 (CAND1) was another upregulated DEG (
The miR–DEG module A that we constructed suggested that miR-1, miR-1-3p, miR-21, miR-21-5p, miR-192, and miR-192-5p could be associated with stroke in AF patients via MEF2A or CAND1 pathways. The miR-1 family has important roles in heart diseases such as hypertrophy, myocardial infarction, and arrhythmias.38–40 Calcium signaling is a central regulator of cardiomyocyte growth and function. Ikeda et al. 41 suggested that miR-1 regulates cardiomyocyte growth responses by negatively regulating the calcium signaling components calmodulin, MEF2A, and GATA4. miR-21 has been shown to play important roles in the development of heart diseases. Its level was found to increase in fibroblasts of the failing heart, and in vivo silencing of miR-21 was shown to inhibit interstitial fibrosis and improve cardiac function in a pressure-overload cardiac disease mouse model. 42 In addition, atrial fibrosis is important for the pathogenesis of atrial fibrillation. miR-21 was found to be overexpressed in atrial tissue from patients with AF and was proposed to play a role in the pathogenesis of atrial fibrosis. 43 miR-192 is also upregulated in certain cardiac diseases, including heart failure after acute myocardial infarction and hypertrophic cardiomyopathy. 44 , 45
Pellino E3 ubiquitin protein ligase 1 (PELI1;
The miR–DEG module C that we constructed showed that PELI1 and PDCD4 were co-regulated by miR-155 and miR-155-5p. Previous studies have indicated an intimate relationship between inflammation, innate immunity, and miR-155 expression.53–58 miR-155 is essential for the generation and function of T follicular helper (Tfh) cells through a miR-155-PELI1-c-Rel pathway. 59 miR-155 is associated with cancer metastasis through pathways including PDCD4.60–62
As with AF, miR-155 was found to be upregulated in left atrial appendage from nonvalvular AF patients, which indicates a role of miR-155 in electric remodeling of AF. 63 In contrast, miR-155 polymorphism was found to be associated with the risk of ischemic stroke. 64 miR-155 mediates inflammatory responses in ischemic cerebral tissue and other CNS neuroinflammatory disorders. 65 More importantly, inhibition of miR-155 exerts an anti-inflammatory action and promotes recovery after stroke, 66 , 67 indicating that miR-155 may be a potential therapeutic target.
In summary, by bioinformatic approaches, we constructed a stroke-related miR–DEG network in patients with AF. Dysregulation of certain genes and miRNAs in the network involved in inflammation may be associated with a higher risk of stroke in AF patients, which suggests that inflammation might be a potential risk factor of stroke that should be considered when assessing and treating AF patients. Certain genes, including
Footnotes
Declaration of conflicting interest
The authors declare that there is no conflict of interest.
Funding
This work was funded by grants from the National Natural Science Foundation of China (Grant no. 81601711) and Science and Technology Project of Guangzhou (Grant no. 201904010418) to Zhongxing Wang. The funding sources had no involvement in study design; collection, analysis, and interpretation of data; writing the report; or the decision to submit the article for publication.
