Abstract
Introduction
It is unclear whether patients diagnosed according to International Classification of Headache Disorders criteria for migraine with aura (MA) and migraine without aura (MO) experience distinct disorders or whether their migraine subtypes are genetically related.
Aim
Using a novel gene-based (statistical) approach, we aimed to identify individual genes and pathways associated both with MA and MO.
Methods
Gene-based tests were performed using genome-wide association summary statistic results from the most recent International Headache Genetics Consortium study comparing 4505 MA cases with 34,813 controls and 4038 MO cases with 40,294 controls. After accounting for non-independence of gene-based test results, we examined the significance of the proportion of shared genes associated with MA and MO.
Results
We found a significant overlap in genes associated with MA and MO. Of the total 1514 genes with a nominally significant gene-based p value (pgene-based ≤ 0.05) in the MA subgroup, 107 also produced pgene-based ≤ 0.05 in the MO subgroup. The proportion of overlapping genes is almost double the empirically derived null expectation, producing significant evidence of gene-based overlap (pleiotropy) (pbinomial-test = 1.5 × 10–4). Combining results across MA and MO, six genes produced genome-wide significant gene-based p values. Four of these genes (TRPM8, UFL1, FHL5 and LRP1) were located in close proximity to previously reported genome-wide significant SNPs for migraine, while two genes, TARBP2 and NPFF separated by just 259 bp on chromosome 12q13.13, represent a novel risk locus. The genes overlapping in both migraine types were enriched for functions related to inflammation, the cardiovascular system and connective tissue.
Conclusions
Our results provide novel insight into the likely genes and biological mechanisms that underlie both MA and MO, and when combined with previous data, highlight the neuropeptide FF-amide peptide encoding gene (NPFF) as a novel candidate risk gene for both types of migraine.
Introduction
Two common forms of migraine exist that are clinically distinguished by the presence of aura symptoms prior to the headache phase and hence are called migraine with aura (MA) and migraine without aura (MO) (1). The International Classification of Headache Disorders (ICHD) specifies MO patients have only migraine attacks without aura, whereas MA patients are characterized by their migraine attacks with aura, but can also have attacks without aura (1). The migraine aura is likely caused by cortical spreading depression (CSD), a slowly self-propagating wave of neuronal and glial depolarization in the cortex (2). The headache is caused by activation of the trigeminovascular system, which leads to abnormal processing of pain signals in the brain stem and subsequent activation of higher-order brain centers giving the sensation of pain (3). CSD has been shown to activate this pain pathway in experimental animal studies, although such proof is essentially lacking in patients (4). Whereas the biological mechanisms in the early phase of an attack seem to differ between MA and MO, they may be similar in the headache phase.
Supportive evidence for converging headache-generating mechanisms in both migraine types comes from the fact that some drugs have equal efficacy in treating both types of migraines (5). Moreover, several studies have reported that MA and MO frequently coexist within the same family (6,7). A large population-based study has reported high co-occurrence of MA and MO, and found that 13% of patients with active migraine (at least one migraine attack in the past year) have both MA and MO, which is higher than expected from their individual prevalence (8). Thus, the frequent co-occurrence of the two disorders within families and individuals suggest MA and MO share—at least to some extent—the same biological mechanisms. In support of this idea, latent class and genetic analysis of migraine symptom data in large Australian (9) and Dutch (10) twin samples indicated the existence of a continuum of severity, with MA being more severe but not etiologically distinct from MO.
Genetics studies are starting to discover genes and loci associated with migraine. A recent genome-wide association (GWA) meta-analysis of 23,285 migraine cases and 95,425 controls of European ancestry that was conducted by the International Headache Genetics Consortium (IHGC) identified 12 independent single nucleotide polymorphism (SNP) loci (rs2651899, rs10915437, rs12134493, rs2274316, rs7577262, rs6790925, rs9349379, rs13208321, rs4379368, rs10504861, rs6478241, rs11172113) significantly associated with migraine (p values < 5 × 10–8) (11). Many genes in or near these loci are involved in neuronal function, thus supporting a role for neuronal signaling in migraine etiology as had previously been shown for monogenic familial hemiplegic migraine, a subtype of MA (12). In addition, SNPs at 134 independent loci showed suggestive association with migraine (p values < 1 × 10–5) (11). When considering only the genome-wide significant hits (11,13–15), GWA studies have been more successful using case samples satisfying criteria for only MO, with six loci (rs2274316, rs7577262, rs6790925, rs9349379, rs6478241, rs11172113) identified (13) compared to only one locus (rs1835740) using case samples satisfying criteria for only MA (15). However, in the GWA meta-analysis of 23,285 migraine cases and 95,425 controls (11), nine of the 12 genome-wide significant loci—which comprise five of the six loci previously identified using only MO cases (13)—produced the strongest evidence for association using case samples satisfying MA and/or MO; the remaining three loci (rs9349379, rs13208321, rs10504861) were most strongly associated in the only MO subgroup (16). A lack of detailed clinical data precluded a more in-depth analysis of migraine subtypes.
We previously compared the effects (odds ratios, OR) of the top SNPs from the 12 genome-wide significant loci in the MA and only MO subgroups, and showed that four SNPs (rs10915437, rs9349379, rs13208321, rs10504861) had a significant heterogeneous effect size (p value = 4.4 × 10–3, 3.2 × 10–4, 4.9 × 10–2, 4.5 × 10–3, respectively). However, for all 12 SNPs, the allele associated with increasing risk was the same across these subgroups and an analysis of ∼23,000 independent SNPs found the majority of genome-wide SNP effects to be in the same direction across the MA and MO subgroups (16).
To more thoroughly assess the genetic overlap between MA and MO (i.e. beyond inspection of the genome-wide significant loci and individual SNP effects), we performed gene-based tests of association using GWA data for MA and MO from the IHGC meta-analysis. We evaluated the gene-level genetic relationship between MA and MO patients by testing whether the number of overlapping (pleiotropic) genes—i.e. genes with nominally significant gene-based association p values for both MA and MO—was more than expected by chance. The overlapping genes were also used in pathway and network analyses, to identify potential canonical pathways, biological functions, and molecular networks underlying both MA and MO.
Methods
Ethics statement
For all study cohorts, participation was based on informed consent. Each study was approved by local research ethics boards in the country where the study cohort was collected. See the original publication of the IHGC GWA meta-analysis for full details of ethics and consent procedures in each study cohort (11).
Cohorts and sample collection
The IHGC GWA meta-analysis used SNP marker data from 23,285 cases with migraine and 95,425 controls of European ancestry from 18 GWA studies, comprising five clinic-based studies, mainly compared to population-matched control samples with unknown migraine status, as well as 13 population-based cohorts (11). For a subset of the 18 total GWA studies comprising the original GWA meta-analysis, sufficiently detailed phenotype information was available to allow sub-classification into either of the two migraine subtypes, MA (seven cohorts: total 4505 MA cases versus 34,813 controls) or MO (four cohorts: total 4038 MO cases versus 40,294 controls) (Table S1). For more detailed descriptions of the cohorts, please see the original report (11).
GWA data
GWA summary statistic results for MA and MO were obtained from seven studies with case samples satisfying criteria for MA, and four studies with case samples satisfying criteria for only MO from the 2013 IHGC GWA meta-analysis (11). The selected MA and MO studies do not contain overlapping individuals and are of similar total size.
Genome-wide SNP genotyping was performed independently in each cohort with the use of various standard genotyping technologies, and imputed for each study with reference to HapMap release 21 or 22 CEU-phased genotypes. Each study contributed summary statistic data from an association analysis performed using a frequentist additive model based on estimated SNP allelic dosages, adjusting for gender. SNPs were filtered on a per-study level based on inclusion criteria of minor allele frequency (MAF) > 0.1% and imputation quality measures of IA > 0.6 (IMPUTE 2) or r2 > 0.3 (MaCH). The 1,680,313 “consensus” SNPs analyzed in the original IHGC GWA meta-analysis (11) were used in the present study. GWA summary statistic results for the seven MA studies were meta-analyzed in a fixed-effect model using GWAMA (17). GWA summary statistic results for the 4 MO studies were analogously meta-analyzed using GWAMA (17). The resulting MA and MO GWA meta-analysis results were subsequently used in gene-based tests for association.
Gene-based association test
We obtained RefSeq gene information (hg19) from the UCSC genome browser (downloaded 20 March 2014). Overlapping isoforms of the same gene were combined to form a single full-length version of the gene. Isoforms that did not overlap were left as duplicates of that gene. For the duplicated genes, the gene with the lowest downstream gene-based p value was retained. This led to 24,383 unique genes. The IHGC SNP positions were converted from hg18 to hg19 using the liftOver utility (https://genome.ucsc.edu/cgi-bin/hgLiftOver), and were assigned to genes if they mapped to between 15 kb 5’ of the transcription start site (TSS) and 15 kb 3’ of the transcription end site (TES). The 15-kb gene boundary extension was chosen based on the observation that 90% of SNPs affecting expression quantitative trait loci (eQTLs) were within this proximity (18). Gene-based association tests were performed using the GATES test (19) implemented in the Fast ASsociation Tests (FAST) package (20). GATES performs the gene-based test by adjusting the observed p value of the most significant SNP located in a gene, by the effective number of independent SNPs tested across the gene.
Given the possibility that gene-based results of neighboring genes may in fact be correlated because the most significant SNPs are the same or in strong linkage disequilibrium (LD), we estimated the effective number of independent genes (gene-based tests) by examining the LD between the most significant SNPs within each gene. This calculation was performed using the Genetic type I Error Calculator (GEC) (21). GEC is a method addressing the multiple-testing issue with dependent SNPs. In short, GEC first divides the input SNPs into LD blocks, and assumes LD blocks as independent (r2 < 0.1), then by examining the eigenvalues obtained from spectral decomposition of the LD correlation matrix of SNPs, GEC estimates the effective number of independent SNPs in the blocks.
Overlapping genes and statistic tests
After the gene-based test, we generated gene sets with gene-based association p values less than three nominally significant thresholds (p values<0.01, < 0.05, or < 0.1) in the MA and MO datasets. For each gene set, the effective number of independent genes was calculated. Subsequently, we regarded the MA dataset as the “discovery” set and the MO dataset as the “target” set to test whether the proportion of overlapping genes between MA and MO for each of the thresholds is more than expected by chance. Here, the observed number of overlapping genes is defined as the effective number of genes with p values less than the threshold in both the discovery (MA) and target (MO) sets. The observed proportion of overlapping genes is the observed effective number of overlapping genes divided by the effective number of genes with a p value less than the threshold in the discovery set. The expected proportion of overlapping genes is the effective number of genes with a p value less than the threshold in the target set divided by the total effective number of genes in the target set. The statistical significance of whether the number of overlapping genes was more than expected by chance was calculated using exact binomial statistical tests.
Pathway analysis
The pathway analysis was designed to discover biological mechanisms that are shared between MA and MO. Here, we used Ingenuity Pathway Analysis (IPA) software, which is widely used to identify pathways enriched in a gene-set based on biological interactions and functional annotations (22,23). The significance of the identified pathways was evaluated by Fisher’s exact statistical test. The pathway data sources include “Canonical pathways,” “Biological Functions” and “Networks.” “Canonical pathways” contain well-characterized metabolic and cell signaling pathways from specific journal articles, review articles, text books, and KEGG LIGAND. “Biological functions” are used to explore the enriched functions and annotated diseases of input genes. “Networks” represent diagrams of known protein-protein interactions, and are generated based on the input data.
We performed the IPA analysis using genes with a p value < 0.1 in both the MA and MO GWA datasets. The results can potentially be biased owing to non-independent (neighboring) genes. For example, one pathway may include multiple genes tagged with the same top SNP or SNPs that are in strong LD. We, therefore, checked the IPA results to ensure such bias did not contribute to the identified associated pathways.
Given that significant canonical pathways or biological functions may include redundant genes and gene-gene interactions, we explored their relationships by constructing customized networks. First, the genes overlapping between MA and MO were input into IPA. Subsequently, the focus genes (i.e. the subset of overlapping genes) involved in the significant canonical pathways/biological functions were again input into IPA but now to build functional networks for the significant canonical pathways/biological functions. As per IPA recommendations, biological functions were defined as significant if their p values were < 0.01, and canonical pathways were considered significant if their p values were < 0.05.
Results
Effective number of genes for MA and MO
Effective number of genes in MA and MO.
Migraine with aura. bMigraine without aura. cEffective number of genes. dProportion of total effective number of genes.
Gene-level genetic relationships of MA and MO
Number of overlapping genes and binomial test results under three p value thresholds of gene-based association.
Migraine with aura. bMigraine without aura.
Combined gene-based association across MA and MO
Genes with Fisher’s combined p value lower than the threshold.
Genes close to single nucleotide polymorphisms (SNPs) significantly associated with migraine without aura (MO) in a genome-wide association (GWA) meta-analysis (11).
Genes not close to SNPs significantly associated with MO or total migraine in a GWA meta-analysis (11).
Gene-based p value for migraine with aura.
Gene-based p value for migraine without aura.
Association results for the top SNPs in the six genome-wide significant genes.
Effect allele. bNon-effect allele. cOdds ratio for effect allele calculated in the GWA data. dp value of SNP. eMigraine with aura. fMigraine without aura.
SNPs: single nucleotide polymorphisms; GWA: genome-wide association.
Pathway analysis on overlapping genes
The significant biological functions, canonical pathways, and networks found in the overlap between MA and MO are shown in Tables S2–S4. The most significant biological function is “chronic inflammatory disorder” (Table S2). Notably, two genes linked to the function “chronic inflammatory disorder”: TRPM8 (2q37.1) and UFL1 (6q16.1) have Fisher’s combined gene-based p values surpassing the genome-wide threshold (3.45×10-6). The most significant canonical pathway is “Notch Signaling” (Table S3), which plays important roles in neuronal function and development (24–27). The most significant network is “Cardiovascular Disease, Organismal Injury and Abnormalities, Cardiac Stenosis” (Table S4). Other biological functions and networks that were significantly enriched in the overlap between MA and MO were mostly involved in cardiovascular function, inflammation, development and connective tissue-related functions (Tables S3 and S4).
As shown in Tables S2 and S3, several significant biological functions and canonical pathways share the same candidate genes, which indicate that these functions are related. In order to find the relationship between them, we constructed networks by IPA (described in Methods). The significant canonical pathways and biological functions were combined into five networks without overlapping genes (Table S5).
Discussion
This study identified a significant gene-level genetic relationship between migraine cases satisfying ICHD criteria for MA and migraine cases satisfying criteria for only MO by integrating gene-based tests and estimating the effective number of genes using the GWA summary statistic results for both disorders from the recent meta-analysis conducted by the IHGC (11).
Our approach is different from single-SNP-based approaches, e.g. polygenic prediction (28) and
From the combined analysis of MA and MO gene-based p values, six genes surpass our genome-wide significance threshold: TRPM8, UFL1, FHL5, TARBP2, NPFF and LRP1 on chromosome 2q37.1, 6q16.1, 6q16.1, 12q13.13, 12q13.13 and 12q13.3, respectively. These results support the utility of our gene-based approach, especially given they are based on 14,742 fewer migraine cases and 20,318 fewer controls (∼41% smaller effective sample size) compared to the original IHGC GWA study of 23,285 migraine cases and 95,425 controls.
Four of these genes (TRPM8, UFL1, FHL5 and LRP1) were located at or in close proximity to previously reported genome-wide significant SNPs. Transient receptor potential cation channel, subfamily M, member 8 (TRPM8) is a calcium channel expressed in primary sensory neurons essential for cold sensation. It can also mediate cold-mediated analgesics in chronic pain models (30), suggesting a modulatory role for TRPM8 in pain sensation. TRPM8 was also identified in the vasculature where it plays a role in setting the vascular tone (31), another function through which the channel could play a role in migraine. UFM1-specific ligase 1 (UFL1) is a protein located at the ER membrane. GO molecular functions assigned to UFL1 are ligase activity, protein binding and UFM1 transferase activity. The biological roles of UFL1 are largely unknown, but the protein has recently been associated with female hormone signaling pathways (32), providing a potential link with female preponderance in migraine. Four and a half LIM domains 5 (FHL5) is a transcriptional activator of cAMP-responsive element modulator that activates transcriptional programs during spermatogenesis; however, a brain-related role is currently not known. Low-density lipoprotein receptor-related protein 1 (LRP1) is an endocytic receptor with many ligands. Many biological functions have been described for LRP1. In the brain, LRP1 associates with and regulates the expression of NMDAR1 and GluR1 at postsynaptic neurons and can thereby regulate neurotransmission (33), providing an apparent link with migraine pathophysiology.
Two genes, TARBP2 and NPFF, represent novel candidate risk genes for migraine. The known molecular functions of TARBP2, TAR (HIV-1) RNA binding protein 2, that is encoded by the TARBP2 gene, centers on its requirement for formation of the RNA-induced silencing complex. TARBP2 binds between the bulge and the loop of the HIV-1 TAR RNA regulatory element and activates HIV-1 gene expression in synergy with the viral Tat protein. TARBP2 is also an integral component of the DICER1-containing complex and involved in miRNA processing (34). Together with the gene ontology (GO) molecular function terms assigned to TARBP2 (double-stranded RNA binding, protein binding, siRNA binding, miRNA binding, and protein homodimerization activity), there is no obvious functional link between TARBP2 and migraine risk.
The neuropeptide FF-amide peptide precursor (NPFF), encoded by the NPFF gene, has wide-ranging physiologic effects, including the modulation of morphine-induced analgesia, elevation of arterial blood pressure, and increased somatostatin secretion from the pancreas. The GO molecular functions assigned to NPFF are G-protein coupled receptor binding, receptor binding, and neuropeptide hormone activity. Notably, neuropeptide FF potentiates and sensitizes acid-sensing ion channels ASIC1 and ASIC3. The ASICs represent proton-gated channels that are able to transport Na+ and Ca2+. A recent study using whole-cell patch-clamp electrophysiology showed that a decrease in extracellular pH can directly excite primary dural-afferent neurons via the opening of ASICs and produce migraine-related pain behavior, suggesting ASIC inhibitors may represent novel candidates for migraine therapy (35). Indeed, the ASIC inhibitor amiloride was recently shown to block CSD—the neurophysiological correlate of migraine aura—and inhibited trigeminal activation in in vivo in animals, via an ASIC1 mechanism (36). These previous findings suggest that NPFF is the more probable migraine risk gene on 12q13.13.
To gain better understanding of the biological mechanisms that are involved in the two migraine subtypes, we also performed pathway analyses on the genes that showed nominal association with both MA and MO. Pathway analysis based on biological functions provided 22 significant diseases/functions (Table S2). Among them, many were related to inflammatory disorders. A causal role for inflammation in migraine pathophysiology has been described previously, as CSD can activate an inflammatory cascade that can reach the meninges (37) where it may cause neurogenic inflammation and subsequent activation of trigeminal neurons and thereby activation of the migraine headache (37,38).
Gene functions and networks related to cardiovascular and connective tissue disorders showed enrichment in the overlap between MA and MO as well (Tables S2, S4 and S5). A possible role for the cardiovascular system in migraine has been suggested by the high coincidence of MA with stroke, several cardiac disorders, and cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy (CADASIL) (39–41). Very few brain-related functions were identified in the genes nominally associated with both migraine subtypes.
Although approximately three-quarters of the MA patients studied may also suffer migraine attacks without aura (15), our analyses are valid with respect to testing the hypothesis of a shared genetic susceptibility between patients experiencing migraine attacks with aura and patients experiencing only migraine attacks without aura. It is unclear what the relative proportion of the observed genetic overlap is due to patients suffering only migraine attacks with aura, and patients suffering migraine attacks both with and without aura. To specifically assess the genetic overlap between patients suffering only migraine attacks with aura and patients suffering only migraine attacks without aura, additional studies comparing results from GWA analysis of MA-only patients, to results from GWA analysis of MO-only patients are required. However, the MA-only GWA samples currently available are too small to deliver sufficient power to provide a valid conclusion regarding the genetic overlap between MA-only patients and the more common MA + MO and MO-only patients groups.
In summary, these results show a significant gene-based overlap (pleiotropy) between migraine cases satisfying criteria for MA and migraine cases satisfying criteria for only MO, further explaining—at least partly—their co-occurrence within the same patient or family. Our results also highlight four genes (TRPM8, UFL1, FHL5 and LRP1) that had earlier surfaced in GWA studies and two novel genes (TARBP2 and NPFF) as candidate risk genes for migraine, and indicate that inflammatory and cardiovascular processes may be involved in the etiology of both MA and MO.
Article highlights
A significant overlap in associated genes exists between the primary subtypes migraine with aura (MA) and migraine without aura (MO), further explaining—at least partly—their co-occurrence within the same patient or family. Combining gene-based association results across MA and MO confirmed association of four genes at three previously implicated loci, and when combined with previous data, highlight NPFF as a novel candidate risk gene for both types of migraine. Results from pathway analyses indicate that inflammatory and cardiovascular processes may be involved in the etiology of both MA and MO.
Footnotes
Funding
This work was supported by a grant from the National Health and Medical Research Council (NHMRC) (APP1075175), and by the European Union’s Seventh Framework program (2007–2013) under grant agreement no. 602633 (EUROHEADPAIN).
Conflict of interest
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Acknowledgments
Huiying Zhao was supported by a National Health and Medical Research Council (NHMRC) Early Career Fellowship (APP1091816). Dale R. Nyholt was supported by the Australian Research Council (ARC) Future Fellowship (FT0991022) and NHMRC Research Fellowship (APP0613674) Schemes. Grant W. Montgomery was supported by the NHMRC Fellowships Scheme (339446, 619667). See the original publication of the 2013 IHGC migraine GWA meta-analysis (
) for detailed cohort-specific acknowledgments.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
