Abstract
Background
Before the genome-wide association (GWA) era, many hypothesis-driven candidate gene association studies were performed that tested whether DNA variants in genes that had been selected based on prior knowledge about migraine pathophysiology were associated with migraine. Most studies involved small sample sets without robust replication, thereby making the risk of false-positive findings high. Genome-wide marker data of thousands of migraine patients and controls from the International Headache Genetics Consortium provide a unique opportunity to re-evaluate key findings from candidate gene association studies (and other non-GWA genetic studies) in a much larger data set.
Methods
We selected 21 genes from published candidate gene association studies and six additional genes from other non-GWA genetic studies in migraine. Single nucleotide polymorphisms (SNPs) in these genes, as well as in the regions 500 kb up- and downstream, were inspected in IHGC GWAS data from 5175 clinic-based migraine patients with and without aura and 13,972 controls.
Results
None of the SNPs in or near the 27 genes, including the SNPs that were previously found to be associated with migraine, reached the Bonferroni-corrected significance threshold; neither when analyzing all migraine patients together, nor when analyzing the migraine with and without aura patients or males and females separately.
Conclusion
The available migraine GWAS data provide no clear evidence for involvement of the previously reported most promising candidate genes in migraine.
Introduction
Disease susceptibility for common disorders, including migraine, is thought to be conferred by a combination of environmental factors and genetic factors that are either common (i.e. with a minor allele frequency (MAF) larger than 5% in the population) or rare. In the past decades, many genetic association studies have been performed by testing DNA variants in dozens of candidate genes in order to identify genetic factors for migraine (1,2). Genes were selected based on the hypothesis that the respective pathway was implicated in migraine pathophysiology; e.g. genes that play a role in serotonin and dopamine pathways (3). The majority of the studies investigated only a single or a limited number of DNA variants per gene and therefore had a low a priori likelihood of targeting the correct variant that confers disease susceptibility. Moreover, rather low numbers of cases and controls (rarely more than 300 per group) were studied, resulting in limited statistical power to evaluate their association. For the majority of the associations no replication of the findings in independent cohorts was provided (for review, see de Vries et al. (1)). Consequently, many of the associations may in fact represent false-positive findings. Similar experiences have been observed in other common diseases (4–6).
Over the last few years, genome-wide association studies (GWAS) have become the state-of-the-art approach to identify genetic factors involved in common disorders. Unlike candidate gene association studies that are hypothesis driven, GWAS are hypothesis free and hypothesis generating in nature. Typically they involve large cohorts of at least several thousand patients and controls and test the association with disease of hundreds of thousands of single nucleotide polymorphisms (SNPs) distributed over the genome (7). Importantly, initial association findings are always scrutinized by follow-up testing in multiple independent replication cohorts. Therefore, the GWAS approach is less susceptible to false-positive results and more powerful than candidate gene association studies. Two GWA studies that investigated large numbers of migraine cases from clinic-based cohorts and controls have been published (8,9). One study investigated migraine with aura (MA) (with 2731 cases and 10,747 controls) and revealed a single genome-wide significant migraine susceptibility locus on chromosome 8q22.1 that pinpointed the MTDH gene as the possible disease-causing gene in this region (8). The other study investigated migraine without aura (MO) (with 2326 cases and 4580 controls) and yielded four additional migraine susceptibility loci on 1q22, 3p24, 6p24 and 9q33 presenting evidence for involvement of the MEF2D, TGFBR2, PHACTR1 and ASTN2 genes, respectively (9). The latter study also confirmed genetic associations of SNPs in the TRPM8 and LRP1 genes (2q37 and 12q13, respectively) that had previously been identified as migraine susceptibility loci in a population-based GWA study (with 5122 cases and 18,108 controls) (10). A recent large meta-analysis (with 23,285 cases and 95,425 controls) that studied patients from clinic-based as well as population-based cohorts confirmed these loci and provided evidence for five additional migraine susceptibility loci (11). Notably, none of these genome-wide significant gene loci overlapped with genes that had been selected for candidate gene association studies in migraine.
The availability of GWAS data provides a unique opportunity to re-evaluate key findings from previous genetic studies in a much larger data set. We investigated 27 genes. Twenty-one genes were previously reported to be associated with migraine in candidate genes-based association studies. Three genes had been identified by positional cloning studies in families with familial hemiplegic migraine (FHM), a monogenic subtype of MA (12–14). Three genes came from direct sequencing of candidate genes in families and patients with monogenic migraine or common migraine (15–17). As the majority of the original studies investigated migraine patients who had been collected via specialized headache centers (i.e. patients who are clinic-based), we restricted our investigations to GWAS data of clinic-based migraine patients only (8,9,11).
Materials and methods
Selection of candidate genes for re-evaluation in the International Headache Genetics Consortium (IHGC) GWA data set
Summary of candidate gene association studies performed for migraine that reported at least nominal evidence for association (p < 0.05 for a single SNP) and that contained at least 300 cases and controls.
MA: migraine with aura; MO: migraine without migraine; NS: not significant; −: not tested/not available; SNP: single nucleotide polymorphism; Ins: insertion; Del: deletion; VNTR: variable number of tandem repeats. aNumber of cases and bp values are given for all migraine cases combined or, when specified between brackets, for migraine with aura cases only and/or migraine without aura cases only.
Migraine candidate genes from family studies.
FHM: familial hemiplegic migraine; SHM: sporadic hemiplegic migraine; MA: migraine with aura.
GWAS data sets
Description of the cohorts included in the previously published GWA studies for clinic-based migraine.
MA: migraine with aura; MO: migraine without aura.
Genome-wide marker data from 13,972 individuals from several pre-existing non-overlapping control cohorts that were population-matched to the cases were used as controls. The majority of the control cohorts were unselected for migraine status, implicating that they are expected to contain cases at the same frequency as the general population (Table 3). In the meta-analysis, SNPs missing from one of the studies, those with a MAF < 1%, and/or those showing excess heterogeneity (I2 > 0.75) were excluded.
Power calculation and significance threshold
Data for the selected genes were extracted from the existing GWAS data from an interval containing the candidate gene and the flanking region 500 kb in each direction, to have a reasonable chance of covering possible regulatory effects for the targeted genes. The threshold for evaluating the significance of SNPs located in the tested gene regions was 2.19 × 10−6, based on a Bonferroni correction for the number of unique SNPs that were tested (0.05/22,774). Our GWAS sample (5175 cases and 13,972 controls) has 99% power to detect association with an SNP under the assumption of an allele frequency (AF) of at least 0.05 and a relative risk of 1.5 or higher (allelic test, Genetic Power Calculator (http://pngu.mgh.harvard.edu/∼purcell/gpc) (19)). These thresholds are in line with published candidate gene association studies. On a more stringent level, we have 84% power to detect a variant with a relative risk of 1.4. See Supplemental Table 1 for power calculations at a range of different allele frequencies (0.05–0.4) and relative risks (1.15–1.5).
Effect size estimation
We used the Genetic Power Calculator to estimate the genotype frequencies for a marker with similar MAF and odds ratio (OR) as the MTHFR C677T risk allele, while assuming a disease prevalence of 12%, and using the sample size of the current study (5175 cases and 13,972 controls). A chi-square test for the resulting genotype frequencies was converted to a p value using a two-degree of freedom (df) chi-square test.
Results
We used GWAS data of clinic-based migraine patients (8,9,11) to re-evaluate 21 genes from migraine candidate gene association studies that had analyzed at least 300 migraine cases and controls and yielded associations of at least nominal p values (Table 1). Six additional genes were included that came from other non-GWA studies, i.e. either candidate gene sequencing studies (KCNK18, SLC1A3, SLC4A4) in common migraine and/or hemiplegic migraine or linkage studies in FHM (CACNA1A, ATP1A2, SCN1A) (Table 2). Within the 27 gene regions we investigated 22,774 SNPs for association with migraine, applying a significance threshold for individual SNPs of 2.19 × 10−6.
None of the SNPs, including the specific SNPs reported in the original publications (Supplemental Table 2), surpassed the significance threshold (Table 4, Supplemental Information). When analyzing MA and MO together, the best p value was seen for SNP rs805287 (p = 1.08 × 10−4) that is located within the surrounding region of the TNFA and LTA genes. However, this SNP is located in a gene-dense region over 130 kb downstream of both genes (Figure 1(a)) and lies within the major histocompatibility complex locus, where overall levels of noise are higher because of the complex linkage disequilibrium structure (49). When analyzing MA and MO separately, for MA, again the best p value was observed with an SNP (rs630379; p = 9.68 × 10−6) at the border of the region surrounding the TNFA and LTA genes (Supplemental Information). For MO, the best p value was seen for an SNP (rs13024246, p = 2.76 × 10−5) located in the FSHR gene region (Figure 1(b)) but this SNP was located far away from the originally selected gene. Only one gene region, namely that of the DRD3 gene, showed a potentially interesting peak (with best associated SNP rs1486008, p = 2.88 × 10−4; OR = 1.19) within the previously implicated migraine gene (Figure 1(c)).
Regional association plot (generated using LocusZoom) for SNPs within the (a) TNFA and LTA gene region and their association with migraine; (b) the FSHR gene region and their association with migraine without aura (MO); and (c) the DRD3 gene region and their association with migraine. The plots show the chromosomal position (based on NCBI build 36) for the SNP in the respective region against –log10 p values. The SNP with the highest association signal is represented as a purple diamond. Other SNPs are color coded according to the extent of LD with that specific SNP. SNP: single nucleotide polymorphism; NCBI: National Center for Biotechnology Information; LD: linkage disequilibrium.
Although the chance of observing associations that are gender-specific is limited, as the vast majority of the migraine patients are women, we performed a gender-specific analysis for the total migraine group. Analyzing males and females separately did not reveal SNPs with gender-specific signals surpassing the significance threshold (Supplemental Table 3).
Discussion
The most significant association results for the 27 gene regions that contain the previously proposed migraine gene and its 1 Mb window.
SNP: single nucleotide polymorphism; MA: migraine with aura; MO: migraine without aura. aChromosomal position is based on build 36. bSame SNP.
Based on current knowledge of effect sizes of common variants for many common diseases, the vast majority of the candidate gene association studies in the literature lacked sufficient power to detect an effect that can be realistically expected for a common allele in a common disorder like migraine. Therefore, the most probable reason for the lack of replication is that the results of the candidate gene association studies most likely represent false-positive findings. Although we did not show significant evidence for any of the genes previously implicated in common migraine as genetic migraine risk factors, we cannot, however, exclude the possibility that some of the previous findings are true-positive findings reflecting effects specific to a particular patient pool (such as individual families, in whom alleles that are rare in the general population can predominate). Possible additional reasons that could explain why we did not detect associations are that: (1) rare variants that may play a role may not be captured, either in candidate gene association studies or GWAS platforms, because of specific LD patterns that are not sufficiently reflected in the surrounding common markers; or (2) variants located in these candidate gene regions may play a role that have effect sizes too low to be detected, even with the current sample size, and will surface only with sample sizes on the order of several hundreds of thousands cases and controls.
In conclusion, our analysis shows no evidence for the involvement of any of the selected 27 genes in migraine pathophysiology of common migraine. For future studies, other approaches should be considered to identify migraine susceptibility genes. This finding is in line with experiences of candidate gene association studies in other common diseases (53).
Article highlights
Re-evaluation of previously reported migraine candidate gene hits shows no evidence for involvement in migraine pathology in a genome-wide association (GWA) data set. Small-scale genetic association studies lacking proper replication appear of limited value.
Footnotes
Funding
This work was supported by grants from the Netherlands Organization for the Health Research and Development (ZonMw) no. 90700217 and VIDI (ZonMw) no. 91711319 (to G.M.T.); the Netherlands Organisation for Scientific Research (NWO) VICI (918.56.602) and Spinoza (2009) grants (to M.D.F.); the EuroHead project (LSM-CT-2004-504837); the EUROHEADPAIN project (grant number 602633); and the Center for Medical Systems Biology (CMSB) established in the Netherlands Genomics Initiative/Netherlands Organisation for Scientific Research (NGI/NWO), project nr. 050-060‐409 (to R.R.F., M.D.F. and A.M.J.M.v.d.M.). The Wellcome Trust (grant number 098051 to AP); the Academy of Finland (grant number 251704 to AP, and 139795 to MW); the Academy of Finland, Center of Excellence in Complex Disease Genetics, (grant numbers 213506 and 129680 to AP); ENGAGE Consortium, (grant agreement HEALTH-F4-2007- 201413); EU/SYNSYS- Synaptic Systems (grant number 242167 to AP); the Sigrid Juselius Foundation (to AP); the Folkhälsan Research Foundation (to MW); the Medicinska Understödsföreningen Liv & Hälsa (to MW); the Helsinki University Central Hospital (to MK, VAr).
Conflict of interest
None declared.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
