Abstract
Background
Global gene expression analysis may be used to obtain insights into the functional processes underlying migraine. However, there is a shortage of high-quality post-mortem brain tissue samples for genetic analysis. One approach is to use a more accessible tissue as a surrogate, such as peripheral blood.
Purpose
Discuss the benefits and caveats of blood genomic profiling in migraine and its potential application in the development of biomarkers of migraine susceptibility and outcome. Demonstrate the utility of blood-based expression profiles in migraine by analysing pilot Illumina HT-12 expression data from 76 (38 case, 38 control) whole-blood samples.
Conclusion
Current evidence suggests peripheral blood is a biologically valid substrate for genetic studies of migraine, and may be used to identify biomarkers and therapeutic pathways. Pilot blood gene expression data confirm that expression profiles significantly differ between migraine case and non-migraine control individuals.
Introduction
Migraine is among the most pervasive medical conditions to afflict humankind, with a 1-year prevalence of 12% (1–4). It currently ranks as the most disabling neurological condition in terms of years lost to disability, and the seventh most disabling of all diseases worldwide (5). Current molecular evidence suggests migraine may be a disease of vascular origin, or one that involves vascular changes as a result of neuronal dysfunction (6,7). However, the exact causes are unknown, and there are no reliable pathological markers for clinical testing. Diagnosis is therefore based on the personal evaluation of headache symptoms, described by the diagnostic manual of the International Headache Society (8). Two migraine subtypes account for 90% of diagnoses, and include migraine without aura, characterised by moderate to severe headache and nausea accompanied in various combinations by vomiting and sensitivity to light and/or sound, and migraine with aura, which includes transient aura symptoms that manifest as visual, sensory or motor disturbances, followed by a headache phase similar to migraine without aura. The overlap of symptom profiles between these migraine subtypes, as well as other less prevalent forms of headache, means patients often satisfy multiple diagnoses. Both acute and prophylactic treatments are available to relieve migraine symptoms and reduce the severity and frequency of attacks, but are associated with a substantial amount of non-response, and their overuse can cause a rebound effect (9). Therefore, identifying new molecular targets could lead to improvements in the diagnosis of migraine and inform the development of more effective treatments.
Migraine is an inherited disorder with an estimated heritability in the order of 60% (10–14). Genome-wide association (GWA) studies have identified 13 common (minor allele frequency > 0.05) single-nucleotide polymorphism (SNP) loci associated with migraine (15–18), but their biological role in migraine susceptibility is currently unknown. As a complementary approach, the biological pathways underlying migraine may be identified using global gene expression analysis. Gene expression has an important role in the underlying susceptibility to complex disease and is influenced by genetic polymorphisms, as indicated by expression quantitative trait loci (eQTL). Epigenetic marks, including DNA methylation and post-translational histone modifications, also modify gene expression, in addition to a host of other factors, including age, sex, gonadal hormones, diet and medication use. Therefore, the gene expression patterns associated with migraine may reflect genetic and non-genetic effects and inform the development of biomarkers of migraine susceptibility and outcome.
Migraine is most likely caused by the dysfunction of multiple tissues, including blood vessels and the brain, so it is not obvious
A blood gene expression signature in migraine would greatly improve the clinical management of patients. In this article, we discuss the validity and potential of blood genomic profiling studies in migraine, including such profiling’s utility as a substrate for the interpretation of DNA polymorphisms. We present pilot data to support these methods of investigation and suggest blood constitutes a highly accessible and promising source of genomic biomarkers for migraine.
Post-mortem gene expression of brain tissue
Genes are differentially expressed between tissues. It is therefore desirable to perform genetic tests of association in the tissue primarily affected by disease. It can be argued that brain tissue is most appropriate for expression studies of migraine, where progressive structural and functional changes associated with the disorder have been observed (20–22). However, the collection of brain biopsies from living and otherwise healthy individuals is not feasible. Researchers must instead use samples collected post-mortem. Genetic studies of post-mortem tissues have been used to identify the genes and pathways of many brain-related diseases, including Alzheimer’s disease (23–25), Parkinson’s disease (26), amyotrophic lateral sclerosis (27) and schizophrenia (28). These and other neurodegenerative diseases are well characterised, can be diagnosed using objective clinical tests and are generally associated with a reduced lifespan, so the collection of post-mortem tissues is possible. Migraine, on the other hand, is non-fatal and clinically heterogeneous, lacks objective clinical biomarkers and the frequency of attacks varies over time and often shows a marked decline in later life. These factors preclude the collection of sufficient numbers of high-quality, well-diagnosed post-mortem samples required for a controlled experiment.
The use of post-mortem brain tissues is also beset with analytical issues that may impact study results. First, pre-analytical factors such as the manner of death, length of post-mortem interval, method of tissue dissection and length of storage all impact RNA quality and may confound downstream analyses, resulting in spurious associations (29). Second, the brain is composed of heterogeneous cell types and different regions of the brain express unique transcripts (30). Thus, there is a level of cell type specificity within the brain itself that means an additional choice must be made in terms of the brain region for genetic dissection. While single-cell purification assays such as flow cytometry (31) and laser capture microdissection (32) are available in order to mitigate tissue heterogeneity, they introduce a level of technical variability that may diminish study power. Finally, matching appropriate control subjects and controlling for potential confounders is difficult using post-mortem samples because phenotypic data are often limited and rely on retrospective data collection from patient medical records or hospital administrative databases.
Peripheral gene expression in migraine
Gene expression-based markers of disease, regardless of their tissue of origin, have inherent variation. Expression variation is especially pronounced in blood, where physiological factors, such as circulating hormones, circadian processes and cell type proportions, as well as age and gender, are known to influence the gene activity (33–36). Establishing the clinical utility of a biomarker often depends on the ability to control for these sources of variation and bias. This requires the evaluation of large numbers of tissue specimens in a controlled and prospective manner, including repeated measurements from the same individual over time. As noted above, post-mortem brain tissues cannot meet these criteria and can never be used in a controlled trial combining the use of genetic markers and various treatment interventions. Researchers must instead rely on a more accessible tissue, such as peripheral blood, and use carefully designed studies in order to limit unwanted variation in gene expression and extract biologically meaningful associations that can be generalised to the brain.
Peripheral blood is an attractive candidate for genomic studies of migraine due to the ease of acquisition and the ability to collect multiple samples with little discomfort to the patient. Current evidence suggests that the vast majority of genes expressed in brain tissue are also found in blood, with tissue-specific expression driven by a small number of genes (37). A study of matched whole blood and brain tissue in humans found that genes expressed in both tissues were significantly correlated and enriched for specialised brain processes, including axon guidance and neurogenesis (38). More recently, a direct comparison of peripheral blood monocytes and brain tissue reported a high correlation (
Global gene expression studies of peripheral blood have a number of other advantages. Blood can be collected from a large and representative group of age and sex-matched cases and controls, each with detailed phenotypic and anthropomorphic data, as well as follow-up information on migraine frequency and characteristics. Follow-up information may then be used to select more homogenous patient subgroups for analysis, which will increase the power to detect genetic associations. Blood can also be collected at a time and in a manner that is similar for all participants, and samples can be enriched for a cell type of interest (e.g. neutrophils or monocytes), thereby reducing unwanted variation and improving gene expression resolution. However, the post-extraction methods required to enrich a cell type of interest exposes the sample to pre-analytical factors (e.g. blood collection systems, cell purification methods and temperature changes) that can alter RNA and produce artefactual changes in expression, and the cell type of interest has to be defined
An alternative method is to simply use expression profiles derived from whole blood, which captures RNA profiles of all blood cell types, including erythrocytes, granulocytes, lymphocytes, monocytes and platelets. Popular blood sampling methods for gene expression profiling include the PAXgene™ and TEMPUS™ tube systems, which contain proprietary compounds that lyse blood cells and immediately stabilise RNA. While several studies have reported non-trivial differences in gene expression profiles between whole-blood samples and blood cell type-specific RNA sources (40,41), no single blood RNA source has outperformed the others. Therefore, the choice of blood RNA source should be based upon the measurement criteria of practicality and reliability. Whole blood is practical because it does not require complex post-extraction methods and produces more stable expression profiles over repeated measurements within the same individual compared with cell type-specific methods (42). In addition, it is possible to impute cell type proportions in whole blood and to adjust for their impact on disease associations in downstream analyses (43). As such, whole blood is arguably the most appropriate for large-scale genomic epidemiological studies of complex traits such as migraine.
Blood genomic profiling of migraine is supported by parallel studies of rodents, in which surgically induced neurological injuries designed to mimic ischaemia, stroke and hypoxia produced overlapping gene expression changes in matched peripheral blood and brain samples (44–46). In humans, unique blood signatures have been reported for several neurological disorders, including stroke (47–49), multiple sclerosis (50) and Alzheimer’s disease (51), as well as neuropsychiatric disease such as schizophrenia (52) and major depressive disorder (53). One large whole-blood expression study of major depression, which shares a higher than expected comorbidity with migraine (54), reported many experiment-wide significant (False Discovery Rate (FDR) < 0.05) gene associations, as well as clear differences between the expression profiles of depressed and non-depressed individuals (55). Furthermore, a follow-up investigation of potential confounders found that depression status–gene expression associations were largely unrelated to medication use and cell type counts. These data provide further support for blood genomic studies of complex brain-related traits.
Blood gene expression profiling studies in migraine.
Table 1 presents, from left to right, the author; the International Headache Society (IHS) or International Classification of Headache Disorders (ICHD-II) diagnostic criteria used (M: migraine; CM: chronic migraine; MOH: medication overuse headache; MRM: menstrual-related migraine); sample sized utilised (M: males; F: females; T total); age (years) of the participants; ethnicity; treatment or therapy followed by the participants prior or during the time of blood collection (NSAID: nonsteroidal anti-inflammatory drugs; azolmitriptan, sumatriptan, almotriptan; bamitriptyline, divalproex sodium, topiramate, propranolol); the blood sample collection (prior treat.: prior treatment; D1: blood obtained prior to treatment; D2: first follow-up after treatment); number of genes upregulated in cases; functional categories or names of the genes of upregulated transcripts.
A follow-up study of gene expression in chronic migraine sufferers with and without medication overuse headache was able to characterise the impact of medication use on chronic migraine (57). To do this, gene expression from whole blood in chronic migraine sufferers with and without medication overuse headache was measured at baseline and at 30–60 days following a standard treatment regimen (Table 1). Subjects who responded to treatment had a unique blood genomic expression pattern compared to non-responders at baseline and follow-up, suggesting that it may be possible to develop a predictive biomarker in order to inform treatment options for medication overuse headache. In addition, migraine-gene expression associations showed clear enrichment of the genes expressed in neuronal and epithelial/smooth muscle cells, supporting a role for both tissues in migraine pathophysiology and further demonstrating the utility of whole blood in studying the molecular pathways underlying migraine.
A genomic profile of gene expression variation in blood
The observed difference in gene expression between individuals is not only the product of physiological and environmental variation, but also genetic regulatory variation. The impact of regulatory variation on gene expression can be determined through association testing in order to identify regions of the genome where there is a significant correlation between genetic polymorphism and gene expression, known as eQTL mapping (62,63). eQTL data may then be used to prioritise gene candidates at a migraine GWA study locus, and also to characterise the genomic profile of regulatory variation that impacts on the activity of the numerous genes underlying migraine susceptibility.
Many eQTL, particularly those associated with disease, are specific to a given tissue, and the fold change in expression and direction of effect of shared eQTL can differ across tissues (64,65). Therefore, it is not clear whether the eQTL found in one tissue will be replicated in others. This impedes the biological interpretation and extrapolation of eQTL associations when measured in a substitute tissue. Migraine pathogenesis is thought to involve processes in multiple tissues, so tissue-specific effects can be expected. Many eQTL associated with human traits have been identified in blood (66–68) and show a high level of sharing with many tissues (69). However, it is not possible to characterise all disease-relevant eQTL using blood alone. This caveat supports detailed eQTL studies of multiple tissues – including blood and brain – in order to dissect the complex regulatory systems that govern migraine susceptibility.
Several studies have examined eQTL specificity or sharing between the blood and brain using samples derived from different patients. Two recent meta-analyses compared gene expression and genotype data from post-mortem brain samples with catalogued eQTL from large surveys of peripheral blood. The first study reported that 24% of eQTL in peripheral blood overlapped with genes with local eQTL in the brain, which was significantly greater than expected by chance (
Amalgamating eQTL data with other molecular information that is known to influence gene expression, such as epigenomic (e.g. DNA methylation) assays, and drawing on data from experimental genomic compendia, such as the Epigenomics Roadmap Project (73), will further characterise gene-regulatory variation between the blood and brain. In addition, tissue-specific effects of eQTL across the blood and brain may be assessed using data from the Genotype-Tissue Expression (GTEx) study (69), which contains genotype data from blood linked to genome-wide gene expression patterns across 54 distinct body tissues, including 13 brain regions. The use of diverse data types will aid with the characterisation of gene expression and causal variants in blood and improve the biological interpretation of the brain-related processes underlying migraine activity.
Blood gene profiles can separate migraine cases and non-migraine controls
We demonstrate the potential utility of blood genomic profiling in migraine by comparing blood gene expression profiles between migraine cases and non-migraine controls. Blood gene expression data were collated from the Brisbane Systems Genetics Study (BSGS), a family-based study designed to investigate the genetic factors affecting gene expression and the role of gene regulation in mediating phenotypes and complex disease (74). Informed written consent was obtained from each participant, and the study was approved by the Health Research Ethics Committee (HERC) of the QIMR Berghofer Medical Research Institute (Queensland Institute of Medical Research (QIMRB)). The baseline sample comprised 962 individuals from 314 families. Whole blood for gene expression profiling was collected directly into PAXgene™ tubes (QIAGEN, Valencia, CA). Total RNA was extracted using QIAGEN whole-blood gene RNA purification kits. All gene expression profiles were generated on Illumina HumanHT-12 v3 or v4 bead arrays, and the data were normalised as described in detail elsewhere (75,76). After quality control, 17,994 probes representing 13,355 RefSeq genes were available for analysis. Gene expression data are available at the Gene Expression Omnibus under the accession code GSE53195.
Unrelated migraine cases were selected using the ID Migraine™ screening tool, which has been shown to accurately identify 93% of migraine sufferers in a clinical setting (77). A total of 38 unrelated adult individuals (15 male, 23 female; aged 42–78 years) screened positive for migraine and were age and sex matched with unrelated individuals who screened negative for migraine. In order to assess the differential expression between the migraine cases and non-migraine control groups, we fitted generalised linear models for each of the 17,994 gene expression probes, adjusted for sex, age and age2. A total of 54 genes were identified as differentially expressed at a liberal threshold (FDR < 0.1). A Q–Q plot showed clear elevation of small Q–Q plot of -log10 Heatmap of mean expression across migraine case and non-migraine control groups. For the 54 genes that were differentially expressed between cases and controls (False Discover Rate (FDR) < 0.1), mean expression was colour coded for the non-migraine control and migraine case groups. Genes were ordered according to the dendrogram of the hierarchical cluster analysis on the basis of Euclidean distance.

Concluding remarks
In migraine, brain tissue that may hold the key to identifying causal genes and molecular pathways cannot be directly assessed from living patients in a clinical setting, and there is a lack of high-quality post-mortem samples for genetic dissection. Due to its accessibility, blood has been used as a substitute for brain tissue in order to gain insight into gene expression and gene regulation. There is debate as to whether peripheral blood provides biologically relevant information on pathological mechanisms within the brain (78). As we have discussed, current results suggest that many genes expressed in the brain are found in blood and may reflect brain-related disease processes. By analysing blood gene expression data from 38 migraine cases versus 38 non-migraine controls, we show that gene expression significantly differs between migraine and non-migraine individuals, adjusting for age and sex. These lines of evidence suggest that blood is a biologically valid substrate for the characterisation of the gene expression and genetic variation underlying migraine; however, confounding effects of blood cell counts and medication use cannot be excluded.
There are several important caveats in using blood as a surrogate for capturing gene expression and regulatory effects in brain tissue. First, genes are differentially expressed between tissues, so a surrogate may misrepresent both the expression direction and magnitude of effect in the tissue of interest. Moreover, genes expressed in one tissue may simply not be expressed in another, and therefore cannot contribute to a transcriptional profile of disease (37). Second, transcription in whole blood is dominated by a relatively small number of genes (37), which decreases the power to evaluate gene expression variation. This is especially problematic for complex diseases such as migraine, where small changes in the activity of many genes are thought to collectively drive disease susceptibility. Third, there are well-recognised differences in gene expression and regulatory effects between brain regions (79), limiting the ability to extrapolate findings from blood to the whole brain structure. Finally, more than any other tissue, blood gene expression is confounded by biological, clinical and technical variables, most of which are unobserved and introduce unwanted variation in gene expression. Taking these sources of variation into account, blood expression studies may require more samples in order to detect meaningful differences in gene expression compared to in the brain. For example, assuming a standard deviation of 0.7 for the gene intensity measurements on the log2 scale (realistic for genes expressed at moderate to high levels), for a two-sample two-sided
Notwithstanding these caveats, if a biomarker is to be used in a clinical setting to inform the diagnosis and treatment of migraine, it must be measureable in blood. The current evidence suggests that blood provides a window into the state of the brain in the presence of disease. While small sample sizes and the lack of matched brain tissue specimens has impeded the development of reliable blood biomarkers in migraine, studies have shown that blood gene expression differences exist between migraine cases and controls and can capture brain-related expression changes. Differential expression analyses using basic migraine information from the Brisbane Systems Genetics Study, representing one of the largest blood-based expression studies of migraine to date, corroborate these findings. The use of larger, more highly selected cohorts with detailed phenotype information will improve the power to detect the blood gene expression associated with migraine. Blood-based global gene expression levels also provide an ideal substrate for the integration of genomic factors that influence gene expression, such as DNA (SNP) polymorphism and epigenomic alterations. Linking these data with ongoing experimental genomic compendia will further prioritise blood gene expression signals and aid in the discovery of a genomic profile of migraine susceptibility and outcome.
Article highlights
Whole blood is an accessible and reliable source of RNA and has been widely used in gene expression studies of neurological and neurodegenerative traits for disease profiling and biomarker identification. Global gene (RNA) expression data can be used in order to study the genetic and non-genetic effects underlying migraine susceptibility. Analysing pilot Illumina HT-12 expression data from 76 (38 case, 38 control) PAXgene™ whole-blood samples, we found significant differences in gene expression between migraine cases and non-migraine controls.
Footnotes
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Dale R Nyholt (613674) and Grant W Montgomery (619667, 1078399) were supported by an Australian National Health and Medical Research Council (NHMRC) Research Fellowship. Joseph E Powell is supported by an Australian Research Council DECRA (DE1310691). Research reported in this publication was supported by NHMRC project grants (APP1046880, APP1075175, APP1083405) and by the European Union’s Seventh Framework programme (2007–2013) under grant agreement no. 602633 (EUROHEADPAIN).
