Abstract
The aim was to estimate lifetime sex and age-specific incidence of migraine. Data are from the American Migraine Prevalence and Prevention study, a mailed survey sent to 120 000 US households. Age-specific incidence was estimated using self-reported data relevant to identification of migraine cases, age of onset of migraine and age at interview. Migraine incidence peaked between the ages of 20 and 24 years in women (18.2/1000 person-years) and the ages of 15 and 19 years in men (6.2/1000 person-years). Cumulative incidence was 43± in women and 18± in men. Median age of onset was 25 years among women and 24 years among men. Onset in 50± of cases occurred before age 25 and in 75± before age 35 years. Four of every 10 women and two of every 10 men will contract migraine in their lifetime, most before age 35 years. The incidence estimates from this analysis are consistent with those reported in previous longitudinal studies.
Keywords
Evidence on the age-specific incidence of disease offers important clues to aetiology. Although migraine prevalence has been extensively described (1), relatively little is known about its incidence. Prospective studies of migraine incidence are uncommon and confined to selected age groups (2–8). Moreover, estimates from the relatively few incidence studies are highly variable and based on relatively few new-onset cases. As an alternative to longitudinal studies, cross-sectional data can be used to estimate age-specific incidence (9, 10). Although secular trends of increasing or decreasing migraine incidence can influence estimates derived from cross-sectional studies, evidence suggests that migraine prevalence has not increased in a 10-year period (11, 12).
In this study, we used data on current age and age of migraine onset from a large national survey to estimate age-specific incidence. Our findings are consistent with previously reported incidence estimates based on small samples. An advantage of the results from this study in contrast to longitudinal studies is the large sample and broad age coverage. Our analysis indicates that the cumulative lifetime risk of migraine is 2.5–3 times higher than prevalence. We believe these findings have implications for clinical care and for aetiological and health services research.
Methods
In order, we describe the source of the population, data collection method and data collected, and the statistical methods for estimating incidence.
Population survey
Data for estimating incidence rates were obtained from a US nationwide mailed survey of 163 186 individuals ≥ 12 years of age participating in the American Migraine Prevalence and Prevention (AMPP) study. The AMPP was modelled on the methods of the American Migraine Studies 1 and 2, described in detail elsewhere (11–13). A self-administered headache questionnaire was mailed to a stratified random sample of 120 000 US households, drawn from a 600 000-household nationwide panel maintained by the National Family Opinion, Inc. (NFO). All NFO participants first complete a NFO baseline questionnaire used to obtain data on the household, including details on the head of the household (i.e. the self-defined NFO point of contact), the total number of household members and demographic details (e.g. age, sex, education level) of household members. The AMPP questionnaire was sent to the designated head of household, who was instructed to identify all household members suffering from at least occasional self-defined severe headache. Each household member with severe headaches was asked to complete a symptom screening questionnaire, consisting of 21 questions assessing all headache features important in the detection of migraine. The screening questionnaire has been validated [i.e. taking the second edition of the International Classification of Headache Disorders (ICHD) as a reference (14)] in a population sample of migraineurs and of controls with other types of headache. The sensitivity of the survey for this sample was 100%; the specificity was 82.3% (15).
Migraine case definition
Identification of active migraine cases (i.e. reporting at least one attack in the previous year) was based on established ICHD-II criteria (14) and assigned if a respondent reported at least one severe headache in the previous 12 months, with unilateral or pulsatile pain, and either nausea, vomiting, phonophobia with photophobia, or visual or sensory aura before the headache. Individuals who reported having headaches every day, even though they met migraine symptom criteria, were excluded.
Estimating methods
In a typical longitudinal study, the age-specific incidence rate, Ij
, for the jth age interval is nj
The first method, labelled the ‘naive’ approach (9), uses all available data, assumes that active migraine cases identified at cross-section include all individuals who have ever had migraine (i.e. no remission). For this approach, nj is the count of active migraine cases who reported onset in age interval j; tj is the sum of person-time for the jth age interval contributed by those greater than or equal to age j at interview and who had not reported migraine onset before the beginning of the jth age interval. Individuals reporting migraine onset during the jth age interval contribute person-time for only part of the age interval.
The second method (9) uses only data from migraine cases who reported their onset to have occurred in the recent past. For the purposes of this analysis, recent onset was defined if an active case reported that their migraine first started within 3 years of the interview. In this approach, nj is confined to counts of active migraine cases who reported onset in the past 36 months and who reported it occurring in the jth age interval; tj is defined as it is for the naive method, except that person-time is confined to individuals whose age at interview was within 3 years of the upper bound of the jth age interval and who were migraine free at any time during this age interval. There are two advantages to this method. First, errors in reporting age of onset are likely to be minimal. Second, complete remission of migraine among recent-onset cases is unlikely. This method uses data from individuals represented by the diagonal in Table 1. We label this the ‘diagonal’ method (9). A disadvantage of this method is that the number of new-onset cases is limited and cannot be estimated for those ≤ 9 years old.
Distribution of active migraine cases and those currently without migraine by age at interview and age at onset (migraine cases only) for 163 186 respondents
tj , person-years (in 1000s of years) during the age interval, taking into account the number of cases who reported onset of migraine during or before the age interval; nj , total number of persons interviewed within each age.
The third method combines advantages of the naive (i.e. uses all data) and the diagonal (i.e. minimizes bias) methods. Like the naive approach, data from all individuals were used to improve precision of estimates. In contrast to the naive approach, regression methods were used to adjust for bias. Two sources of bias were of interest. Under-ascertainment occurs when a respondent forgets to report a history of severe headache or a cardinal migraine-defining symptom. In prevalence surveys of active migraine cases, under-ascertainment also occurs implicitly, as individuals with migraines that have remitted are not ascertained. Exclusion of inactive cases is likely to be the dominant source of bias and more common with increasing age. Temporal reporting bias, another error, occurs when case status is accurately reported (i.e. no ascertainment bias), but age of onset is over- or underestimated (16, 17). A more complete discussion of these errors has been reported elsewhere (9, 10). The third method assumes that errors are related to two temporal variables: age at interview and how long before the interview migraine first occurred. LAG was defined as the difference between the age at interview and the reported age of onset of migraine. It is important to recognize that LAG is only defined for active cases. LAG predicts variation in age-specific incidence. Variation occurs largely because inactive cases are not reported in the numerator and because the number of inactive cases naturally increases as LAG increases. We modelled data assuming that the number of active cases interviewed at age interval k with age of onset in interval j, njk , is approximated by a negative binomial with mean:
where Njk is the number of person-years at risk for that age-onset age group, X jk is a vector of covariates and γ is the corresponding vector of parameters. If there were no covariates, then exp(α j ) would be the event rate at age j. The naive method essentially uses all of the data to estimate α j . However, it yields a biased estimate, because remitted cases are not represented in the estimate. The diagonal approach uses only recent cases to estimate α j . The third approach uses all of the data, but adjusts the incidence estimate for recall and ascertainment errors (9). For example, consider the simple model exp(α j + γ1LAG jk ). Think of exp(α j ) as the true incidence, and exp(γ1LAG jk ) as the adjustment factor. As LAG increases (i.e. the years since onset), the incidence rate estimate would decrease (due to inactive cases), which implies that γ1 < 0. The event rate for someone with LAG = 10 would be the true incidence rate multiplied by exp(γ1 × 10). For example, if exp(γ1 × 10) = 0.5, this would suggest that half of the cases remitted (or were inactive) within 10 years. One advantage of this method over the diagonal method is that all data are used to estimate α j and γ. If the model is correctly specified, the incidence rate exp(α j ) will be less biased than the naive approach and more efficient than the diagonal approach. The model described above was just for illustration. More realistic models would include non-linear terms for AGE and LAG, as well as interactions between them. Interaction terms would be necessary if recall errors or inactive rates varied with age. The estimated true incidence curve is obtained by setting LAG to 0 (i.e. the expectation is there are no inactive cases) and using the predicted age-specific values from the model solution. Use of this method should minimize effects of underreporting of inactive cases and recall errors, while retaining all of the data and allowing us to extrapolate to ages not available with the diagonal method.
The negative binomial model, a generalized version of Poisson regression, was selected because of over-dispersion in the Poisson model (18). Cumulative incidence was estimated using the product-limit formula (19, 20) and median age at onset was defined as age at which cumulative incidence was 50%.
Separate estimates were derived for men and women, since the incidence of migraine is known to be different by sex. For this analysis, Ij was estimated for 5-year intervals. Data displayed in Table 1 for women and men, respectively, were used to derive age-specific incidence estimates for each of the three models. Model parameters were estimated using the GENMOD procedure in SAS version 8.1 (SAS Inc., Cary, NC, USA). In the model, we included age at interview, reported age of onset (ONST) and LAG as input variables. We considered higher ordered terms (i.e. to account for non-linear effects of incidence in relation to ONST and LAG) for each of these variables, as well as interactions between ONST and LAG. For regression modelling, ONST and LAG were centred. Model fit was assessed using a likelihood ratio test (LRT) to determine if higher order models significantly improved fit to the data compared with simpler models. A higher order model was deemed to improve fit to the data if the LRT statistic was equal to or greater than the critical value (i.e. P < 0.05) for the χ2 statistic with degrees of freedom equal to the number of additional parameters in the higher order model. The optimal model for both men and women was based on the following parameters: ONST + ONST2 + ONST3 + LAG + LAG2 + ONST × LAG.
Results
Survey participants were between 12 and 100 years of age; 55% were female. A total of 4364 (6%) of the 77 379 men and 14 604 (17%) of the 85 807 women met criteria for migraine. Person-years of observation, tj (i.e. must be multiplied by 1000), is displayed in the last row by age interval (Table 1).
Applying the naive method to data on men (Table 2, Fig. 1), migraine incidence peaks between 15 and 19 years of age at 2.2 cases per 1000 person-years. Among women, migraine incidence peaks during the same age interval at 8.09 cases per 1000 person-years. The naive median age of onset in men is 25.5 years and in women it is 23.2 years (Table 2, Fig. 2), and the cumulative incidence in men is 7.5% and in women 21% (Fig. 2).

(a) Male 5-year age-specific migraine incidence for the naive, diagonal and model-based estimates. (b) Female 5-year age-specific migraine incidence for the naive, diagonal and model-based estimates.

(a) Male 5-year age-specific cumulative incidence rates of current migraine headache comparing the model method and the naive method. (b) Female 5-year age-specific cumulative incidence of current migraine headache comparing the model method and the naive method.
Sex and 5-year age-specific incidence rates (per 1000) of migraine headache for the naive, diagonal and model-based estimates
The diagonal method uses data on only 431 male cases compared with 4364 male cases used in the other two methods. Diagonal estimates (Table 2, Fig. 1) cannot be derived for ages ≤ 9 years and estimates below age 20 are unstable (i.e. small n). Nevertheless, diagonal estimates are consistently higher than the comparable naive estimate, particularly for women (i.e. age-specific estimates are two to three times higher than the comparable naive estimate). Cumulative incidence was not calculated because incidence could not be estimated at or below age 9 years.
Using the parameters described in the methods, estimates were derived using the negative binomial model. Once fitted, estimated incidence curves (for men and women) were derived by obtaining predicted values for each value of age from the model with LAG equal to ‘0’. The model-based median age of onset for men is 24.1 years and for women it is 25.2 years (Table 2, Fig. 2), and the cumulative incidence is 18% for men and 43% for women. Among men, > 75% of new-onset cases occur after age 14 years and > 85% of new-onset female cases occur after age 14 years. For both men and women, approximately 75% of new-onset cases occur before age 35 years. The model-based 5-year age-specific incidence rates displayed in Table 2 and in Fig. 1 are substantially higher than the comparable naive estimates. In contrast, the model-based estimates for women, in particular, are comparable to the corresponding diagonal estimate.
Including the 273 daily headache cases has a negligible effect on incidence estimates. For example, the peak incidence in women (i.e. the greatest difference) increased from 18.2 to 18.4 per 1000 person-years.
Discussion
The model-based estimates suggest that the cumulative lifetime risk of migraine is very high and that the median age of onset for migraine occurs after puberty. At first glance, the assumption of no remission for the naive estimate is invalid, as evidence clearly indicates that migraine does remit over time (2–4). The naive method therefore underestimates incidence and cumulative risk. In contrast, the diagonal method relies on self-reports from active cases of migraine reporting onset in the past 3 years. While age of onset is subject to reporting errors, the diagonal method approximates a longitudinal study with relatively complete representation of all cases whose onset occurred in the recent past. Cases would be excluded in the diagonal method if migraine first occurred in the past 3 years and remitted at least 1 year before the interview, or the respondent simply forgot to report relevant symptoms or the occurrence of severe headache. The striking contrast between the naive and diagonal age-specific estimates is likely to be largely explained by remission and indicates that active prevalent cases represent only a share of the total number of historical cases.
The model-based approach overcomes limitations (e.g. unstable estimates, lower age limit to estimates) to the diagonal method. Our estimate of cumulative incidence is consistent with prevalence estimates. Previous studies have consistently reported a peak male prevalence of 8% and a peak female prevalence of 25–27% occurring around age 38 years (1, 11). If migraine remits, as indicated by the rapidly declining prevalence after age 40, the peak prevalence indicates that the cumulative incidence of migraine must be > 8% for men and > 25% for women.
Our model-based estimates are largely consistent with a Danish longitudinal study, but substantially lower than estimates from two other studies (Fig. 3a,b) (2–4). In a Danish 12-year follow-up study of 453 migraine-free adults (mean age 42 years at baseline) (3) we derived age-specific estimates from Fig. 2 and accompanying tables. The 95% confidence intervals for seven of the eight estimates (see Fig. 3a,b) overlap with the model-based estimates. Over a 2-year period, Breslau et al. (2) estimated an annual incidence of 19.5/1000 person-years among a predominantly female (79%) cohort with a mean age of 42 years. This estimate, which is a weighted composite of the rate for those with and without depression, is more than twice (i.e. statistically significant) as high as the female model-based estimate. Annual incidence of migraine was estimated in an occupational cohort of men (n = 180) and women (n = 60) with a median age of 46 years at baseline and an average age of 51 years between the two surveys. Annual incidence estimates, based on two surveys 10 years apart (4), were approximately 8.8/1000 person-years for women and 3.8/1000 person-years for men. Both of these estimates are substantially higher than the corresponding model-based estimates. Although there are differences between several of the longitudinal estimates and the corresponding models, in total these estimates show a declining incidence after age 30 years, a pattern consistent with our model.

(a) Model-based male 5-year age-specific migraine incidence compared with estimates from longitudinal studies. (b) Model-based female 5-year age-specific migraine incidence compared with estimates from longitudinal studies.
We completed sensitivity analysis to determine how age-specific incidence estimates varied in relation to the upper age limit of the study population (Table 3, Fig. 4). The model-based estimates did not vary substantially. In contrast, the naive age-specific and cumulative incidence estimates declined as the upper age limit increased. When the naive method is applied to a younger age population (i.e. restricted to those < 30 years old), age-specific and cumulative incidence estimates (i.e. < 30) are similar to the model-based estimates (Table 3). In contrast, as the upper age limit is increased, the naive age-specific and cumulative incidence estimates are lower than the comparable model-based estimates. For example, when the upper age limit is 50 years, the cumulative incidence up to age 30 years for women is 22% for the naive method compared with 28% for the model-based method. Similarly, the cumulative incidence for all age groups is 21% for the naive method vs. 43% for the model-based method. The striking difference between the naive and model-based cumulative incidence estimates suggests that a majority of those who report migraine remit or experience a substantial change in symptoms associated with headache. This finding is consistent with longitudinal studies of adults (4) and adolescents (8). For example, in a workplace cohort, migraine changed to a non-migraine headache or remitted 10 years later in 63% of active migraine cases 35–50 years old at baseline (8).

Model-based female 5-year age-specific migraine incidence compared with naive estimates derived for the total population and when the upper age was truncated below 30 and 50 years.
Age-specific (per 1000 person-years) and cumulative incidence (%) of migraine for women when naive and model-based estimates are applied to subpopulations truncated at an older age of 30, 40, and no limit
Ij , the age-specific incidence for the jth age category; Cj , the cumulative incidence up through the jth age category.
Our estimates of cumulative incidence (Fig. 2) suggest a very high liability for migraine in the general population. The median age of onset for migraine between 24 and 25 years for men and women would suggest that induction (onset) is largely attributable to factors that occur well past puberty.
Aetiological factors for migraine induction, persistence and remission are likely to be the product of genetic and environmental factors. Duration of illness reflects determinants of disease persistence and remission. Age of onset largely reflects determinants of disease induction. For example, the known relation of higher migraine prevalence with lower income could reflect the influence of environmental factors on the rate of induction or on disease persistence. In contrast, the finding that younger age of migraine onset in probands predicts familial aggregation for migraine (21, 22) suggests a possible genetic influence on early-life induction or persistence of migraine.
Acknowledgement
This study was sponsored by the National Headache Foundation through a grant from Ortho-McNeil Neurologics, Inc.
