Abstract
Objective
Habituation of visual-evoked potentials (VEPs) is typically described as deficient interictally in migraine patients, supposedly indicating altered cortical excitability. Use of this parameter for monitoring changes over time, e.g. under treatment, requires demonstration of test-retest reliability.
Methods
VEPs were recorded interictally in 41 episodic migraine patients and 40 controls. N75–P100 amplitudes were measured over six consecutive blocks of 75 VEPs each. Amplitude regression slopes and block ratios were used to quantify VEP habituation. Test-retest reliability was assessed over 15 minutes and two to three weeks.
Results
Controls showed significantly more negative VEP habituation slopes than migraine patients (−0.21 ± 0.40 vs. 0.04 ± 0.46 µV/block, p < 0.05). Results were similar for block ratios, though, in the migraine group, VEP habituation significantly increased from test to two- to three-week retest (p < 0.05). In addition, VEP habituation test-retest correlations were mostly poor both in migraine patients and controls (intraclass correlation coefficients, 15 minutes: −0.13 to 0.30, two to three weeks: 0.07 to 0.59).
Conclusions
Deficient VEP habituation in migraine was confirmed. However, the test-retest reliability of VEP habituation was rather weak. Therefore, we suggest that VEP habituation should be used for evaluation of cortical excitability under treatment only at the group level and only when a control group with sham treatment is included.
Introduction
Migraine is associated with considerable individual suffering and socioeconomic impact (1). It is ranked by the World Health Organization as cause number 19 of worldwide disability among all diseases (2). The underlying mechanisms are only incompletely understood. An interictal lack of habituation to sensory stimulation (e.g. visual, auditory, somatosensory and nociceptive) has been repeatedly reported (3). Also responses related to mental functions, such as contingent negative variation and P300 amplitude in oddball paradigms, show an interictal lack of habituation (4,5). Therefore, the lack of habituation is seen as a major neurophysiological hallmark of migraine and has been interpreted as being the consequence of increased cortical excitability, decreased cortical inhibition or reduced cortical pre-activation. This change in cortical excitability is regarded as having a causal role in migraine pathogenesis (6). Habituation of visual-evoked potentials (VEPs) is easy to assess, cost effective, and may be of special relevance to migraine because visual symptoms are common in migraine (photophobia, visual aura) (7). For assessment of VEP habituation, pattern-reversal VEPs are usually averaged in blocks of 50–100 and amplitude slopes or first-to-last block amplitude ratios are calculated over five to six blocks (8–10). Most studies find that healthy individuals exhibit a significant habituation of VEP amplitudes while migraine patients in the interictal period do not (8,11,12). There are several pharmacological and psychological interventions hypothesized to reduce migraine frequency and/or severity by regulating cortical excitability, e.g. antiepileptic drugs or psychological techniques such as progressive muscle relaxation or biofeedback (13–16). It would therefore be an obvious approach to use VEP habituation for monitoring of treatment success, for evaluation of new treatment approaches, and maybe even for individual prediction of treatment success. However, before using VEP habituation for longitudinal evaluation of treatment effects at the group level or individual level, test-retest reliability has to be demonstrated. In the present study, we confirmed group differences in VEP habituation between interictal migraine patients (n = 41) and healthy controls (n = 40) and evaluated the test-retest reliability of VEP habituation over a short period of 15 minutes (15 migraine patients, 15 controls) and over a longer period of two to three weeks (15 migraine patients, 24 controls). Test-retest reliability at the group level was assessed by comparing group means at test and retest. Test-retest reliability at the individual level was determined using intraclass correlation coefficients (ICCs).
Methods
Individuals
The study was conducted in accordance with the Declaration of Helsinki and approved by the local ethics committee at the University of Munich. Prior to participation, informed written consent was obtained. Patients with episodic migraine and healthy controls with normal or corrected-to-normal vision were recruited by advertisements on the hospital and university campus. Presence of episodic migraine with or without aura and absence of other headache types was diagnosed according to the current International Classification of Headache Disorders (ICHD-III beta version) (17) by an experienced physician of the outpatient headache clinic at the Department of Neurology. Healthy controls had to be free of any headaches. Participants had to be free of migraine-preventive medication for at least four weeks prior to participation. Analgesic medication and triptans were allowed, but not within 48 hours before the experiment. Migraine patients were tested interictally, meaning they were free of headache for at least 48 hours before and after the experimental session, which was assessed by a telephone or email interview 48 hours after the recording. In addition, all participants had to meet the following criteria: (1) age above 18 years, (2) sufficient knowledge of the German language, (3) no major neurological, psychiatric or medical conditions (apart from migraine in the patient group), (4) no chronic pain disorders, and (5) not pregnant or breast-feeding. All migraine patients completed the Migraine Disability Assessment Scale (MIDAS) (18,19). In addition, migraine patients were interviewed regarding headache frequency (days per month in the last month), headache intensity (average during the last month) and first onset of migraine. The period of recruitment was from July 2013 until April 2014. Follow-up was conducted from April 2015 to July 2015.
Group differences and test-retest reliability were assessed. Analysis of group differences in VEP habituation was based on all migraine patients for whom at least one interictal measurement could be obtained (n = 41) and on all controls (n = 40). For test-retest analysis over 15 minutes, data from 15 interictal migraine patients and 15 controls was available and for test-retest analysis over two to three weeks, data from 15 interictal migraine patients and 24 controls were available. A larger number of migraine patients had been tested for the group comparison (n = 50), for the test-retest over 15 minutes (n = 21) and for the test-retest over two to three weeks (n = 31), but some of the migraine patients had to be excluded from analysis because they developed headache within 48 hours after the experiment. The high incidence of headache in the two- to three-week retest condition is likely due to the use of transcranial magnetic stimulation to determine magnetic suppression of visual accuracy ((20) within the same experimental sessions, but after measurement of VEPs (data reported elsewhere (21)). Occipital transcranial stimulation may have induced headaches in some of the patients (22).
VEP recording
VEP recording was performed according to international standards (23,24). Participants were seated in a relaxed position in a quiet room with dimmed light. The screen was placed 135 cm in front of their eyes and the right eye was covered. All participants confirmed that they could see the checks clearly and got the instruction to concentrate on the screen and focus on the red fixation point in the middle of the checkerboard. During the session talking was not allowed. Standard surface electrodes (cup gold electrodes, GVB-geliMED KG, Bad Segeberg, Germany) were attached to Oz (active electrode) and Fz (reference), according to the 10/20 system (25). The ground electrode was fixed on the right wrist with a ground strap. Electrode impedances were kept below 5 kΩ. A black-and-white checkerboard reversal pattern presented on a Viewsonic E70fsB Monitor (check size 51′, reversal frequency 3 Hz, stimulated part of the visual field 14 degrees × 10 degrees, Michelson’s contrast 90.5%, mean luminance 52.5 cd/m2, vertical refresh rate: 60 Hz) was used to evoke VEPs that were recorded using an Oxford Instruments Medical EP system (Software Version 11, Medelec Synergy, Old Woking, Surrey, UK). In total, 450 responses obtained by continuous stimulation were recorded between 0 and 300 ms after stimulation, averaged in six consecutive blocks of 75 responses and bandpass filtered from 1 to 100 Hz. Traces were rejected automatically as artifacts if the signal amplitude exceeded 200 µV. The number of rejected traces was below 10% for every recording, therefore unlikely to affect the VEP habituation kinetics.
VEP data analysis
Individual N75 and P100 peaks of blocks 1 to 6 were detected automatically and confirmed visually by the investigator according to standard procedures (23). N75 to P100 peak-to-peak amplitudes were calculated and used as the measure of VEP amplitude in the present study. Linear regression over blocks 1 to 6 VEP amplitudes was used to determine the habituation slope. The ratio between the VEP amplitudes of the sixth and the first block in percentage (block ratio = [block 6 amplitude/block 1 amplitude] * 100) was calculated and used as a second measure of VEP habituation. Thus, larger habituation is indicated by more negative habituation slopes and smaller block ratios. Block 1 VEP amplitudes were also analyzed to allow comparison with published data on test-retest reliability of single-block VEP amplitudes (26,27).
Statistics
Statistical analysis was performed using the Statistical Package for Social Sciences (SPSS version 22, IBM Corporation, Armonk, NY, USA). Values are presented as means and standard deviations (SD) unless stated otherwise. Mean differences and correlations were considered significant if the p value was p < 0.05. The Kolmogorov-Smirnov-test was used to test for normal distribution before the use of parametric tests. Unpaired T-tests were used to test for group differences in VEP block 1 amplitudes, habituation slopes and block ratios. Test-retest reliability of block 1 amplitudes, habituation slopes and block ratios was assessed (1) at the group level by comparing test and retest group mean values using paired T-tests and (2) at the individual level by calculating ICCs, two-way mixed, single measure, absolute agreement). Effect sizes were quantified using Cohen’s d.
Results
Group differences
Description of the cohort. SD is given in parentheses.
MIDAS: Migraine Disability Assessment Scale.

Group differences in visual-evoked potential (VEP) habituation.
Group comparison of visual-evoked potential (VEP) amplitudes and habituation. SD is given in parentheses. Block I amplitude indicates the first block peak to peak amplitude. Slope shows off the slope of the linear regression over block 1 to block 6 amplitudes. Block ratio indicates block 6 amplitude in percentage of block 1 amplitude.
Test-retest reliability over 15 minutes
Reliability of VEP amplitudes and habituation after 15 minutes retesting in interictal migraine patients and in controls. SD is given in parentheses. Block 1 amplitude indicates the first block peak to peak amplitude. Slope shows off the slope of the linear regression over block 1 to block 6 amplitudes. Block ratio indicates block 6 amplitude in percentage of block 1 amplitude.
VEP: visual-evoked potential; ICC: intraclass correlation coefficient.

Test-retest reliability of visual-evoked potential (VEP) measures.
Test-retest reliability over two to three weeks
Reliability of VEP amplitudes and habituation after two to three weeks retesting in interictal migraine patients and in controls. SD is given in parentheses. Block 1 amplitude indicates the first block peak to peak amplitude. Slope shows off the slope of the linear regression over block 1 to block 6 amplitudes. Block ratio indicates block 6 amplitude in percentage of block 1 amplitude.
VEP: visual-evoked potential; ICC: intraclass correlation coefficient.
Habituation slopes between test and retest after 15 minutes and after two to three weeks were slightly different. In controls, habituation slopes tended to be more negative at the 15-minute test than at the retest (slope(test) = −0.22 ± 0.43; slope(retest) = −0.01 ± 0.43) (p = 0.16). Migraine patients showed no difference at the retest after 15 minutes compared to the first experimental session (slope(test) = 0.03 ± 0.40; slope(retest) = 0.10 ± 0.37) (p = 0.54).
Habituation slopes for test and retest after two to three weeks showed different results. Both groups showed more habituation at the retest session, but this was not significantly different for controls. In contrast, migraine patients showed a significantly different habituation between test and retest (controls: slope(test) = −0.19 ± 0.33, slope(retest) = −0.26 ± 0.50 (p = 0.56); migraine patients: slope(test) = 0.08 ± 0.34, slope(retest) = −0.18 ± 0.50 (p = 0.020)).
Discussion
Main results of the present study are: (1) deficient interictal VEP habituation was confirmed in episodic migraine patients, (2) VEP habituation group mean values showed significantly increased habituation in the migraine group at the two- to three-week retest, and (3) correlations between VEP habituation measures at test and retest were disappointing, both for the 15 minutes and the two- to three-week interval.
To the best of our knowledge, test-retest reliability of VEP habituation has not been assessed before. Test-retest reliability of VEP P100 latencies has been assessed before and yielded test-retest correlation coefficients around 0.7 over an unspecified interval (28) and of 0.59–0.93 over eight months (27). Test-retest reliability of VEP N75–P100 amplitudes (a single block of 100 trials) has been shown to be good over eight months (r = 0.72) (27). This is consistent with the results of the present study, in which test-retest ICCs of block 1 VEP amplitudes were above 0.74 for both test-retest intervals and both groups. In contrast, test-retest correlations of VEP habituation measures were mostly poor (ICC 0.07 to 0.59 for two to three weeks) in the present study, suggesting that VEP habituation may not be the best way to monitor individual changes of cortical excitability over time. Previous studies have shown that cortical excitability and habituation are dependent on the migraine cycle, with a maximum habituation deficit immediately before the attack and normalization of the habituation during the attack (4,29). These studies showed that every patient may be driven by his or her own “migraine-rhythm.” Therefore, we hypothesized that the poor test-retest correlations found over the two- to three-week interval might be due to unpredictable changes of cortical excitability during the individually varying migraine cycle (although all measurements were conducted in the interictal period as documented by telephone interview two days after the recordings). To exclude this possibility, we conducted an additional set of experiments investigating test-retest reliability over 15 minutes. However, correlations between VEP habituation at baseline and 15 minutes later were also poor (ICC 0.06 to 0.30), suggesting a large inherent variability of VEP habituation as quantified by habituation slopes and block ratios instead of physiological changes over time as the cause of poor test-retest correlations.
Other measures of cortical excitability also differentiate between migraine patients and controls and may exhibit better test-retest reliability than VEP habituation. Contingent negative variation (CNV) amplitudes are increased in migraine patients compared to controls (4,30). Test-retest reliability of CNV over 10 days has been examined by Kropp et al. (31) in 27 healthy volunteers. Test-retest correlation coefficients were good, especially for the early CNV component (iCNV, Pearson’s r = 0.86). The authors concluded that the standardized CNV recording procedure is reproducible and stable in healthy individuals. Therefore, the early CNV component may be more suitable to follow individual changes over time than VEP habituation, but this has not been shown until now in migraine patients.
In addition to mostly poor test-retest correlations of VEP habituation, our results also indicate that group mean values of VEP habituation measures may not necessarily be stable over time. Importantly, there was a significant increase in habituation in the migraine group over the two- to three-week interval. For the 15-minute interval, a possible systematic shift toward smaller habituation at retest compared to test was observed both in the migraine and the control groups, which has to be interpreted with much caution because it did not reach significance. The reasons for these shifts over time are not clear. It has been described that habituation may increase after repeated stimulation (32). This might in part explain the increase in habituation in the migraine group after two to three weeks, but is at variance with the possible shift toward smaller habituation over 15 minutes. However, in most experimental settings, net results of habituation are caused by the sum of concurrent habituation and sensitization processes that occur with different kinetics and therefore may give different results depending on the tested time frame (“dual-process theory” (32)). In migraine, it is strongly debated if the lack of net habituation seen, e.g. in VEP recordings, is the result of increased cortical excitability (leading to decreased habituation) or the result of decreased cortical pre-activation levels (leading to decreased VEP amplitudes at stimulation onset, which implies that according to the “ceiling model” of cortical excitation, the threshold for initiation of habitation is reached later in migraine patients than controls) (6,33). Therefore, habituation in migraine might also involve a combination of both mechanisms, possibly with different kinetics.
Several studies have monitored VEP amplitudes or habituation in parallel with clinical outcome during preventive migraine treatment (34–36). Diener et al. (34) recorded a single block (64 trials) of VEPs before and after four months of preventive treatment with beta blockers in 41 migraine patients. Migraine patients had significantly higher VEP amplitudes than healthy volunteers at baseline. The use of beta blockers led to a significant decrease in VEP amplitude in migraine patients, but was not related to the clinical treatment effect. After three months of washout, VEP amplitudes partially returned to the initial values. VEP habituation was not tested in this study. Ozkul and Bozlar (35) monitored the effect of fluoxetine on interictal VEP habituation (five blocks of 50 trials) in 79 migraine patients. At baseline, VEP block ratios were significantly higher in migraine patients compared to controls. After four weeks of fluoxetine, VEP block ratios were similar to the baseline values of the control group. A correlation with the clinical effect was not reported. One study also examined treatment effects over a 15-minute interval. Viganò et al. (36) recorded VEP habituation slopes (over six blocks of 100 trials) before and after 15 minutes of anodal transcranial direct current stimulation (tDCS) over the visual cortex, and again three hours later in 11 controls and 13 interictal migraine patients. VEP habituation slopes were significantly more negative after tDCS both in migraine patients and controls, and back to baseline values after three hours. Our data suggest that there may be a systematic shift in VEP habituation at the retest interval of 15 minutes, which may compromise the interpretation of treatment effects at this time window. However, in the cited study (36) the treatment effect was in the direction of increased habituation, while the systematic shift detected in the present study was in the direction of reduced habituation, suggesting that the effect of tDCS on VEP habituation was real. Nonetheless, our results, showing that VEP habituation is not necessarily stable over time, underline that treatment studies should always include a group with sham treatment or without treatment to determine the natural course of VEP habituation over the investigated time interval.
To the best of our knowledge, the present study is the first to investigate test-retest reliability of VEP habituation. An important limitation was the relatively small number of participants in the test-retest part of the study. However, the number was sufficient to show a significant increase in VEP habituation in migraine patients over the two- to three-week interval. Another limitation is the fact that we pooled together migraine patients with and without aura. However, there were only three migraine patients with aura, suggesting that their inclusion cannot affect fundamentally the results. A strength of the study is that participants of both groups were recruited from the general population. Including patients from a tertiary care facility might significantly bias the results because these patients are usually severely affected. A further strength of the study is the care that was taken to include individuals only during the interictal period (48 hours before and after the measurement). A further limitation is the relatively low mean age in the present study, limiting the generalizability of the results to older individuals. However, migraine is a disorder that is most prevalent in younger adults (37).
Conclusion
VEP habituation is not generally suitable to assess longitudinal changes of cortical excitability at an individual level. It may be acceptable for monitoring treatment effects at the group level, provided the group size is large enough, and a control group without treatment or with sham treatment is included since the VEP habituation may also change in these patients over time.
Article highlights
A significantly larger visual-evoked potential (VEP) habituation was found in controls compared to interictal migraine patients. VEP habituation in migraine patients showed a significant shift toward larger habituation at the two- to three-week retest interval, indicating that a sham treatment group should be included in studies using VEP habitation as a measure of treatment outcome. Test-retest reliability of VEP habituation was insufficient to follow individual changes of cortical excitability over time.
Footnotes
Acknowledgments
We would like to thank all participants of this study (as well as Mrs. K. Ogston for editing the paper). The study was approved by the local ethics committee at the University of Munich (Project number: 133-13). Participants gave informed consent before taking part.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
This work was supported by a grant to VR from the DFG (GRK 1091).
