Abstract
Evoked potentials (EP) characterize signal conduction in selected tracts of the central nervous system in a quantifiable way. Since alteration of signal conduction is the main mechanism of symptoms and signs in multiple sclerosis (MS), multimodal EP may serve as a representative measure of the functional impairment in MS. Moreover, EP have been shown to be predictive for disease course, and thus might help to select patient groups at high risk of progression for clinical trials. EP can detect deterioration, as well as improvement of impulse propagation, independently from the mechanism causing the change. Therefore, they are candidates for biomarkers with application in clinical phase-II trials. Applicability of EP in multicenter trials has been limited by different standards of registration and assessment.
Keywords
Introduction
Four years ago, the role of evoked potentials (EP) for diagnosis and monitoring of multiple sclerosis (MS) was discussed in this journal.1,2 The bottom line of the commentary by Hutchinson 3 was that despite some strong arguments for the use of EP in predicting and monitoring the disease course, emerging magnetic resonance imaging (MRI) techniques would finally become the methods of choice for these purposes. While advances in imaging and the understanding of its biological substrate have made considerable progress and provide a unique avenue for the characterization of tissue damage and repair,4,5 many of the proposed techniques remain to be validated and are available at specialized centers only. Information gained by EP is widely available at low cost, and it is complementary to structural data, as well as to biochemical and metabolic information. Most importantly, direct functional assessment of myelin, axon and synapses in multisynaptic eloquent sensorimotor pathways is only granted by electrophysiological techniques. In this topical review, we will discuss the current and possible future role of EP in MS with a focus on their suitability as biomarkers, especially in phase-II trials.
EP characterize impulse propagation in the central nervous system
Most clinical symptoms typical of MS are closely related to altered impulse generation and conduction in the central nervous system. Abnormal signal propagation can be due to different mechanisms including demyelination, localized conduction block, frequency-dependent block, and axonal damage, which may be due to different causes such as inflammation, axonal transection, or mitochondrial dysfunction6–10 (see also Figures 1 and 2). As an example, slowing and dispersion of conduction speed has been shown recently to interfere with motion perception. 11 A demyelinating lesion in the optic nerve of 10-mm length causes a conduction delay of approximately 25 ms. 12 Conversely, the exact mechanism for a delayed or diminished EP, for example, slowed conduction, prolongation, or even replacement of spatial by temporal summation at the synapse due to conduction block or axonal loss, cannot be determined with certainty. 7

Signal conduction at the level of single axons; left and right panels: input and output spike and spike trains; middle panel: (a) normal saltatory impulse conduction; (b) conduction block due to demyelinisation; (c) redistribution of sodium channels on demyelinated axon and non-saltatory conduction; (d) partly remyelinated axon with slowed saltatory conduction; and (e) ephaptic/mechanic impulse generation at the demyelinated axon (adapted from Smith 8 ; blue: axon; black dots: sodium channels at the nodes of Ranvier; green: myelin sheath; red arrows: impulse propagation).

Signal conduction at the level of tracts (left panels; blue: axons; green: myelin sheath; red: impulse propagation) and membrane potential (red) at the synapse (right panels; blue dotted line: depolarization threshold); (a) normal conduction in axons of different size; (b) blocked conduction as depolarization threshold at the synapse is not reached due to insufficient spatial (too few axons) and temporal (dispersed arrival of volleys) summation; and (c) delayed conduction due to slow impulse propagation but still reaching depolarization threshold.
EP are measures of central signal conduction in vivo and cross at least one central synapse. Sensory EP include brainstem auditory EP (BAEP), visual EP (VEP), and somatosensory EP (SEP). They are elicited by standardized stimuli and recorded over the cortex by averaging the response over a number of repetitions to cancel out background activity. Motor EP (MEP) are recorded over the target muscle in the upper and lower limbs (UL and LL). They are elicited by a short magnetic pulse which induces a depolarizing current in the motor cortex. In SEP and MEP, the duration of peripheral conduction is subtracted from the total latency to deduce the central conduction time (CCT) and central motor conduction time (CMCT), respectively.
Before the advent of MRI, EP were used to document clinically manifest and silent lesions in MS and were part of routine diagnosis (median SEP, 13 VEP, 14 and MEP 15 ). The sensitivity of an EP study to detect an abnormality depends on the length of the tracts measured and on the probability of the examined functional system suffering from a demyelinating lesion. Therefore, multimodal assessment using a combination of different EP modalities has been proposed.16,17 This approach parallels partly the clinical evaluation and is more appropriate for covering the heterogeneity of MS than single modalities. Several studies using multimodal EP (mmEP) have demonstrated a strong correlation between mmEP score and the Expanded Disability Status Scale (EDSS) cross-sectionally (median rho = 0.64, range: 0.16–0.79 over 13 cohorts; see Table 1).
Overview of studies in MS using EP scores to summarize results from different EP modalities.
VEP, BAEP, SEP, MEP: visual, brainstem auditory, somatosensory, motor evoked potentials; UL: upper limbs; LL: lower limbs; EPAS: evoked potential abnormality score; qEPS: quantitative evoked potentials score; mEPS: multimodal evoked potentials score; gEPS: global evoked potentials score; CEPS: combined evoked potentials score.
Correlation coefficients; significant results are given in bold;
Only RR subgroup with complete follow up (bs, y1, y2) reported in the table; p1 for correlation of EP score 1 year prior to EDSS.
In demyelinating disorders, conduction depends not only on the number of intact nerve fibers but can also be altered by temperature and medication interfering with ion channels. The effect of body core temperature on symptoms in MS has been known for a long time.
32
Action potentials become shorter when temperature increases as sodium-channel inactivation occurs earlier. The brevity of the action potential decreases the time for accumulation of current to reach the firing threshold of the axonal membrane. In demyelinated axons at the verge of conducting, the time may become too short to reach the threshold resulting in a temperature-dependent conduction block.
8
This observation is probably a partial explanation for the fact that an increase of only 0.2C°–0.4C° in body core temperature is sufficient in susceptible subjects to worsen symptoms.
33
Therapeutically, the potassium-channel blocking agent 4-aminopyridine (4AP) has been used to improve signal conduction
34
which has been shown experimentally to improve signal propagation along demyelinated axons.35,36 Besides clinical effects, short-term effects of 4AP on the elicitability of MEP and VEP latency and amplitude have been demonstrated.37,38 These mechanisms are at the base of the observation that only patients with a prolonged CMCT profited from fampridine medication for improvement of gait.
39
Here, MEP were a
Clinical research in MS requires novel biomarkers
To achieve the goal of successful interventional trials, especially for phase-II studies in progressive MS, novel biomarkers are desirable. Requirements include reliability, validity, quantifiability, tolerability, and efficiency. Intraclass correlation of VEP over 1 year in healthy subjects is 0.94 for latency and 0.73 for amplitudes. 40 Reliability of any EP can be improved further by standardization of recording procedures and central reading by appropriate tools (e.g. EPMark). 41 Construct validity is given by the close relationship between EP, pathological, and clinical alterations in MS. Criterion validity is documented by many observational studies showing significant correlations between mmEP and current state, as well as disease course and prognosis (see Table 1). Quantifiability of mmEP is obtained either by ordinally or numerically scaled scores. Tolerability of mmEP may limit compliance, but has proven not to be a problem in the majority of patients. Efficiency of EP in clinical trials is likely as their sensitivity to change is higher than that of EDSS and as correlations with clinical course are significant even with small numbers of patients (see Table 1).
According to Amur et al.,
42
there are four different types of biomarkers:
EP as diagnostic biomarker
Due to its high sensitivity to subclinical lesions and relatively high specificity, MRI has largely replaced EP to demonstrate dissemination in time and space in patients presenting with typical symptoms suspicious of a demyelinating event and to exclude alternative diagnoses. 43 However, the capability of EP to detect even subclinical lesions in pathways which are not well explored in routine MRI assessments, such as optic nerve and spinal cord, has been shown in many studies. Summarizing the results from several studies performed in the seventies and eighties (using Schumaker or Poser diagnostic criteria) including about a thousand (BAEP and SEP) or even nearly 2000 (VEP) patients, 44 the proportion of abnormal sensory EP was high in clinically possible, probable, and definite MS (SEP: 49%, 67%, 77%; VEP: 37%, 58%, 85%; BAEP: 30%, 47%, 67%, respectively), as well as in patients without a history of prior symptoms in the respective functional system (SEP: 51%; VEP: 51%; BAEP: 38%). The same applies to motor EP 15 with a strong correlation between CMCT to lower extremities and EDSS (rho = 0.53 45 ). In patients with primary progressive multiple sclerosis (PPMS), spinal syndromes often predominate, and VEP are frequently abnormal (in about 90% 20 ) even without corresponding clinical signs and, therefore, add diagnostic information.
The added value of mmEP to confirm a clinical diagnosis of MS has been shown in a sample of 189 patients, in which the reclassification sensitivity of a paraclinical test over clinical assessment alone was higher in MEP, SEP, and VEP (91%–96%) compared with conventional MRI (86%); mmEP allowed reclassification in 32% of patients in whom MRI did not change the diagnostic category. 46 However, as MRI standards are changing over time, reclassification sensitivity of MRI is probably higher nowadays. Nonetheless, diagnosis of MS may become more difficult in patients not presenting with a classical clinically isolated syndrome (CIS). In these cases, overreliance on imaging results may lead to misdiagnosis: 47 the final diagnoses of patients referred to a tertiary center for evaluation of MS were migraine (22%), fibromyalgia (15%), nonspecific symptoms with abnormal MRI (12%), psychogenic disorders (11%), and neuromyelitis optica spectrum disorders (6%). In these cases, normal cerebrospinal fluid and EP studies might have been helpful to the clinician.
EP as prognostic biomarker
Prognosis of disease course is important for individualized counseling and therapeutic decisions. Moreover, mmEP may be useful as a prognostic biomarker to select patients at high risk of progression for clinical trials. Enriching study samples lowers the risk of negative results due to a less-than-expected event rate.
Several studies with a total of more than 1000 patients in 13 cohorts have shown mostly a strong relationship between a baseline mmEP score and future disability measured by the EDSS (median rho = 0.57, range: 0.38–0.82; see Table 1). This general finding applies to all phases of the disease, but prognostic power is more pronounced in the early relapsing remitting phase and in primary progressive patients as compared to CIS or SP.21,22,25,28,31 The relationship increases with the length of the observation period, and mmEP at baseline have been shown to correlate with the EDSS even after 20 years. 48
To determine the added value of EP assessment over the EDSS alone, some studies have looked at the relationship between baseline mmEP and change of EDSS over time, and have still shown mainly significant correlations (median rho = 0.39, range: 0.21–0.88, Table 1). Using regression models, EDSS and EP scores at baseline were independent predictors of clinical outcome,26,28,29 and the amount of explained variability to predict EDSS after 3 years increased when EP data were included (EDSS alone:
The odds ratio for progression in mixed-patient samples was 4 over 2.5 years (RRMS, SPMS, and PPMS 20 ) and 11 over 10 years (CIS and early RRMS 31 ) in patients with EP score values greater than the median. Receiver-operating characteristic curves have shown sensitivities between 57% and 85% and specificities between 83% and 88% to detect EDSS progression in different cohorts.24,25,30,48 In a small sample of PPMS patients, the positive predictive value for EDSS progression after 3 years was actually 1, and the negative predictive value was 0.62. 28 The fact that different centers with different combinations of EP modalities and different scoring systems reached similar conclusions underlines the validity of this approach. However, a generally applicable cut-off value in EP scores remains to be determined, as does the selection of the modalities to be included into mmEP. Since upper limb EP (SEP-UL, MEP-UL) may only be affected in later stages or progressive disease and since lower limb EP (SEP-LL, MEP-LL) may be absent in these patients, the combination for EP depends on the patient sample in question. However, BAEP have shown the weakest association to future disability and have a low overall frequency of abnormal conduction in MS.20,22,23
EP as response biomarker
In MS, the relationship between structural measures from conventional MRI (brain atrophy, development of hypointense T1-lesions) and disease progression is moderate. 49 Non-conventional MRI techniques need to be validated in particular for their multicenter applicability.4,5 Measures from optical coherence tomography (OCT) reflect axonal degeneration and seem less sensitive than VEP to early damage from primarily demyelinating disorders in the optic nerve. 50 Given the fact that VEP and OCT assess the two main pathological processes in MS in a complementary way, the combination seems to be well suited for proof-of-concept studies in optic neuritis. 51 Body fluid markers as neurofilaments among others may reflect global axonal damage or other specific aspects of the disease process but need to be validated. 52
EP are more closely related to clinical disability than structural data. Improved signal conduction can be a “symptomatic” effect, for example, due to 4AP or cooling as discussed above. However, signal conduction may deteriorate with fever (Uhthoff’s phenomenon) or with agents acting on ion channels such as antiepileptic drugs. However, when excluding or balancing such confounding factors in clinical studies, improvement of signal conduction most probably reflects a true effect of remyelination or neuroprotection.
To detect treatment effects, the outcome measure must be able to reflect disease progression in the placebo group, and the effect size of the intervention has to be large enough for the chosen sample size. Studies with serial EP assessments have shown mostly significant correlations between change in EP score and change in EDSS, particularly when employing qEPS (median rho = 0.43, range: 0.18–0.69, see Table 1). Studies using EP latencies or EP scores to evaluate treatment effects in relapsing or progressive MS are summarized in Table 2 and studies in the visual system are summarized in Table 3. Possible treatment effects could be identified with EP latencies and mmEP scores in small cohorts of RRMS patients using natalizumab, 53 fingolimod, 54 and after re-infusion of the patient’s own bone-marrow cells. 55 A large study testing azathioprine in progressive MS has shown increasing latencies in sensory EP in the placebo, as well as in the treated group in parallel to clinical deterioration in both groups. 17 These studies indicate that EP change with disease course and provide rational rather than ordinal scores as the EDSS does. Therefore, EP may help to differentiate early between possibly effective and futile interventions in phase-II trials and thus may serve as response biomarkers.
Studies using EP to measure treatment effects in relapsing and progressive MS.
RCT: randomized-controlled-trial; VEP/BAEP/SEP/MEP: visual, brainstem auditory, somatosensory, motor evoked potentials; UL: upper limbs; LL: lower limbs; RMT: resting motor threshold; AR: amplitude ratio; CMCT: central motor conduction time; Lat: latency; IV: intravenous; MOA: mechanism of action
Studies using EP to measure treatment effects of neuroprotective and remyelinating agents in patients with ON.
RCT: randomized controlled trial; AON: acute (unilateral) optic neuritis; VEP: visual evoked potentials; Lat: latency; Amp: amplitude; RNFL: (thickness of the) retinal nerve fiber layer; MOA: mechanism of action.
Studies focusing on the visual system and ON as a model for testing remyelinating or neuroprotective agents are summarized in Table 3.51,62 In these studies, either VEP latency or measures from OCT were used as the primary outcome. Since baseline values in the affected eye are not reliable due to the effects of acute inflammation, for example, conduction block and optic nerve swelling, usually the values of the unaffected eye have been taken as the reference. Comparing the difference in latency change between treated and placebo groups showed probable treatment effects for simvastatin 57 and opicinumab, 58 whereas phenytoin 60 had no effect and amiloride 59 actually prolonged VEP latencies; OCT measures showed a probable effect under therapy with phenytoin, 60 a trend towards improvement under opicinumab, 58 and no effect of amiloride. 59 In a cross-over design in patients with MS and chronic demyelinating optic neuropathy, VEP latency but not OCT measures improved during treatment with clemastine while no changes were observed during the off-drug period. 61
These studies show that measuring effects of agents with proposed neuroprotective or remyelinating properties is still a challenge even in the well-defined visual system and in quite homogenous patients samples recruited for acute ON. This may mainly be due to the small effects of the tested interventions. However, agents interfering with ion channels may have a direct “symptomatic” effect on the VEP limiting its use, but VEP seem to have a higher sensitivity than OCT to detect effects of putatively remyelinating agents like opicinumab and clemastine.
Whatever the cause of improved signal conduction, EP offer a chance to detect or exclude a significant effect of the intervention. However, ceiling effects in patients with PMS may prevent using EP, especially SEP from and MEP to lower limbs. For this reason, mmEP including the upper extremities is recommended to monitor progressive disease.
Choice of EP scoring systems depends on context of use
Summarizing results from the different EP modalities into a one-number score yields an estimate of overall dysfunction from the different functional systems. Ordinal scores range from a qualitative assessment of the number of abnormal modalities 23 and the number of abnormal tests18,27 to graded scores with four 20 or six 22 steps per test. Accordingly, the dynamic range can be very small (0–3 23 ) or quite high (0–70 22 ) as given in Table 1. The global EP score (gEPS, dynamic range: 0–36 20 ) may be an attractive compromise between number of steps and complexity of definitions. The gEPS has shown robust prognostic correlations in four different cohorts.20,24,30,31
However, ordinal scores may be less well suited to detect change over time, as an increase or decrease in latency still within or still above upper limits of normal would not change an ordinal score. Furthermore, latencies are the most reliable or “solid” measures of early EP components;
63
consequently, they have been proposed to be used as the principal measure.
17
Latencies can be more easily quantified than any other parameter such as amplitudes or configurations. Z-transformation of latencies allows the normalization of all EP modalities summary into one rationally scaled number.
19
When comparing with ordinal EP scores, this quantitative EP score (qEPS) has shown similar cross-sectional and predictive correlations with the actual and future EDSS, but more frequently significant longitudinal correlations with EDSS change in three different cohorts (RRMS and SPMS;19,48 early RRMS;
25
PPMS
28
). Figure 3 shows an example of the qEPS over time in a patient with PPMS. Direct comparison of scoring systems has shown equal performance in cross-sectional correlations with the EDSS.64,65 However, the qEPS has a higher sensitivity to change as compared to the gEPS as illustrated by the lower number of patients needed to detect EP deterioration with 90% certainty over 6 months (

Multimodal evoked potentials (visual, somatosensory, motor EP; UL/LL: upper/lower limb) over time (baseline, weeks 48, 120, 172) in a sample case (39-year-old male, PPMS, disease duration: 10 month, one side per modality is shown). Red lines signify progressively longer latencies of the main EP components (VEP: P100, SEP-UL: N20, SEP-LL: P40, MEP: shortest cortico-muscular latency), bold blue lines in VEP and SEP are the mean of two replications (gray lines). In the case of SEP-LL, no P40 could be determined as indicated by the question marks; for quantitative analysis, the longest measured latency of the study sample is taken as an approximation.
Conclusion
What is needed most currently is detection of a safe and highly effective therapy for progressive MS. Efficient response biomarkers could be multidimensional, including mmEP to cover functional aspects of wanted treatment effect. MmEP scores are bi-directional, covering both improvement and deterioration. Furthermore, mmEP can be analyzed by central reading in a multicenter setting and their quantifiability is well suited for statistical analysis. EP recording is time-consuming and may not be tolerated by every patient. However, as sample size can probably be considerably smaller than when using other outcome measures, novel effective treatments may get discovered earlier, at lower cost and with less inconvenience to the whole community of patients suffering from MS. Limitations include insensitivity of EP to cerebellar, frontal, and cognitive dysfunctions and ceiling effects in advanced disease; moreover, they are not validated yet for evaluation of individual patients.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship and/or publication of this article: M.H.’s research is or was supported by the Swiss Multiple Sclerosis Society and the Swiss National Science Foundation SPUM 33CM30_124115 and 33CM30_140338; L.L.’s research is or was supported by Gossweiler Foundation, Telethon, Multiple Sclerosis Italian Foundation, and by unconditional research grants from industry (Almirall, Biogen, Novartis, Merck KGaA); P.F.’s research is or was supported by the Swiss National Science Foundation SPUM 33CM30_124115 and 33CM30_140338 (PI), Swiss Multiple Sclerosis Society, Synapsis Foundation, Parkinson Schweiz, Novartis Research Foundation, Gossweiler Foundation, Freiwillige Akademische Gesellschaft Basel, Mach-Gaensslen-Stiftung, Botnar Foundation, Bangerter Foundation, and by unconditional research grants from industry (Roche, AbbVie, Biogen, General Electrics).
