Abstract
Novel or unexpected sounds that deviate from an otherwise repetitive sequence of the same sound cause behavioural distraction. Recent work has suggested that distraction also occurs during reading as fixation durations increased when a deviant sound was presented at the fixation onset of words. The present study tested the hypothesis that this increase in fixation durations occurs due to saccadic inhibition. This was done by manipulating the temporal onset of sounds relative to the fixation onset of words in the text. If novel sounds cause saccadic inhibition, they should be more distracting when presented during the second half of fixations when saccade programming usually takes place. Participants read single sentences and heard a 120 ms sound when they fixated five target words in the sentence. On most occasions (
Novel or unexpected sounds that occur in an otherwise repetitive sequence of the same sound cause cognitive distraction that is observed in both neurophysiological and behavioural responses (Berti, 2012; Escera et al., 2000; Horváth et al., 2008; Parmentier, 2014; Parmentier et al., 2019; Schröger, 1996). The detrimental effect of unexpected sounds on behavioural performance has been reported using visual, auditory and crossmodal categorization tasks, two-alternative forced-choice tasks, Go/ NoGo, visual matching, and serial recall tasks (Bendixen et al., 2010; Berti & Schröger, 2001; Hughes et al., 2005; Li et al., 2013; Ljungberg et al., 2012; Ljungberg & Parmentier, 2012; Pacheco-Unguetti & Parmentier, 2016; Parmentier, 2016; Röer et al., 2015, 2018). Interestingly, recent research has suggested that unexpected sounds may also lead to longer fixation durations during everyday tasks such as reading and scene viewing (Graupner et al., 2007; Vasilev, Parmentier, et al., 2019). In the present study, we tested the hypothesis that the increase in fixation durations during reading is due to inhibition of saccade planning. In addition, we examined whether novel sounds affect the execution of reading saccades.
Distraction by auditory novelty
Novel sounds are distracting not because of their low frequency of occurrence, but because they violate sensory predictions (Bubic et al., 2009; Parmentier et al., 2011; Schröger et al., 2007), even when the stimuli and responses in the primary task are predictable (Parmentier & Gallego, 2020). Novelty distraction is associated with specific electrophysiological responses: (a) a mismatch negativity (MMN) component, reflecting an early detection of auditory change in the brain (Näätänen et al., 1978, 2007); (b) a P3a component, reflecting the involuntary orientation of attention away from the main task (Berti, 2012; Escera et al., 2000; Schröger & Wolff, 1998); and (c) a re-orientation negativity (RON) component, interpreted as the re-orientation of attention to the main task (Berti & Schröger, 2001; Horváth et al., 2008; Schröger et al., 2000). Hence, novelty distraction is typically viewed as the outcome of an involuntary orienting response (Sokolov, 1963) towards the novel sound. The shift of attention to and away from the novel sound, as well as the processing aftermath of its involuntary semantic analysis, have been argued to be important determinants of the reduction in main task performance (Parmentier et al., 2008, 2014; Schröger, 1996).
Interestingly, recent evidence has suggested that novel sounds may also lead to general motor inhibition (Dutra et al., 2018; Wessel, 2017; Wessel & Aron, 2013, 2017). For example, Wessel and Aron (2013) found that novel sounds led to a reduction in cortico-spinal excitability some 150 ms after their presentation, which was interpreted as evidence for global inhibition of the motor system. This reduction in cortico-spinal excitability appears to be directly related to motor planning as it was positively correlated with action stopping in a Go/NoGo task (Dutra et al., 2018). These results suggest that unexpected novel sounds may trigger a transient and rapid inhibition of motor responses. Such inhibition may occur earlier in time than the attention orienting response (Wessel & Aron, 2017) and stop ongoing processes to enable a more rapid and effective analysis of unexpected sounds (Wessel, 2017).
While novelty distraction is typically measured from response times using forced-choice categorization tasks, recent evidence has suggested that deviant sounds may also affect oculomotor control (Graupner et al., 2007; Vasilev et al., 2019; Widmann et al., 2014; see also Marois & Vachon, 2018; Wetzel et al., 2016 for pupil dilation responses). For example, Graupner et al. (2007) presented visual and auditory distractors at every fifth fixation in a scene viewing task. Participants were presented with 17 standard distractors, 16 standard distractors, and 1 deviant distractor, or no distractors. Graupner et al. (2007) found that the deviant sound led to an increase in fixation durations, which was due to a reduction in the proportion of terminated fixations. This reduction occurred at two distinct time intervals: first at around 90 ms and then at around 150 ms after the sound’s onset.
In addition, Widmann et al. (2014) studied the categorisation of sounds using
Furthermore, Vasilev et al. (2019) presented a 50 ms sound when readers fixated five target words in a sentence. Participants either heard five standard sounds (a sine wave) or four standard and one deviant sound (a burst of white noise). The authors found that the deviant sound led to an increase in fixation durations immediately after its presentation. A time-course analysis revealed that the deviant sound began to affect fixation durations some 180 ms following the sound’s onset. Because the increase in fixation durations originated relatively late in the fixation duration distribution and did not appear to be related to the lexical processing of words in the sentence, Vasilev et al. (2019) hypothesised that this delay stemmed from the disrupted programming of the next saccade. In summary, these results suggest that unexpected sounds can affect oculomotor control during everyday tasks such as scene viewing and reading.
Eye-movement control during reading
During reading, the eyes alternate between short periods of relative stability (i.e.,
The average fixation duration during reading is about 240 ms (Reichle & Reingold, 2013). In that time, readers have to extract linguistic information from the fixated word to recognise it and then plan a saccade to the next word. There is evidence that word processing starts soon after fixation onset. For example, visual input during the first 60 ms appears to be crucial for word recognition as lexical processing occurs even if the word disappears afterwards (Liversedge et al., 2004; Rayner et al., 2003). In addition, linguistic variables typically have a quick influence on fixation durations. For instance, survival analyses have shown that lexical frequency, predictability, lexical ambiguity, and preview validity start to influence fixation durations between 120 and 140 ms from fixation onset (Reingold & Sheridan, 2014). Moreover, neurophysiological evidence has shown that lexical processing typically occurs around 127–172 ms after fixation onset on average (Reichle & Reingold, 2013). However, because some parafoveal pre-processing usually occurs before words are fixated, lexical processing often starts even earlier than that (Reichle & Reingold, 2013). Therefore, lexical processing of the fixated word typically begins prior to the planning of the next saccade.
This idea is illustrated well in serial-attention models of eye-movement control such as E-Z Reader (Reichle et al., 1998, 2009). In this model, word recognition starts with a 50 ms visual processing stage that reflects the time needed for visual information from the retina to reach the cortex (Pollatsek et al., 2006; see also Foxe & Simpson, 2002). This is then followed by two attention-dependent lexical processing stages —familiarity check (L1) and lexical access (L2). L1 roughly corresponds to the recognition of the orthographic wordform and is used to estimate the difficulty of accessing the meaning of the word. L2, on the other hand, reflects the actual act of word identification (i.e., recognising and retrieving the word’s meaning from memory; Reichle et al., 2003). Once L1 is completed, the programming of the saccade to the next word is initiated because completion of the second stage (L2) is imminent. The programming of the next saccade also occurs in two stages: (a) a labile stage (M1) in which the current saccade plan can be cancelled by another programme and (b) a non-labile stage (M2) in which the current saccade plan can no longer be cancelled. Once lexical processing of the fixated word is completed, attention shifts covertly to the next word in anticipation of the eye-movement.
Parallel-attention models such as SWIFT (Engbert et al., 2002, 2005) differ from E-Z Reader in that attention is allocated to more than one word at a time, but they also share a number of core assumptions. For instance, lexical processing in SWIFT also occurs in two stages, as does the programming of the next saccade. However, contrary to E-Z Reader, the trigger to start planning the next saccade in SWIFT is not determined by the completion of some initial lexical processing, but is instead based on a random saccadic timer with a pre-defined mean (Engbert et al., 2005). Nevertheless, this timer can be inhibited foveally by the processing difficulty of the fixated word, such that less frequent (i.e., more difficult) words will prolong fixation durations. This effectively delays the onset of the next saccade to allow for lexical processing of less frequent words to occur. Such delay mechanism could only work if some lexical processing occurs early enough to allow the difficulty of the fixated word to be estimated and the programming of the next saccade to be delayed as a consequence. Therefore, in both models, lexical processing of the fixated word must generally start early on in the fixation, whereas saccade planning by necessity occurs towards the end of fixations.
Neural control of saccades
The execution of saccades occurs immediately after their planning. However, it is currently not known whether saccade execution can be perturbed by novel sounds. Saccades begin with an acceleration phase that lasts until their peak velocity is reached, after which they start to decelerate. The duration and peak velocity of saccades increases non-linearly with greater saccadic amplitude (often called the “main sequence”; Bahill et al., 1975). Reading saccades usually have an amplitude of ~ 2º (7–9 letters; Rayner, 2009) and last for about 20–40 ms (Bouma & De Voogd, 1974; Pollatsek et al., 2006). Saccade execution is controlled by brainstem burst neurons, which receive their main input from the superior colliculus (SC) (Sparks, 2002; Watanabe & Munoz, 2011). Importantly, saccadic parameters are coupled to the activity of excitatory burst neurons (EBNs) in the brainstem. More specifically, peak saccade velocity is correlated with the maximum firing rate of EBNs, saccade duration is correlated with their burst duration, and the number of spikes in the burst is correlated with saccade amplitude (Fuchs et al., 1985; Galley, 1989; Sparks, 2002). Therefore, as the execution of saccades depends on the firing of brainstem neurons, any inhibition of these neurons should also be reflected in slower saccade velocities and longer saccade durations.
Present study
Unexpected sounds lead to an immediate increase in fixation durations during reading (Vasilev, Parmentier et al., 2019), which may occur due to global transient inhibition of motor responses (Wessel & Aron, 2013, 2017). Therefore, the increase in fixation durations may be due to saccadic inhibition during the planning stages of the next saccade. We will refer to this explanation as the saccadic inhibition hypothesis (SIH). This hypothesis is supported by the finding that deviance distraction occurs late in fixation duration distributions and does not appear to be modulated by the lexical frequency of words (Vasilev, Parmentier et al., 2019). However, this evidence is only indirect in nature. Here, we set out to test the SIH more directly by manipulating the timing of auditory distractors (standard versus novel) relative to the fixation onset of words.
As already mentioned, lexical processing of words starts soon after fixation onset, whereas programming of the next saccade by necessity occurs towards the end of fixations. Therefore, if the SIH is correct, novel sounds should be more distracting when played temporarily closer to the end of fixation (i.e., when saccade programming takes places). To test this hypothesis, we manipulated the temporal onset of sounds relative to the fixation onset of words. A 120 ms sound was played when participants fixated five target words in a sentence. The same sine wave tone (“standard”) was played on most occasions (
A secondary goal of our study was to examine the unexplored issue of whether novel sounds may affect not only the programming, but also the execution of the next saccade. This is a relevant question given both the temporal overlap between saccade planning and execution, and the anatomical overlap between structures thought to be implicated in the temporary suppression of motor activity and structures controlling the execution of saccades. Wessel and Aron (2013) have argued that unexpected events lead to a global inhibition of the motor system, which may be modulated by the subthalamic nucleus (STN). The STN contains visual-motor neurons (Fawcett et al., 2005; Matsumura et al., 1992) and receives direct projections from the cerebral cortex (Ma & Geyer, 2017). Furthermore, the STN may be involved in maintaining a sustained fixation before goal-directed saccades by suppressing neurons in the SC via an indirect route through the substantia nigra pars reticulata (SNr) (Hikosaka et al., 2000). Of interest, the SC also controls the execution of saccades in the brainstem (Sparks, 2002; Watanabe & Munoz, 2011). Therefore, it is possible, but yet unknown, that the motor inhibition thought to follow unexpected sounds may also affect the firing of EBNs, which is related to the peak velocity of saccades (Fuchs et al., 1985). If novel sounds affect the execution of saccades, then we should observe a reduction in their velocity and an increase in their duration. This should particularly be the case in the 120 ms delay condition, where the novel sound is played temporarily closer to the execution of the next saccade.
Hypotheses
Method
Participants
Sixty-four undergraduate students from Bournemouth University (53 female) participated for course credits. Their average age was 19.3 years (range: 18–31 years;
Materials and design
The method is illustrated in Figure 1. The reading stimuli consisted of 120 English sentences that were taken from Vasilev et al. (2019). In each sentence, there were five target words on which the sounds were presented using a gaze-contingent manipulation (see Inhoff et al., 2002). The target words were always the 3rd, 5th, 7th, 9th, and 11th word in the sentence. Each target word was followed by one non-target word on which no sounds were played. This ensured that the sounds were not played too quickly after one another. The target words were 6.75 letters long on average (

Diagrammatic illustration of the task’s key characteristics. (a) Standard and novel sounds used in the experiment. (b) Presentation of sounds in standard and novel trials. Standard trials contained five standard sounds, while novel sounds contained four standard and one novel sound (played equiprobably at positions 2–5). (c) Example sentence with the position of words on which the five sounds were played. (d) Gaze-contingent sound presentation in the experiment. Sounds were played when participants crossed an invisible boundary (Rayner, 1975; denoted by a vertical orange line) located before the five target words. (e) Sound onset delay manipulation. Sounds were played either immediately (0 ms delay) or with a 120 ms delay after crossing the boundary. The delay was the same for all sounds played in a given sentence. (f) Example of a comprehension question (presented after 33% of sentences).
Sixty novel sounds were taken from Andrés et al. (2006). The sounds were originally 200 ms long but were compressed to 120 ms (without alteration in pitch) for the purpose of the present experiment. The novel sounds consisted of different environmental sounds (e.g., sound of a drill, telephone ringing, engine, etc.). The standard sound was a 120 ms sinewave tone with a frequency of 400 Hz, with 10 ms fade-in and fade-out ramps. All sounds were monoaural, with a sampling rate of 44100 Hz, a bit depth of 16 bit. A silence condition was not included since (a) we previously showed that the presentation of five gaze-contingent standard sounds does not affect reading behaviour compared to silent reading (Vasilev, Parmentier et al., 2019); and (b) because it was not relevant to our objective, which was to determine whether the timing of the novel sounds influences oculomotor control.
The study had a 2 x 2 within-subject design, with
Apparatus
Participants’ eye-movements were recorded with an Eyelink 1000 eye-tracker at 1000 Hz. Viewing was binocular, but only the right eye was recorded. Participants’ head was stabilised with a chin-and-forehead rest to reduce head-movements. The stimuli were displayed on a Cambridge Research Systems LCD ++ monitor (resolution: 1920 x 1080 pixels; refresh rate: 120 Hz). The sound stimuli were played on a Creative Labs Sound Blaster X-Fi SB0770 sound card and were presented binaurally through Bose QuietComfort 25 noise-cancelling headphones at a 65 dB(A) SPL.
The experiment was programmed in Matlab R2014a (MathWorks, 2014) using the Psychophysics Toolbox v.3.0.11 (Brainard, 1997; Pelli, 1997) and Eyelink libraries (Cornelissen et al., 2002). The sound stimuli were played using the low-latency mode of the Psychophysics Toolbox (output latency was 14 ms). The sentences were formatted in a monospaced Courier New 18pt. font and appeared on a single line in the middle of the screen. The sentences were presented with a 50-pixel offset from the left side of the screen and appeared as black text over white background. The letter width was 14 pixels and the eye-to-screen distance was 80 cm. At this distance, each letter subtended ~0.34º of visual angle horizontally. The experiment was run on a PC in a Windows 7 64-bit environment.
Procedure
Participants were tested individually in a session that lasted about 40 min. Before the start of the experiment, a 3-point horizontal calibration was performed. Calibration accuracy was then monitored with a drift check presented before each trial and participants were recalibrated whenever necessary (the error was kept at < 0.3º throughout the experiment). All beeps during calibration and drift check were turned off. Participants were instructed to ignore any sounds they may hear and to read the sentences for comprehension. The experiment started with six practice items presented in silence. Each trial began with a black gaze box that was centred at the first letter in the sentence. Once a stable fixation inside the gaze box was detected, the box disappeared and the sentence was presented on the screen.
The gaze-contingent sound presentation was implemented by placing an invisible boundary (Rayner, 1975) before each of the five target words (for more details on this technique, see Eiter & Inhoff, 2010; Inhoff et al., 2002, 2004). Once the gaze position of the eye crossed an invisible boundary, the sound was presented. Depending on the sound onset condition, the sound was played either immediately after crossing the boundary (0 ms delay) or 120 ms after crossing the boundary (120 ms delay). This typically happened as participants made a forward saccade towards the target word. A third of the sentences were followed by a Yes/No comprehension question (see Figure 1f). Participants pressed the left button of the mouse to terminate the trial and to answer the comprehension questions. The questions were used to ensure that participants were reading for comprehension and were not a variable of main interest in the analyses.
Data analysis
The analysis focused on the immediate effect of the sound, which was measured on the first fixation and first saccade after it was played. We did not restrict the analysis only to cases where the Target Word was fixated during first-pass reading, but also included cases where it was skipped and another word was fixated. This was because our hypothesis predicted a general inhibition of saccade planning that should occur regardless of which word is fixated. However, the results did not change when restricting the analysis only to cases where the target word was fixated (see the online Supplementary Material A).
The fixation and saccade data were analysed only for sounds played at Target Word locations 2–5 since no novel sounds occurred on Target Word 1. Because the standard sound was presented much more frequently than the novel sounds, only one standard sound was sampled per trial. This sound was picked using the design matrix that was used to counter-balance the presentation of novel sounds. This ensured that a balanced dataset was formed, which had an equal number of standard and novel sounds that were presented equally often on Target Words 2–5. In the fixation data, we analysed only the first fixation duration during which the sound was played. This was because Vasilev et al. (2019) established that the first fixation was the source of the sound deviance effect. To measure the effect of novel sounds on the execution of the next saccade after the playing the sound, a few measures were used: saccade duration, saccade amplitude, peak and average saccade velocity.
The data were analysed with (Generalised) Linear Mixed Models (GLMM[s]) using the lme4 package v.1.1-21 (Bates et al., 2014) in the R software v.3.6.2 (R Core Team, 2019). The fixed factors in the model were sound type (standard vs. novel), sound onset delay (0 vs. 120 ms delay), and their interaction. Random intercepts were added for both participants and items (Baayen et al., 2008). In addition, we tried adding random slopes for sound type and sound onset delay (Barr et al., 2013). However, the models converged only with a random slope for sound onset delay (subjects) for the fixation duration and saccade velocity analyses, and sound type (subjects) for the saccade duration and amplitude analyses. Fixation durations, saccade durations, and saccade amplitude were log-transformed in the models as this improved the distribution of residuals. However, the results did not change when using the untransformed values. Treatment contrast coding was used for both the sound type (baseline: standard) and sound onset delay (baseline: 0 ms delay) conditions. The results were considered statistically significant if the |
In addition, a survival analysis was done to determine the time course of novelty distraction by estimating the earliest point in time when the effect could be reliably detected. This was done using the confidence interval divergence point analysis (CI-DPA; Reingold & Sheridan, 2014, 2018). This is a survival analysis technique that helps determine the earliest point in time where the distributions of the two sound conditions begin to significantly diverge from one another. The analysis was done with 10 000 bootstrap iterations following the method described in Reingold and Sheridan (2018).
Results
All participants exhibited a comprehension accuracy score greater than 84.6%, thus indicating they had no problems understanding the sentences. Participants’ mean comprehension accuracy was 95.7% in the standard (
First fixation duration
Descriptive statistics for the first fixation duration during which the sound was played are presented in Table 1 and are illustrated in Figure 2. The results from the LMM analysis are presented in Table 2. Consistent with
Mean descriptive statistics for the first fixation duration during which the sound is played and the first saccade immediately after playing the sound (SDs in parentheses).
1º = 2.9 letters.
LMM results for the first fixation duration during which the sound is played.
LMM: linear mixed models; BF: Bayes factor;
BF10: Bayes factor (values > 1 indicate evidence for the alternative hypothesis; values < 1 indicate evidence for the null hypothesis). Statistically significant

Duration of the first fixation during which the sound is played. Shading indicates ± 1
A simple-effects analysis (Lenth et al., 2019) of the interaction with Tukey
One possibility is that novel sounds may have affected word processing beyond the first fixation. If participants were more likely to make additional first-pass fixations on the target word, this may indicate lexical processing difficulty. To examine this, we did a post hoc analysis of first-pass re-fixation probability on the target word. The results (presented in the Supplementary Material C) indicated no significant differences in this measure. This suggests that the effect of novel sounds was largely constrained to the first fixation during which they are played.
To determine the earliest point in time when the novel sound started to significantly influence fixation durations, the CI-DPA analysis (Reingold & Sheridan, 2018) was used with the first fixation data. The results are illustrated in Figure 3. In the 0 ms delay condition, the distributions of the two sound conditions began to significantly diverge at 152 ms (95% CI = [130, 185]. In contrast, the two distributions in the 120 ms onset delay condition started to diverge at 178 ms (95% CI = [149, 198]) (58 ms post the sound onset). Therefore, while the divergence point for the 120 ms delay condition occurred some 26 ms later, the novel sounds in both conditions had a relatively quick influence on fixation durations. Nevertheless, as Figure 3c shows, the strongest reduction in terminated fixations between the novel and the standard sound did not occur until 199 ms in 0 ms delay condition and 241 ms in the 120 ms delay condition (199 and 121 ms post the sound onset, respectively). Therefore, while the difference was detectible statistically early on, it did not reach its greatest magnitude immediately in the distribution of fixation durations.

Results from the divergence point analysis of the first fixation duration during which the sound is played (a, b) and difference in survival between the novel and standard conditions (c). In panels (a, b), the survival curves are plotted for the two sound conditions, denoting the percentage of “surviving” (i.e., remaining) fixations at each duration bin. The divergence point is illustrated by a vertical solid line and the 95% confidence interval (CI) is illustrated by vertical dotted lines. Panel (c) shows the difference in “survival” curves between the standard and novel conditions, along with the time point of maximum difference (i.e., when the effect was strongest in the distribution of fixations).
Saccade execution
The descriptive statistics for saccade measures are shown in Table 1 and the LMM results are presented in Tables 3 and 4. Contrary to
LMM results for the duration and amplitude of the next saccade after playing the sound.
LMM: linear mixed models; BF: Bayes factor;
BF10: Bayes factor (values > 1 indicate evidence for the alternative hypothesis; values < 1 indicate evidence for the null hypothesis). Statistically significant
LMM results for the peak and average saccade velocity of the next saccade after playing the sound.
LMM: linear mixed models; BF: Bayes factor;
BF10: Bayes factor (values > 1 indicate evidence for the alternative hypothesis; values < 1 indicate evidence for the null hypothesis). Statistically significant

A scatterplot of peak saccade velocity and saccade amplitude (a) and saccade duration and saccade amplitude (b) in the two onset delay conditions (“main sequence”; Bahill et al., 1975). Each data point shows the mean value for novel (circle shape) and standard sounds (triangle shape) for each subject. Lines show the regression slopes.
Discussion
The present study tested whether the increase in fixation durations during reading in response to unexpected sounds reflects a perturbation of the planning of the next saccade. In addition, it examined whether novels sounds affect not only the planning, but also the execution of the next saccade. Consistent with previous results (Vasilev, Parmentier et al., 2019), novel sounds led to longer first fixation durations compared to standard sounds immediately after their presentation. Although the magnitude of the effect was small, it was comparable to that of other auditory distraction effects during reading (Vasilev et al., 2018; but see also Hyönä & Ekholm, 2016; Yan et al., 2018). Critically, however, novel sounds were more distracting when presented with a 120 ms delay compared to a 0 ms delay. Because the sound in the 120 ms delay condition was played temporarily closer to when readers usually plan their next saccade, the present results support the SIH, which stated that the observed distraction is due to the transient inhibition of saccade planning.
Interestingly, however, saccade execution parameters such as amplitude, duration, and velocity remained unaffected by novel sounds in both delay conditions. The Bayes Factor analyses favoured the null hypothesis of no difference over the alternative hypothesis, and the evidence for the null was substantial (Jeffreys, 1961) in all but two cases. This suggests that novel sounds affect only the planning but not the execution of saccades (at least when the sound is played during a fixation). Therefore, while novelty distraction was rapid enough to delay the onset of the next saccade, its effect was short-lived and did not affect the movement of the eyes during a saccade once this movement was triggered. In this sense, the present results suggest that novel sounds do not lead to a general inhibition of all oculomotor behaviour but, rather, to a transient one that affects only saccade programming (at least when the sounds are presented up to 120 ms into the fixation).
The greater distraction in first fixations durations in the 120 ms delay condition is consistent with the notion that unexpected sounds lead to a general and temporary inhibition of motor responses (Dutra et al., 2018; Wessel, 2017; Wessel & Aron, 2013). However, such inhibition would be expected to affect all oculomotor behaviour, including the execution of saccades. Because saccadic variables such as peak velocity are assumed to be closely correlated with the firing of brainstem neurons (e.g., Di Stasi et al., 2013; Fuchs et al., 1985), our results suggest that novel sounds do not affect the neural circuits that control the movement of the eyes during a saccade. While this result may appear at odds with Wessel and Aron’s (2013) global motor suppression account, this suppression could be very time-sensitive and disappear by the time the eyes are in motion. In fact, Wessel and Aron (2013) reported that inhibition of motor-evoked potentials was present 150 ms after the sound onset, but had already disappeared 25 ms later. Therefore, the inhibition in the present experiment may be too transient to affect the execution of the next saccade.
A stronger test of whether novel sounds inhibit saccade execution would be to play them during a saccade. However, as most saccades are considerably shorter than the 60 to 150 ms needed for the effect to occur, the influence of novel sounds on saccade execution may be better studied in smooth pursuit tasks where observers have to continuously follow a moving object with their eyes (e.g., Lisberger et al., 1987; Robinson, 1965). Of course, one can also try to adjust the timing of the sounds in the current paradigm to maximise the chance that the strongest inhibition occurs exactly during the saccade. However, as reading saccades are very short, this critical point can be easily missed. As such, simple saccade tasks where there is strong experimental control over when the next saccade occurs may be preferable.
The time-course of novelty distraction in first fixation durations revealed that novel sounds first began to have an effect some 150 ms after fixation onset in the 0 ms delay condition, and some 180 ms after fixation onset in the 120 ms delay condition. The 150 ms onset in the former condition is in line with Wessel and Aron’s (2013) findings. Interestingly, the distraction effect in the 120 ms delay condition first started some 60 ms after the sound’s onset (i.e., 180–120 = 60 ms). Therefore, novel sounds exerted a faster effect on fixation durations when the sound was played, on average, during the second half of fixations. This rapid effect echoes Graupner et al.’s (2007) results where an auditory deviant caused a “first” saccadic inhibition some 90 ms after sound onset (the sounds in their study were played 100 ms after fixation onset, making it similar to the 120 ms delay condition in our study). In addition, the early onset in the 120 ms condition is generally consistent with Widmann et al.’s (2014) finding that microsaccadic inhibition originating some 80–100 ms post the sound onset may indicate the fast categorisation of auditory sounds.
Nevertheless, it is important to emphasise that the divergence point analysis indicates only the
The earlier onset of the effect in the 120 ms delay condition may occur because novel sounds usually coincided with the critical period during which saccade planning takes place. Because very few fixations terminated within the first 100 ms from the sound’s onset in the 0 ms delay condition, such early inhibition may not be seen in that condition. 5 Therefore, the overall pattern of inhibition was somewhat similar across the two delay conditions, but it was much stronger and started earlier in the 120 ms delay condition. Thus, while novelty distraction was present in both delay conditions, the magnitude of the effect was greater in the 120 ms condition because it affected a larger proportion of fixations.
What is the purpose of this temporary inhibition of eye-movement control by novel sounds? It may represent an adaptive response in the face of an unexpected contextual change where readers delay their oculomotor plans to react to something unexpected (i.e., auditory novelty pointing towards potential danger). Unexpected sounds are arguably surprising events (e.g., Parmentier et al., 2019; Wessel et al., 2016). As such, this “freezing” of oculomotor plans prior to goal-directed saccades may allow readers time to process this contextual change and select an appropriate response (see also Wessel, 2017, 2018). Generally speaking, sounds violating sensory prediction can be regarded as one instance of an unexpected change in our immediate environment, which brings some uncertainty and calls for a reappraisal of current actions plans. In line with this contention, novel sounds have been shown to render the repetition of one’s behaviour more difficult to facilitate the adoption of a different action plan (Bendixen & Schröger, 2008; Parmentier, 2016; Roeber et al., 2005, 2009). Interestingly, there is some evidence that saccadic inhibition can also occur in response to visual deviants (e.g., Godijn & Kramer, 2008; Graupner et al., 2007). Therefore, one interesting question for future research would be to explore whether this inhibitory response is specific to auditory stimuli, or if it reflects a more general response to unexpected contextual changes, which also extends to the visual domain.
Currently, it is not clear whether the increase in fixation durations simply reflects a neural inhibition that slows down the neural firing leading up to a saccade or if the current saccade plan is cancelled to plan a different one. Computational models of reading (e.g., Engbert et al., 2005; Reichle et al., 1998) assume that saccade planning occurs in two stages: one cancellable and another non-cancellable. This proposition is based on Becker and Jürgens’ (1979) findings that saccades can be programmed in parallel and modified if a new target is presented sufficiently early in the planning stages. In the context of our study, the next saccade plan may be reset or cancelled due to the detection of auditory novelty and, perhaps, the following reappraisal of action plans, before the saccade is re-programmed. Alternatively, saccade plans may stay the same, but simply take longer to programme because they are interrupted by a transient global neuro-motor inhibition. Future research might possibly adjudicate between these two explanations by using the double-step paradigm (e.g., Becker, 1972), where a target jumps to a new location prior to the execution of a saccade towards it. If novel sounds affect saccadic plans, then they should also influence the parallel planning of saccades in response to double-step stimuli.
Given that the maximum inhibition in the 120 ms delay condition occurred close to when the “average” fixation duration in the experiment was ending and the next saccade was starting (~ 240 ms; see Figure 3c), it is perhaps surprising that novel sounds did not affect saccade execution. In the E-Z Reader framework, the observed inhibition was well within the last 25–50 ms of the fixation, corresponding to the non-labile stage where the direction and distance of the next saccade is communicated to the motor system (Reichle et al., 2003, 2009). However, the subsequent movement of the eyes during the saccade was not affected. We speculate that this may occur because the inhibitory effect is very time-sensitive and may affect behaviour only within a very narrow window of time when the novel sound is first detected (~ 25 ms in Wessel & Aron, 2013). In addition, saccades are very quick ballistic movements- their duration and velocity are not under voluntary control (Leigh & Zee, 1999) and they are usually too short to be influenced by perceptual feedback (Optican & Pretegiani, 2017). In this sense, there may be little adaptive value in inhibiting a saccadic movement once it has already started because the individual will have a limited ability to act upon the unexpected stimulus during the saccade. As this is the first study to look at this topic, more evidence is required to reach a firmer conclusion.
Finally, while the present data is consistent with the SIH, it does not prove that novelty distraction is completely independent of language processing. For instance, the experimental manipulation is not sensitive to higher-level syntactic or integration processes that may occur after readers have left the fixated word. While the available data suggest that distraction is mostly constrained to the current fixation (Vasilev, Parmentier et al., 2019), further research is needed to better understand if unexpected sounds can have any effect at all on language processing. Nevertheless, we anticipate that the present results will be useful in stimulating further research on how unexpected sounds may affect complex everyday tasks such as reading.
In summary, the present study showed that novel, task-irrelevant sounds exert greater distraction in a reading task when they occur temporarily close to the end of fixations, thus suggesting that they disrupt the planning of the next saccade. At the same time, novel sounds do not appear to influence the execution of the next saccade. This suggests that their inhibition is short-lived, and the distractive impact of the novel sounds generally does not extend to subsequent oculomotor behaviour. These results provide further evidence that unexpected sounds can lead to a rapid inhibition of oculomotor planning in everyday tasks such as reading.
Supplemental Material
sj-docx-1-qjp-10.1177_1747021820982267 – Supplemental material for Distraction by auditory novelty during reading: Evidence for disruption in saccade planning, but not saccade execution
Supplemental material, sj-docx-1-qjp-10.1177_1747021820982267 for Distraction by auditory novelty during reading: Evidence for disruption in saccade planning, but not saccade execution by Martin R Vasilev, Fabrice BR Parmentier and Julie A Kirkby in Quarterly Journal of Experimental Psychology
Footnotes
Acknowledgements
Authors’ note
These data were presented at the 2020 EPS meeting (8–10 January) in London, UK.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Martin Vasilev was supported by a doctoral and a post-doctoral fellowship from Bournemouth University. Fabrice Parmentier was supported by a research grant from the Spanish Ministry of Science, Innovation and Universities (PSI2014-54261 -P), the Spanish State Agency for Research (AEI), and the European Regional Development Fund (FEDER).
Data accessibility statement
The data and materials from the present experiment are publicly available at the Open Science Framework website: https://osf.io/dscjy and ![]()
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
