Abstract
Past research from our lab has suggested visual demands in video games serve to exercise attentional-oculomotor (A-O) processing in a manner beneficial to reading. However, testing the effect of video games on reading typically requires long timeframes (e.g., multiweek training or years of accumulated video game experience). The current study manipulated within-experiment peripheral and central demands to evaluate the effects of A-O exercise on task performance. Our study included two tasks: an orthographic lexical decision task (OLDT), designed to optimise orthographic lexical processing, and a novel graphic-based health bar decision task (HBDT). In Experiment 1, the stimuli were presented centrally in one block and peripherally in another block to manipulate A-O exercise. We observed greater improvements in the peripheral-first than the central-first group, particularly for the OLDT. In Experiments 2 and 3, we focused on the OLDT, with the HBDT serving as the A-O exercise task, and observed improvements in both centrally and peripherally trained participants. We additionally observed, through analyses of word and bigram frequency, a double dissociation, whereby increased target word frequency was associated with faster target reaction times and improved error rates, whereas increased foil bigram frequency was associated with slower foil reaction times and worse error rates. Taken together, the experiments demonstrate a mechanism beyond simple task learning that drives reading improvements, and A-O exercise, even if movements are small, appears to play a role in the improvements observed. We suggest future research should further develop this paradigm and examine its utility for reading remediation in dyslexia.
Keywords
When an athlete wants to improve at activities like weightlifting, or running, they will prepare a training regimen of exercises targeted to strengthen their muscles or their endurance. Nonathletes also engage in physical exercise to maintain their form and health, and just as lifting weights or practicing for a marathon improves your physical fitness, one can also engage in cognitive exercise to maintain or improve mental skills such as memory (Holmes et al., 2019) or arithmetic (Sella et al., 2016). There is also commercial interest in cognitive exercise, with mobile applications and games purporting to improve cognitive skills. Nintendo’s game, BrainAge, is one example of a commercial game which aims to improve cognitive skills through a variety of mini-games (Nouchi et al., 2012). In the case of BrainAge, researchers observed improvements in executive function and processing speed relative to the control group, thus demonstrating the potential for commercial video games to improve cognitive skills.
Reading is an incredibly valuable cognitive skill and uniquely human experience. Printed text is a common occurrence on signage, medication labels, employment contracts, or bank documents, which all need to be read and comprehended to act on appropriately. Difficulties with reading therefore lead to difficulties in carrying out many day-to-day tasks required for success and well-being. The goal of our present research is to test whether reading can be improved through cognitive exercise, namely attentional-oculomotor (A-O) exercise.
Reading requires cognitive exercise
Improving literacy is a global issue; the international median rate of students at or below the low reading benchmark was 16% (Brochu et al., 2018). Both children and adults can struggle with literacy. The Council of Ministers of Education Canada and Statistics Canada (2013) reported that 17% of adult Canadians are at the lowest levels for literacy.
Given the vital role that reading plays in everyday activities, and the currently existing gaps in many individuals’ reading skills, reading makes an excellent candidate to target through cognitive exercise. Extracurricular reading is perhaps the broadest option for improving reading skills, and higher levels of reading engagement are associated with better reading comprehension and vocabulary (Pfost et al., 2013), but this natural form of reading practice is partially reliant on the individual’s baseline ability and interest in reading. If individuals with dyslexia have negative associations with the reading experience, any anxiety or frustration that accompanies the activity could discourage them from extracurricular reading (Stanovich, 1986).
Rather than relying on organic reading, which could contribute to existing equity gaps in reading ability, educational organisations could instead use cognitive exercise that specifically targets reading (reading-focused exercise). Reading-focused exercise could include activities such as phonetic decoding (Foorman et al., 2018), text reading, orthographic rule instruction, multisensory instruction (Hall et al., 2022), and individually tailored reading interventions (Partanen et al., 2019). These exercises are often administered over multiple sessions, under the guidance of an educator or specialist, which can help ensure the individual is making regular progress in their reading improvement. However, the extended timeframes and personnel required in most reading interventions may pose barriers for children from impoverished communities and for adults juggling multiple responsibilities.
Alternatively, cognitive exercise could target important processes that make reading easier. Given the multimodal system of reading, this approach opens numerous avenues for activities to indirectly improve reading, particularly through visual attention, which we have already seen is a key component in reading processes (e.g., Ekstrand et al., 2016; Ekstrand, Neudorf, Gould, et al., 2019; Ekstrand, Neudorf, Kress, & Borowsky, 2019; Gabrieli & Norton, 2012).
Visual demands and reading performance
Some previous research from our lab found that greater experience with peripheral demands in video games was associated with faster reading speeds, which may indicate that peripheral visual demands exercise oculomotor control ability (Kress et al., 2023). This same study observed greater experience with central demands in video games was associated with slower reading speeds, demonstrating that visual-spatial demand location exhibits a double dissociation with reading speed.
This double dissociation suggests that visual-spatial demands employ oculomotor control processes differently depending on location, with peripheral demands exercising oculomotor control processes more than central demands. Other researchers (Minissi et al., 2024; West et al., 2013) have also argued that video games can affect oculomotor performance as well. From our findings (Kress et al., 2023), we argued that games with a high quantity of peripheral visual demands (e.g., enemies popping up in unexpected locations on the outskirts of the screen) encourage the player to regularly and systematically move their eyes across all parts of the screen, exercising the full range of oculomotor function. In contrast, when the player only needs to focus on the centre of the screen, their oculomotor function is not fully engaged. If the oculomotor system is not used to being engaged, it will be less prepared for the systematic eye movements required during reading. Based on that previous work, we propose A-O exercise as a form of cognitive exercise which may benefit reading performance through improved oculomotor control.
A-O exercise for reading
The form of cognitive exercise that we are interested in for the purposes of our research is A-O exercise, which we will define as activities that encourage processing in the visual-attentional system and promote oculomotor control. The visual attention system integrates a number of visual and attentional processes, including low-level visual processing, attentional alerting and orienting mechanisms, and higher-level executive control (Corbetta & Shulman, 2002; Fan et al., 2002). The visual-attentional system is traditionally conceptualised as a two-network system (Corbetta & Shulman, 2002). The ventral attention network handles bottom-up control processes, which are stimulus-driven responses that capture attention automatically (e.g., looking automatically to the source of a sudden sound or movement), and includes the inferior frontal and middle frontal gyri. The temporoparietal junction also plays a role in this network and is thought to act as a switching mechanism (interrupting top-down attention, engaging bottom-up attention) when a salient stimulus arrives in an unattended location. The dorsal attention network handles top-down attentional control processes, which are those goal-driven responses that are capturing attention through voluntary effort (e.g., instructions to attend to a certain stimulus). Regions such as the frontal-eye-field (FEF) and intraparietal sulcus/superior parietal lobule are noteworthy regions of the dorsal attention network.
The FEF is particularly important for its role in oculomotor control. Models of oculomotor control describe the system underlying our eye movements, which involves deciding when and to where eye shifts should take place (Tatler et al., 2017). This control over eye movements is critical in reading, where the reader must systematically shift their eyes across the page. Simulations by Li and Pollatsek (2020) demonstrate the role of eye movements in Chinese word processing, supporting the argument for oculomotor control’s involvement in reading.
Researchers in the field of dyslexia frequently propose links between attentional network dysfunction and the reading deficits that are the hallmark of the disorder (Boden & Giaschi, 2007; Boros et al., 2016; Facoetti et al., 2000; Vidyasagar & Pammer, 2010). Oculomotor control difficulties would contribute to difficulties with attentional orienting processes, such as maintaining fixation or moving eyes accurately (Boden & Giaschi, 2007; Morrison, 1984). In the reading network, these attentional orienting difficulties would be most likely to translate to difficulties with visual letter processing (orthographic feature encoding) or phonetic decoding (grapheme-to-phoneme conversion), as these processes are more associated with voluntary attention and oculomotor control than whole-word processing via the orthographic lexicon (Ekstrand, Neudorf, Kress, & Borowsky, 2019), and both orthographic and phonological processes are important to successful reading (Acha et al., 2024).
Video game play as A-O exercise
Based on the literature, video game play may be one activity that promotes A-O exercise. Video games expose players to an assortment of visual stimuli both in central and peripheral areas of the screen. The visual stimuli presented in video games can be thought of as highly valid attentional cues—if a stimulus appears on-screen (such as a quest marker that appears on the player’s mini-map, a glowing pop-up that indicates it is possible to interact with an object, or a damage notification that points the player in the direction from which they received the attack), it is almost always important and should be attended to. The high-frequency, high-relevance visual stimuli presented peripherally would encourage substantial use of the oculomotor control system, thus contributing to A-O exercise. Training studies have observed oculomotor control system changes in structure and performance following video game play in older adults (Diarra et al., 2019; West et al., 2013). In a structural brain connectivity study, Kowalczyk et al. (2018) found that the connectivity in visual-spatial regions was greater in real-time strategy players compared with people who do not play games regularly, and behavioural studies have suggested that action video game play may improve reading ability in children (e.g., Bertoni et al., 2021; Franceschini et al., 2017; Pasqualotto et al., 2022; see Puccio et al., 2023 for a meta-analysis).
The research on the relationship between game-based A-O exercise and reading ability has focussed on long timeframes. In training studies, participants receive multiple hours of training over the course of several days or weeks (over 10 hr on average; Puccio et al., 2023), whereas cross-sectional studies typically include variables that reflect months or years of accumulated video game experience (e.g., Kress et al., 2023). The current study seeks to determine whether the benefits of A-O exercise on reading performance can be realised in a brief and efficient timeframe. In addition, much of the past research has focused on the phonological benefits of video game experience (e.g., Bertoni et al., 2021, 2024; Franceschini & Bertoni, 2019; Pasqualotto et al., 2022). This study will extend the field’s understanding of the role A-O exercise plays in reading by using a lexical decision task (LDT) designed to isolate orthographic lexical processing from sublexical phonological processing, thus filling the knowledge gap in how A-O exercise may affect orthographic processing.
Measures of lexicality and sublexicality
Word frequency has been used in past reading research as a type of measure to help predict how reliant a stimulus might be on either the lexical or sublexical route (when considering dual-route reading models, Borowsky et al., 2013). Word frequency is a measure of how likely a word is to appear in the given corpus. Generally, researchers find that higher word frequency is associated with faster response times (Borowsky & Besner, 1993; Borowsky et al., 2013; Brysbaert & New, 2009), and greater activation in ventral brain regions (Borowsky et al., 2013), and is thus thought to reflect the level of involvement of the lexical route.
Bigram frequency is another corpus-based word characteristic and is a measure of how frequently a pair of letters appears in the corpus (Balota et al., 2007). There are multiple ways to combine the frequencies of each bigram to calculate a single measure for the word. One option is the sum bigram frequency, where the frequencies of each bigram are simply added together. Another option is the mean bigram frequency, where the sum bigram frequency is divided by the number of bigrams in the word. Bigram frequency is less well studied compared with word frequency, but higher bigram frequency is associated with faster reading response times and has been argued to reflect the speed at which spelling can be mapped onto sound, thus representing the efficiency of the dorsal-sublexical grapheme-to-phoneme processing route (Borowsky et al., 2013). Through reading aloud studies, lexical and sublexical processing has been shown to involve distinct regions of activation in the ventral stream and dorsal stream, respectively (Borowsky et al., 2006; Ekstrand, Neudorf, Kress, & Borowsky, 2019).
Not all models of reading employ a dual-route structure. As Owen and Borowsky (2003) describe, single-route models exist which employ only one nonsemantic route from visual text input to phonological representation. The connectionist models proposed by Seidenberg and colleagues (e.g., Harm & Seidenberg, 1999) are an example of this single-route model conceptualisation. Past research has found that single- vs. dual-route models offer different simulations of reading results depending on the task and stimuli, with some single-route models struggling to predict word frequency interaction effects (see Borowsky & Besner, 2006 for discussion of this issue; see also Besner & Borowsky, 2006; Plaut & Booth, 2006).
In the context of the LDT, most designs focus on word stimuli with nonword foils (see Borowsky & Besner, 1993; Masson & Borowsky, 1998). In this task, the participant must decide whether the stimulus is a word or not. These designs may not be ideal for isolating lexical and sublexical processing, as the word and nonword stimuli are not matched on their non-orthographic lexical features (Kress et al., 2021). For example, using nonspecific word targets (e.g., gave) and pronounceable but meaningless nonword foils (e.g., bave), a participant could rely on any combination of orthographic (spelling), phonological (sound), and semantic (meaning) representations to inform the correct response. An improved LDT design uses exception word targets (e.g., yacht) and pseudohomophone foils which are matched on the semantic and phonological features of the word (e.g., yawt is a pseudohomophone for the real word yacht). This specialised LDT forces one to rely on orthographic lexical representations because the sublexical assembly of phonology (and its resulting activation of lexical phonology) can no longer be used to successfully determine whether the stimulus spells a real word or not (Ekstrand et al., 2016; Neudorf, Ekstrand, Kress, & Borowsky, 2019; Neudorf, Ekstrand, Kress, Neufeldt, & Borowsky, 2019). For this unique quality of isolating orthographic lexical route processing, we call it an orthographic lexical decision task (OLDT). However, word and bigram frequency effects have not been studied in the context of the specialised OLDT, leaving a gap in the literature regarding whether this lexical decision variant can reflect orthographic lexical processing more clearly than tasks using the nonspecific word stimuli and unmatched nonword foils.
The current study
The goal of the current study is to investigate whether A-O exercise can have an immediate impact on visual processing and reading, such as during a single experiment session. One goal of our first experiment’s design was to test whether visual processing tasks in general, or reading tasks specifically, can benefit from A-O exercise. Inspired by the use of video games to improve reading ability, we developed a new type of visual stimulus and task design for use in A-O exercise—a novel health bar decision task (HBDT) which uses stimuli resembling the health bars found in video games in a decision task similar to the OLDT. In the HBDT, the participant indicates whether the health bar matches the number presented. This task will not utilise reading processes as heavily as the OLDT, so it reflects general visual processing. Because it uses stimuli resembling those found in video games, it may better reflect the benefits of video game play on reading in a more controlled environment than what is found in commercial games. The location of the stimuli presented in our two tasks can be manipulated to induce A-O exercise through peripheral or central visual demands.
If standard task practice effects are present (which we refer to as the task-learning hypothesis), participants will experience performance benefits between a first block and a second block, regardless of whether peripheral or central visual demands are presented first. If exposure to peripheral visual demands is beneficial and exposure to central visual demands is detrimental, as our previous reading aloud research suggests (Kress et al., 2023), then participants who are exposed to peripheral before central conditions should show improved performance compared with participants who are exposed to central before peripheral conditions (which we refer to as the A-O exercise hypothesis).
Thus, there are three hypothesised patterns of results we may expect to see in our single-session paradigm:
The task-learning hypothesis: If participants’ performance is based on general experience with the task, we will expect to see typical task-learning effects, whereby performance in the second block is better than in the first block (see Figure 1a).
The A-O exercise hypothesis: If our paradigm promotes A-O exercise in the same manner as previous studies on visual demands in video games, we expect to see a similar dissociation in performance between central and peripheral blocks, as was observed by Kress et al. (2023). In this case, there should be a beneficial effect when peripheral blocks are presented first and a detrimental effect when the central blocks are presented first (see Figure 1b).
A blend of task learning and A-O exercise: Both patterns could be present and result in a blended response pattern in our data (see Figure 1c).

Task learning (a), A-O exercise (b), and A-O exercise + task learning (c) hypotheses with difference scores.
In addition, in our OLDT, we will test correlations of performance with the lexical measure of word frequency and the sublexical measure of bigram frequency to confirm that targets in our OLDT are successfully isolating lexical processes, as we expect. In tasks that isolate lexical processing, researchers typically observe a word frequency effect in which performance is better for items with higher word frequency (Borowsky et al., 2013). We expect to replicate this typically observed word frequency effect in our study. Bigram frequency is less well studied, but it has been associated with sublexical processes (Borowsky et al., 2013). Our inclusion of word and bigram frequency analyses will help researchers better understand the role of the lexical and sublexical processes in the OLDT.
Experiment 1
Methods
Participants
This study was approved by the University of Saskatchewan Behavioural Research Ethics Board. We recruited 61 participants 1 (28 cisgender men, 29 cisgender women, 4 nonbinary or transgender participants; M = 37.97 years, SD = 12.90 years, mean video game experience = 8.86 hr/week, SD = 9.42, see Table 1) through the Prolific recruitment platform (https://prolific.co) who were covertly prescreened using Prolific’s prescreening filter options. Participants were prescreened for country of residence (Canada or the United States), first language (English), language-related disorders (none), vision (normal or corrected-to-normal), and handedness (right). Participants were recruited between November and December 2022. As compensation for participating, participants received £6.75 (Prolific is based in the United Kingdom and provides participants compensation in British Pounds). Participants provided informed consent by registering for the study on Prolific after reading the description (which contained the full written consent form). Informed consent was confirmed by repeating a brief version of the consent form in the PsychoPy experiment, which participants had to agree to with a keypress before the experiment would proceed.
Demographic characteristics as a function of group assignment.
Apparatus and stimuli
The experiment was designed in PsychoPy Version 2022.2.4 (Peirce et al., 2019; PsychoPy, 2022) and hosted online through Pavlovia (https://pavlovia.org/). Participants were requested to use a computer with a screen size of 14 inches or larger so that the stimuli and instructions would appear on-screen correctly. In addition, participants used standardised credit card dimensions to calibrate their screen size at the start of the experiment (code adapted from Morys-Carter, 2020) and were instructed to sit approximately 70 cm from their computer monitor to maximise the consistency in the size and visual angle of stimuli between participants.
Peripheral and central presentation
The two tasks in this study were both presented in two different versions that differed in target location. In both versions, a star-shaped visual-spatial cue appeared on-screen for 150 ms in the target location before the stimulus appeared. In the centrally presented conditions, the target location was always the centre of the screen. In the peripherally presented conditions, the target location was randomly selected from one of four locations (top, bottom, left, or right) which were 6.1 cm (4.98 degrees of visual angle using a sitting distance of 70 cm) from the centre of the screen. The stimulus appeared at the target location for 1,000 ms and participants could respond from the moment the stimulus appeared to 500 ms after the stimulus disappeared. If the participant responded before the time was up, then the next trial would begin. If the participant did not respond, then the experiment automatically proceeded to the next trial. The task versions were fully within-subjects (i.e., all participants experienced all conditions), but the order of task presentation was between-subjects.
OLDT
The OLDT is a reading task that focuses on orthographic lexical processes through its use of exception word targets and pseudohomophone foils. The OLDT blocks consisted of 120 exception words (target stimuli; e.g., ninth; mean loge-transformed HAL word frequency = 9.67, SD = 2.30; mean bigram frequency = 3,014.44, SD = 1,334.81) which require orthographic lexical processing to be decoded and 120 corresponding pseudohomophones (foil stimuli; e.g., nynth; mean bigram frequency = 2,296.45, SD = 1,650.25) which do not use orthographic lexical processing but still can activate phonological and semantic processing via phonetic decoding processes. We selected the exception word stimuli from Patterson and Hodges’ list of 126 monosyllabic exception words (Patterson & Hodges, 1992) and generated pseudohomophones by changing a letter or multiple letters within the word to disrupt the spelling as much as possible but maintain the same pronunciation (based on Canadian English pronunciations of the words). These exception word and pseudohomophone stimuli are listed in the online Supplementary Material A (Tables A.1 and A.2). The OLDT stimuli were presented in white text (Open Sans font) on a black background, with a maximum word height of 1.1 cm and a maximum word length of 3.5 cm, and flanked by two pairs of white letter x’s (i.e., XX), 2 one above and one below the letter-string (making the overall dimensions of these stimuli, including the flankers, 3.8 cm tall by 3.5 cm wide).
HBDT
The HBDT is a novel task designed to reflect aspects of video game play by using stimuli found in video games. The HBDT blocks of the experiment consisted of a set of images depicting a red and white bar (meant to resemble the health bar in a typical video game) which was 1 cm tall by 3.4 cm long and flanked with a two-digit number above and below the bar (overall dimensions of the stimuli, including the flankers, were 3.8 cm tall by 3.4 cm wide). The proportion of the bar that was red ranged from 10% to 90%, in increments of 10%. The flanker number ranged from 20 less than the red proportion of the bar to 20 greater than the red proportion of the bar, in increments of 10. The flanking number was restricted such that it could not be lower than 10 or greater than 90, so all flanking numbers were two digits. With these parameters, the set of health bar images totalled 39 unique images—five images each for the health bar values 30 to 70, four images each for the health bar values 20 and 80, and three images each for the health bar values 10 and 90 (see the online Supplementary Material A, Table A.3).
Procedure
After being recruited through Prolific, participants were directed to Pavlovia to complete the online study. The order of the task location blocks was counterbalanced such that half the participants received the centrally presented tasks first. The order of the HBDT and OLDT for each participant was the same in both location (central vs. peripheral) blocks and counterbalanced such that half the participants received the HBDT first:
1. Central-HBDT, 2. Central-OLDT, 3. Peripheral-HBDT, 4. Peripheral-OLDT.
1. Central-OLDT, 2. Central-HBDT, 3. Peripheral-OLDT, 4. Peripheral-HBDT.
1. Peripheral-HBDT, 2. Peripheral-OLDT, 3. Central-HBDT, 4. Central-OLDT.
1. Peripheral-OLDT, 2. Peripheral-HBDT, 3. Central-OLDT, 4. Central-HBDT.
In the HBDT, participants were instructed to press the left arrow key on the keyboard if the proportion of the health bar that was filled red and the number flanking the health bar matched and press the right arrow key if these did not match (240 trials). In the OLDT, participants were instructed to press the left arrow key if the stimulus spelled a real word and press the right arrow key if the stimulus did not spell a real word (240 trials). Figure 2 outlines the counterbalancing and trial process for both tasks. The two tasks would then repeat for the other location block (240 trials × 4 blocks = 960 trials total in the experiment). There was a set of eight practice trials before each block. As part of the instructions, participants were asked to use their index finger to press the left arrow key and their middle finger to press the right arrow key (see Figure 3). The instructed hand position on the keyboard was intended to match the hand position of a similar study where participants responded with mouse clicks (this study is not reported here). After completing the study, participants were redirected to SurveyMonkey (https://www.surveymonkey.com) to complete the postexperiment demographics questions.

Block counterbalancing and trial procedure for Experiment 1.

Instructed hand position on arrow keys for Experiment 1.
Results
For our analyses, we examined by-subjects and by-items median reaction time (which is more robust against outlier responses than mean reaction time) of correct responses to exception word targets and pseudohomophone foils. The values reported in text are the mean of these median response times. We also examined by-subjects and by-items mean error rates. Trials with missing responses were removed prior to analysis. OLDT item pairs with outlier error rates (error rates which exceeded 3 SD outside the mean error rate; 4 OLDT item pairs (climb/clime, dost/dawst, sieve/siv, and soot/suht) were also removed prior to the analysis. For Experiment 1, rather than reporting the by-subjects and by-items analyses together as F1 and F2 analyses, respectively, the analyses are reported separately, as the difference scores had to be calculated with distinct formulas for each analysis to handle the counterbalanced design of this study.
By-subjects results
Reaction time and error rate Block 1 − Block 2 difference scores were computed based on the correct responses to target or foil stimuli (see Tables 2 and 3). With this calculation, positive values indicate the performance benefits, whereas negative values indicate the performance detriments.
Experiment 1 reaction time, error rates, and difference scores (by-subjects, on targets).
Experiment 1 reaction time, error rates, and difference scores (by-subjects, on foils).
A set of General Linear Models (GLMs) was used to assess the difference scores in 2 (Task: HBDT vs. OLDT, within-subjects) × 2 (Order: central-first vs. peripheral-first, between-subjects) mixed-measures designs. For exception word targets, the main effect of Task (HBDT: M = 20 ms, SD = 98; OLDT: M = 24 ms, SD = 66), F(1, 59) = 0.20, MSE = 4,213.69, p = .660, and Task × Order interaction, F(1, 59) = 3.01, MSE = 4,213.69, p = .088, were not significant. There was a main effect of order on target stimuli, F(1, 59) = 36.25, MSE = 6,085.05, p < .001, whereby peripheral-first participants (M = 66 ms, SD = 56) had greater improvements in Block 2 than central-first participants (M = −19 ms, SD = 83). We also observed a significant double dissociation of visual demands on OLDT target performance in terms of the 95% confidence intervals, whereby the OLDT target difference scores were greater than zero for peripheral-first participants and less than zero for central-first participants (Masson & Loftus, 2003, see Figure 4).

Experiment 1 costs vs. benefits in reaction time as a function of Task and Order (by-subjects, on targets).
The same GLM analysis was also conducted on the Block 1 − Block 2 target difference scores for target error rate. There was a significant main effect of task on targets, F(1, 59) = 27.16, MSE = 68.85, p < .001, whereby the HBDT exhibited larger error rate improvements (M = 7.96%, SD = 11.43) than the OLDT (M = −0.02%, SD = 4.54). The main effect of Order (central-first: M = 2.71%, SD = 10.86; peripheral-first: M = 5.36%, SD = 7.70) and Task × Order interaction was not significant on target stimuli, F(1, 59) = 2.77, MSE = 76.87, p = .101 and F(1, 59) = 3.86, MSE = 68.85, p = .054, respectively.
The same analyses were conducted on the foil stimuli, given our interest in how the stimuli were processed differently in this specialised OLDT. Reaction time and error rate Block 1 − Block 2 difference scores were computed for pseudohomophone foil stimuli (see Table 3) in the same manner as for target stimuli. We used two 2 (Task: HBDT vs. OLDT, within-subjects) × 2 (Order: central-first vs. peripheral-first, between-subjects) mixed-measures GLMs to assess the difference scores.
On reaction time, there was a main effect of Order, F(1, 59) = 33.64, MSE = 7,762.25, p < .001, whereby peripheral-first participants exhibited larger improvements towards foils (M = 73 ms, SD = 70) than central-first participants (M = −20 ms, SD = 100). The main effect of Task (HBDT: M = 14 ms, SD = 120; LDT: M = 35 ms, SD = 70) and Order × Task interaction was not significant on foil stimuli (see Figure 5), all Fs < 2.06, all ps > .157 (MSE = 7,232.79). On foil error rate, there was a main effect of Order, F(1, 59) = 5.21, MSE = 42.84, p = .026, whereby peripheral-first participants exhibited larger improvements towards foils (M = 3.57%, SD = 6.25) than central-first participants (M = 0.86%, SD = 7.11). There was also a main effect of Task, F(1, 59) = 22.73, MSE = 35.21, p < .001, whereby there were larger improvements in the HBDT (M = 4.71%, SD = 7.75) than the OLDT (M = −0.42%, SD = 4.51). The Order × Task interaction was not significant, F(1, 59) = .01, MSE = 35.21, p = .905.

Experiment 1 costs vs. benefits in reaction time as a function of Task and Order (by-subjects, on foils).
By-items results
With the current design, the perceived benefit of peripheral-first presentation may be due to the peripheral tasks being more difficult than the central tasks, so participants simply perform better in the easier central tasks, regardless of order.
Analysing the data from a by-items perspective allows us to address this issue, as all items were presented in all conditions. With this perspective, a Peripheral Attentional Exercise Score is calculated as peripheral-first participants’ central block (Central Block 2) minus central-first participants’ central block (Central Block 1). The difference score between these two blocks can be considered a view of the peripheral attentional exercise effect, with any effects of difficulty accounted for (because we are comparing the same location). Similarly, a Central Attentional Exercise Score is calculated as central-first participants’ peripheral block (Peripheral Block 2) minus peripheral-first participants’ peripheral block (Peripheral Block 1).
Reaction time Block 1 − Block 2 difference scores were computed based on the correct responses to target or foil stimuli (see Tables 4 and 5, for reaction times and error rates).
Experiment 1 reaction time, error rates, and difference scores (by-items, on targets).
Experiment 1 reaction time, error rates, and difference scores (by-items, on foils).
A one-way GLM analysis was conducted on the Block 1 − Block 2 difference scores for target item reaction time. There was a significant main effect of Exercise Type, F(1, 115) = 12.98, MSE = 1,462.69, p < .001, whereby peripheral exercise was associated with greater target reaction time improvements (M = 35 ms, SD = 38) than central exercise (M = 17 ms, SD = 39). Exercise type was also significant for error rate on targets, F(1, 115) = 23.18, MSE = 37.48, p < .001, with peripheral exercise exhibiting larger target error rate improvements (M = 2.13%, SD = 5.80) than central exercise (M = −1.74%, SD = 6.40).
As in the previous analyses, reaction time and error rate Block 1 − Block 2 difference scores were computed based on the correct responses to foil stimuli (see Table 5). We used two one-way (Exercise Type: central vs. peripheral) repeated-measures GLMs to assess the difference scores. On foil reaction time, there was a main effect of exercise type, F(1, 114) = 56.21, MSE = 1,646.49, p < .001, whereby peripheral exercise exhibits greater improvements on foil reaction time (M = 53 ms, SD = 35) than central exercise (M = 13 ms, SD = 44). On foil error rate, there was also a main effect of exercise type, F(1, 114) = 65.94, MSE = 52.30, p < .001, whereby peripheral exercise exhibits greater improvements on foil error rate (M = 3.34%, SD = 5.27) than central exercise (M = −4.12%, SD = 7.96).
Word frequency and bigram frequency
To test whether our OLDT is successfully isolating lexical processes, we used simple correlations to analyse the relationship between word frequency, bigram frequency, and our two dependent variables (reaction time and error rate). We used loge-transformed HAL word frequency as our word frequency measure and mean bigram frequency as our bigram frequency measure (see the online Supplementary Material A). In a previous reading aloud study (Borowsky et al., 2013), these word and bigram frequency measures were used to reflect ventral-lexical and dorsal-sublexical processing, respectively. Loge-transformed HAL word frequency and mean bigram frequency were derived from the complete English Lexicon Project database (Balota et al., 2007). Pseudohomophones, being nonwords, do not have a word frequency measure associated with them, and as such, we used the base word frequency for these stimuli instead (i.e., the word frequency of the real word from which the pseudohomophone is derived). In our set of stimuli, exception word frequency was not correlated with exception word bigram frequency, t(118) = −0.04, r = −.003, p = .97, nor with pseudohomophone bigram frequency, t(118) = −0.92, r = −.08, p = .362, suggesting that our word and bigram frequency measures are independent.
We observed a significant correlation between word frequency and exception word performance, such that higher word frequencies were associated with faster reaction times and lower error rates (see Table 6). In exception words, there was no significant correlation with bigram frequency and performance. In contrast, we observed a significant correlation between bigram frequency and pseudohomophone performance such that higher bigram frequencies were associated with slower reaction times and higher error rates (see Table 6). There was no significant correlation with base word frequency and performance for the pseudohomophones.
Simple correlations between reaction time or error rate and word/bigram frequency for all three experiments (targets and foils).
Note. WF: loge-transformed HAL word frequency; BF: mean bigram frequency.
Significance levels for the aggregate correlations: *<.05; **<.01; ***<.001.
Interim discussion
The results of Experiment 1 show that short-term A-O exercise in the form of exposure to peripheral visual demands may benefit visual processing tasks, and the consistent trends in the word target reaction times and error rates (both in terms of block values and difference scores) suggest there are no speed-accuracy trade-offs. The pattern of results appears to support the blended attentional exercise + task-learning hypothesis, as we see improvements after peripheral blocks, whereas there is minimal change after central blocks (controlling for task learning via by-items analysis). Clearly, as demonstrated by the by-items analysis, a general learning effect is insufficient to explain the results of this experiment.
The results of Experiment 1 demonstrate a clear double dissociation between the higher word frequency and improved performance for exception words and higher bigram frequency and worse performance for pseudohomophones. For pseudohomophones to exhibit worse performance as a function of increased bigram frequency, it suggests that—due to increased familiarity and efficiency in sublexical grapheme-to-phoneme processing—those high-frequency bigrams may be making the pseudohomophones seem more word-like. This evidence supports our argument that our OLDT with exception word targets and pseudohomophone foils is well designed to isolate orthographic lexical processing, where strong word frequency effects support the idea that exception word targets optimally activate orthographic lexical route representations, and bigram frequency may be a useful measure to assess the involvement of the sublexical route for pseudohomophone foil processing in this task. This novel finding is an exciting contribution to the field of lexical decision but should be replicated in additional OLDT studies.
It is important to note that the counterbalance of our design was intended to examine whether reading performance (via the OLDT), general visual processing performance (via the HBDT), both, or neither could be affected by manipulations of A-O exercise. Given our interest in reading processes, our follow-up refines the paradigm to target reading processes specifically. These refinements can also modify the manipulation of A-O exercise in specific test blocks to control the attentional difficulty of the test task. To address concerns regarding increased task difficulty in the peripheral block contributing to the observed results, we developed a refined A-O exercise paradigm that controls the location of the pre- and post-exercise tasks. In addition, we used a similar OLDT to the current study, so that we can test the replicability of the word and bigram frequency double dissociation on performance.
Experiment 2
In Experiment 1, we were interested in the presence of any effect of attentional exercise, as this would support our theory that short-term attentional exercise may be able to benefit reading performance. For Experiments 2 and 3, we refined our A-O exercise paradigm to focus exclusively on the OLDT as our target task of interest, with the HBDT as a training task. These new designs will address Experiment 1’s issues regarding task difficulty, as the pre- and post-training tasks are matched on location. In addition, our use of the same OLDT from Experiment 1 gives us the opportunity to replicate the double dissociation between the word frequency and bigram frequency effects on exception word target and pseudohomophone foil performance, respectively.
Hypotheses
In this design, we expected to see an interaction between Training Group and Time, where participants who take part in peripheral A-O exercise should exhibit larger performance improvements than the participants who take part in central A-O exercise. In addition, we expected to see a replication of the double dissociation between the word and bigram frequency effects on exception word and pseudohomophone performance, respectively, which would further support the OLDT as a task to isolate orthographic processing.
Method
Participants
We recruited 40 participants (9 cisgender men, 31 cisgender women; M = 19.85 years, SD = 3.54, mean video game experience = 2.40 hr/week, SD = 4.40) through the University of Saskatchewan SONA participant pool during the Fall 2023 term. As compensation for participating, participants received two bonus course credits. The first language of all participants was English. In addition, although two participants were left-handed, all participants reported using the mouse with their right hand. This study was approved by the University of Saskatchewan Research Ethics Board, and all participants provided informed consent before participating.
Participants were randomly assigned to the training groups such that half the participants received the central HBDT as their training task and the other half received the peripheral HBDT as their training task. There were no significant differences between the training groups in terms of age or gender (see Table 7).
Demographic characteristics as a function of group assignment.
Note. All participants reported using the mouse with their right hand.
Apparatus and stimuli
The experiment was designed in PsychoPy Version 2023.1.3 (Peirce et al., 2019; PsychoPy, 2023) and run in a dimly lit, quiet experiment room. The experiment computer was a Lenovo ThinkCentre M70q computer running Windows 10. An HP E243 23.8-inch monitor displayed the experiment stimuli. Responses were recorded with a standard keyboard and mouse. The OLDT and HBDT stimuli were the same as in Experiment 1. The OLDT stimuli were split into two half-blocks (see the online Supplementary Material A). These half-blocks were counterbalanced such that half the participants in each training group received Half-Block A as the pretraining task and the other participants received Half-Block B as the pretraining task.
Procedure
An experimenter was present for the duration of the experiment, and there was a set of eight practice trials before each of the three tasks. After providing informed consent, participants began the experiment with the pretraining reading task, which was a 120-trial OLDT (half the length of the OLDT in Experiment 1). In the half-OLDT, a fixation cross would appear, and participants would press the spacebar to begin the trial. After a 150-ms interstimulus interval, where no fixation cross, stimulus, or cue was on-screen, a letter-string stimulus would appear. Participants were instructed to left-click if the letter-string spelled a real word, and right-click if the letter-string did not spell a real word. The OLDT stimuli were presented until response. Then participants received one block (240 trials) of either the central or peripheral HBDT as their training task, depending on the group to which they were assigned. The HBDT followed the same timing and procedure as in Experiment 1 with one exception. Instead of using the keyboard, participants were instructed to use the mouse to left-click for target responses and right-click for foil responses. After the training task, participants completed a post-training reading task, which was a half-OLDT following the procedure of the pretraining task but using the other set of 120 trials. Finally, participants completed their demographics form and were debriefed on the purpose of the study. Figure 6 depicts the design and trial procedure of this experiment.

Training group assignment and trial procedure for Experiment 2.
Results
As in Experiment 1, we examined participant (reported as F1) or item (reported as F2) median reaction time and mean error rate to correct responses on both exception word targets and pseudohomophone foils. The reaction times reported in the text are the mean of these median response times. OLDT item pairs with outlier error rates (error rates which exceeded 3 SD outside the mean error rate) were removed prior to the analysis (six item pairs: brooch/broatch, dost/dawst, sieve/siv, soot/suht, suave/swawv, suede/swaid). Linear mixed model and general linear mixed model analyses were conducted in R (R Core Team 2022), using the methods of Baayen et al. (2008), Bates et al. (2015), and Matuschek et al. (2017), and can be found in the online Supplementary Material B.
Target stimuli analyses
First, 2 (Training Group: central vs. peripheral) × 2 (Time: pretraining vs. post-training) mixed-measures GLMs were conducted on median reaction times to target stimuli to test the effect of A-O exercise (through the HBDT) on OLDT performance. There was a consistent main effect of Time, F1(1, 38) = 20.92, MSE1 = 1,878.54, p1 < .001, F2(1, 113) = 75.33, MSE2 = 3,305.54, p2 < .001, whereby responses to targets were faster post-training (M1 = 581 ms, SD1 = 69, M2 = 595 ms, SD2 = 75) than pretraining (M1 = 625 ms, SD1 = 85, M2 = 642 ms, SD2 = 86). There was no effect of Training Group (peripheral: M1 = 606 ms, SD1 = 87, M2 = 620 ms, SD2 = 85; central: M1 = 601 ms, SD1 = 74, M2 = 616 ms, SD2 = 83), F1(1, 38) = 0.05, MSE1 = 10,444.53, p1 = .820, F2(1, 113) = .70, MSE2 = 2,443.64, p2 = .403. There also was no Training Group × Time interaction, F1(1, 38) = 0.003, MSE1 = 1,878.54, p1 = .958 (see Figure 7), F2(1, 113) = .61, MSE2 = 2,000.89, p2 = .436.

Experiment 2 reaction time as a function of time and training group (by-subjects, on targets) depicted (a) as difference scores and (b) by Time.
The same GLMs were conducted on mean error rates to targets. There was a consistent main effect of Training Group on targets, F1(1, 38) = 6.29, MSE1 = 38.74, p1 = .017, F2(1, 113) = 24.23, MSE2 = 57.93, p2 < .001, whereby the peripheral training group had worse error rates (M1 = 9.77%, SD1 = 6.68; M2 = 9.82%, SD2 = 14.23) than the central training group (M1 = 6.28%, SD1 = 3.71; M2 = 6.32%, SD2 = 12.51). The main effect of Time (pretraining: M1 = 7.57%, SD1 = 5.34, M2 = 7.59%, SD2 = 13.04; post-training: M1 = 8.48%, SD1 = 5.97, M2 = 8.55%, SD2 = 13.96) was not significant F1(1, 38) = 0.82, MSE1 = 20.31, p1 = .372, F2(1, 113) = 1.33, MSE2 = 80.03, p2 = .252. The Training Group × Time interaction was also not significant, F1(1, 38) = 0.82, MSE1 = 20.31, p1 = .372, F2(1, 113) = 1.34, MSE2 = 79.15, p2 = .249.
Foil stimuli analyses
The same 2 (Training Group: central vs. peripheral) × 2 (Time: pretraining vs. post-training) mixed-measures GLMs were conducted on median reaction times to foil stimuli. There was a main effect of Time on foils (see Figure 8), F1(1, 38) = 38.38, MSE1 = 3,751.48, p1 < .001, F2(1, 113) = 144.14, MSE2 = 3,681.23, p2 < .001, whereby responses were faster after training (M1 = 651 ms, SD1 = 76; M2 = 665 ms, SD2 = 87) than before training (M1 = 735 ms, SD1 = 119; M2 = 733 ms, SD2 = 101). There was no effect of Training Group by-subjects (central: M1 = 699 ms, SD1 = 108; peripheral: M1 = 687 ms, SD1 = 109), F1(1, 38) = 0.16, MSE1 = 16,594.04, p1 = .696. However, the effect of Training Group was significant by-items, F2(1, 113) = 12.02, MSE2 = 3,425.37, p2 < .001, whereby foils were responded to more quickly by participants in the peripherally trained group (M2 = 690 ms, SD2 = 100) than participants in the centrally trained group (M2 = 709 ms, SD2 = 99). There was no Training Group × Time interaction, F1(1, 38) = 1.37, MSE1 = 3,751.48, p1 = .250, F2(1, 113) = 0.11, MSE2 = 4,294.18, p2 = .741.

Experiment 2 reaction time as a function of time and training group (by-subjects, on foils) depicted (a) as difference scores and (b) by Time.
On mean error rate for foils, there were no by-subjects effects or interactions, all F1s < 1.01, all p1s > .323 (Time-pretraining: M1 = 13.23%, SD1 = 8.93; Time-post-training: M1 = 13.57%, SD1 = 9.04; Training Group-central: M1 = 13.19%, SD1 = 10.23; Training Group-peripheral: M1 = 13.61%, SD1 = 7.55). There were also no by-items effects or interactions, all F2s < 2.38, all p2s > .126 (Time-pretraining: M2 = 13.33%, SD2 = 15.23; Time-post-training: M2 = 13.60%, SD2 = 15.91; Training Group-central: M2 = 13.20%, SD2 = 15.56; Training Group-peripheral: M2 = 13.73%, SD2 = 15.58).
Word frequency and bigram frequency
As in Experiment 1, we used correlations to analyse the relationship between word frequency, bigram frequency, and our two dependent variables (reaction time and error rate). We used the same word and bigram frequency measures as in Experiment 1. The word and bigram frequency results were consistent with Experiment 1 (see Table 6). Word frequency had a negative correlation with exception word reaction times and error rate (i.e., improved performance as a function of increased word frequency), whereas bigram frequency had a positive correlation with pseudohomophone reaction time and error rate (i.e., worse performance as a function of increased bigram frequency).
Interim discussion
With Experiment 2, we were able to replicate the double dissociation of a word frequency effect on exception word performance, a bigram frequency effect on pseudohomophone performance, and the absence of these effects with their counterparts. However, we did not observe a difference between the peripheral and central A-O exercise groups in this refined paradigm. It may be the case that in the new paradigm, the single block of A-O exercise was not enough to induce the additional benefit expected of peripheral A-O exercise. To address this, we extended the exercise duration in Experiment 3 to boost the paradigm’s sensitivity for detecting differences in these exercise groups.
Experiment 3
Method
Participants
We recruited 39 participants who were included in the analysis 3 (12 cisgender men, 25 cisgender women, 1 nonbinary or transgender participants, 1 preferred not to respond; M = 22.77 years, SD = 6.64, Mean video game experience = 8.09 hr/week, SD = 13.18) through the University of Saskatchewan SONA participant pool during the Winter and Spring 2024 terms. As compensation for participating, participants received two bonus course credits. As in Experiment 2, the first language of all participants was English, and all participants reported using the mouse with their right hand (although four participants were otherwise left-handed). This study was approved by the University of Saskatchewan Research Ethics Board, and all participants provided informed consent before participating.
Participants were randomly assigned to the training groups such that half the participants received the central HBDT as their training task and the other half received the peripheral HBDT as their training task. There were no significant differences between the training groups in terms of age or gender (see Table 8).
Demographic characteristics as a function of group assignment.
Note. All participants reported using the mouse with their right hand.
Apparatus and stimuli
The apparatus and stimuli in Experiment 3 were the same as those used in Experiment 2.
Changes in procedure
The experiment design and procedure were nearly the same as Experiment 2, with two major exceptions. The first change was in the training HBDT length. The training HBDT was doubled (i.e., two HBDT blocks instead of one), with a brief participant-controlled break of 10–30 s to reduce eye-fatigue during training. The second change was in the OLDT design. The OLDT blocks for Experiment 2 were peripherally presented to increase the difficulty of the task. This also made the design more similar to our earlier experiment where we first observed the double dissociation between central and peripheral visual demands on reading performance (Kress et al., 2023). The peripherally presented OLDT was not cued, to keep the stimulus presentation consistent between Experiments 2 and 3, and was presented until response or for a maximum of 1,500 ms (the HBDT also used this timing). The participant would see the fixation cross and press the spacebar to begin the trial. Then, after a 150-ms interstimulus interval, where no fixation cross, stimulus, or cue was on-screen, the letter-string stimulus would appear randomly in one of the four target locations, and the participant would make their response . The four target locations were the same as those used in the HBDT.
Results
The analysis for Experiment 3 followed the same steps as the previous experiment. We examined participant (F1) or item (F2) median reaction time and mean error rate to correct responses and the values reported in text are the mean of these median response times. OLDT item pairs with outlier error rates (error rates which exceeded 3 SD outside the mean error rate) were removed prior to the analysis (seven item pairs: dost/dawst, leapt/lept, mauve/moav, sieve/siv, soot/suht, suave/swawv, suede/swaid). Linear mixed model and general linear mixed model analyses can be found in the online Supplementary Material C.
Target stimuli analyses
As in the previous experiment, 2 (Training Group: central vs. peripheral) × 2 (Time: pretraining vs. post-training) mixed-measures GLMs were conducted on median reaction times to target stimuli to test the effect of attentional exercise (through the HBDT) on OLDT performance. There was a main effect of Time, F1(1, 37) = 29.91, MSE1 = 1,123.67, p1 < .001, F2(1, 112) = 79.74, MSE2 = 3,574.53, p2 < .001, whereby responses were faster post-training (M1 = 699 ms, SD1 = 70; M2 = 711 ms, SD2 = 72) than pretraining (M1 = 741 ms, SD1 = 71; M2 = 761 ms, SD2 = 90). There was also an effect of Training Group (central: M1 = 700 ms, SD1 = 68, M2 = 713 ms, SD2 = 81; peripheral: M1 = 741 ms, SD1 = 74, M2 = 759 ms, SD2 = 84), F1(1, 37) = 3.89, MSE1 = 8,261.10, p1 = .056, F2(1, 112) = 82.45, MSE2 = 2,962.74, p2 < .001. There was no significant Time × Training Group interaction, F1(1, 37) = 0.06, MSE1 = 1,123.67, p1 = .801, F2(1, 112) = 0.45, MSE2 = 3,605.77, p2 = .505. Figure 9 depicts the by-subjects main effect of Time in both groups.

Experiment 3 reaction time as a function of time and training group (by-subjects, on targets) depicted (a) as difference scores and (b) by Time.
When the same GLM was conducted on error rates, there was a significant effect of Training Group, F1(1, 37) = 7.48, MSE1 = 49.69, p1 < .001, F2(1, 112) = 37.19, MSE2 = 59.52, p2 < .001, with participants in the central group (M1 = 6.28%, SD1 = 4.69; M2 = 6.30%, SD2 = 10.83) responding with less errors than participants in the peripheral group (M1 = 10.65%, SD1 = 6.50; M2 = 10.73%, SD2 = 14.19). The main effect of Time was significant by-items but not by-subjects (pretraining: M1 = 7.69%, SD1 = 6.02, M2 = 7.78%, SD2 = 11.56; post-training: M1 = 9.12%, SD1 = 6.02, M2 = 9.25%, SD2 = 13.92), F1(1, 37) = 2.73, MSE1 = 14.74, p1 = .107, F2(1, 112) = 3.94, MSE2 = 62.33, p2 = .050. The Time × Training Group interaction was not significant, F1(1, 37) = 0.21, MSE1 = 14.74, p1 = .900, F2(1, 112) = 0.006, MSE2 = 61.13, p2 = .938.
Foil stimuli analyses
We also ran the 2 (Training Group: central vs. peripheral) × 2 (Time: pretraining vs. post-training) mixed-measures GLMs on median reaction times to foil stimuli (see Figure 10). There was a main effect of Time, F1(1, 37) = 48.22, MSE1 = 2,101.40, p1 < .001, F2(1, 112) = 154.02, MSE2 = 3,411.58, p2 < .001, whereby participants responded more quickly to foils after training (M1 = 766 ms, SD1 = 86; M2 = 776 ms, SD2 = 90) than before training (M1 = 838 ms, SD1 = 113; M2 = 844 ms, SD2 = 92). There was no effect of Training Group (central: M1 = 798 ms, SD1 = 103, M2 = 809 ms, SD2 = 95; peripheral: M1 = 806 ms, SD1 = 111, M2 = 812 ms, SD2 = 99), F1(1, 37) = 0.06, MSE1 = 18,596.49, p1 = .807, and there was no Training Group × Time interaction, F1(1, 37) = 0.03, MSE1 = 2,101.40, p1 = .873, F2(1, 112) = 0.69, MSE2 = 4,442.25, p2 = .407.

Experiment 3 reaction time as a function of time and training group (by-subjects, on foils) depicted (a) as difference scores and (b) by Time.
On error rate for foils, there was a main effect of Time, F1(1, 37) = 5.32, MSE1 = 22.41, p1 = .027, F2(1, 112) = 6.09, MSE2 = 99.95, p2 = .015, whereby participants responded with more errors after training (M1 = 14.50%, SD1 = 9.20, M2 = 14.51%, SD2 = 17.26) than before training (M1 = 12.03, SD1 = 9.41, M2 = 12.19%, SD2 = 14.24). This speed-accuracy trade-off as a function of time is explored in the online Supplementary Material D with an analysis of incorrect trial reaction times. The effect of Training Group (central: M1 = 14.56%, SD1 = 8.54, M2 = 12.08%, SD2 = 14.29; peripheral: M1 = 11.90%, SD1 = 10.03, M2 = 14.61%, SD2 = 17.20) was not significant by-subjects F1(1, 37) = 0.91, MSE1 = 151.75, p1 = .347, but was significant by-items, F2(1, 112) = 6.25, MSE2 = 116.31, p2 = .014. The Training Group × Time interaction was not significant, F1(1, 37) = 0.02, MSE1 = 22.41, p1 = .876, F2(1, 112) = 0.004, MSE2 = 95.19, p2 = .945.
Word frequency and bigram frequency
As in the previous experiments, we used simple correlations to analyse the relationship between word frequency, bigram frequency, and our two dependent variables (reaction time and error rate). We used the same word and bigram frequency measures as the previous experiments and followed the same procedure. Replicating the results of the previous experiments, we observed a significant correlation between word frequency and exception word performance, such that higher word frequencies were associated with faster reaction times and lower error rates, and a significant correlation between bigram frequency and pseudohomophone performance, such that higher bigram frequencies were associated with slower reaction times and worse error rates (see Table 6). As in the previous experiments, bigram frequency was not correlated with exception word performance, and base word frequency was not correlated with pseudohomophone performance.
Interim discussion
Overall, the results of Experiment 3 are consistent with the results of Experiment 2. The same pattern of word and bigram frequency effects was observed, so the double dissociation appears to be robust. There was also an overall main beneficial effect of Time on reaction time, with no interaction, similar to Experiment 2, and suggesting both our central and peripheral training conditions may be help reading performance. However, unlike Experiment 2, this was accompanied by some evidence for a detrimental effect of Time on error rates, suggesting a form of speed-accuracy trade-off or fatigue effect. These findings mean the length of the training period in our third experiment may be too long for a single session, and future studies should consider an intermediate number of trials between the length of Experiments 2 and 3 to maximise the benefits of training before reaching this fatigue effect.
Discussion
Bringing together the research fields of reading and attention, the goal of this set of studies was to test whether short-term attentional exercise could benefit reading performance. Our hypothesis that attentional exercise may benefit reading ability drew from the past literature on cognitive exercise (e.g., Nouchi et al., 2012), past links between reading and attention (Boden & Giaschi, 2007; Ekstrand, Neudorf, Kress, & Borowsky, 2019), and recent work on the relationship between video games and both reading and attention (Bertoni et al., 2021; Kress et al., 2023).
Potential A-O exercise effects
In both the by-subjects and by-items analyses of Experiment 1, we observed an effect of Order, whereby the peripheral A-O exercise demonstrated improved performance over time, whereas central A-O exercise demonstrated either minimal change or worsened performance over time. These reaction time benefits did not come at the cost to accuracy, meaning participants were not experiencing speed-accuracy trade-offs with this improved reaction time performance. This was replicated by-subjects and by-items, demonstrating that the effects could not be attributable to general learning processes.
The designs of Experiments 2 and 3 were intended to refine the paradigm by controlling the location of the pre- and post-training reading task. In this case, we observed a significant effect of Time, whereby all participants’ performance improved between pre- and post-training; however, there was no interaction that would suggest peripheral A-O exercise was more effective than central A-O exercise at inducing reading improvements in the OLDT. It may have been the case that Experiment 1’s effects were driven by task difficulty, although note that the interleaved design of the blocks in Experiment 1 meant that there was the potential for a longer period for differences to emerge relative to Experiment 2. Furthermore, the significant by-items effect of Task in the Experiment 1 analyses suggests that the effect cannot be fully accounted for by task difficulty alone. Likewise, general learning effects would not account for the findings observed in Experiment 1, so Experiments 2 and 3 must have other factors beyond task learning which drive the improvements observed after both central and peripheral training. The trade-off between speed and accuracy in Experiment 3 also indicates there is more than simple task learning at play in these designs. The next sections will unpack the reading and attentional processes present in our A-O exercise paradigm as we consider explanations for the results of Experiments 2 and 3 and options to further refine the A-O exercise programme.
Which attentional processes benefit from A-O exercise?
Our goal was to test whether short-term A-O exercise had the potential to provide any amount of reading performance benefit, and we can bring past findings together with the current study to make some predictions regarding the relevant systems. Our A-O exercise consisted of briefly flashing a star at the target location, which one could argue would activate both the alerting (when will the target appear) and orienting (where will the target appear) mechanisms of the attentional network (Dye et al., 2009; Fan et al., 2002). In the dyslexia research field, the dysfunction in the orienting system is most frequently discussed when exploring the overlapping reading + attention deficits in dyslexia (Boden & Giaschi, 2007; Gori et al., 2014, 2016; Taran et al., 2022). Oculomotor control and attentional orienting is also more frequently discussed over the alerting system when testing action video games as a cognitive exercise tool (Diarra et al., 2019; Dye et al., 2009; Pasqualotto et al., 2022; West et al., 2013). For these reasons, we suspect that attentional orienting, rather than attentional alerting, is the more relevant process when designing an A-O exercise task to benefit reading. These attentional orienting processes could involve large oculomotor shifts, such as those required to shift from the centre of the screen to the peripheral of the screen during a video game, or from the end of one line to the start of the next line while reading a book. Attentional orienting processes could also involve small oculomotor shifts, such as those required to precisely target the enemy you are looking at in a video game, or those required to shift between, or within, nearby words in a sentence.
When one considers attentional orienting processes, this can be subcategorised into the voluntary and reflexive orienting processes discussed by Corbetta and Shulman (2002). The star cue could be considered a reflexive orienting cue, albeit one with 100% validity. In the peripheral A-O exercise, this reflexive orienting cue required large oculomotor shifts to move from the central fixation cross to the target location for the HBDT. The reflexive orienting cue for the central A-O exercise condition would not require this same large oculomotor shift. It would be reasonable to expect the exercise of this reflexive orienting process to benefit reading, as we originally hypothesised, given that past research has suggested action video games (which emphasise peripheral processing) benefit reading processes (Bertoni et al., 2021; Franceschini et al., 2017; Pasqualotto et al., 2022).
Our designs, which varied the degree of large reflexive orienting movements, did not lead to differences in the reading performance between the central and peripheral A-O exercise groups. This could mean that both versions of our task contained a form of A-O exercise that was beneficial to reading.
Voluntary orienting processes, and in particular fine-grained voluntary oculomotor processes, would be good A-O exercise to consider in future designs. The HBDT involves small voluntary oculomotor movements from the health bar in the centre of the screen, up or down to the flanker number, and then potentially back to the health bar to evaluate its size. These small oculomotor movements are present in both the central and peripheral versions of the task. Reading also involves small systematic oculomotor movements between word segments or whole words, and large oculomotor movements generally only need to take place when you reach the end of the line or the end of the page. With this in mind, it could be the case that both peripheral and central versions of the HBDT were sufficient to induce the type of A-O exercise that could most benefit reading (i.e., small oculomotor movements).
With this new idea of exercising small, rather than large, oculomotor movements, future refinements of the A-O exercise paradigm should include a version of the HBDT to reduce small oculomotor movements in one of the training groups. This could be done by overlaying the number with the health bar or by briefly presenting the number at the target location, followed by the health bar in the same location. Either of these designs would minimise the involvement of small oculomotor movements in the given group and help determine what type of attentional process is most important for A-O exercise to improve reading. Ideally, a future study would include three conditions, one each for large movements, small movements, and no movements to compare the effectiveness of all three options against the others. In addition, eye-tracking could be employed to verify the magnitude of the oculomotor movements that are occurring in each condition.
Isolating orthographic processing
The reading task we used in this set of experiments was an OLDT, where participants indicated whether the presented letter-string spelled a real word or not. In this case, the foils were pseudohomophones, which are nonwords that sound like a real word if they are phonetically decoded (Cummine et al., 2015; Neudorf, Ekstrand, Kress, Neufeldt, & Borowsky, 2019; Wingerak et al., 2017). The use of exception word targets and pseudohomophone foils should force orthographic lexical processing in tasks such as the OLDT, as an incorrect response will be generated if participants focus on phonological and/or semantic instead of orthographic lexical representations.
Our theory that the OLDT forces orthographic lexical processing is supported by the results in our word and bigram frequency analyses which show that performance is better for exception words that have high word frequency, whereas performance is worse for pseudohomophones that have high bigram frequency. Our word frequency effect aligns with past research that has attributed the effect to enhanced orthographic processing (Borowsky & Besner, 1993; Borowsky et al., 2013; McCann et al., 1988). The research on bigram frequency effects is far more limited, but bigram frequency effects, whereby increased bigram frequency is related to faster naming RTs, have been attributed to the efficiency of mapping graphemes onto phonemes and thus represent a measure of efficiency of sublexical phonological processing (Borowsky et al., 2013). Given this attribution to phonological processing, we argue that the worse performance for high bigram frequency pseudohomophones represents a form of phonological interference, possibly due to activation of representations in the phonological lexicon via the sublexical grapheme-to-phoneme conversion route.
It is likely the case that both routes are running in parallel (Paap & Noel, 1991), so successful performance in the OLDT simply requires monitoring the orthographic lexicon, which exception words will activate strongly and quickly by utilising word frequency-sensitive connections (Borowsky & Besner, 1993; Borowsky et al., 2013; McCann et al., 1988). The pseudohomophones will not activate the orthographic lexicon as strongly, nor as quickly, so the activation from the bigram frequency-sensitive grapheme-to-phoneme conversion path in the sublexical route will have more opportunity to (incorrectly) intervene. Single-route models will have a difficult time accounting for this double dissociation because they can only make use of one nonsemantic path from orthographic to phonological representations to handle the opposing effects.
Given this double dissociation, our results suggest that both lexical and sublexical processing benefitted from attentional exercise in our studies. This is an exciting extension to the past research on attentional training during reading, as most previous studies have focused on the benefits of attentional training (through video game play) on sublexical processes specifically (Bertoni et al., 2024; Franceschini & Bertoni, 2019). Future studies could use reading tasks that isolate phonetic decoding, such as reading exception words and pseudohomophones aloud (e.g., Kress et al., 2023) or a phonological LDT with pseudohomophone targets and nonword foils (e.g., McCann et al., 1988) to further contrast the impact of short-term attentional exercise on the two reading systems.
In light of the current results, we should also consider the results of the naming task in Kress et al. (2023), where we observed experience with peripheral visual demands was beneficial whereas experience with central visual demands was detrimental to reading aloud performance. These results are somewhat different from those observed in our current experiments. Experiment 1 was consistent with the Kress et al. (2023) results, whereas Experiments 2 and 3 replicated the peripheral but not central A-O exercise effects. One common difference that exists in Kress et al. (2023) and Experiment 1, where the double dissociation between peripheral and central A-O exercise exists, relative to Experiments 2 and 3, is the mixing of peripheral and central visual demands. The A-O exercise in Kress et al. (2023) is provided by the participants’ video game experience, and video games include a mix of peripheral and central visual demands. Although the location of stimulus presentation was blocked, Experiment 1 included a combination of peripheral and central A-O blocks to serve as both training and testing measures, so its design is more mixed than Experiments 2 or 3, where only one type of A-O exercise was presented. It could be the case that mixed A-O exercise captures the shared benefits of both types of visual demands, allowing peripheral-exclusive A-O exercise benefits to be observed.
Limitations and future directions
One interesting issue to consider is the case of bigram legality in pseudohomophone construction. Our pseudohomophones were constructed with the intent to use legal bigrams where possible, but in some instances, a very rare (e.g., uz in duz—found in real words such as fuzz; wv in swawv—bigram frequency of 0 according to the English Lexicon Project) was used. To address this concern, we repeated our correlational analyses, excluding the exception word/pseudohomophone pairs where the pseudohomophone included a rare bigram, and saw consistent patterns in our correlation (see the online Supplementary Material E). Nevertheless, future studies should consider constructing pseudohomophones and nonwords with systematically varying levels of bigram legality to further assess the differences in orthographic and phonological processing.
In addition, there has been some debate over the consistency of bigram frequency effects (e.g., Schmalz & Mulatti, 2017). However, mixed exception/regular word stimulus lists (such as those used by Schmalz & Mulatti, 2017) are more likely to result in inconsistent bigram frequency effects, because regular words can utilise the dorsal stream grapheme-to-phoneme conversion system (and thus can be affected by bigram frequency), whereas exception words cannot use grapheme-to-phoneme conversion to pronounce the word or make a word/nonword decision (and thus should not be affected by bigram frequency in the same way). Future studies should explicitly test whether exception words and regular words demonstrate different bigram frequency effect patterns, and we may expect that the bigram frequency should have a facilitatory effect on lexical decision performance for regular words, unlike exception words (no effect, as shown here) and pseudohomophones (detrimental effect, as shown here).
The current study focused on measures meant to capture orthographic (word frequency) and phonological processing (bigram frequency). In future studies, it would also be useful to measure the degree of semantic access in this task using a semantic measure such as participant imageability ratings for the base words.
A question raised by a reviewer was whether demographic differences could potentially contribute to the different patterns of results across the three experiments (Experiment 1 participants were the oldest and were not prescreened for education levels, whereas Experiment 2 had the lowest video game experience). However, we consider this unlikely, because the different demographics could counteract each other (e.g., age differences could be balanced by education differences or video game experience). To fully evaluate demographic differences, future studies should prescreen the participants on their demographic factors and baseline performance, and then make sure the two training groups are matched on these factors during the group assignment, or include continuous measures of these as variables in the analysis.
Other paradigm refinements should include manipulations of the exercise duration, such as by comparing single-session designs to multisession designs. Manipulating the exercise duration will determine where the optimal window is to maximise the benefits of A-O exercise while minimising the temporal costs for the individual. For example, reading intervention programmes in schools may have sessions cut short because of limited time and resources (Hall et al., 2022). By identifying the optimal A-O exercise window, a programme could be designed that would better accommodate the limited time and resources and thus be able to reach more individuals who could benefit from such exercise. In comparisons of single-session versus multisession designs, researchers may also be able to bring participants back for delayed post-tests to evaluate the persistency of the benefits gained through large or small A-O exercise (e.g., Pasqualotto et al., 2022).
Conclusion
Our set of experiments manipulated the location of stimuli to test whether A-O exercise may support reading processes. Although Experiment 1 offered some support of our A-O exercise hypothesis, Experiments 2 and 3 raised questions about whether large or small oculomotor shifts matter most in visual attention processes when trying to improve reading ability. Regardless, when the results of these studies are considered together, general learning effects are insufficient to explain the results, and A-O exercise, even if the movements are small, appears to play a role in the reading improvements observed. Future research comparing our small and large movements in A-O exercise to a zero A-O exercise condition will be an important next step in measuring the extent of A-O exercise benefits.
If future studies successfully narrow down an A-O exercise effect, programmes could be developed to specifically target particular reading processes. This will benefit individuals with dyslexia, as they can focus on the exercise that is most relevant to their individual reading deficit. For example, someone with surface dyslexia can focus on the exercise that targets orthographic processing, whereas someone with phonological dyslexia can focus on the exercise associated with phonological processing (Castles & Coltheart, 1993; Franceschini & Bertoni, 2019; McDougall et al., 2005; Sotiropoulos & Hanley, 2017).
Short A-O exercise activities such as a “game-like” version of the task in these studies could provide a boost in reading performance that is worthy of future research. With further study, the A-O exercise demonstrated in our experiments can be refined to be a quick and engaging activity to add to the toolkits of educators who support individuals with reading difficulties.
Supplemental Material
sj-docx-1-qjp-10.1177_17470218241310440 – Supplemental material for Effects of central vs. peripheral attentional-oculomotor exercise on lexical processing
Supplemental material, sj-docx-1-qjp-10.1177_17470218241310440 for Effects of central vs. peripheral attentional-oculomotor exercise on lexical processing by Shaylyn Kress, Scott Caron, Josh Neudorf, Braedyn Borowsky and Ron Borowsky in Quarterly Journal of Experimental Psychology
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by the Natural Sciences and Engineering Research Council of Canada (NSERC) through an Alexander Graham Bell Canada Graduate Scholarship to S.K., Postdoctoral Fellowship to J.N., and Discovery Grant 18968-2013-25 to the senior author R.B.
Data accessibility statement
Supplementary material
The supplementary material is available at: qjep.sagepub.com.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
