Abstract
Previous research has shown the effects of task demands on pupil responses in both normal hearing (NH) and hearing impaired (HI) adults. One consistent finding is that HI listeners have smaller pupil dilations at low levels of speech recognition performance (≤50%). This study aimed to examine the pupil dilation in adults with a normal pure-tone audiogram who experience serious difficulties when processing speech-in-noise. Hence, 20 adults, aged 26 to 62 years, with traumatic brain injury (TBI) or cerebrovascular accident (CVA) but with a normal audiogram participated. Their pupil size was recorded while they listened to sentences masked by fluctuating noise or interfering speech at 50% and 84% intelligibility. In each condition, participants rated their perceived performance, effort, and task persistence. In addition, participants performed the text reception threshold task—a visual sentence completion task—that measured language-related processing. Data were compared with those of age-matched NH and HI participants with no neurological problems obtained in earlier studies using the same setup and design. The TBI group had the same pure-tone audiogram and text reception threshold scores as the NH listeners, yet their speech reception thresholds were significantly worse. Although the pupil dilation responses on average did not differ between groups, self-rated effort scores were highest in the TBI group. Results of a correlation analyses showed that TBI participants with worse speech reception thresholds had a smaller pupil response. We speculate that increased distractibility or fatigue affected the ability of TBI participants to allocate effort during speech perception in noise.
Introduction
The pupil dilation response to speech-in-noise processing is considered to be an objective measure of listening effort (Pichora-Fuller et al., 2016). There is a large range of studies showing that the pupil response is sensitive to properties of the speech perception tasks and stimuli used (for a review see, Zekveld, Koelewijn, & Kramer, 2018). In our previous studies, we demonstrated that pupil response is sensitive to the level of speech intelligibility (e.g., Kramer, Kapteyn, Festen, & Kuik, 1997; Zekveld, Kramer, & Festen, 2010) and type of background sound (Koelewijn, Zekveld, Festen, & Kramer, 2012; Wendt, Koelewijn, Książek, Kramer, & Lunner, 2018). We found this effect in both normal hearing (NH) and hearing impaired (HI) listeners.
According to the framework for understanding effortful listening (FUEL; Pichora-Fuller et al., 2016), listening effort depends on more than just task demands. Listener factors may also control the allocation of resources needed to overcome obstacles in goal pursuit. Research has shown that hearing status is one type of listener factor, which affects the pupil dilation response. To illustrate, while replicating the effects of intelligibility level and masker type on the pupil response, Zekveld, Kramer, and Festen (2011) observed a
Apart from hearing loss, there are other listener factors that have shown to be associated with the pupil response. Examples are cognitive and linguistic ability (for a review see, Zekveld et al., 2018). An individual’s linguistic processing ability can be assessed using a visual sentence completion test, known as the text reception threshold (TRT) task (Zekveld, George, Kramer, Goverts, & Houtgast, 2007). First, the TRT scores are known to be correlated with effects of interfering speech on both the speech reception threshold (SRT) and the pupil dilation response (e.g., Koelewijn, Zekveld, Festen, Rönnberg, & Kramer, 2012). Second, TRT scores correlate with working memory capacity measures (e.g., Koelewijn, Zekveld, Festen, Rönnberg, et al., 2012). This is of interest since a person’s cognitive capacity could limits one’s ability to exert effort (Pichora-Fuller et al., 2016). In addition, Wang et al. (2017) showed that during speech processing in noise, a higher level of fatigue was related to a smaller pupil response. These findings together indicate that changes in the pupil dilation response evoked during speech recognition in noise not only depend on task-related factors, but the magnitude of the pupil response seems to also vary across different listener groups. Importantly, previous research does not allow for fully dissociating the impact of hearing loss as diagnosed by a pure-tone audiogram (PTA) or by a speech-in-noise task on the pupil dilation response. Although FUEL suggests the pupil response to reflect central processing, to our knowledge, there is no study yet that has investigated the pupil response in people who report difficulties in speech processing in noise but who have a normal PTA. Hence, it is unclear whether the observed differences in the pupil response between NH and HI listeners are related to differences in the PTAs alone or whether differences in central (functional) speech processing may underlie these differences.
A significant proportion of people with traumatic brain injuries (TBIs) report difficulties and high levels of effort when listening to speech in a noisy background even when these individuals have normal thresholds in the audiogram (Zhao & Stephens, 2007). A recent review by Werff (2016) encompasses an extensive overview of the impact mild TBI may have on auditory processing. Often both frontal and temporal regions of the brain are damaged in people with TBI caused by external force. These regions are typically vulnerable for trauma (French, Taber, Helmick, Hurley, & Warden, 2011), caused by fast acceleration or deceleration in movement, for example, a car accident or a fall. Given that temporal regions are often damaged, it is generally known that TBI may affect functions related to primary auditory processing needed for speech segregation. In this study, we included participants with TBI caused by external force or a CVA. All participants had a normal PTA but they reported serious difficulties understanding speech in a noisy background. These characteristics make the TBI group a unique group of listeners. The aim of this study was to examine their pupil dilation responses evoked by an SRT task (Plomp & Mimpen, 1979) and compare their responses with those of NH and HI listeners. Testing this group of TBI participants allowed the effects of functional hearing difficulties on the pupil response to be investigated, despite normal thresholds in the audiogram.
For this study, we used the same design and tasks as used in our previous studies including NH and HI participants without any reported neurological problems (Koelewijn, Zekveld, Festen, & Kramer, 2014; Koelewijn, Zekveld, Festen, Rönnberg, et al., 2012). We used the SRT task to measure an individual’s ability to perceive sentences presented in noise. Participants listened to speech in fluctuating noise and to speech masked by a single talker at two different levels of intelligibility. The SRT task was used in combination with pupillometry, allowing us to investigate the effort individuals exploited in these different listening situations. Both peak pupil dilation (PPD) and mean pupil dilation (MPD) can be used as parameters to measure listening effort (Zekveld et al., 2011). PPD latency is an indicator of the time needed for cognitive processing (e.g., Hyönä, Tommola, & Alaja, 1995), and the baseline pupil size prior to the pupil response is considered to be an index of arousal or cognitive resources exerted in preparation for the task (e.g., Aston-Jones & Cohen, 2005). An individual’s linguistic processing ability was also assessed by administering the TRT task (Zekveld et al., 2007). We used the same setup as in our previous studies because our primary goal was to examine whether we would be able to replicate the effects of intelligibility level and masker type on both the SRT and pupil parameters in this unique group of participants with TBI.
In summary, the objective of this study was twofold. The first goal was to examine whether we would be able to replicate the effects of intelligibility and masker type on the pupil response in a group of listeners with TBI. Based on previous research showing that the pupil response is sensitive to intelligibility level and masker type independent of hearing status, we hypothesized (H1) that TBI participants would show similar effects. The second goal was to compare the results of the TBI participants with those of NH and HI listeners. We hypothesized (H2a) that the participants with TBI would have an increased SRT, indicating worse speech processing in noise, compared with what has been previously shown for NH participants without any reported neurological problems (Koelewijn, Zekveld, Festen, Rönnberg, et al., 2012). The pupil dilation response was recorded while participants performed the SRT tasks. Based on the previously shown difference in PPD between NH and HI participants, we hypothesized (H2b) that a smaller PPD for the TBI compared with NH participants would be observed. Finally, we tested whether any speech processing deficit, when observed, would be restricted to the auditory domain. This was verified by having the TBI participants performing a TRT task. We hypothesized (H2c) that if the effect of TBI is restricted to auditory processing, TRT scores for the TBI group should be similar to those of NH and HI participants without any neurological problems (Koelewijn et al., 2014; Koelewijn, Zekveld, Festen, Rönnberg, et al., 2012). This would indicate that cognitive capacity needed for language processing was not affected in the included group of TBI participants.
Methods
Participants
Twenty adults (aged between 26 and 62 years, mean age 47 years, four males) with a history of acute TBI participated in this study. Of them, seven participants reported TBI, nine reported a CVA, and four participants had another unspecified cause of TBI. All participants were recruited via a Dutch TBI information website (hersenletsel-uitleg.nl) and at meeting points (brain injury-cafés) where Dutch people with traumatic or non-TBI gather. All participants reported difficulties with listening to speech in background sound and for at least 1 year did not receive TBI-related medical treatment or rehabilitation therapy. All participants had a normal PTA, which averaged over both ears and across octave frequencies 1 to 4 kHz. was 9.1 dB Hearing Level (HL,
SRT—Task
Speech perception in noise was measured using SRT tasks (Plomp & Mimpen, 1979) with speech presented in fluctuating noise or with a single-talker masker (Festen & Plomp, 1990; Koelewijn et al., 2014). Target sentences were everyday Dutch sentences (Versfeld, Daalder, Festen, & Houtgast, 2000), spoken by a female voice. An example of an everyday sentence is “Hij maakte de brief snel open,” which directly translates to “He quickly opened the letter.” The single-talker masker contained concatenated sentences from another set uttered by a male talker. Fluctuating noise was created by multiplying noise by the speech amplitude envelope of the single-talker masker for two separate frequency bands below and above 1 kHz (Festen & Plomp, 1990). Both the single-talker masker and the fluctuating noise had a long-term average frequency spectrum identical to the spectrum of the target speech signal. The value of the SRT (dB signal-to-noise ratio [SNR]) was estimated for speech presented at 50% and at 84% intelligibility (sentences entirely correct) using a 1-up-1-down (Plomp & Mimpen, 1979) and a 4-up-1-down staircase procedure (Levitt, 1971), respectively. For each condition, the target speech level was fixed at 55 dB SPL.
The onset of the masker was 3 s prior to the onset of the target sentence and continued for 3 s after the offset of the target sentence. The length of each trial varied with the length of the presented sentence, which had a mean duration of 1.9 s (range 1.4–2.5 s). At the end of the trial, a 1000 Hz prompt tone was presented for 1 s after which participants were instructed to respond. Manipulation of both masker type and intelligibility level resulted in a total of four different conditions that were presented in a blocked fashion (i.e., 50% fluctuating noise, 50% single talker, 84% fluctuating noise, and 84% single talker). Each block contained 39 trials and the order of the blocks was counterbalanced (Latin square) over participants. The SRT was calculated for each condition (block) by averaging the SNRs at which the sentences were presented, with the exception of the first four sentences that were excluded. Prior to the experiment, participants were familiarized to the task at an intermediate sentence intelligibility level of 71% (2-up-1-down procedure) by listening and responding to 13 practice sentences for both masker types.
SRT—Self-Ratings
While performing the SRT tasks, listeners did not receive any feedback. After each block, participants were asked to rate their effort, performance, and quitting (task persistence; Koelewijn, Zekveld, Festen, & Kramer, 2012; Zekveld et al., 2010). For the effort rating, participants indicated how much effort it took on average to perceive the speech during that last block. This was rated on a visual analog scale from 0 (
TRT—Task
As in previous studies (Koelewijn et al., 2014; Koelewijn, Zekveld, Festen, Rönnberg, et al., 2012), participants performed a TRT task (Zekveld et al., 2007). During this task, sentences were visually presented on a computer screen in a red font (lower case Arial, vertical visual angle of 0.48°) on a white background partially masked by black vertical bars. These bars were evenly distributed across the computer screen and the width of the bars was varied depending on the required percentage of unmasked text. Sentences were presented word-by-word with onset timings similar to the corresponding recorded SRT sentences. After the onset of the last word, the full sentence remained on screen for 500 ms (Besser, Koelewijn, Zekveld, Kramer, & Festen, 2012). Before this experiment, an adaptive 1-up-1-down procedure was applied, targeting the percentage of unmasked text required to read 50% of the sentences without any error. Four lists of 13 sentences each were presented. The first list was for practice purposes only and was excluded from the analysis. The TRT score was defined by the average percentage of unmasked text in the three remaining tests under exclusion of the first four sentences of each list. Lower scores indicate better performance.
Apparatus and Procedure
All testing was performed in a sound-treated room. After recording the participant’s audiogram, the test session started with the SRT tasks. During the SRT tasks, participants were asked to fixate their gaze at a dot (diameter 0.47°) located at eye level on a white wall at 3.5 -m distance. During the SRT test, the pupil diameter of the left eye was measured using an infrared eye-tracker (SMI, 2D Video-Oculography, version 4). The light intensity was adjusted such that, for each participant, the pupil diameter was around the middle of its dynamic range as measured by examination of the pupil size at 0 l× and 250 l× (see also, Zekveld et al., 2010). For the SRT tasks, audio in the form of stereo wave files (44.1 Hz, 16 bit) was presented binaurally by an external soundcard (Creative SoundBlaster, 24 bit) through headphones (Sennheisser, HD 280, 64 Ω). The TRT test was administered following the SRT tasks. During the TRT task, participants were seated in front of a computer screen (Dell, 17 in.) at 60 cm viewing distance. All tests were presented by a PC running Windows XP (Dell, Optiplex GX745, 2.66 GHz 2 Core). The whole procedure, including measurement of pure-tone hearing thresholds, near vision acuity, fitting the eye-tracker, practicing and performing the SRT tasks, practicing and performing the TRT task, and a 15-min break halfway through plus additional breaks requested by participants took 2 to 3 h. Note that seven participants requested an additional break, one participant requested a longer break halfway through without an additional break, and one participant took a nap during the break halfway through without requesting an additional break.
SRT—Pupillometry
During the SRT tasks, for each participant and trial, the pupil diameter and pupil
Statistical Analysis
To examine whether we would be able to replicate the effects observed in previous studies (H1), a repeated measures analysis of variance (ANOVA) testing the effects of intelligibility (50% and 84%) and masker type (fluctuating noise and single-talker masker) was performed on the SRT score, the pupil measures, and the subjective ratings of the TBI participants. Statistically significant (
For testing the Hypotheses 2a to 2c, the current results (PTA, SRT, PPD, self-rated effort, and TRT) were compared with data previously recorded for NH and HI participants without any neurological problems (Koelewijn et al., 2014; Koelewijn, Zekveld, Festen, Rönnberg, et al., 2012). The NH and HI groups used in our previous studies each included 32 participants. From each of these groups, we selected a subgroup of 20 participants that was age matched to the current group of 20 TBI participants. This allowed us to compare the scores of the current TBI group comprising 20 participants with those of the NH and HI groups. The creation of age-matched samples was accomplished by removing the 12 oldest participants from the NH and HI groups. After matching, the average age of the NH group was 46.5 years (
Results
Speech Reception Threshold
Analyses on the SRTs (see Table 1 and Figure 1) revealed a significant main effect of intelligibility, SRTs (dB SNR) at two intelligibility levels for both masker types, averaged over subjects. The error bars show the standard errors for each condition. SRT = speech reception threshold; SNR = signal-to-noise ratio. Average SRT, MPD, PPD, PPD Latency, Pupil Baseline, and Self-Rated Effort, Performance, and Quitting Rate Scores, as a Function of Intelligibility Level and Masker Type in the TBI Group.
Pupil Measures
Analysis on the MPDs (see Table 1 and Figure 2) revealed a significant main effect of intelligibility, Pupil responses per condition averaged over participants. The onset of the sentences is at 0 s. The baseline is indicated as the average pupil diameter over 1 s preceding the start of the sentence. The area between the second and third dotted lines indicates the time window used for calculating the mean pupil dilation.
Self-Rated Scores
Analysis of the self-rated effort ratings (see Table 1) revealed a significant main effect of intelligibility,
Correlation Analyses
To make sure that interindividual differences in pupil dilation could not be explained by the absolute noise level of the masker, Pearson correlations were calculated between the pupil baseline and the SRTs for each of the four conditions (α = .0125). None of these correlations were significant (single talker 50%, SRT (dB SNR) as a function of PPD both averaged over intelligibility level and masker type. TBI = traumatic brain injury; PPD = peak pupil dilation.
Comparison of TBI, NH, and HI Groups (Testing H2a, H2b, and H2c)
The PTA between the TBI group, PTA (1–4 KHz) =9.1 dB HL, and NH group, PTA (1–4 KHz) = 9.0 dB HL, did not differ significantly (
Summary
The results of this study within the group of adults with TBI revealed significantly lower SNRs at 50% compared with 84% intelligibility and a significantly lower SNR for the single-talker masker compared with fluctuating noise at 50% intelligibility. The MPD, PPD, and PPD latency were all larger at the 50% compared with 84% intelligibility level, indicating higher listening effort over a longer period of time at lower intelligibility. However, contrary to what we observed in our previous studies, no effect of masker type on the pupil responses was found. The intelligibility effect was also observed for all
The results of the TBI group were statistically analyzed against results of the age-matched NH and HI groups. A group difference was observed for the SRTs that were slightly worse (higher) for the TBI group compared with the NH group, which confirms H2a. The PPD did not differ significantly between groups suggesting no difference in actual processing load while listening to speech in noise, which does not confirm H2b. Remarkably, the current result for the TBI group showed a negative correlation between SRT and PPD, showing participants with a worse SRTs to have overall smaller PPDs. The absence of a significant difference between the TBI, NH, and HI groups for the TRT suggests that the ability to complete sentences in the visual modality was not affected for this group of TBI participants, confirming H2c. In addition, the absence of a difference in TRT scores provided no indication of a deviation in cognitive capacity for the TBI participants compared with the NH and HI groups.
Discussion
H1. Replication of Findings
This study revealed that the previously observed main effects of intelligibility level on the SRT and the pupil responses were replicated in the TBI group. A main effect of masker type on SRT as previously shown for NH participants was not observed in the TBI group. In our previous studies, we found a slightly but significantly better SRT for speech presented with a single-talker masker in the background compared with the condition with fluctuating noise. In the TBI group, this effect was only observed in the 50% intelligibility condition. These results are generally consistent with previous studies including NH participants without reported neurological problems (Koelewijn, Zekveld, Festen, Rönnberg, et al., 2012). Still, in our previous studies, a main effect of masker type on the pupil dilation response was found, with a larger response for the single-talker masker compared with the fluctuating masker at 50% and 84% intelligibility level (Koelewijn et al., 2014; Koelewijn, Zekveld, Festen, Rönnberg, et al., 2012). This effect was not found in the TBI group.
People with TBI are known to be easily distracted (Kaipio et al., 2000) and often report the inability to suppress irrelevant sounds (Lipowski, 1975). They report being constantly aware of every sound around them, which is often described as annoying or confusing. High distractibility may hinder the processing of incoming auditory information implying that it cannot be sufficiently sorted out and filtered accordingly. Hence, high level of distractibility in the TBI group might have overshadowed any performance benefit for the single-talker masker over the fluctuating noise. In all, these results may indicate disrupted attentional processing required during speech processing for the TBI group compared with that of both NH and HI groups without any reported neurological problems (Koelewijn et al., 2014; Koelewijn, Zekveld, Festen, Rönnberg, et al., 2012).
H2. Comparison of TBI Results With Those of NH and HI Listeners
The significantly higher (poorer) SRTs for the TBI group compared with the NH group in the absence of a difference in PTA confirms H2a. The results indicate that TBI participants have difficulties with listening to speech in noise despite a normal audiogram. Remarkably, no significant differences were observed for the PPD between the TBI and NH group, indicating that there was no group effect of TBI on the PPD. Therefore, the idea that TBI would have a limiting effect on the deployment of cognitive resources, resulting in a smaller PPD (H2b), was not confirmed. Nevertheless, a trend was observed. This is further discussed in the following section.
Allocation of Effort
Figure 3 is an updated version of a previous figure (Koelewijn et al., 2014), which now includes the SRT and PPD data of the TBI group and age-matched NH and HI groups. Although no significant differences in PPD between the TBI and NH group were observed, the mean values of the PPDs measured for the TBI group were systematically smaller than those of the NH group for all conditions (see Figures 3). This is consistent with the previous observation showing that HI participants had a smaller pupil dilation response than NH participants (e.g., Zekveld et al., 2011) in listening conditions targeting 50% intelligibility. It is important to note, however, that the pupil dilation response in NH, HI, and probably also in TBI listeners depends on what intelligibility range is tested. This was shown by Ohlenforst et al. (2017). They examined the entire range from 0% to 100% sentence recognition performance and found that at higher levels of performance (≥60%), pupil dilation was larger for HI than for NH listeners. It is thus possible that the same interaction effect would have appeared in the TBI group if we had included a larger range of intelligibility levels in this study. This is an issue that needs to be addressed in future studies. Nevertheless, the question of why the pupil dilation response would be smaller for TBI group remains. Attentional problems, but also fatigue may underlie this phenomenon. This will be discussed in the following section.
Average PPD as function of the average SRT for both masker type and intelligibility conditions for participants of the ABI, NH, and HI group. The ovals denote the standard errors of both the SRTs and PPDs. TBI = traumatic brain injury; NH = normal hearing; HI = hearing impaired.
The Possible Influence of Fatigue
Interestingly, there was a significant correlation between SRT and PPD in the TBI group. As shown in Figure 4, TBI participants with an overall higher (worse) SRT score showed a smaller PPD. This was observed while within participants a worse SRT for a relatively difficult listening condition normally related to a larger PPD (e.g., Zekveld et al., 2010). This result was not related to an increased peripheral hearing loss, as all TBI participants had a normal PTA.
The significant correlation might be related to diversity in levels of fatigue in the TBI group. High levels of listening effort over a longer period of time can result in fatigue (Hornsby, Naylor, & Bess, 2016). A substantial number (almost one third) of the people with TBI report abnormally high levels of fatigue (French et al., 2011; van Zomeren & van den Burg, 1985). To illustrate, 40% of a sample of Swedish stroke patients reported feeling fatigue
The Pupil Baseline
Finally, the current results showed an effect of intelligibility on the pupil baseline in the TBI group. In previous studies among NH and HI listeners using the same setup (Koelewijn, Zekveld, Festen, & Kramer, 2012; e.g., Zekveld et al., 2010), no effect of intelligibility on the pupil baseline was shown. A lower pupil baseline in the 84% intelligibility than in the 50% condition, as observed in this study, could indicate a higher level of arousal at 50% intelligibility, which might reflect participants getting closer to depleting their cognitive resources in order to perform a more difficult task (Pichora-Fuller et al., 2016). Note that no significant correlations between baseline and SRT were observed indicating that it is unlikely that the absolute noise level used in the SRT task was the underlying cause of the effect of intelligibility on the baseline.
Limitations
Future research, if a sufficiently large sample of participants can be included, could incorporate ‘type of traumatic brain damage’ as a between-subject variable. However, it was beyond the goals of this study and was not achievable based on the small number of participants. In addition, future research should include independent and objective measures for distractibility and fatigue.
Conclusion
Participants with TBI in this study had normal PTAs and TRTs but showed difficulties with speech processing in noise, as reflected by worse SRTs compared with the NH group. This made them a unique group to study listening effort evoked by speech intelligibility in noise independently from differences in PTA. Previously reported effects of intelligibility on the SRT but not masker type were replicated in listeners with TBI. Most remarkably, in the TBI group, a significantly negative correlation between SRT and PPD was observed, indicating that given a fixed intelligibility level, higher (worse) SRTs related to smaller PPDs (Figure 4). Together with the absence of main effect of masker type for the TBI participants on SRT and the pupil dilation response, the results may suggest increased distractibility as a direct consequence of the TBI (Kaipio et al., 2000) or as suggested, related to high levels of fatigue (e.g., Boksem, Meijman, & Lorist, 2005). Explaining these observations in terms of depleted resources because of increased distractibility and fatigue is consistent with the FUEL (Pichora-Fuller et al., 2016) but still speculative. Further investigation is requested.
Footnotes
Acknowledgments
The authors thank Becky Desjardins and Bethany Plain for their helpful textual comments on the current version of the article.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was funded by The
