Abstract
In recent years, there has been a growing interest in the relationship between effort and performance. Early formulations implied that, as the challenge of a task increases, individuals will exert more effort, with resultant maintenance of stable performance. We report an experiment in which normal-hearing young adults, normal-hearing older adults, and older adults with age-related mild-to-moderate hearing loss were tested for comprehension of recorded sentences that varied the comprehension challenge in two ways. First, sentences were constructed that expressed their meaning either with a simpler subject-relative syntactic structure or a more computationally demanding object-relative structure. Second, for each sentence type, an adjectival phrase was inserted that created either a short or long gap in the sentence between the agent performing an action and the action being performed. The measurement of pupil dilation as an index of processing effort showed effort to increase with task difficulty until a difficulty tipping point was reached. Beyond this point, the measurement of pupil size revealed a commitment of effort by the two groups of older adults who failed to keep pace with task demands as evidenced by reduced comprehension accuracy. We take these pupillometry data as revealing a complex relationship between task difficulty, effort, and performance that might not otherwise appear from task performance alone.
Introduction
Since Kahneman’s (1973) seminal publication of
It is Kahneman’s limited-resource principle that underlies current arguments that, with degraded hearing, the perceptual effort needed for successful speech recognition draws resources that would otherwise be available for encoding what has been heard in memory (Pichora-Fuller, Schneider, & Daneman, 1995; Rabbitt, 1968, 1991; Surprenant, 2007; Wingfield, Tun, & Mccoy, 2005) and for comprehension of sentences that express their meaning with complex syntax (Wingfield, McCoy, Peelle, Tun, & Cox, 2006; see also Van Engen & Peelle, 2014).
One of the earliest extensions of Kahneman’s emphasis on a relationship between effort and performance was seen in Norman and Bobrow’s (1975) contrast between
Consistent with the earlier observations, FUEL defines effort as the allocation of mental resources to meet a perceptual or cognitive challenge, with the recognition that the effort put into a task will reflect a balance between listening difficulty, task demands, and motivation to expend the necessary effort to meet the processing challenge (Pichora-Fuller et al., 2016). One aspect of this relationship is the postulate that the degree of effort an individual is willing to expend toward a task will be greater if he or she believes that task success is possible, and hence that the required effort is justified (Richter, 2016).
The issue of processing effort takes on special significance in adult aging, where age-related hearing impairment is often accompanied by reduced working memory capacity and executive function (McCabe, Roediger, Mcdaniel, Balota, & Hambrick, 2010; Salthouse, 1994), and a general slowing in a number of perceptual and cognitive tasks (Salthouse, 1996). In this regard, it has been shown that older adults who can perform at ceiling or near-ceiling levels in comprehension of spoken sentences that have a simple syntactic structure (e.g., a canonical noun–verb–noun structure) may show significant difficulty comprehending sentences that express their meaning with complex syntax. Notably, this can occur even when the complex sentences contain exactly the same words as sentences with a simpler structure, are recorded by the same speaker, and are presented at the same sound level as the structurally simpler sentences (e.g., DeCaro, Peelle, Grossman, & Wingfield, 2016; Obler, Fein, Nicholas, & Albert, 1991; Stewart & Wingfield, 2009; Wingfield et al., 2006). Such findings, however, should not necessarily imply that older adults’ successful comprehension of syntactically simpler sentences may not require more effort than young adults’ to achieve the same level of comprehension performance. Nor do they directly address the question of whether listeners will continue to commit maximum effort when the linguistic complexity of the speech materials crosses a threshold of difficulty where successful comprehension may seem beyond reach.
In the following experiment, older adults with clinically normal hearing, age-matched older adults with mild-to-moderate hearing loss, and normal-hearing young adults were tested for comprehension accuracy and processing effort for spoken English sentences that varied the comprehension challenge in two ways: increasing the syntactic complexity of a sentence and increasing the gap distance between the agent performing an action in a sentence and the action being performed.
Syntactic Manipulation
The simpler of two syntactic structures we used were sentences with a
A number of reasons underlie the greater processing demands of sentences with an object-relative structure compared with those with a subject-relative structure. In object-relative sentences, the order of thematic roles is not canonical (i.e., the first noun is not the agent of the action) such that they require a more extensive thematic integration than subject-relative sentences (Warren & Gibson, 2002) and place a heavier demand on working memory (Cooke et al., 2002; DeCaro et al., 2016). In addition, because they are less common in everyday discourse (Goldman-Eisler, 1968; Goldman-Eisler & Cohen, 1970), they violate listener’s expectations of the likely structure of a sentence being heard, thus requiring a reanalysis of the sentence meaning (cf., Gibson, Bergen, & Piantadosi, 2013; Levy, 2008; Novick, Trueswell, & Thompson-Schill, 2005; Padó, Crocker, & Keller, 2009). Consistent with these arguments, past studies have reliably shown that object-relative sentences produce more comprehension errors than subject-relative sentences, especially for older adults (e.g., Carpenter, Miyake, & Just, 1994; Wingfield, Peelle, & Grossman, 2003). For this reason, the subject-relative and object-relative contrast offers a good basis for an examination of performance and effort.
Agent-Action Gap Distance
To further vary the processing challenge, both the subject-relative and object-relative sentences we used had a four-word adjectival phrase inserted in a position that put a short or long gap between the agent performing an action and the action being performed. For example, the following is a subject-relative sentence with a short gap: “
DeCaro et al. (2016) presented normal-hearing young adults and normal-hearing and hearing-impaired older adults with such sentences with instructions to indicate who was agent of the action. They found that all three groups’ comprehension was close to error-free for subject-relative sentences with a short gap between the agent and action. By contrast, all three participant groups began to make comprehension errors when hearing object-relative long gap sentences, with this degree of difficulty larger for the normal-hearing older adults than the young adults, and still larger for the hearing-impaired older adults (DeCaro et al., 2016). Consistent with a limited-resource model (Kahneman, 1973; Pichora-Fuller, 2016; see also Rabbitt, 1991; Wingfield et al., 2005), it was concluded that the listening effort associated with age-related hearing loss drew on older adults’ limited resources which, in turn, put a special burden on successful comprehension of the most computationally demanding object-relative long gap sentences. In offering this interpretation, however, DeCaro et al. acknowledged that the reference to effort was inferred from participants’ performance, rather than independently measured.
Measuring Processing Effort
A number of approaches have been taken to the measurement of processing effort and resource allocation associated with task performance. Dual-task studies have seen frequent use following the principle that the effort needed for success on, for example, a primary speech task will be revealed in a performance decrement on a concurrent unrelated task (Naveh-Benjamin, Craik, Guez, & Kreuger, 2005; Pals, Sarampalis, & Baskent, 2013; Sarampalis, 2009; Tun, Mccoy, & Wingfield, 2009). Such dual-task studies, however, are prone to trade-offs in the momentary attention given to each task that may complicate interpretation (Hegarty, Sha, & Miyake, 2000). Ratings of subjective effort, although having potential ecological interest, have shown mixed reliability as well as being an inherently off-line measure (see the review in McGarrigle et al., 2014). In contrast,
Besides its response to ambient light and emotional arousal (Kim, Beversdorf, & Heilman, 2000), a transient change in pupil size has been shown to correspond to perceptual and cognitive effort (Beatty & Lucero-Wagoner, 2000; Kahneman & Beatty, 1966). Relevant to our present interests, numerous studies have reliably shown that increasing the perceptual or linguistic processing challenge for speech materials is accompanied by a progressive increase in pupil dilation. In the former case, this is true whether perceptual difficulty is increased by acoustically degrading the speech signal (e.g., Kramer, Kapteyn, Festen, & Kuik, 1997; Kuchinsky et al., 2013, 2014; Winn, 2016; Winn, Edwards, & Litovsky, 2015; Zekveld, Festen, & Kramer, 2013; Zekveld & Kramer, 2014; Zekveld, Kramer, & Festen, 2010, 2011) or whether the perceptual challenge results from impaired hearing acuity (e.g., Ayasse, Lash, & Wingfield, 2017; Kramer et al., 1997; Kuchinsky et al., 2013; Zekveld et al., 2011). Increased pupil dilation is also seen when listeners are tested for comprehension or recall of clearly spoken sentences that increase in linguistic complexity (e.g., Engelhardt, Ferreira, & Patsenko, 2010; Just & Carpenter, 1993; Piquado, Isaacowitz, et al., 2010; Wright & Kahneman, 1971).
Experimental Question
Testing comprehension of sentences that vary in syntactic complexity and gap distance, our experimental question was whether one would see a progressive increase in effort with increasing levels of sentence complexity, or whether older adults with hearing loss might reach a tipping point, in which the difficulty of the comprehension task is no longer accompanied by an increased commitment of effort.
The possibility that increasing task difficulty may lead to such a tipping point appeared early in the pupillometry literature. Such an effect was observed by Peavler (1974) in a digit-list recall study with young adults. Peavler observed that pupil dilation as an indication of cognitive effort was larger while recalling longer compared with shorter digit lists but that pupil dilation plateaued for supraspan lists (i.e., digit lists too long for accurate recall). These results suggest that, so long as additional effort results in additional gains, one will see the expected increase in pupil dilation as an index of this effort. Beyond this point, however, a further increase in task demands may result in a plateau, or potentially a decrease, in effort and its concomitant pupillary response.
Subsequent to Peavler’s (1974) findings for digit recall by young adults, analogous findings have appeared or been postulated in the context of comprehension of degraded speech and listeners’ willingness or ability to commit additional effort when difficulty exceeds a certain point (cf., Kuchinsky et al., 2014; Ohlenforst et al., 2017; Richter, 2016; Wang et al., 2018; Wendt, Dau, & Hjortkjaer, 2016; Zekveld & Kramer, 2014; Zekveld et al., 2011). This question takes on special significance for older adults, and especially for those with hearing impairment, when faced with linguistically complex sentences as can occur in everyday speech communication.
By making continuous recordings of changes in pupil dilation while normal-hearing young adults, normal-hearing older adults, and hearing-impaired older adults were tested for sentence comprehension, we wished to test the alternative possibilities that (a) mean pupil sizes would be incrementally larger with increasing sentence complexity indicative of a progressive increase in commitment of effort, more so for older adults and older adults with impaired hearing or (b) whether pupil size as a measure of effort might plateau when sentence complexity reaches a difficulty threshold, with this potentially most likely to occur for older adults with impaired hearing.
Methods
Participants
Participants were 28 older adults, 14 with clinically normal hearing (3 males and 11 females) and 14 older adults with a mild-to-moderate hearing loss (4 males and 10 females). Audiometric assessment was conducted using an AudioStar Pro clinical audiometer (Grason-Stadler, Madison, WI, USA) using standard audiometric procedures in a sound attenuating testing room. The participants in the older adult normal-hearing group had a mean better ear pure tone average (PTA) of 18.75 dB HL (
The older adult hearing-impaired group had a mean better ear PTA of 36.43 dB HL (
The normal-hearing older adult group ranged in age from 69 to 79 years (
To ensure that the two older adult groups did not accidentally differ in cognitive ability, working memory capacity was assessed with the Reading Span task (RSpan) modified from Daneman and Carpenter (1980; Stine & Hindman, 1994). The RSpan task requires participants to read sets of sentences and respond after each sentence whether the statement in the sentence was true or false. Once a full set of sentences has been presented, participants are asked to recall the last word of each of the sentences in the order in which the sentences had been presented. Participants received three trials for any given number of sentences, with a working memory score calculated as the total number of trials in which all sentence-final words were recalled correctly in the correct order.
The RSpan task was chosen because it draws heavily on both storage and processing components that represent the characteristics of working memory (McCabe et al., 2010; Wingfield, 2016) and in written form would not be confounded with hearing acuity. Spans for the normal-hearing and hearing-impaired older adults, respectively, were 8.50 (
For purposes of comparison, we also included a group of 14 young adults (3 males, 11 females), ranging in age from 18 to 24 years (
It is common for older adults to have superior vocabulary scores compared with young adults (e.g., Kempler & Zelinski, 1994; Verhaeghen, 2003). This held true for the young adults’ vocabulary scores in the present sample, (
All participants reported themselves to be in good health, with no history of stroke, Parkinson’s disease, or other neuropathology that might compromise their ability to carry out the experimental task. All participants reported themselves to be monolingual native speakers of American English. Written informed consent was obtained from all participants according to a protocol approved by the Brandeis University Institutional Review Board.
Stimuli
Examples of Sentence Types.
To discourage listeners from developing incidental processing strategies based in limited sentence types, 100 filler sentences were prepared in addition to the 144 test sentences. Fifty-two of these were 6 - to 10-word sentences similar in content to the test sentences but that did not contain an embedded clause, and 48 consisted of 6-word sentences similar in structure to the test sentences but without the inclusion of a 4-word adjectival phrase.
The test sentences and fillers were recorded by a female native speaker of American English using Sound Studio v2.2.4 (Macromedia, Inc., San Francisco, CA, USA) that digitized (16-bit) at a sampling rate of 44.1 kHz. Recordings were equalized within and across sentence types for root-mean-square intensity using MATLAB (MathWorks, Natick, MA, USA).
Procedure
Stimulus presentations
Each participant heard 96 test sentences, 24 in each of the four sentence types (24 subject-relative short gap, 24 subject-relative long gap, 24 object-relative short gap, and 24 object-relative long gap) along with 100 filler sentences. Prior to each sentence presentation, the names of the agent and recipient of the action (e.g.,
Stimuli were presented binaurally over Eartone 3 A insert earphones (E-A-R Auditory Systems, Aero Company, Indianapolis, IN, USA) with a nominal presentation level of 20 dB above the individual’s better ear SRT (20 dB SL). Prior to the main experiment, audibility was tested by presenting two low predictability sentences taken from the IEEE/Harvard sentence corpus (IEEE, 1969) at 20 dB SL with the instruction to repeat each sentence as it was presented. An example sentence was “The lake sparkled in the red, hot sun.” Eleven older adults (7 normal hearing and 4 hearing impaired) were unable to accurately repeat back either of the two sentences. For these participants, the presentation level was increased to 25 dB SL, and two additional IEEE sentences were presented. All 11 were able to accurately repeat the sentences at this level. Twenty-five dB SL was used as the presentation level for these participants.
The main experiment was preceded by a brief practice session to familiarize the participant with the sound of the speaker’s voice and the experimental procedures. Ten sentences, representing a mix of test sentence and filler types, were used in the practice session. None of these sentences was used in the main experiment.
Pupillary response data acquisition and preprocessing
Throughout the course of each trial, the participant’s moment-to-moment pupil size was recorded via an EyeLink 1000 Plus eye-tracking apparatus (SR Research, Ontario, Canada), with pupil size data acquired at a rate of 1000 Hz and recorded via MATLAB software (MathWorks, Natick, MA, USA). The EyeLink camera was positioned below the computer screen that showed the names of the agent and recipient for the particular sentence. To facilitate reliable pupil size measurement, the participant’s head was stabilized using a customized individually adjusted chin rest that positioned the participant’s eyes approximately 60 cm from the EyeLink camera.
Pupil diameters below three standard deviations of a trial mean were coded as a blink (Wendt et al., 2016; Zekveld et al., 2010, 2014; Zekveld & Kramer, 2014). These blinks were removed, and linear interpolation was performed starting 80 ms before and ending 160 ms after each blink. This procedure was used to reduce artifacts resulting from partial closures of the eyelids at the beginning and ending of a blink that would cause brief partial obscurations of the pupil (Siegle, Ichikawa, & Steinhauer, 2008; Winn et al., 2015). A 20-sample moving average smoothing filter was then passed over the data (e.g., Winn et al., 2015).
To adjust for individual differences in pupil size dynamic range, at the beginning of the session, pupil sizes were recorded while the participant viewed a light screen (199.8 cd/m2) presented for 60 s followed by a dark screen for 60 s (0.4 cd/m2). This range was used for calculation of adjusted pupil size as will be discussed. Ambient light in the testing room was kept constant throughout the experiment.
Peak pupil dilation (PPD) was quantified as the peak pupil size for correct trials occurring after the onset of the verb in the embedded clause and before the participant’s response. Pupil size was baseline-corrected by subtracting measured pupil size from a pretrial baseline averaged over a 2-s window prior to the onset of the sentence. Pupil size changes were also scaled to account for age differences in the pupillary response (
Results
Comprehension Accuracy
Figure 1 depicts the accuracy data for all three participant groups. It can be seen that all three groups reached ceiling or near-ceiling accuracy levels for both the short and long gap subject-relative sentences. All three groups, however, appear to show poorer comprehension for object-relative compared with subject-relative sentences, and for the object-relative sentences, all three participant groups appear to perform worse in the long gap compared with the short gap condition. In addition, the presence of a long agent-action gap in the object-relative sentences appears to be associated with differentially poorer comprehension for the hearing-impaired older adults.
Mean comprehension accuracy for SR and OR sentences with a short or long gap distance between the agent performing an action and the action being performed. Data are shown for young adults with normal hearing acuity (left panel), older adults with clinically normal hearing for speech (middle panel), and older adults with a mild-to-moderate hearing loss (right panel). Error bars represent one standard error.
These data were analyzed using a logistic mixed-effects model, with syntax, gap distance, and participant group as fixed effects, and whether a trial was correct or incorrect as the dependent variable; individual participants and items were included as random effects using an intercept model based on Matuschek, Kliegl, Vasishth, Baayen, and Bates’s (2017) suggested method for choosing the most parsimonious model. The fixed effects were added into the model in the aforementioned order with the respective interactions entered after the main effects. The effects of the fixed effects on model fit were evaluated using model comparisons of the change in log-likelihood using the analysis of variance function (Bates, Maechler, Bolker, & Walker, 2015; Matuschek et al., 2017), with the young adult group treated as the baseline for comparison. All analyses were carried out in R version 3.4.4 using the
This analysis revealed a significant main effect of syntax,
A second analysis was conducted to compare the two groups of particular interest, the normal-hearing and hearing-impaired older adults (with the normal-hearing group treated as the baseline for comparison). This analysis was conducted on the object-relative sentences alone because participants from all groups reached ceiling or near-ceiling accuracy for the subject-relative sentences. A logistic mixed-effects model was run with gap distance and participant group as fixed effects, added in that order, and with the interaction term added last. Individual participants and items were entered as random effects. This analysis revealed a significant main effect of gap distance,
Pupillary Responses
Figure 2 shows the mean-adjusted PPD associated with correct comprehension of subject-relative and object-relative short and long gap sentences for each of the three participant groups. Two important features are suggested by visual inspection of Figure 2. First, there appears a progressive increase in PPD across the three participant groups, with the older adult hearing-impaired group showing the largest PPD and the young adults the smallest. Second, while the young adults show an increase in PPD in response to long gap sentences relative to short gap sentences for both subject-relative and object-relative sentences, the two older adult groups show this only for the subject-relative sentences. For the more challenging object-relative sentences, adding a long gap between agent and action in the sentences did not result in a further increase in PPD for the older adults.
Mean-adjusted peak pupil size associated with comprehension of SR and OR sentences with a short or long gap distance between the agent performing an action and the action being performed. Data are shown for young adults with normal hearing acuity (left panel), older adults with clinically normal hearing for speech (middle panel), and older adults with a mild-to-moderate hearing loss (right panel). Error bars represent one standard error.
These data were analyzed using a linear mixed-effects model in a similar manner to the comprehension accuracy data but the use of the
A second analysis was conducted to compare the two groups of primary interest, the normal-hearing and hearing-impaired older adults. Again, syntax, gap distance, and group were entered into a linear mixed model in that order with the respective interactions entered after the main effects. Participants and items were included as random effects and normal-hearing older adults were treated as the baseline for purposes of comparison. This analysis revealed a significant main effect of syntax,
An alternative to peak pupil diameter as an index of effort is the calculation of mean pupil size across a selected region of a trial (cf., Ahern & Beatty, 1979; Verney, Granholm, & Dionisio, 2001; Zekveld et al., 2010). For comparison, we calculated the mean pupil size over a 2 s time bin beginning at the onset of the verb in the embedded clause, the approximate time point at which the meaning of the sentence could be resolved. Two seconds was chosen as a sufficient window to capture the majority of the pupillary response to cognitive processing (Bitsios et al., 1996). At least for our data, calculation of mean pupil size yielded a similar pattern in response to sentence type and participant group as observed with PPD.
Effects of Working Memory Capacity and Hearing Acuity as Continuous Variables
As would be expected from extant literature, the young and older adults in this study differed in both working memory capacity and hearing acuity. This raises the question of the degree to which each of these variables may have contributed to the group differences in comprehension accuracy and processing effort as indexed by the pupillary response. To address this question, we conducted mixed-model analyses using the continuous variables of hearing acuity, working memory, and age as predictors, first of comprehension accuracy, and then the size of the pupillary response.
Predictors of comprehension accuracy
Logistic Mixed-Effects Models of Continuous Variables for Comprehension Accuracy.
Unstandardized coefficient (of standardized variables).
χ2 value for comparisons of each step of the model.
Degrees of freedom for the χ2 test.
SR and OR indicate subject-relative and object-relative syntactic constructions, respectively. The terms
For the subject-relative short and long gap sentences, there was no significant main effect of adding hearing acuity, working memory, or age onto the model. However, for the object-relative short gap sentences, there was a significant effect of adding hearing acuity into the model, while the effects of working memory and age were not significant once hearing acuity had been accounted for. For the most challenging condition, the object-relative long gap sentences, both hearing acuity and working memory, were significant predictors, while age again was not. That is, increased hearing thresholds (decreased hearing acuity) predicted decreased accuracy, and increased working memory capacity predicted increased accuracy.
The absence of a contribution of any of the predictor variables for the subject-relative sentences in either their short gap or long gap versions can be attributed to the participants’ near-ceiling performance for these less challenging sentence types. In the case of the object-relative short gap sentences, one sees a significant contribution of hearing acuity, indicative of a single-resource model in which effortful listening attendant to hearing impairment had a detrimental effect on comprehension accuracy even though presentations were at a perceptually audible level. However, when a long gap was imposed between the agent and action in the already challenging object-relative sentences, a significant contribution of hearing acuity was joined by a significant effect of working memory span as predictors of comprehension accuracy. Once these sensory and cognitive variables were taken into account, participant age added no additional significant effect.
Predictors of peak pupillary response
Linear Mixed-Effects Models of Continuous Variables for Peak Pupil Dilation.
Unstandardized coefficient (of standardized variables).
χ2 value for comparisons of each step of the model.
Degrees of freedom for the χ2 test.
SR and OR indicate subject-relative and object-relative syntactic constructions, respectively. The terms
Discussion
The so-called
Comprehension Accuracy
As would have been expected from prior research (DeCaro et al., 2016), both syntactic complexity and the gap between the agent performing an action in a sentence and the action being performed had significant effects on comprehension accuracy. The present finding that all three participant groups performed at ceiling or near-ceiling regardless of gap distance for subject-relative sentences is emblematic of older adults’ generally effective comprehension of spoken sentences when meaning is expressed in a canonical syntactic form and presented at a suprathreshold intensity level. When the sentence meaning was expressed with a more complex object-relative structure, however, an age difference now appeared. Most striking was a sharp decline in comprehension accuracy for object-relative sentences with a long agent-action gap for the older adults with hearing impairment when compared with older adults with better hearing acuity.
In noting this differential effect of linguistic challenge on the hearing-impaired older adults’ comprehension, it is important to emphasize that the long gap object-relative sentences had the same words, were recorded by the same speaker, and were presented at the same sound level as the accurately comprehended subject-relative short and long gap sentences. Consistent with Rabbitt’s (1968, 1991) effortfulness hypothesis, and the related central resource models (Kahneman, 1973; Pichora-Fuller et al., 2016; Wingfield, 2016), we would interpret this finding as demonstrating that the extra resources (effort) required by the hearing-impaired participants for success at the perceptual level drew on resources that would otherwise be available for higher level comprehension operations. This extra draw on resources would have little overt consequence for comprehension of computationally less demanding speech materials such as the subject-relative sentences used in the present experiment and might go unnoticed in everyday discourse. A detrimental effect of this same resource draw, however, would be revealed when the hearing-impaired listener is confronted by speech materials with high resource demands at the linguistic level, such as revealed in the present experiment with the object-relative long gap sentences.
A possible mechanism underlying Rabbitt’s
Although this concept has been tested at the level of word-list recall (Cousins, Dar, Wingfield, & Miller, 2014; Miller & Wingfield, 2010; Piquado, Cousins et al., 2010), one may speculate that a similar principle of sensory-based interference underlies errors in comprehension of sentence meanings, with such an effect appearing most prominently for computationally demanding sentences such as those used in the present experiment. This postulate might imply that the increased effect of interference would also be revealed in longer latencies to peak pupillary responses. This possibility was examined but did not appear in the present study. However, slowed perceptual processing could interfere with comprehension at the sentence level without necessarily resulting in a delayed pupillary response. This would be so, for example, if the primary effect of the interference is to reduce available resources for additional poststimulus processing. We suggest this as an area for future research.
Whatever the mechanism underlying the interference effect of degraded but identifiable stimuli on comprehension and memory, the comprehension data in the present experiment add to a number of studies showing poorer comprehension and recall of impoverished but suprathreshold auditory stimuli, and especially so for older adults, and older adults with mild-to-moderate hearing impairment (e.g., DeCaro et al., 2016; Pichora-Fuller et al., 1995; Surprenant, 2007; Ward et al., 2016; Wingfield et al., 2005, 2006; Winn, 2016).
Pupillary Response
Our pupillometry results join others in showing that the task-evoked pupillary response is sensitive to task difficulty. Such studies, some conducted with young adults and some with middle aged or older adults, have shown increased pupil dilatation when listeners have been presented with speech that has been acoustically degraded, that has complex syntax, or that has lacked helpful contextual constraints (e.g., Just & Carpenter, 1993; Kramer et al., 1997; Piquado, Isaacowitz, et al., 2010; Wendt et al., 2016; Winn, 2016; Winn et al., 2015; Zekveld et al., 2010, 2011; Zekveld & Kramer, 2014). Our present findings reveal the sensitivity of the pupillary response in a number of expected but previously untested ways. There were also findings that might have been less expected.
Among our expected outcomes, we observed that even though comprehension accuracy was at or near ceiling for the subject-relative sentences regardless of gap distance, comprehension of the long gap subject-relative sentences tended to be accompanied by larger pupil dilations than short gap subject-relative sentences across the three participant groups. As also might have been predicted from an extension of Rabbitt’s (1968, 1991) effortfulness hypothesis, decreased hearing acuity predicted a larger pupillary response. The importance of hearing acuity to processing effort was further confirmed for all four sentence types when hearing acuity, age, and working memory were considered as continuous variables.
Treating hearing acuity, working memory as measured by the RSpan, and age as continuous variables showed hearing acuity to have a significant contribution to comprehension accuracy only for the more resource-demanding object-relative sentences, while hearing contributed significantly to pupillary responses for all four sentence types. Working memory appeared as a factor on comprehension only for the most computationally demanding object-relative long gap sentences and not at all on pupillary responses.
A potentially less intuitive finding was the absence of an increase in pupil dilation for the more challenging long gap object-relative sentences compared with the short gap object-relative sentences for the two older adult participant groups. To the extent that the pupillary response serves as an index of processing effort, this would appear to reflect a plateau in the amount of effort the older adults were able, or willing, to commit to the comprehension task when that task had reached a tipping point of processing difficulty marked by a combination of complex syntax and a long agent-action gap. We consider this possibility in the following section.
Effort as Task Engagement
Although a direct extrapolation from the less complex sentence types might lead one to expect a further increase in task difficulty to be accompanied by a further increase in relative pupil dilation, we saw instead a more complex relationship between pupil size as an index of effort and processing challenge.
A plateau in pupil size when task difficulty begins to exceed processing capacity is not without precedence in the literature. Peavler (1974), for example, reported that pupil size plateaued when the size of digit lists exceeded young adults’ digit spans, while Zekveld and Kramer (2014), in a speech in noise study, observed a plateau in pupil dilation and a decrease in self-reports of expended effort, at especially low intelligibility levels (see also Kuchinsky et al., 2014; Ohlenforst et al., 2017; Wang et al., 2018).
The question might thus be raised as to whether there is a specific level of difficulty that can define a difficulty tipping point. In a speech in noise recall task, Ohlenforst et al. (2017) found the pupillary response to peak at approximately 50% accuracy in that half of the sentences were recalled correctly. In the present study, we saw a higher accuracy level leading to a plateau in pupil dilation for the older adults. It is difficult to compare these two findings, however, as Ohlenforst et al. (2017) were testing recall, with scores that could vary from 0% to 100%, while in the present study, we tested comprehension, where simple chance would yield 50% correct.
We cannot say with our current data whether the plateau in effort we saw for the older adults in the most difficult linguistic condition (long gap object-relative sentences) reflects a reduced ability to engage in effortful processing consequent to age-related changes in frontal attention networks, or an unwillingness to expend the necessary effort when the task demands make it uncertain whether additional effort will achieve success (cf., Kuchinsky et al., 2014; Richter, 2016). In either case, it can be seen that the plateau of effortful engagement was associated with markedly reduced comprehension accuracy for the hearing-impaired older adults under the dual challenge of object-relative sentences with a long agent-action gap.
Conclusions
It is the case that
Two points are nevertheless clear from the current study. The first is that differences in effort were observed even when comprehension accuracy was at near-ceiling levels of performance, demonstrating the importance of additional metrics for evaluating task difficulty beyond comprehension accuracy or intelligibility alone. The second is an observation consistent with a tipping point principle in performance and effort. That is, although ideally the effort given to a task should increase with the challenge represented by the task, in cases where a task crosses a threshold of difficulty task-evoked pupillary responses may reveal an inadequate commitment of effort, potentially associated with a decline in task success. It is thus clear that the use of pupillometry reveals a complex relationship between task difficulty, effort and performance that might not otherwise appear from task performance alone.
Footnotes
Acknowledgments
We thank Mario Svirsky for his insightful comments on a tipping point in task difficulty and its implications for performance. We also thank Victoria Sorrentino for her help in data collection.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Institutes of Health under award R01 AG019714 from the National Institute on Aging (to A. W.). N. D. A. acknowledges support from NIH training grant T32 GM084907. The authors also gratefully acknowledge the support from the W.M. Keck Foundation.
