Abstract
This study investigated the role of contextual information in speech intelligibility, the influence of verbal working memory on the use of contextual information, and the suitability of an ecologically valid sentence test containing contextual information, compared with a CNC (Consonant-Nucleus-Consonant) word test, in cochlear implant (CI) users. Speech intelligibility performance was assessed in 50 postlingual adult CI users on sentence lists and on CNC word lists. Results were compared with a normal-hearing (NH) group. The influence of contextual information was calculated from three different context models. Working memory capacity was measured with a Reading Span Test. CI recipients made significantly more use of contextual information in recognition of CNC words and sentences than NH listeners. Their use of contextual information in sentences was related to verbal working memory capacity but not to age, indicating that the ability to use context is dependent on cognitive abilities, regardless of age. The presence of context in sentences enhanced the sensitivity to differences in sensory bottom-up information but also increased the risk of a ceiling effect. A sentence test appeared to be suitable in CI users if word scoring is used and noise is added for the best performers.
Keywords
Introduction
Cochlear implants (CIs) are currently the treatment of choice for bilateral severe to profound postlingual sensorineural hearing loss, with significant improvements reported in speech intelligibility and quality of life (Gaylor et al., 2013; McRackan et al., 2018). The effect of a CI on speech intelligibility is usually measured with standardized speech tests. However, much variation in used speech materials and scoring methods exists between studies, as reported in Table II of the study of McRackan et al. (2018). Most studies used lists of CNC (Consonant-Nucleus-Consonant) words with a score of either percent correct words or percent correct phonemes. Besides CNC words, several studies reported the use of sentence tests. One of the most important differences compared with word tests is the possibility of using context, because the words in the sentences are related to each other. Although not all words of a sentence may be perceived correctly, a listener may reconstruct the correct sentence based on a few perceived words. The amount of available contextual information in the sentences of a test has a substantial effect on the score that will be obtained. More context will lead to a better predictability of missing parts and hence to a higher speech score (Boothroyd & Nittrouer, 1988), although the resulting score may depend on the ability of the listener to make use of this contextual information (Grant & Seitz, 2000). However, in the literature, it is reported that sentence tests may be too difficult for use in CI listeners (van Wieringen & Wouters, 2008) or that listening to sentences may require much listening effort (Theelen-van den Hoek, Houben, & Dreschler, 2014). This is not in accordance with the finding of Winn (2016) that understanding of high-context sentences in CI users required less effort than understanding of low-context sentences. Given these observations, it is important to consider whether clinically available sentence tests may be a better choice for evaluating the effect of CI treatment compared with CNC word tests. Especially the effect of contextual information in the sentences needs to be considered.
Several studies that focused on sentence tests for CI users mainly reported on test properties, like the risk of floor or ceiling effects and good reproducibility (test–retest reliability). A floor effect exists if a relatively large proportion of a group of listeners obtains a score on or very nearby the minimum score of a test (in case of a speech test, this is usually 0% intelligibility). A ceiling effect exists if a relatively large proportion of a group of listeners obtains the maximum score of a test. For example, Gifford, Shallop, and Peterson (2008) reported that with the Hearing in Noise Test (HINT) sentence test 28% of 156 adult CI users achieved the maximum score and 71% reached a score above 85% sentence intelligibility in quiet. This makes the HINT not responsive to differences in stimulation strategies or different signal processing options for high-performing CI users. The HINT sentences were selected from the Bamford-Kowal-Bench (BKB) sentences (Bench, Kowal, & Bamford, 1979). These sentences have an easy structure and consist of relatively easy, frequently used words. Words that are unintelligible in the first instance are identified easily, because they are highly predictable. According to Boothroyd and Nittrouer (1988), sentences with high predictability result in higher scores than sentences with low predictability and are therefore more prone to ceiling effects. Ebrahimi-Madiseh, Eikelboom, Jayakody, and Atlas (2016) showed that a ceiling effect also exists in the City University of New York sentence test (Boothroyd, Hanin, & Hnath, 1985) if used in CI recipients. Gifford et al. (2008) recommended the use of the Arizona Biomedical Institute sentence test (Spahr et al., 2012), because this test contains more difficult, less predictable sentences, spoken by different talkers in a casual style. Only 0.7% of the CI users reached the maximum score. The Minimum Speech Test Battery for adult CI users (Luxford, Ad Hoc Subcommittee of the Committee on Hearing, & Equilibrium of the American Academy of Otolaryngology-Head and Neck Surgery, 2001; Minimum Speech Test Battery, 2011) recommends assessment of performance with both CNC word and sentence materials, to increase the probability that a patient’s performance will be within the range of at least one test, not confounded by either ceiling or floor effects.
Several studies reported on the reproducibility of sentence tests by describing the test–retest variability (e.g., Firszt et al., 2004; Spahr et al., 2012). The test–retest variability is, among other factors, related to the effective number of statistically independent elements in the speech, which depends on the amount of contextual information within the sentence (Boothroyd et al., 1985; Boothroyd & Nittrouer, 1988; Spahr et al., 2012; Versfeld, Daalder, Festen, & Houtgast, 2000).
Until recently, relatively little attention has been paid in the literature to the ecological validity of a speech test. Ecological validity means that the speech used must be characteristic of everyday speech in different aspects, for example, speaking rate and clarity, sentence structure, and topics. An important aspect of ecological validity is that the speech contains contextual information, as in real speech. The performance on an ecologically valid speech test may better reflect the perceived difficulties with speech intelligibility in real life. A test with sentences could arguably serve as more representative of everyday conversation than a word test. The Arizona Biomedical Institute sentences have relatively good ecological validity (Spahr et al., 2012). Another test that is designed to be more ecologically valid is PRESTO (Perceptually Robust English Sentence Test Open-set), which incorporates variability in words, sentences, talkers, and regional dialects (Gilbert, Tamati, & Pisoni, 2013). In the Netherlands, the VU sentences (Versfeld et al., 2000) have good ecological validity, because they are taken from newspapers, have variation in sentence structure and topics, and are spoken with a normal speaking style and rate.
However, when testing CI recipients, ecological validity is often secondary to the ease of the test material or properties that are thought to better suit the capabilities of CI users. For example, the Dutch Leuven Intelligibility Sentence Test (LIST) (van Wieringen & Wouters, 2008) uses a relatively low speaking rate of 2.5 syllables/s and clear speech, to make the test easier for CI recipients. Theelen-van den Hoek et al. (2014) investigated if it was possible to reliably measure the speech-reception threshold in noise (SRTn) in CI listeners with the Dutch matrix test. A matrix test generates sentences with a length of five words from a matrix that contains 10 alternatives for each word position. This results in meaningful semantically unpredictable sentences with a fixed grammatical structure. These sentences contain little contextual information and are not very representative for everyday speech. The BKB Speech-in-Noise (BKB-SIN) Test is often used to test CI users because of its easy sentences (Bench et al., 1979). In all these examples, the specific material or test characteristics lead to a reduced ecological validity of the test.
CI recipients have more difficulties with speech perception, because their CI delivers a degraded signal. The quality of the speech signal is reduced due to limited spectral resolution (Friesen, Shannon, Baskent, & Wang, 2001; Henry & Turner, 2003; Winn, Chatterjee, & Idsardi, 2012) and temporal fine-structure cues (Loizou, 2006; Rubinstein, 2004). In other words, the bottom-up information is limited. Consequently, CI users have to rely more on top-down processing based on linguistic context (Kong, Donaldson, & Somarowthu, 2015; Nittrouer et al., 2014; Oh, Donaldson, & Kong, 2016; Winn et al., 2012).
Therefore, it is reasonable to assume that in CI recipients, speech intelligibility depends also on nonauditory factors like linguistic skills and cognitive abilities. Some studies investigated the relationship between speech intelligibility and linguistic skills or cognitive abilities in adult CI users. Heydebrand, Hale, Potts, Gotter, and Skinner (2007) found that better intelligibility of CNC words 6 months after cochlear implantation was associated with better verbal learning scores and verbal working memory (letter span) but not with general cognitive ability. Holden et al. (2013) reported a significant positive correlation between a composite measure of cognition (including a vocabulary test, a forward and backward digit span tests, and a verbal learning test) with CNC word recognition scores. In contrast, Moberly, Harris, Boyce, and Nittrouer (2017) found no significant correlation between sentence intelligibility in noise (percentage of words correct) and verbal working memory accuracy scores for serial recall of spoken nonrhyming words. Given these inconclusive findings, in the current study, we explored the relation of working memory capacity with sentence intelligibility and word intelligibility within the same group.
Some studies have investigated the use of contextual information in CI users. Amichetti, Atagi, Kong, and Wingfield (2018) reported that CI users made effective use of linguistic context. Older CI users were able to use context to compensate for their initial disadvantage in recognizing words in low-context conditions compared with young CI users but were also more hindered by interference from other words that might also be activated by context. Winn (2016) showed that listening effort as measured by the pupillary response is higher in CI users than in NH listeners, but the listening effort is less for high-context sentences than for low-context sentences. Results from Başkent et al. (2016) suggest that top-down restoration of interrupted speech can only be achieved in a more limited manner in CI listeners compared with NH listeners. Uncertainty still exists about whether CI users make more or less use of contextual information compared with NH listeners.
In summary, contextual information in a speech test is an important factor because of its influence on test scores, reliability, the relation with ecological validity, and the relation with cognitive and linguistic abilities. In this study, we investigated these aspects of contextual information in an ecological sentence test and a CNC words test in CI users. The purpose was to answer the following questions:
What is the effect of contextual information from the speech materials on speech intelligibility in CI users? Are sentence intelligibility and the use of contextual information related to verbal working memory in CI users? To what extent is an ecologically valid sentence test suitable in CI users with respect to a possible ceiling effect, the responsiveness to differences in the CI signal and the reproducibility of the test compared with CNC wordlists?
Materials and Methods
Participants
Fifty adult CI recipients were included in this study, with a mean age of 63 years (
Twenty-seven participants had an Advanced Bionics implant with at least 14 active electrode contacts and a Naida Q70 sound processor with all sound enhancement algorithms switched off. Twenty-three participants had a Cochlear Ltd implant with at least 21 active electrode contacts and a Nucleus 5 sound processor with Autosensitivity and ADRO active, as in their daily life program. Volume adjustments were not allowed during the test session.
For the speech-in-noise test, the reference data for normal hearing (NH) was based on 16 subjects, with a mean age of 22 years (
Participants signed a written informed consent form, and the Erasmus Medical Center Ethics Committee approved the study protocols of the original studies whose data were taken (as described in the Design and Procedures section).
Speech Intelligibility Tests
Speech intelligibility was measured with Dutch female-spoken, unrelated sentences (Versfeld et al., 2000). These sentences were representative for daily-used communication and mainly selected form a newspaper database. The sentences were pronounced in a natural, clear manner with normal vocal effort and speaking rate. For the estimate of the amount of context, we needed sentences with a fixed number of words (see Context Parameters section). Therefore, we selected sentences with a length of six words and grouped them into lists of 26 sentences. The presentation level of the sentences was fixed at 70 dB (SPL). This speech level is often reached in noisy situations (Pearsons, Bennett, & Fidell, 1977). Participants were instructed to repeat as many words as possible of each sentence and to guess when unsure about any word.
The proportion of correct recognized words in quiet (PCq) was measured at an SNR of 40 dB (i.e., a noise level of 30 dB). This is equivalent to the speech score in quiet, but it has the advantage that it is a distinct point on the psychometrical curve, instead of being the asymptotic value. The SRTn at 50% word intelligibility was measured in steady-state noise with a speech spectrum that corresponds to the long-term spectrum of the sentences. The noise level was varied following an adaptive procedure based on a stochastic approximation method with step size 4 (PC(
Phoneme perception in quiet was measured with the clinically used Dutch word lists for speech audiometry of the Dutch Society of Audiology (Bosman & Smoorenburg, 1995), which consist of 11 phonetically balanced CNC words. Data were obtained from a participant’s clinical record if it was measured within 6 months before the visit or measured just before the experiment otherwise. The phoneme perception score was measured at 65 and 75 dB (SPL). These scores were averaged to reduce measurement variability and to obtain an estimate of the score at 70 dB (SPL).
For the reference data of the speech-in-noise test in the NH group, the SRTn was measured along with the proportion of correct words at four SNRs around the individual SRTn.
Context Parameters
There are several approaches to quantify the use of context information in speech perception. In this study, we used the approaches of Boothroyd and Nittrouer (1988) and Bronkhorst et al. (1993). Boothroyd and Nittrouer (1988) described two equations to quantify the role of context. The first equation describes the relationship between the recognition probability
The second equation describes the relationship between the recognition probability
The parameter
We calculated a
Bronkhorst et al. (1993) developed a more extensive model for context effects in speech recognition. Their model gives predictions of the probabilities
The context parameters
Definition of Three Different Context Measures.
The context model of Bronkhorst et al. was fitted to the data of this study, resulting in a set of context parameters
To model the relation between scores for the CNC speech material and the VU sentence material, we regarded the CNC word scores as proportions correct of isolated words (without context) that could be used as input in the context model of the sentences. However, the words in the sentences have different lengths, varying from 2 phonemes to 10 phonemes (mean: 4.4), while CNC words have 3 phonemes. Therefore, we designed a transform of the CNC word scores to scores for words of five phonemes (as the first integer value above the mean phoneme length of 4.4). This transform is a simplification, because in fact the transform should be the weighted sum of the transforms for each number of phonemes. However that would result in too many parameters. Because we only fit the relation between the score of isolated phonemes and Illustration of the transform of a CNC word score to a word from sentence score, using the context model of CNC words (solid line in left panel), a context model of words with five phonemes (dashed line in left panel), and the context model of sentences (solid line in right panel).
Responsiveness and Reproducibility
We defined the responsiveness to bottom-up differences as the change in a speech score in reaction on a change in the PC of isolated phonemes (Δpisol_ph). We regarded the last as an adequate measure of sensory bottom-up information in accordance with Boothroyd and Nittrouer (1988). It was not possible to measure these proportions correct, because no recordings of isolated phonemes were available. However, measured values were not needed, because the context model provided us with the relations between the PC-isolated phonemes and the other speech measures
For example, in Figure 1, the slope of the curve for CNC words (left panel) is almost one. This slope is the responsiveness for CNC words. For sentences, the transform of Figure 1 was used to obtain the responsiveness.
We also defined a measure of reproducibility with the influence of context included. As already described by Thornton and Raffin (1978), each score from trials having two response options (“true” or “false”) can be modeled according to a binomial distribution. In a sentence test with word scoring, the recognition of each word can be true or false. However, in a sentence, the recognition of each word is not independent from the recognition of the other words. According to Equation 1, there are only
We calculated also responsiveness-reliability ratios. Use of context may enhance the responsiveness, but may also enlarge the
Reading Span Task
We used a computerized Dutch version of the Reading Span Task as a measure of verbal working memory capacity (van den Noort, Bosch, Haverkort, & Hugdahl, 2008). Participants had to read sentences aloud, which appeared on a computer screen for 6.5 s, and to remember the final word of each sentence. After reading the sentence, they had to press the space bar to go to the next item. If participants could not finish the sentence within this time, the next sentence was shown automatically. Sentences were presented in different set sizes of 2, 3, 4, 5, or 6 sentences in random order. After a set, the word “recall” appeared, and the participants had to recall the final word of each sentence in the set (in free order). The reading span (Rspan) score was the average of the number of correctly recalled words for three sets of 20 sentences, giving a Rspan score range from 0 to 20.
Design and Procedures
The speech intelligibility and reading span data were available from three recent studies of our Department of Otorhinolaryngology: data of Vroegop, Dingemanse, van der Schroeff, Metselaar, and Goedegebure (2017) and data of Dingemanse and Goedegebure (2018) and Dingemanse, Vroegop, and Goedegebure (2018). From Dingemanse and Goedegebure (2018), we included only 11 participants, because the other participants were already included from Dingemanse et al. (2018). In all studies, each participant was tested in one test session following partly the same protocol. First, a practice run of the sentence-in-noise test was done to make the participants familiar with the voice and the task and to obtain a first estimation of a participant’s SRTn. Second, sentence tests in quiet and in noise were performed. Next, tests were performed that were specific of the aforementioned studies where the data are taken from. At the end of the test session, a Reading Span Task was performed to obtain a measure of the verbal working memory span.
Equipment
All testing was performed in a sound-treated room. Participants sat 1 m in front of a Westra Lab 251 loudspeaker that was connected to an external soundcard (MOTU UltraLite mk3 Hybrid and after failure of the MOTU card a Roland Octa-capture UA-1010, calibration was checked) and a computer. The tests were presented in a custom application (cf. Dingemanse & Goedegebure, 2015) running in MATLAB.
Data Analysis
Speech performance scores were transformed to rationalized arcsine unit scores in order to make them suitable for statistical analysis according to Studebaker (1985), but not for use in the context models. In cases of multiple comparisons, we used the Benjamini–Hochberg method to control the false discovery rate at level 0.05 (Benjamini & Hochberg, 1995). Data analysis was performed with MATLAB (MathWorks, v9.0.0).
Results
Descriptive Values of Mean (
CNC = consonant-nucleus-consonant; SRTn = speech-reception threshold in noise; PCq = proportion of correct recognized words in quiet; PCn: proportion of correct recognized words in noise; CI = cochlear implant.
Use of Context
Figure 2 shows the results for each of the three context parameters Left panel: Context parameters 
The center panel of Figure 2 shows the calculated
The context model was also fitted to the sentence intelligibility data, following the same approach as in the fitting of the CNC words. Both the data of sentences in quiet and in noise were used, because we found that the speech intelligibility in quiet (PCq_wrd) and in noise (SRTn) were highly correlated (ρ = 0.87, Left panel: Context parameters 
The center panel of Figure 3 shows the calculated
Speech Intelligibility and Context Factors in Relation to the Reading Span
Spearman Correlation Coefficients of Speech Intelligibility Measures (PC and SRTn), and Context Factors (
The correlation is significant (<.05) after correction for multiple testing. Variables that were partialled out were given between brackets.
None of the
Table 3 provides also Spearman correlation coefficients for correlations of speech intelligibility measures with age. All speech scores tend to be lower for higher age, but the correlations were not significant, except for the SRTn measure. The
Responsiveness and Reproducibility
We plotted relations between the different scoring methods and the different speech materials in Figure 4 to obtain information about floor and ceiling effects and to get more insight into the suitability of the materials and scoring methods in individual CI users. In Panel a of Figure 4, the CNC word scores (PCq_wrd_CNC) are plotted against the CNC phoneme scores (PCq_ph_CNC). The Relations between proportions correct recognition for different scoring methods and different speech materials. Panel a shows the relation between CNC phoneme scores (PCq_ph_CNC) and CNC word scores (PCq_wrd_CNC). Panel b shows the relation of the proportion of correctly recognized words from sentences (PCn_wrd) and the proportion correct recognized sentences (PCn_sen). The curves in Panels a and b are the result of fitting of the context model of Bronkhorst et al. (1993) to the data. Panels c and d show a comparison of CNC phoneme scores with scores from the sentence material. See the text for more information. Data from speech in noise are plotted with a 
Panel c of Figure 4 shows that, on average, the PC words from sentences were higher than CNC phoneme scores for phoneme scores >0.5. Panel c shows an apparent ceiling effect for words from sentences. Panel d shows that the PC sentences were less than the PC phonemes, except for phoneme scores >0.8. For sentence scoring, no ceiling effect was seen, but a floor effect was obvious.
The plotted curves in Panels c and d of Figure 4 are based on a fitted transform of CNC word scores to sentence scores, as described in the Methods section and illustrated in Figure 1. The resulting values of the ci (
Interestingly, the sentence scores in Panel d differ largely between subjects in a range of 0.15 to 1 for phoneme scores between 0.5 and 0.8, suggesting that the ability to use contextual information differs between subjects. Therefore, we calculated the correlation between sentence scores and the
The left panel of Figure 5 shows the PC of the different scoring methods and the different speech materials, plotted against the PC for isolated phonemes. From this figure, it is clear that differences in ceiling effects between materials are related to the amount of context within the material. For sentences, the PC score is already near maximum if still not all isolated phonemes were recognized. If the wholes are scored (CNC words or sentences), a larger PC-recognized isolated phonemes is needed for correct understanding of the wholes.
Left panel: Proportion correct values of the different speech materials and scoring methods and plotted against the proportion correct for isolated phonemes as obtained from the context models. Center panel: Standard deviations of the proportion correct values of the different speech materials and scoring methods from Equation 5. Right panel: Responsiveness-reliability ratios for CNC words and sentences with different scoring methods from the CI group relative to the responsiveness-reliability ratio of isolated phonemes.
The center panel of Figure 5 shows the
The right panel of Figure 5 presents the relative responsiveness-reliability ratios for CNC words and sentences with different scoring methods. As explained in the Methods section, the slope of the curves of the left panel was divided by the
Discussion
Use of Context
This study has shown that contextual information from the speech materials has several effects on speech intelligibility in CI users. First, an important finding of this study was that CI users rely significantly more on contextual information in speech perception than normal-hearing listeners. This was true for both CNC words and sentences. In CNC words, the contextual information comes mainly from phonotactic constraints: the permissible phoneme sequences or syllables in a language. In the recall of sentences, the difference with NH listeners was largest if three, four, or five words were missing, that is, if relatively little information is available initially (see left panel Figure 3). For sentences, the difference between the CI group and the NH group is mainly the difference in the
A second effect of the extensive use of contextual information in CI users is that the variance in performance scores is somewhat increased, especially in CNC phoneme scores. This observation resulted from Equation 5, which shows that a lower
Third, this study showed that the use of contextual information from sentences could enhance the responsiveness of the speech test to changes in sensory bottom-up information on speech scores. This follows from the interpretation of Figure 5 (left panel) that due to the use of contextual information the responsiveness (the slope of the curves) was greater than one, meaning that a change in sensory bottom-up information (isolated phonemes) leads to an even greater change in word scores. This finding is in accordance with the study of Kong et al. (2015) who reported that the measured effect of electric-acoustic stimulation was larger if measured with high-context sentences compared with low-context sentences. So, the use of speech materials with context information is more sensitive to changes in bottom-up information than tests that aim to measure the amount of bottom-up information directly, for example, a nonword repetition test (e.g., Moberly et al., 2017).
Speech Intelligibility and Context Factors in Relation to the Reading Span and Age
The use of contextual information differed between CI users. This individual ability was best reflected by the individual
Interestingly, the capacity of using contextual information in sentences was only associated with working memory and not with age. As we found a negative correlation between working memory and age, as expected, we could also expect that older people have more difficulty in using context. This idea is supported by Wingfield, Alexander, and Cavigelli (1994) who found that older adults are less effective in retrospective identification of an unrecognized word that is followed by context words. Other studies reported a greater degree of interference from other words in older adults that may negatively affect the retrospective identification from contextual information (Amichetti et al., 2018; Lash, Rogers, Zoller, & Wingfield, 2013; Sommers, 1996; Sommers & Danielson, 1999). However, there is also an effect of aging on using context in the opposite direction, as older adults have on average a larger vocabulary size than younger adults (Burke & Peters, 1986; Verhaeghen, 2003), which could help with recognition of indistinct words from context. The combined effect of these factors is that in older adults, word recognition is facilitated by sentence context to an equal or greater degree than in young adults (Amichetti et al., 2018; Dubno, Ahlstrom, & Horwitz, 2000; Grant & Seitz, 2000; Nittrouer & Boothroyd, 1990; Pichora-Fuller, Schneider, & Daneman, 1995). This might explain our finding that the
Suitability of an Ecologically Valid Sentence Test for Testing CI users and Recommendations for Clinical Practice
The results of this study suggest that an ecologically valid sentence test is suitable for testing speech intelligibility in CI users if word scoring is used. It appeared that the sentences were not too difficult to recognize for CI users.
The suitability of a test depends on the goal of the test. If the goal is to investigate differences in stimulation strategies or different signal processing options, it is recommended to use speech materials with contextual information within the sentences, word scoring, and a target PC in the mid-range (between 0.3 and 0.7). For CI users having a PC words from sentences in quiet ≥0.7, the addition of noise is advised to bring the PC in the responsive mid-range. This recommendation is based on the results in Figure 5, showing that the sensitivity to reliably measure differences between conditions is best if a sentence test with word scoring is used. As explained earlier, the context effect increases the responsiveness to differences in sensory bottom-up information on speech scores.
If the goal is to measure the longitudinal improvement in speech perception due to treatment with CI, the use of the same speech tests pre- and postoperatively is required. From the two speech materials used in this study, the CNC words with phoneme scoring seem to be the best candidate for a longitudinal analysis, because with CNC phoneme scoring there is less risk of a floor or ceiling effect than in a sentence test. The use of phoneme scoring is recommended, because the responsiveness-reliability ratio is better for phoneme scoring than for word scoring (Figure 5, Panel c).
If one wants to combine both goals, we recommend the use of an ecologically valid sentence test with word scoring in combination with a CNC word test with phoneme scoring. The scoring of elements is recommended because it has the best test–retest variability. The combination of a CNC test and an ecologically valid sentence test allows the calculation of the
Limitations
This study had several limitations. First, the test–retest reliability was derived from Equation 5 and was not actually measured. However, the test–retest reliability may not only originate from variance due to the binomial distribution but may be also influenced by variability between sentence lists. List equivalency is only known for NH listeners, not for CI users. But since lists were randomized over participants and the number of sentences was relatively large (
Conclusions
CI users rely significantly more on contextual information in speech perception than normal-hearing listeners. This was true for both isolated words and sentences. The ability to use contextual information differs between CI recipients, and this ability is related to verbal working memory capacity regardless of age, indicating that postprocessing of the scarce sensory information is dependent on cognitive abilities. The Presence of contextual information in the speech of a test improves the responsiveness of the test to differences in sensory bottom-up information between conditions. Contextual information increases the risk of a ceiling effect in the speech test, at least for high-performing CI listeners, but this potential problem can be mitigated by adding noise to bring the scores back into the responsive range.
Footnotes
Author Contributions
J. G. D. designed the study, did the analyses, and drafted the manuscript. A. G. revised the manuscript. Both authors approved the final version of the manuscript for submission.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
