Abstract
Localizing a sound source requires the auditory system to determine its direction and its distance. In general, hearing-impaired listeners do less well in experiments measuring localization performance than normal-hearing listeners, and hearing aids often exacerbate matters. This article summarizes the major experimental effects in direction (and its underlying cues of interaural time differences and interaural level differences) and distance for normal-hearing, hearing-impaired, and aided listeners. Front/back errors and the importance of self-motion are noted. The influence of vision on the localization of real-world sounds is emphasized, such as through the ventriloquist effect or the intriguing link between spatial hearing and visual attention.
Introduction: The Auditory Cues to Direction
In many simple listening circumstances, a normal-hearing person’s percept of the position of a sound source is reasonably veridical: the perceived direction corresponds closely to the actual direction, and the perceived distance is at least passably accurate. For hearing-impaired listeners, however, their performance in experimental spatial-hearing tasks is usually worse, and often hearing aids exacerbate the decrements in performance. There are situations in which even normal-hearing percepts become noticeably inaccurate (e.g., in large amounts of reverberation or at adverse signal-to-noise ratios [SNRs]), entirely wrong (e.g., perceiving a source to be behind when it was really in front), or overridden by vision (e.g., ventriloquism), and in many of these domains there is experimental evidence that hearing-impaired listeners perform even less well. This article outlines some of the major experimental results in these areas, concentrating on the fundamental physics and psychophysics of localization, the effects of hearing impairment and hearing aids, and the importance of vision in the real world. The detailed theoretical reasons for the effects will be saved for a future work. 1
The fundamental auditory cues to direction arise because the ears are on either side of the head (e.g., Blauert, 1997; Moore, 2012; Warren, 1999). The sound from a source on the right side of the head will arrive at the left eardrum after it arrives at the right eardrum, because the left ear is further away, and it will also be lower in level at the left than at the right because the head is a solid object and so casts an acoustic shadow. These differences are termed, respectively, interaural time differences (ITDs) and interaural level differences (ILDs). The auditory system likely uses some form of mapping to decode ITDs (though quite how is unresolved: e.g., Carr & Macleod, 2010; Harper & McAlpine, 2004), whereas the analysis of ILDs may be as a simple as a comparison of the sound levels at the two ears (Hartmann & Constan, 2002). But it is important to remember that though ITDs and ILDs are usually described separately, and often manipulated separately in experiments, the sound from any real source must have
To a first approximation, the relationship between direction and ITD can be found from a simple geometrical calculation of the additional distance to the far ear divided by the speed of sound (Blauert 1997; Moore, 2012; Woodworth, 1938). If the head is assumed to be spherical in shape, and the source of sound is sufficiently far away for the wavefronts to be planes, then the additional distance is given by
The magnitudes of ILDs also vary with direction and frequency. They are generally larger at higher frequencies and are mostly larger at larger azimuths. However, unlike ITDs, there are sharp dips in ILD at some frequencies but strong peaks at others, and an ILD found for one direction may bear little resemblance to the ILD found for a neighboring direction (e.g., Shaw & Vaillancourt, 1985). The dips and peaks occur because of diffraction and reflection of the incoming sound with the torso, head, and pinnae and can be of the order of 20 dB. These are crucial for differentiating up versus down and front versus back (e.g., Zhang & Hartmann, 2010). There is no simple formulae that describe how they vary with frequency and direction, though they can be computed from boundary-element models (e.g., Kreuzer, Majdak, & Chen, 2009) or calculated analytically if simplifying assumptions are made, such as the head being an exact sphere (e.g., Macaulay, Hartmann, and Rakerd, 2010; Rayleigh, 1894/1945).
Many experiments have measured the magnitudes of the just-noticeable differences (JNDs) that listeners can detect in ITD, ILD, or actual direction, while varying the frequency, duration, overall intensity, waveform, onset, offset, masker type, signal-to-masker ratio, and so forth of the stimuli (e.g., Durlach & Colburn, 1978). A few important results are considered here. For normal-hearing listeners, the JND for ILD is of the order of 1 dB (e.g., Grantham, 1984; Hartmann & Constan, 2002; Yost & Dye, 1988). There is little frequency dependence to the JND, except a slight worsening around 1 kHz by no more than a quarter of a decibel. In contrast, the data for the JND for ITD show an extreme frequency dependence. For pure-tone stimuli, the JND is about 60 μs for a frequency of 250 Hz, 10 μs for 1000 Hz, 20 μs for 1250 Hz, but then essentially becomes unmeasurably large for frequencies above about 1500 Hz (e.g., Brughera, Dunai, & Hartmann, 2013; Klump & Eady, 1957). The rate of change of ITD JND with frequency around 1500 Hz is perhaps one of the steepest functions in all of auditory psychophysics. Nevertheless, it is not the case that ITDs are impossible to detect at high frequencies: if complex stimuli are used in which the ITDs are carried by modulations in the temporal envelope, then JNDs can still be measured (e.g., Bernstein & Trahiotis, 2010; Henning, 1980).
The JND for actual direction (known as the minimum audible angle, MAA) is, at best, about 1°. This is found for pure-tone stimuli at around 750 Hz, using sound sources directly ahead and for changes in direction limited to the horizontal plane (e.g., Mills, 1958). It reaches a maximum (about 3°) at frequencies around 2000 Hz before reducing again for frequencies up to about 8 kHz. The MAAs are much higher for changes in azimuth for sound sources located to the side (Mills, 1958) or for MAAs for changes in elevation for any direction (e.g., Grantham, Hornsby, & Erpenbeck, 2003).
The standard explanation of the midfrequency maximum is that people use ITDs to locate sound sources at lower frequencies but not at higher frequencies and ILDs at higher frequencies but not at lower. The theory is known as the
The Effects of Hearing Impairment and Hearing Aids
In general, hearing-impaired listeners perform worse in spatial-hearing experiments than those with normal hearing. For example, Hausler, Colburn, and Marr (1983) measured minimal audible angles for white-noise stimuli as part of a comprehensive set of experimental tests on spatial hearing. For presentation from the side, they found that the
Hearing aids do not improve the localization of sound sources: indeed, in many cases, they interfere. A few examples follow; all found larger errors in horizontal localization for aided than unaided listening. First, Drennan, Gatehouse, Howell, van Tasell, and Lund (2005) compared the accuracy in localization for aided versus unaided listening, using single words in a speech-shaped noise at a SNR of 0 dB. Despite 10 to 15 weeks of acclimatization to the hearing aids, the localization errors when the listeners (
A domain in which hearing aids can cause particular problems is in distinguishing sound sources in front from those behind. A confusion between whether a source is ahead or behind occasionally happens to normal-hearing listeners—for example, people often comment how hard it is to locate the emergency siren of an ambulance or fire engine. The reason is because the head and ears are fairly front/back symmetric. Indeed, Woodworth’s
Laboratory experiments on aided listeners indicate high numbers of front/back errors. Best et al. (2010), in the same experiment described earlier, found that in normal-hearing control listeners, a front-to-back (or a back-to-front) error was made in about 5% of the trials, whereas it was about 12% in unaided, impaired listeners and between 25% and 45% in aided, impaired listeners. A second example is from Vaillancourt, Laroche, Giguere, Beaulieu, and Legault (2011). They found the proportion of front/back errors (for a short broadband noise stimulus) for a large group (
Outside the laboratory, however, it is uncertain whether front/back errors are as prevalent as those experiments would seem to indicate. The reason is because people are continually moving their heads, even if just fidgeting, and head movements can resolve front/back errors (Brimijoin & Akeroyd, 2012; Wallach, 1940). Whenever someone rotates their head by any amount (e.g., 10° to the left), then from
The importance of motion for resolving front from back can be demonstrated by using motion to invoke front/back errors (Brimijoin & Akeroyd, 2012). This can be done by playing a sound from a loudspeaker that is physically behind someone, who then moves their head, say to the left by
Distance, Vision, and the Real World
The phenomena described earlier have been concerned either with the perception of direction or with the underpinning ITD and ILD cues. But in the real world, there is the third dimension of distance, which is crucial to giving the auditory perception of a scene a naturalness. Distance is inherently linked to level: the closer a source is, then the more intense it is (e.g., Blauert, 1997; Zahorik, Brungart, & Bronkhorst, 2005). Normal-hearing listeners do not perceive distance linearly, in that the psychophysical function relating physical distance to perceived distance is compressive: distant sources are reported as closer than they physically are, but close sources are reported as slightly further away than they physically are (Zahorik, 2002). But hearing-impaired listeners may perceive a substantially contracted world, with external sources perceived closer than in reality (Boyd, Whitmer, Soraghan, & Akeroyd, 2012). The JND for changes in distance is distance dependent and in some conditions is affected by hearing loss (Akeroyd, Gatehouse, & Blaschke, 2007). Given that distances are cued by levels, and hearing-aid level compressors change levels, one would expect hearing-aid compressors to affect distance perception, but in a specific test of distance JNDs in aided listening, we did not find any effect of compression ratio (Akeroyd, 2010).
There is another aspect of real-world listening that is generally excluded from many experiments, namely vision: one can often see any sources of sound. There are various reasons for believing that spatial hearing is inherently linked to vision. One is based on the smallness of the MAA, which for pure-tone stimuli in quiet presented from in front can be as little as 1°. This corresponds to about the width of the index fingernail held at arm’s length (O’Shea, 1991). Though it is always possible that the evolutionary pressure for such good directional hearing may be a need to accurately locate sounds in quiet, it is perhaps more likely that it is a corollary (or a
A second demonstration of the importance of vision for spatial hearing comes from the ventriloquist effect. The purpose of the act of a good ventriloquist is to make the audience think that the sound comes from the mouth of the ventriloquist’s dummy, not from its actual source, the mouth of the ventriloquist. There are many laboratory experiments quantifying its strength in terms of localization (e.g., Alais & Burr, 2004; Bertelson, 1999; Bertelson, Vroomen, de Gelder, & Driver, 2000; Jackson, 1953), though it would be of particular interest to know how much it influenced perception of sound sources in everyday life (it would also be of interest to know its strength in hearing impairment or aided listening, as there are few, if any, ventriloquist experiments which have used such listeners). To take just one example, imagine the situation of someone sat at the back of a large hall, listening to a lecturer in front but with a ceiling-mounted public-address system providing the sound heard, so the physical direction of the sound is overhead. It is arguable that the listener might not notice that the acoustic direction is quite different to the visual direction, and even if it is noticed, it may be tolerated then quickly ignored. Does this reflect some form of ventriloquist processing?
Taken together, these effects indicate that auditory perception of spatial position is not solely based on the sounds arriving at the ears, instead vision plays at least some role. Thus, to understand how hearing-impaired listeners perceive the location of sound sources in their everyday, real-life listening, we need know more about the localization of sounds when, like in real life, they can see a potential source. We also need experiments to study how tolerable are mislocations, and, vice versa, when mislocations are annoying. If the importance of localization lies in locating (in both direction and distance) sound sources in noise, dealing with motion, both of sources and of one’s self, and in guiding vision, then these are the domains to ask scientific questions in, to run experiments in, and in which to judge the success of hearing aids. The challenge is then to ask new science questions about how people deal with localization—and mismatched localization—in real, visual environments.
Footnotes
Declaration of Conflicting Interests
The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Medical Research Council (grant number U135097131) and by the Chief Scientist Office of the Scottish Government.
Acknowledgments
This article is based on the text of the third Stuart Gatehouse Lecture, given at the August 2012 International Conference on Hearing Aids (IHCON), Lake Tahoe, California. The author thanks everyone at the Scottish Section for their contributions to the work.
