Now You Hear Me,Later You Don’t: The Immediacy of Linguistic Computation and the Representation of Speech

Abstract

What happens to an acoustic signal after it enters the mind of a listener? Previous work has demonstrated that listeners maintain intermediate representations over time. However, the internal structure of such representations—be they the acoustic-phonetic signal or more general information about the probability of possible categories—remains underspecified. We present two experiments using a novel speaker-adaptation paradigm aimed at uncovering the format of speech representations. We exposed adult listeners (N = 297) to a speaker whose utterances contained acoustically ambiguous information concerning phones (and thus words), and we manipulated the temporal availability of disambiguating cues via visually presented text (presented before or after each utterance). Results from a traditional phoneme-categorization task showed that listeners adapted to a modified acoustic distribution when disambiguating text was provided before but not after the audio. These results support the position that speech representations consist of activation over categories and are inconsistent with direct maintenance of the acoustic-phonetic signal.

Keywords

language speech processing immediacy of computation mental representation acoustic maintenance open data open materials preregistered

Get full access to this article

View all access options for this article.

References

Baayen

R. H.

Piepenbrock

Gulikers

(1995). The CELEX lexical database (Release 2). Philadelphia, PA: Linguistic Data Consortium.

Barr

D. J.

(2013). Random effects structure for testing interactions in linear mixed-effects models. Frontiers in Psychology, 4, Article 328. doi:10.3389/fpsyg.2013.00328

Bertelson

Vroomen

De Gelder

(2003). Visual recalibration of auditory speech identification: A McGurk aftereffect. Psychological Science, 14, 592–597.

Bicknell

Jaeger

T. F.

Tanenhaus

M. K.

(2016). Now or . . . later: Perceptual data are not immediately forgotten during language processing. Behavioral and Brain Sciences, 39, Article e67. doi:10.1017/S0140525X15000734

Bradlow

A. R.

Bent

(2008). Perceptual adaptation to non-native speech. Cognition, 106, 707–729.

Brown-Schmidt

Toscano

J. C.

(2017). Gradient acoustic information induces long-lasting referential uncertainty in short discourses. Language, Cognition and Neuroscience, 32, 1211–1228.

Burchill

Liu

Jaeger

T. F.

(2018). Maintaining information about speech input during accent adaptation. PLOS ONE, 13(8), Article e0199358. doi:10.1371/journal.pone.0199358

Bürkner

P.-C.

(2017). brms: An R Package for Bayesian multilevel models using Stan. Journal of Statistical Software, 80(1). doi:10.18637/jss.v080.i01

Bushong

Jaeger

T. F.

(2017). Maintenance of perceptual information in speech perception. In Proceedings of the 39th Annual Meeting of the Cognitive Science Society (pp. 186–191). Austin, TX: Cognitive Science Society.

10.

Christiansen

M. H.

Chater

(2016). The now-or-never bottleneck: A fundamental constraint on language. Behavioral & Brain Sciences, 39, Article e62. doi:10.1017/S0140525X1500031X

11.

Clayards

Tanenhaus

M. K.

Aslin

R. N.

Jacobs

R. A.

(2008). Perception of speech reflects optimal use of probabilistic speech cues. Cognition, 108, 804–809.

12.

Connine

C. M.

Blasko

D. G.

Hall

(1991). Effects of subsequent sentence context in auditory word recognition: Temporal and linguistic constrainst. Journal of Memory and Language, 30, 234–250.

13.

Cristia

Seidl

Vaughn

Schmale

Bradlow

Floccia

(2012). Linguistic processing of accented speech across the lifespan. Frontiers in Psychology, 3, Article 479. doi:10.3389/fpsyg.2012.00479

14.

Crowder

R. G.

Morton

(1969). Precategorical acoustic storage (PAS). Perception & Psychophysics, 5, 365–373.

15.

Darwin

C. J.

Baddeley

A. D.

(1974). Acoustic memory and the perception of speech. Cognitive Psychology, 6, 41–60.

16.

Dmitrieva

Llanos

Shultz

A. A.

Francis

A. L.

(2015). Phonological status, not voice onset time, determines the acoustic realization of onset f0 as a secondary voicing cue in Spanish and English. Journal of Phonetics, 49, 77–95.

17.

Drouin

J. R.

Theodore

R. M.

(2018). Lexically guided perceptual learning is robust to task-based changes in listening strategy. The Journal of the Acoustical Society of America, 144, 1089–1099.

18.

Erev

Barron

(2005). On adaptation, maximization, and reinforcement learning among cognitive strategies. Psychological Review, 112, 912–931.

19.

Falandays

J. B.

Brown-Schmidt

Toscano

J. C.

(2020). Long-lasting gradient activation of referents during spoken language processing. Journal of Memory and Language, 112, Article 104088. doi:10.1016/j.jml.2020.104088

20.

Frankish

(2008). Precategorical acoustic storage and the perception of speech. Journal of Memory and Language, 58, 815–836.

21.

Galle

M. E.

Klein-Packard

Schreiber

McMurray

(2019). What are you waiting for? Real-time integration of cues for fricatives suggests encapsulated auditory memory. Cognitive Science, 43, Article e12700. doi:10.1111/cogs.12700

22.

Gallistel

C. R.

(1990). The organization of learning. Cambridge, MA: MIT Press.

23.

Ganong

W. F.

III . (1980). Phonetic categorization in auditory word perception. Journal of Experimental Psychology: Human Perception and Performance, 6, 110–125.

24.

Getz

L. M.

Toscano

J. C.

(2019). Electrophysiological evidence for top-down lexical influences on early speech perception. Psychological Science, 30, 830–841.

25.

Goldinger

S. D.

(1998). Echoes of echoes? An episodic theory of lexical access. Psychological Review, 105, 251–279.

26.

Gureckis

T. M.

Martin

McDonnell

Rich

A. S.

Markant

Coenen

. . . Chan

(2016). psiTurk: An open-source framework for conducting replicable behavioral experiments online. Behavior Research Methods, 48, 829–842.

27.

Gwilliams

Linzen

Poeppel

Marantz

(2018). In spoken word recognition, the future predicts the past. The Journal of Neuroscience, 38, 7585–7599.

28.

Jesse

(2021). Sentence context guides phonetic retuning to speaker idiosyncrasies. Journal of Experimental Psychology: Learning, Memory, and Cognition, 47, 184–194. doi:10.1037/xlm0000805

29.

Jesse

McQueen

J. M.

(2011). Positional effects in the lexical retuning of speech perception. Psychonomic Bulletin & Review, 18, 943–950. doi:10.3758/s13423-011-0129-2

30.

Kraljic

Samuel

A. G.

(2006). Generalization in perceptual learning for speech. Psychonomic Bulletin & Review, 13, 262–268.

31.

Liberman

A. M.

Harris

K. S.

Hoffman

H. S.

Griffith

B. C.

(1957). The discrimination of speech sounds within and across phoneme boundaries. Journal of Experimental Psychology, 54, 358–368.

32.

Liu

Jaeger

T. F.

(2018). Inferring causes during speech perception. Cognition, 174, 55–70.

33.

Marslen-Wilson

W. D.

Tyler

L. K.

(1980). The temporal structure of spoken language understanding. Cognition, 8, 1–71.

34.

Marslen-Wilson

W. D.

Welsh

(1978). Processing interactions and lexical access during word recognition in continuous speech. Cognitive Psychology, 10, 29–63. doi:10.1016/0010-0285(78)90018-X

35.

Matuschek

Kliegl

Vasishth

Baayen

Bates

(2017). Balancing Type I error and power in linear mixed models. Journal of Memory and Language, 94, 305–315.

36.

McClelland

J. L.

Elman

J. L.

(1986). The TRACE model of speech perception. Cognitive Psychology, 18, 1–86.

37.

McGurk

MacDonald

(1976). Hearing lips and seeing voices. Nature, 264, 746–748.

38.

McMurray

Tanenhaus

M. K.

Aslin

R. N.

(2009). Within-category VOT affects recovery from “lexical” garden-paths: Evidence against phoneme-level inhibition. Journal of Memory and Language, 60, 65–91.

39.

McQueen

J. M.

Norris

Cutler

(2006). The dynamic nature of speech perception. Language and Speech, 49, 101–112.

40.

Munson

C. M.

(2011). Perceptual learning in speech reveals pathways of processing [Doctoral dissertation]. University of Iowa, Iowa City. Retrieved from https://ir.uiowa.edu/cgi/viewcontent.cgi?article=2727&context=etd

41.

Norris

McQueen

J. M.

Cutler

(2003). Perceptual learning in speech. Cognitive Psychology, 47, 204–238.

42.

Postle

B. R.

(2015). The cognitive neuroscience of visual short-term memory. Current Opinion in Behavioral Sciences, 1, 40–46.

43.

Reinisch

Holt

L. L.

(2014). Lexically guided phonetic retuning of foreign-accented speech and its generalization. Journal of Experimental Psychology: Human Perception and Performance, 40, 539–555.

44.

Sagi

(2011). Perceptual learning in vision research. Vision Research, 51, 1552–1566. doi:10.1016/j.visres.2010.10.019

45.

Samuel

A. G.

Kraljic

(2009). Perceptual learning for speech. Attention, Perception, & Psychophysics, 71, 1207–1218.

46.

Schuler

K. D.

Kodner

Caplan

(2020). Abstractions are good for brains and machines: A commentary on Ambridge (2020). First Language, 40, 631–635. doi:10.1177/0142723720906233

47.

Smith

E. E.

Medin

D. L.

(1981). Categories and concepts (Vol. 9). Cambridge, MA: Harvard University Press.

48.

Toscano

J. C.

Anderson

N. D.

Fabiani

Gratton

Garnsey

S. M.

(2018). The time-course of cortical responses to speech revealed by fast optical imaging. Brain and Language, 184, 32–42.

49.

Toscano

J. C.

McMurray

Dennhardt

Luck

S. J.

(2010). Continuous perception and graded categorization: Electrophysiological evidence for a linear relationship between the acoustic signal and perceptual encoding of speech. Psychological Science, 21, 1532–1540.

50.

van Heugten

Johnson

E. K.

(2014). Learning to contend with accents in infancy: Benefits of brief speaker exposure. Journal of Experimental Psychology: General, 143, 340–350. doi:10.1037/a0032192

51.

Woods

K. J.

Siegel

M. H.

Traer

McDermott

J. H.

(2017). Headphone screening to facilitate web-based auditory experiments. Attention, Perception, & Psychophysics, 79, 2064–2072.

52.

Zellou

Dahan

(2019). Listeners maintain phonological uncertainty over time and across words: The case of vowel nasality in English. Journal of Phonetics, 76, Article 100910. doi:10.1016/j.wocn.2019.06.001

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

2.05 MB