还在等什么？摩擦音线索的实时整合表明了封装的听觉记忆。

What Are You Waiting For? Real-Time Integration of Cues for Fricatives Suggests Encapsulated Auditory Memory.

机构信息

Department of Psychological and Brain Sciences, University of Iowa.

Interdisciplinary Program in Neuroscience, University of Iowa.

出版信息

Cogn Sci. 2019 Jan;43(1). doi: 10.1111/cogs.12700.

DOI:10.1111/cogs.12700

PMID:30648798

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6338078/

Abstract

Speech unfolds over time, and the cues for even a single phoneme are rarely available simultaneously. Consequently, to recognize a single phoneme, listeners must integrate material over several hundred milliseconds. Prior work contrasts two accounts: (a) a memory buffer account in which listeners accumulate auditory information in memory and only access higher level representations (i.e., lexical representations) when sufficient information has arrived; and (b) an immediate integration scheme in which lexical representations can be partially activated on the basis of early cues and then updated when more information arises. These studies have uniformly shown evidence for immediate integration for a variety of phonetic distinctions. We attempted to extend this to fricatives, a class of speech sounds which requires not only temporal integration of asynchronous cues (the frication, followed by the formant transitions 150-350 ms later), but also integration across different frequency bands and compensation for contextual factors like coarticulation. Eye movements in the visual world paradigm showed clear evidence for a memory buffer. Results were replicated in five experiments, ruling out methodological factors and tying the release of the buffer to the onset of the vowel. These findings support a general auditory account for speech by suggesting that the acoustic nature of particular speech sounds may have large effects on how they are processed. It also has major implications for theories of auditory and speech perception by raising the possibility of an encapsulated memory buffer in early auditory processing.

摘要

言语是随着时间展开的，即使是单个音素的线索也很少同时出现。因此，为了识别单个音素，听众必须在几百毫秒的时间内整合信息。先前的研究对比了两种解释：（a）记忆缓冲区解释，即听众在记忆中积累听觉信息，只有在接收到足够的信息后才会访问更高层次的表示（即词汇表示）；（b）即时整合方案，即词汇表示可以基于早期线索部分激活，然后在出现更多信息时进行更新。这些研究一致表明，即时整合适用于各种语音区别。我们试图将其扩展到摩擦音，这是一类需要不仅对异步线索（摩擦音，然后是 150-350 毫秒后出现的共振峰过渡）进行时间整合，还需要在不同的频带之间进行整合，并对协同发音等上下文因素进行补偿的语音。视觉世界范式中的眼动研究清楚地表明了记忆缓冲区的存在。结果在五个实验中得到了复制，排除了方法因素的影响，并将缓冲区的释放与元音的开始联系起来。这些发现通过提出早期听觉处理中可能存在封闭的记忆缓冲区的可能性，为言语的一般听觉解释提供了支持，这表明特定语音的声学性质可能对其处理方式有很大影响。它还对听觉和言语感知理论产生了重大影响，因为它提出了早期听觉处理中可能存在封闭的记忆缓冲区的可能性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8b07/6338078/4492e26bd806/nihms-995502-f0001.jpg

相似文献

What Are You Waiting For? Real-Time Integration of Cues for Fricatives Suggests Encapsulated Auditory Memory.还在等什么？摩擦音线索的实时整合表明了封装的听觉记忆。

Cogn Sci. 2019 Jan;43(1). doi: 10.1111/cogs.12700.

Listeners can anticipate future segments before they identify the current one.听众在识别当前片段之前就能预测未来的片段。

Atten Percept Psychophys. 2019 May;81(4):1147-1166. doi: 10.3758/s13414-019-01712-9.

Tracking the time course of phonetic cue integration during spoken word recognition.追踪口语单词识别过程中语音线索整合的时间进程。

Psychon Bull Rev. 2008 Dec;15(6):1064-71. doi: 10.3758/PBR.15.6.1064.

Assessment of Spectral and Temporal Resolution in Cochlear Implant Users Using Psychoacoustic Discrimination and Speech Cue Categorization.使用心理声学辨别和语音线索分类评估人工耳蜗使用者的频谱和时间分辨率

Ear Hear. 2016 Nov/Dec;37(6):e377-e390. doi: 10.1097/AUD.0000000000000328.

Gradient and categorical patterns of spoken-word recognition and processing of phonetic details.口语单词识别及语音细节处理的梯度和类别模式

Atten Percept Psychophys. 2019 Jul;81(5):1654-1672. doi: 10.3758/s13414-019-01693-9.

Integration efficiency for speech perception within and across sensory modalities by normal-hearing and hearing-impaired individuals.正常听力和听力受损个体在感觉模态内及跨感觉模态的语音感知整合效率。

J Acoust Soc Am. 2007 Feb;121(2):1164-76. doi: 10.1121/1.2405859.

Unified Coding of Spectral and Temporal Phonetic Cues: Electrophysiological Evidence for Abstract Phonological Features.统一编码的光谱和时间语音线索：抽象语音特征的电生理证据。

J Cogn Neurosci. 2022 Mar 5;34(4):618-638. doi: 10.1162/jocn_a_01817.

Perceptual integration of acoustic cues to laryngeal contrasts in Korean fricatives.韩语擦音中喉部对比的声学线索的感知整合

J Acoust Soc Am. 2016 Feb;139(2):605-11. doi: 10.1121/1.4926435.

Individual Differences in Categorization Gradience As Predicted by Online Processing of Phonetic Cues During Spoken Word Recognition: Evidence From Eye Movements.个体在类别渐变中的差异，如在口语识别过程中对语音线索的在线处理所预测的那样：来自眼动的证据。

Cogn Sci. 2021 Mar;45(3):e12948. doi: 10.1111/cogs.12948.

Labeling of /s/ and [see text] by listeners with normal and impaired hearing, revisited.听力正常和听力受损的听众对/s/及[见文本]的标注，再探讨。

J Speech Lang Hear Res. 2003 Jun;46(3):636-48. doi: 10.1044/1092-4388(2003/050).

引用本文的文献

Decoupling speech processing from time.将语音处理与时间解耦。

Trends Cogn Sci. 2025 Jun 25. doi: 10.1016/j.tics.2025.05.017.

The consistency of categorization-consistency in speech perception.分类的一致性——言语感知中的一致性。

Psychon Bull Rev. 2025 Apr 24. doi: 10.3758/s13423-025-02700-x.

Linguistic diversity shapes flexible speech perception in school age children.语言多样性塑造了学龄儿童灵活的言语感知能力。

Sci Rep. 2024 Nov 21;14(1):28825. doi: 10.1038/s41598-024-80430-1.

Temporal dynamics of coarticulatory cues to prediction.预测协同发音线索的时间动态。

Front Psychol. 2024 Sep 9;15:1446240. doi: 10.3389/fpsyg.2024.1446240. eCollection 2024.

Neural evidence suggests phonological acceptability judgments reflect similarity, not constraint evaluation.神经证据表明，语音可接受性判断反映的是相似性，而不是约束评估。

Cognition. 2023 Jan;230:105322. doi: 10.1016/j.cognition.2022.105322. Epub 2022 Nov 10.

I'm not sure that curve means what you think it means: Toward a [more] realistic understanding of the role of eye-movement generation in the Visual World Paradigm.我不确定那条曲线的意思是你所想的那样：朝向对眼动产生在视窗范式中的作用的[更]现实的理解。

Psychon Bull Rev. 2023 Feb;30(1):102-146. doi: 10.3758/s13423-022-02143-8. Epub 2022 Aug 12.

Decoding the temporal dynamics of spoken word and nonword processing from EEG.从 EEG 解码口语和非口语处理的时间动态。

Neuroimage. 2022 Oct 15;260:119457. doi: 10.1016/j.neuroimage.2022.119457. Epub 2022 Jul 14.

The development of lexical competition in written- and spoken-word recognition.词汇竞争在书面和口头词识别中的发展。

Q J Exp Psychol (Hove). 2023 Jan;76(1):196-219. doi: 10.1177/17470218221090483. Epub 2022 Apr 27.

Gradient activation of speech categories facilitates listeners' recovery from lexical garden paths, but not perception of speech-in-noise.言语范畴的渐变激活有助于听者从词汇歧途中恢复，但无助于感知语音噪声。

J Exp Psychol Hum Percept Perform. 2021 Apr;47(4):578-595. doi: 10.1037/xhp0000900.

Listeners can anticipate future segments before they identify the current one.听众在识别当前片段之前就能预测未来的片段。

Atten Percept Psychophys. 2019 May;81(4):1147-1166. doi: 10.3758/s13414-019-01712-9.

本文引用的文献

Discrimination and streaming of speech sounds based on differences in interaural and spectral cues.基于双耳间和频谱线索差异对语音进行辨别和分流。

J Acoust Soc Am. 2017 Sep;142(3):1674. doi: 10.1121/1.5003809.

Waiting for lexical access: Cochlear implants or severely degraded input lead listeners to process speech less incrementally.等待词汇通达：人工耳蜗或严重退化的输入会导致听众对言语的处理不再是逐步进行的。

Cognition. 2017 Dec;169:147-164. doi: 10.1016/j.cognition.2017.08.013. Epub 2017 Sep 14.

Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency.基于基频的有声和无声语音的顺序流分离。

Hear Res. 2017 Feb;344:235-243. doi: 10.1016/j.heares.2016.11.016. Epub 2016 Dec 5.

Eye movement evidence for an immediate Ganong effect.眼动证据表明存在即时加农效应。

J Exp Psychol Hum Percept Perform. 2016 Dec;42(12):1969-1988. doi: 10.1037/xhp0000269. Epub 2016 Aug 15.

Some people are "More Lexical" than others.有些人比其他人更“词汇化”。

Cognition. 2016 Jun;151:68-75. doi: 10.1016/j.cognition.2016.03.008. Epub 2016 Mar 14.

The Effect of Residual Acoustic Hearing and Adaptation to Uncertainty on Speech Perception in Cochlear Implant Users: Evidence From Eye-Tracking.残余听觉和对不确定性的适应对人工耳蜗使用者言语感知的影响：来自眼动追踪的证据。

Ear Hear. 2016 Jan-Feb;37(1):e37-51. doi: 10.1097/AUD.0000000000000207.

The time-course of speaking rate compensation: Effects of sentential rate and vowel length on voicing judgments.语速补偿的时间进程：句子语速和元音长度对语音判断的影响。

Lang Cogn Neurosci. 2015;30(5):529-543. doi: 10.1080/23273798.2014.946427.

The verbal transformation effect and the perceptual organization of speech: influence of formant transitions and F0-contour continuity.言语转换效应与言语的知觉组织：共振峰过渡和基频轮廓连续性的影响

Hear Res. 2015 May;323:22-31. doi: 10.1016/j.heares.2015.01.007. Epub 2015 Jan 22.

Vowels as Islands of Reliability.元音是可靠性的孤岛。

J Mem Lang. 1987 Oct;26(5):564-573. doi: 10.1016/0749-596X(87)90143-4.

Tracking perception of the sounds of English.追踪对英语语音的感知。

J Acoust Soc Am. 2014 May;135(5):2995-3006. doi: 10.1121/1.4870486.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

还在等什么？摩擦音线索的实时整合表明了封装的听觉记忆。

What Are You Waiting For? Real-Time Integration of Cues for Fricatives Suggests Encapsulated Auditory Memory.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献