Suppr超能文献

声噪声和视觉会使言语的听觉分类产生不同程度的扭曲。

Acoustic noise and vision differentially warp the auditory categorization of speech.

机构信息

School of Communication Sciences & Disorders, University of Memphis, 4055 North Park Loop, Memphis, Tennessee 38152, USA.

出版信息

J Acoust Soc Am. 2019 Jul;146(1):60. doi: 10.1121/1.5114822.

Abstract

Speech perception requires grouping acoustic information into meaningful linguistic-phonetic units via categorical perception (CP). Beyond shrinking observers' perceptual space, CP might aid degraded speech perception if categories are more resistant to noise than surface acoustic features. Combining audiovisual (AV) cues also enhances speech recognition, particularly in noisy environments. This study investigated the degree to which visual cues from a talker (i.e., mouth movements) aid speech categorization amidst noise interference by measuring participants' identification of clear and noisy speech (0 dB signal-to-noise ratio) presented in auditory-only or combined AV modalities (i.e., A, A+noise, AV, AV+noise conditions). Auditory noise expectedly weakened (i.e., shallower identification slopes) and slowed speech categorization. Interestingly, additional viseme cues largely counteracted noise-related decrements in performance and stabilized classification speeds in both clear and noise conditions suggesting more precise acoustic-phonetic representations with multisensory information. Results are parsimoniously described under a signal detection theory framework and by a reduction (visual cues) and increase (noise) in the precision of perceptual object representation, which were not due to lapses of attention or guessing. Collectively, findings show that (i) mapping sounds to categories aids speech perception in "cocktail party" environments; (ii) visual cues help lattice formation of auditory-phonetic categories to enhance and refine speech identification.

摘要

言语感知需要通过范畴感知 (CP) 将声学信息分组为有意义的语言-语音单位。除了缩小观察者的感知空间外,如果类别比表面声学特征更能抵抗噪声,CP 还可能有助于降低语音感知的难度。结合视听 (AV) 线索也可以提高语音识别能力,尤其是在嘈杂的环境中。本研究通过测量参与者对仅听觉呈现的清晰和嘈杂语音 (0 dB 信噪比) 以及听觉与视听结合呈现的语音 (A、A+noise、AV、AV+noise 条件) 的识别,调查了说话人视觉线索(即口型运动)在噪声干扰中对言语分类的帮助程度。听觉噪声预期会削弱(即识别斜率变浅)并减缓言语分类。有趣的是,额外的视觉线索在清晰和噪声条件下,很大程度上抵消了与噪声相关的性能下降,并稳定了分类速度,这表明多感官信息具有更精确的声学-语音表示。结果在信号检测理论框架下以及感知对象表示的精度降低(视觉线索)和增加(噪声)下得到了简洁的描述,这不是由于注意力不集中或猜测造成的。总的来说,研究结果表明:(i)将声音映射到类别有助于在“鸡尾酒会”环境中进行语音感知;(ii)视觉线索有助于形成听觉-语音类别网格,从而增强和改善语音识别。

相似文献

2
Effects of Noise on the Behavioral and Neural Categorization of Speech.噪声对言语行为和神经分类的影响。
Front Neurosci. 2020 Feb 27;14:153. doi: 10.3389/fnins.2020.00153. eCollection 2020.
5
Neural Mechanisms Underlying Cross-Modal Phonetic Encoding.跨模态语音编码的神经机制。
J Neurosci. 2018 Feb 14;38(7):1835-1849. doi: 10.1523/JNEUROSCI.1566-17.2017. Epub 2017 Dec 20.
7
The contribution of dynamic visual cues to audiovisual speech perception.动态视觉线索对视听言语感知的贡献。
Neuropsychologia. 2015 Aug;75:402-10. doi: 10.1016/j.neuropsychologia.2015.06.025. Epub 2015 Jun 20.
9
Multisensory benefits for speech recognition in noisy environments.在嘈杂环境中语音识别的多感官益处。
Front Neurosci. 2022 Oct 20;16:1031424. doi: 10.3389/fnins.2022.1031424. eCollection 2022.

引用本文的文献

1
Hearing in categories and speech perception at the "cocktail party".“鸡尾酒会”情境下的分类听觉与言语感知
PLoS One. 2025 Jan 30;20(1):e0318600. doi: 10.1371/journal.pone.0318600. eCollection 2025.
7
Effects of Noise on the Behavioral and Neural Categorization of Speech.噪声对言语行为和神经分类的影响。
Front Neurosci. 2020 Feb 27;14:153. doi: 10.3389/fnins.2020.00153. eCollection 2020.

本文引用的文献

1
Category learning can alter perception and its neural correlates.类别学习可以改变感知及其神经相关性。
PLoS One. 2019 Dec 6;14(12):e0226000. doi: 10.1371/journal.pone.0226000. eCollection 2019.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验