Suppr
超能文献

语音中语音识别的频段重要性。

Band importance for speech-in-speech recognition.

作者信息

Buss Emily, Bosen Adam

机构信息

Department of Otolaryngology/Head and Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA.

Center for Hearing Research, Boys Town National Research Hospital, Omaha, Nebraska 68131, USA

出版信息

JASA Express Lett. 2021 Aug;1(8):084402. doi: 10.1121/10.0005762. Epub 2021 Aug 2.

DOI:10.1121/10.0005762

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8499852/

Abstract

Predicting masked speech perception typically relies on estimates of the spectral distribution of cues supporting recognition. Current methods for estimating band importance for speech-in-noise use filtered stimuli. These methods are not appropriate for speech-in-speech because filtering can modify stimulus features affecting auditory stream segregation. Here, band importance is estimated by quantifying the relationship between speech recognition accuracy for full-spectrum speech and the target-to-masker ratio by channel at the output of an auditory filterbank. Preliminary results provide support for this approach and indicate that frequencies below 2 kHz may contribute more to speech recognition in two-talker speech than in speech-shaped noise.

摘要

预测掩蔽语音感知通常依赖于对支持识别的线索频谱分布的估计。当前用于估计噪声中语音频段重要性的方法使用滤波后的刺激。这些方法不适用于语音中语音的情况，因为滤波会改变影响听觉流分离的刺激特征。在这里，通过量化听觉滤波器组输出端全频谱语音的语音识别准确率与通道处目标与掩蔽比之间的关系来估计频段重要性。初步结果支持了这种方法，并表明在双说话者语音中，低于2kHz的频率对语音识别的贡献可能比在语音形状噪声中更大。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c0c/8499852/87b0f7f5a829/JELAAE-000001-084402_1-g001.jpg

相似文献

1

Band importance for speech-in-speech recognition.

JASA Express Lett. 2021 Aug;1(8):084402. doi: 10.1121/10.0005762. Epub 2021 Aug 2.

2

Differential benefits of unmasking extended high-frequency content of target or background speech.

J Acoust Soc Am. 2023 Jul 1;154(1):454-462. doi: 10.1121/10.0020175.

3

Does it take older adults longer than younger adults to perceptually segregate a speech target from a background masker?

Hear Res. 2012 Aug;290(1-2):55-63. doi: 10.1016/j.heares.2012.04.022. Epub 2012 May 16.

4

Estimates of basilar-membrane nonlinearity effects on masking of tones and speech.

Ear Hear. 2007 Feb;28(1):2-17. doi: 10.1097/AUD.0b013e3180310212.

5

Speech recognition in one- and two-talker maskers in school-age children and adults: Development of perceptual masking and glimpsing.

J Acoust Soc Am. 2017 Apr;141(4):2650. doi: 10.1121/1.4979936.

6

Development of Open-Set Word Recognition in Children: Speech-Shaped Noise and Two-Talker Speech Maskers.

Ear Hear. 2016 Jan-Feb;37(1):55-63. doi: 10.1097/AUD.0000000000000201.

7

Delayed Stream Segregation in Older Adults: More Than Just Informational Masking.

Ear Hear. 2015 Jul-Aug;36(4):482-4. doi: 10.1097/AUD.0000000000000139.

8

Masked Speech Perception Thresholds in Infants, Children, and Adults.

Ear Hear. 2016 May-Jun;37(3):345-53. doi: 10.1097/AUD.0000000000000270.

9

Masked Speech Recognition and Reading Ability in School-Age Children: Is There a Relationship?

J Speech Lang Hear Res. 2018 Mar 15;61(3):776-788. doi: 10.1044/2017_JSLHR-H-17-0279.

10

Speech recognition in noise: estimating effects of compressive nonlinearities in the basilar-membrane response.

Ear Hear. 2007 Sep;28(5):682-93. doi: 10.1097/AUD.0b013e31812f7156.

引用本文的文献

1

Impact of High- and Low-Pass Acoustic Filtering on Audiovisual Speech Redundancy and Benefit in Children.

Ear Hear. 2025;46(3):735-746. doi: 10.1097/AUD.0000000000001622. Epub 2025 Jan 31.

2

Band importance for speech-in-speech recognition in the presence of extended high-frequency cues.

J Acoust Soc Am. 2024 Aug 1;156(2):1202-1213. doi: 10.1121/10.0028269.

3

Effects of entropy in real-world noise on speech perception in listeners with normal hearing and hearing lossa).

J Acoust Soc Am. 2023 Dec 1;154(6):3627-3643. doi: 10.1121/10.0022577.

4

Spectral weighting for sentence recognition in steady-state and amplitude-modulated noise.

JASA Express Lett. 2023 May 1;3(5). doi: 10.1121/10.0017934.

5

On the use of the TIMIT, QuickSIN, NU-6, and other widely used bandlimited speech materials for speech perception experiments.

J Acoust Soc Am. 2022 Sep;152(3):1639. doi: 10.1121/10.0013993.

6

Maturation of Speech-in-Speech Recognition for Whispered and Voiced Speech.

J Speech Lang Hear Res. 2022 Aug 17;65(8):3117-3128. doi: 10.1044/2022_JSLHR-21-00620. Epub 2022 Jul 22.

本文引用的文献

1

A binaural model implementing an internal noise to predict the effect of hearing impairment on speech intelligibility in non-stationary noises.

J Acoust Soc Am. 2020 Nov;148(5):3305. doi: 10.1121/10.0002660.

2

Contribution of Stimulus Variability to Word Recognition in Noise Versus Two-Talker Speech for School-Age Children and Adults.

Ear Hear. 2021 Mar/Apr;42(2):313-322. doi: 10.1097/AUD.0000000000000951.

3

The importance of a broad bandwidth for understanding "glimpsed" speech.

J Acoust Soc Am. 2019 Nov;146(5):3215. doi: 10.1121/1.5131651.

4

The effect of target/masker fundamental frequency contour similarity on masked-speech recognition.

J Acoust Soc Am. 2019 Aug;146(2):1065. doi: 10.1121/1.5121314.

5

Development of a Test Battery for Evaluating Speech Perception in Complex Listening Environments: Effects of Sensorineural Hearing Loss.

Ear Hear. 2018 May/Jun;39(3):449-456. doi: 10.1097/AUD.0000000000000567.

6

Effectiveness of Two-Talker Maskers That Differ in Talker Congruity and Perceptual Similarity to the Target Speech.

Trends Hear. 2017 Jan-Dec;21:2331216517709385. doi: 10.1177/2331216517709385.

7

Speech recognition for multiple bands: Implications for the Speech Intelligibility Index.

J Acoust Soc Am. 2016 Sep;140(3):2019. doi: 10.1121/1.4962539.

8

Band importance functions of listeners with cochlear implants using clinical maps.

J Acoust Soc Am. 2016 Nov;140(5):3718. doi: 10.1121/1.4967298.

9

Methods and applications of the audibility index in hearing aid selection and fitting.

Trends Amplif. 2002 Sep;6(3):81-129. doi: 10.1177/108471380200600302.

10

Band importance for sentences and words reexamined.

J Acoust Soc Am. 2013 Jan;133(1):463-73. doi: 10.1121/1.4770246.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

文档翻译

学术文献翻译模型，支持多种主流文档格式。