Suppr超能文献

关于在语音感知实验中使用 TIMIT、QuickSIN、NU-6 和其他广泛使用的带限语音材料。

On the use of the TIMIT, QuickSIN, NU-6, and other widely used bandlimited speech materials for speech perception experiments.

机构信息

Department of Speech and Hearing Science, University of Illinois Urbana-Champaign, Champaign, Illinois 61820, USA.

Department of Otolaryngology/HNS, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27514, USA.

出版信息

J Acoust Soc Am. 2022 Sep;152(3):1639. doi: 10.1121/10.0013993.

Abstract

The use of spectrally degraded speech signals deprives listeners of acoustic information that is useful for speech perception. Several popular speech corpora, recorded decades ago, have spectral degradations, including limited extended high-frequency (EHF) (>8 kHz) content. Although frequency content above 8 kHz is often assumed to play little or no role in speech perception, recent research suggests that EHF content in speech can have a significant beneficial impact on speech perception under a wide range of natural listening conditions. This paper provides an analysis of the spectral content of popular speech corpora used for speech perception research to highlight the potential shortcomings of using bandlimited speech materials. Two corpora analyzed here, the TIMIT and NU-6, have substantial low-frequency spectral degradation (<500 Hz) in addition to EHF degradation. We provide an overview of the phenomena potentially missed by using bandlimited speech signals, and the factors to consider when selecting stimuli that are sensitive to these effects.

摘要

使用频谱降级的语音信号会剥夺听众对语音感知有用的声学信息。一些几十年前录制的流行语音语料库都存在频谱降级,包括有限的扩展高频(EHF)(>8 kHz)内容。尽管通常认为 8 kHz 以上的频率内容在语音感知中作用不大或没有作用,但最近的研究表明,语音中的 EHF 内容在广泛的自然聆听条件下对语音感知有显著的有益影响。本文对用于语音感知研究的流行语音语料库的频谱内容进行了分析,以突出使用限带语音材料的潜在缺陷。本文分析的两个语料库,TIMIT 和 NU-6,除了 EHF 降级之外,还存在低频谱降级(<500 Hz)。我们提供了一个使用限带语音信号可能错过的现象概述,以及在选择对这些影响敏感的刺激时需要考虑的因素。

相似文献

3
Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition.
Trends Hear. 2020 Jan-Dec;24:2331216520980299. doi: 10.1177/2331216520980299.
8
Extended high-frequency hearing and head orientation cues benefit children during speech-in-speech recognition.
Hear Res. 2021 Jul;406:108230. doi: 10.1016/j.heares.2021.108230. Epub 2021 Apr 8.
9
Human Frequency Following Responses to Filtered Speech.
Ear Hear. 2021 Jan/Feb;42(1):87-105. doi: 10.1097/AUD.0000000000000902.
10
Timbre and speech perception in bimodal and bilateral cochlear-implant listeners.
Ear Hear. 2012 Sep-Oct;33(5):645-59. doi: 10.1097/AUD.0b013e318252caae.

引用本文的文献

1
An Anechoic, High-Fidelity, Multidirectional Speech Corpus.
J Speech Lang Hear Res. 2025 Jan 2;68(1):411-418. doi: 10.1044/2024_JSLHR-24-00296. Epub 2024 Dec 2.
3
Band importance for speech-in-speech recognition in the presence of extended high-frequency cues.
J Acoust Soc Am. 2024 Aug 1;156(2):1202-1213. doi: 10.1121/10.0028269.
4
Digits-in-Noise Test as an Assessment Tool for Hearing Loss and Hearing Aids.
Audiol Res. 2024 Apr 8;14(2):342-358. doi: 10.3390/audiolres14020030.
5
Differential benefits of unmasking extended high-frequency content of target or background speech.
J Acoust Soc Am. 2023 Jul 1;154(1):454-462. doi: 10.1121/10.0020175.

本文引用的文献

1
Hearing Impairment in the Extended High Frequencies in Children Despite Clinically Normal Hearing.
Ear Hear. 2022;43(6):1653-1660. doi: 10.1097/AUD.0000000000001225. Epub 2022 Apr 25.
2
Extended high-frequency audiometry in research and clinical practice.
J Acoust Soc Am. 2022 Mar;151(3):1944. doi: 10.1121/10.0009766.
5
Band importance for speech-in-speech recognition.
JASA Express Lett. 2021 Aug;1(8):084402. doi: 10.1121/10.0005762. Epub 2021 Aug 2.
7
Extended high-frequency hearing and head orientation cues benefit children during speech-in-speech recognition.
Hear Res. 2021 Jul;406:108230. doi: 10.1016/j.heares.2021.108230. Epub 2021 Apr 8.
8
Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition.
Trends Hear. 2020 Jan-Dec;24:2331216520980299. doi: 10.1177/2331216520980299.
9
Extended high frequency hearing and speech perception implications in adults and children.
Hear Res. 2020 Nov;397:107922. doi: 10.1016/j.heares.2020.107922. Epub 2020 Feb 18.
10
The maximum audible low-pass cutoff frequency for speech.
J Acoust Soc Am. 2019 Dec;146(6):EL496. doi: 10.1121/1.5140032.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验