Suppr超能文献

性别和语音材料对长期平均语音频谱的影响,包括扩展高频。

Gender and speech material effects on the long-term average speech spectrum, including at extended high frequencies.

机构信息

Department of Speech and Hearing Science, University of Illinois Urbana-Champaign, Champaign, Illinois 61820, USA.

Boys Town National Research Hospital, Center for Hearing Research, Omaha, Nebraska 68131, USA.

出版信息

J Acoust Soc Am. 2024 Nov 1;156(5):3056-3066. doi: 10.1121/10.0034231.

Abstract

Gender and language effects on the long-term average speech spectrum (LTASS) have been reported, but typically using recordings that were bandlimited and/or failed to accurately capture extended high frequencies (EHFs). Accurate characterization of the full-band LTASS is warranted given recent data on the contribution of EHFs to speech perception. The present study characterized the LTASS for high-fidelity, anechoic recordings of males and females producing Bamford-Kowal-Bench sentences, digits, and unscripted narratives. Gender had an effect on spectral levels at both ends of the spectrum: males had higher levels than females below approximately 160 Hz, owing to lower fundamental frequencies; females had ∼4 dB higher levels at EHFs, but this effect was dependent on speech material. Gender differences were also observed at ∼300 Hz, and between 800 and 1000 Hz, as previously reported. Despite differences in phonetic content, there were only small, gender-dependent differences in EHF levels across speech materials. EHF levels were highly correlated across materials, indicating relative consistency within talkers. Our findings suggest that LTASS levels at EHFs are influenced primarily by talker and gender, highlighting the need for future research to assess whether EHF cues are more audible for female speech than for male speech.

摘要

性别和语言对长期平均言语频谱(LTASS)的影响已有报道,但通常使用的录音是带限的,且未能准确捕捉扩展高频(EHFs)。鉴于最近关于 EHFs 对言语感知贡献的数据,准确描述全频带 LTASS 是有必要的。本研究对男性和女性使用高保真、无回声录制的 Bamford-Kowal-Bench 句子、数字和非脚本叙述进行了 LTASS 特征描述。性别对频谱两端的频谱水平有影响:由于基频较低,男性在大约 160Hz 以下的水平高于女性;女性在 EHFs 处的水平高约 4dB,但这种效应取决于言语材料。在大约 300Hz 和 800 到 1000Hz 之间也观察到了性别差异,这与之前的报道一致。尽管在语音内容上存在差异,但在不同的语音材料中,EHF 水平仅存在较小的、性别依赖的差异。EHF 水平在不同材料之间高度相关,表明说话者之间具有相对一致性。我们的发现表明,EHF 处的 LTASS 水平主要受说话者和性别影响,这突出表明需要进一步研究来评估 EHF 线索对女性言语是否比男性言语更易察觉。

相似文献

2
An Anechoic, High-Fidelity, Multidirectional Speech Corpus.一个无回声、高保真、多方向语音语料库。
J Speech Lang Hear Res. 2025 Jan 2;68(1):411-418. doi: 10.1044/2024_JSLHR-24-00296. Epub 2024 Dec 2.

引用本文的文献

本文引用的文献

8
Extended high-frequency hearing enhances speech perception in noise.扩展高频听力可增强噪声环境下的言语感知。
Proc Natl Acad Sci U S A. 2019 Nov 19;116(47):23753-23759. doi: 10.1073/pnas.1903315116. Epub 2019 Nov 4.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验