• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个无回声、高保真、多方向语音语料库。

An Anechoic, High-Fidelity, Multidirectional Speech Corpus.

作者信息

Miller Margaret K, Delaram Vahid, Trine Allison, Ananthanarayana Rohit M, Buss Emily, Monson Brian B, Stecker G Christopher

机构信息

Center for Hearing Research, Boys Town National Research Hospital, Omaha, NE.

Department of Speech & Hearing Science, University of Illinois Urbana-Champaign.

出版信息

J Speech Lang Hear Res. 2025 Jan 2;68(1):411-418. doi: 10.1044/2024_JSLHR-24-00296. Epub 2024 Dec 2.

DOI:10.1044/2024_JSLHR-24-00296
PMID:39620949
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11842069/
Abstract

INTRODUCTION

We currently lack speech testing materials faithful to broader aspects of real-world auditory scenes such as speech directivity and extended high frequency (EHF; > 8 kHz) content that have demonstrable effects on speech perception. Here, we describe the development of a multidirectional, high-fidelity speech corpus using multichannel anechoic recordings that can be used for future studies of speech perception in complex environments by diverse listeners.

DESIGN

Fifteen male and 15 female talkers (21.3-60.5 years) recorded Bamford-Kowal-Bench (BKB) Standard Sentence Test lists, digits 0-10, and a 2.5-min unscripted narrative. Recordings were made in an anechoic chamber with 17 free-field condenser microphones spanning 0°-180° azimuth angle around the talker using a 48 kHz sampling rate.

RESULTS

Recordings resulted in a large corpus containing four BKB lists, 10 digits, and narratives produced by 30 talkers, and an additional 17 BKB lists (21 total) produced by a subset of six talkers.

CONCLUSIONS

The goal of this study was to create an anechoic, high-fidelity, multidirectional speech corpus using standard speech materials. More naturalistic narratives, useful for the creation of babble noise and speech maskers, were also recorded. A large group of 30 talkers permits testers to select speech materials based on talker characteristics relevant to a specific task. The resulting speech corpus allows for more diverse and precise speech recognition testing, including testing effects of speech directivity and EHF content. Recordings are publicly available.

摘要

引言

目前,我们缺乏忠实于现实世界听觉场景更广泛方面的言语测试材料,例如对言语感知有显著影响的言语指向性和扩展高频(EHF;>8kHz)内容。在此,我们描述了一种多向、高保真言语语料库的开发,该语料库使用多通道消声录音,可用于未来不同听众在复杂环境中进行言语感知研究。

设计

15名男性和15名女性说话者(年龄在21.3 - 60.5岁之间)录制了班福德 - 科瓦尔 - 本奇(BKB)标准句子测试列表、数字0 - 10以及一段2.5分钟的无脚本叙述。录音在消声室内进行,使用17个自由场电容式麦克风,以48kHz采样率围绕说话者在0° - 180°方位角范围内进行录制。

结果

录制得到了一个大型语料库,其中包含由30名说话者生成的四个BKB列表、10个数字和叙述内容,以及由六名说话者子集生成的另外17个BKB列表(共21个)。

结论

本研究的目标是使用标准言语材料创建一个消声、高保真、多向的言语语料库。还录制了更自然的叙述内容,可用于创建嘈杂声和言语掩蔽器。30名说话者的大群体使测试人员能够根据与特定任务相关的说话者特征选择言语材料。由此产生的言语语料库允许进行更多样化和精确的言语识别测试,包括测试言语指向性和EHF内容的影响。录音可公开获取。

相似文献

1
An Anechoic, High-Fidelity, Multidirectional Speech Corpus.一个无回声、高保真、多方向语音语料库。
J Speech Lang Hear Res. 2025 Jan 2;68(1):411-418. doi: 10.1044/2024_JSLHR-24-00296. Epub 2024 Dec 2.
2
Comparing the AzBio Sentence-in-Noise Test in English and Spanish in Bilingual Adults.比较双语成年人中英语和西班牙语的AzBio噪声环境下句子测试。
J Am Acad Audiol. 2025 Jan 1;36(1):2-10. doi: 10.3766/jaaa.230120. Epub 2025 Feb 11.
3
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
Using Pupillometry in Virtual Reality as a Tool for Speech-in-Noise Research.在虚拟现实中使用瞳孔测量法作为噪声环境下语音研究的工具。
Ear Hear. 2025 Jul 2. doi: 10.1097/AUD.0000000000001692.
6
A systematic review of speech, language and communication interventions for children with Down syndrome from 0 to 6 years.对0至6岁唐氏综合征儿童言语、语言和沟通干预措施的系统评价。
Int J Lang Commun Disord. 2022 Mar;57(2):441-463. doi: 10.1111/1460-6984.12699. Epub 2022 Feb 22.
7
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
8
Music interventions for acquired brain injury.后天性脑损伤的音乐干预措施
Cochrane Database Syst Rev. 2017 Jan 20;1(1):CD006787. doi: 10.1002/14651858.CD006787.pub3.
9
Comparison of cellulose, modified cellulose and synthetic membranes in the haemodialysis of patients with end-stage renal disease.纤维素、改性纤维素和合成膜在终末期肾病患者血液透析中的比较。
Cochrane Database Syst Rev. 2001(3):CD003234. doi: 10.1002/14651858.CD003234.
10
Technological aids for the rehabilitation of memory and executive functioning in children and adolescents with acquired brain injury.脑损伤儿童和青少年记忆与执行功能康复的技术辅助手段。
Cochrane Database Syst Rev. 2016 Jul 1;7(7):CD011020. doi: 10.1002/14651858.CD011020.pub2.

本文引用的文献

1
Gender and speech material effects on the long-term average speech spectrum, including at extended high frequencies.性别和语音材料对长期平均语音频谱的影响,包括扩展高频。
J Acoust Soc Am. 2024 Nov 1;156(5):3056-3066. doi: 10.1121/10.0034231.
2
Predicting speech-in-speech recognition: Short-term audibility, talker sex, and listener factors.预测言语中的言语识别:短期可听度、说话人性别和听者因素。
J Acoust Soc Am. 2022 Nov;152(5):3010. doi: 10.1121/10.0015228.
3
On the use of the TIMIT, QuickSIN, NU-6, and other widely used bandlimited speech materials for speech perception experiments.关于在语音感知实验中使用 TIMIT、QuickSIN、NU-6 和其他广泛使用的带限语音材料。
J Acoust Soc Am. 2022 Sep;152(3):1639. doi: 10.1121/10.0013993.
4
Extended high-frequency audiometry in research and clinical practice.扩展高频测听在研究和临床实践中的应用。
J Acoust Soc Am. 2022 Mar;151(3):1944. doi: 10.1121/10.0009766.
5
Effect of Masker Head Orientation, Listener Age, and Extended High-Frequency Sensitivity on Speech Recognition in Spatially Separated Speech.掩蔽头方向、听众年龄和高频扩展灵敏度对空间分离语音中的语音识别的影响。
Ear Hear. 2022 Jan/Feb;43(1):90-100. doi: 10.1097/AUD.0000000000001081.
6
Extended high-frequency hearing and head orientation cues benefit children during speech-in-speech recognition.扩展高频听力和头部方向线索有助于儿童在语音干扰环境下进行语音识别。
Hear Res. 2021 Jul;406:108230. doi: 10.1016/j.heares.2021.108230. Epub 2021 Apr 8.
7
Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition.扩展高频段可提供频谱和时域信息,从而提高语音内语音识别性能。
Trends Hear. 2020 Jan-Dec;24:2331216520980299. doi: 10.1177/2331216520980299.
8
Extended high frequency hearing and speech perception implications in adults and children.成人和儿童扩展高频听力和言语感知的意义。
Hear Res. 2020 Nov;397:107922. doi: 10.1016/j.heares.2020.107922. Epub 2020 Feb 18.
9
Extended high-frequency hearing enhances speech perception in noise.扩展高频听力可增强噪声环境下的言语感知。
Proc Natl Acad Sci U S A. 2019 Nov 19;116(47):23753-23759. doi: 10.1073/pnas.1903315116. Epub 2019 Nov 4.
10
Ecological cocktail party listening reveals the utility of extended high-frequency hearing.生态鸡尾酒会听力揭示了扩展高频听力的实用性。
Hear Res. 2019 Sep 15;381:107773. doi: 10.1016/j.heares.2019.107773. Epub 2019 Aug 3.