• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于短语的语音识别测试的合成语音开发。

Development of a Phrase-Based Speech-Recognition Test Using Synthetic Speech.

机构信息

Institute of Hearing Technology and Audiology, Jade University of Applied Sciences, Oldenburg, Germany.

Medizinische Physik, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany.

出版信息

Trends Hear. 2024 Jan-Dec;28:23312165241261490. doi: 10.1177/23312165241261490.

DOI:10.1177/23312165241261490
PMID:39051703
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11273571/
Abstract

Speech-recognition tests are widely used in both clinical and research audiology. The purpose of this study was the development of a novel speech-recognition test that combines concepts of different speech-recognition tests to reduce training effects and allows for a large set of speech material. The new test consists of four different words per trial in a meaningful construct with a fixed structure, the so-called phrases. Various free databases were used to select the words and to determine their frequency. Highly frequent nouns were grouped into thematic categories and combined with related adjectives and infinitives. After discarding inappropriate and unnatural combinations, and eliminating duplications of (sub-)phrases, a total number of 772 phrases remained. Subsequently, the phrases were synthesized using a text-to-speech system. The synthesis significantly reduces the effort compared to recordings with a real speaker. After excluding outliers, measured speech-recognition scores for the phrases with 31 normal-hearing participants at fixed signal-to-noise ratios (SNR) revealed speech-recognition thresholds (SRT) for each phrase varying up to 4 dB. The median SRT was -9.1 dB SNR and thus comparable to existing sentence tests. The psychometric function's slope of 15 percentage points per dB is also comparable and enables efficient use in audiology. Summarizing, the principle of creating speech material in a modular system has many potential applications.

摘要

语音识别测试在临床和研究听力学中都得到了广泛应用。本研究的目的是开发一种新的语音识别测试,该测试结合了不同语音识别测试的概念,以减少训练效应,并允许使用大量的语音材料。新测试由每个试验中的四个不同单词组成,这些单词具有固定结构的有意义的构词,即所谓的短语。各种免费数据库被用于选择单词并确定它们的频率。高频率名词被分为主题类别,并与相关形容词和不定式结合使用。在剔除不适当和不自然的组合,以及消除(子)短语的重复之后,剩下了总共 772 个短语。随后,使用文本到语音系统对这些短语进行合成。与使用真实说话者录制相比,这种合成方法大大减少了工作量。在排除离群值后,在固定信噪比(SNR)下,31 名正常听力参与者对短语进行的语音识别测试,得到了每个短语的语音识别阈值(SRT),范围在 4dB 以内。中位数 SRT 为-9.1dB SNR,与现有的句子测试相当。15 个百分点每分贝的心理物理函数斜率也相当,可在听力学中有效使用。综上所述,在模块化系统中创建语音材料的原理具有许多潜在的应用。

相似文献

1
Development of a Phrase-Based Speech-Recognition Test Using Synthetic Speech.基于短语的语音识别测试的合成语音开发。
Trends Hear. 2024 Jan-Dec;28:23312165241261490. doi: 10.1177/23312165241261490.
2
Automated Measurement of Speech Recognition, Reaction Time, and Speech Rate and Their Relation to Self-Reported Listening Effort for Normal-Hearing and Hearing-Impaired Listeners Using various Maskers.使用不同掩蔽噪声对正常听力和听力障碍者的言语识别率、反应时、言语率的自动测量及其与自我报告的听力努力度的关系。
Trends Hear. 2024 Jan-Dec;28:23312165241276435. doi: 10.1177/23312165241276435.
3
No association between idiopathic hidden hearing loss and behavioral adaptation to noise in humans.特发性隐匿性听力损失与人类对噪声的行为适应性之间无关联。
Hear Res. 2025 Aug;464:109321. doi: 10.1016/j.heares.2025.109321. Epub 2025 May 24.
4
Reference Speech-recognition curves for a German monosyllabic test in noise: effects of loudspeaker configuration and room acoustics.德语单音节噪声测试的参考语音识别曲线:扬声器配置和室内声学的影响。
Int J Audiol. 2025 Jul;64(7):695-704. doi: 10.1080/14992027.2024.2401519. Epub 2024 Nov 7.
5
Comparing the AzBio Sentence-in-Noise Test in English and Spanish in Bilingual Adults.比较双语成年人中英语和西班牙语的AzBio噪声环境下句子测试。
J Am Acad Audiol. 2025 Jan 1;36(1):2-10. doi: 10.3766/jaaa.230120. Epub 2025 Feb 11.
6
Adaptation to Noise in Spectrotemporal Modulation Detection and Word Recognition.声谱时变调制检测和单词识别中的噪声适应。
Trends Hear. 2024 Jan-Dec;28:23312165241266322. doi: 10.1177/23312165241266322.
7
On the Feasibility of Using Behavioral Listening Effort Test Methods to Evaluate Auditory Performance in Cochlear Implant Users.关于使用行为性听力努力测试方法评估人工耳蜗使用者听觉表现的可行性
Trends Hear. 2024 Jan-Dec;28:23312165241240572. doi: 10.1177/23312165241240572.
8
Evaluation of Speaker-Conditioned Target Speaker Extraction Algorithms for Hearing-Impaired Listeners.针对听力受损听众的说话者条件目标说话者提取算法评估
Trends Hear. 2025 Jan-Dec;29:23312165251365802. doi: 10.1177/23312165251365802. Epub 2025 Aug 11.
9
Speech Recognition and Spatial Hearing in Young Adults With Down Syndrome: Relationships With Hearing Thresholds and Auditory Working Memory.唐氏综合征青年的语音识别与空间听觉:与听力阈值和听觉工作记忆的关系。
Ear Hear. 2024;45(6):1568-1584. doi: 10.1097/AUD.0000000000001549. Epub 2024 Aug 2.
10
A Quantitative Protocol for Calibrating Short Speech Signals (Monosyllabic Words) Based on the 50-ms Segment of the Voiced Phoneme(s) with the Maximum Root-Mean-Square Amplitude.一种基于具有最大均方根振幅的浊音音素50毫秒片段校准短语音信号(单音节词)的定量协议。
J Am Acad Audiol. 2025 Mar 1;36(2):68-94. doi: 10.3766/jaaa.21126. Epub 2025 Mar 14.

本文引用的文献

1
Speech Recognition and Listening Effort of Meaningful Sentences Using Synthetic Speech.使用合成语音识别和理解有意义的句子的听力努力。
Trends Hear. 2022 Jan-Dec;26:23312165221130656. doi: 10.1177/23312165221130656.
2
Is speech intelligibility what speech intelligibility tests test?言语可懂度测试测试的是言语可懂度吗?
J Acoust Soc Am. 2022 Sep;152(3):1573. doi: 10.1121/10.0013896.
3
Measuring Speech Recognition With a Matrix Test Using Synthetic Speech.使用合成语音的矩阵测试测量语音识别。
Trends Hear. 2019 Jan-Dec;23:2331216519862982. doi: 10.1177/2331216519862982.
4
Impact of Lexical Parameters and Audibility on the Recognition of the Freiburg Monosyllabic Speech Test.词汇参数和可听度对 Freiburg 单音节语音测试识别的影响。
Ear Hear. 2020 Jan/Feb;41(1):136-142. doi: 10.1097/AUD.0000000000000737.
5
Processing Mechanisms in Hearing-Impaired Listeners: Evidence from Reaction Times and Sentence Interpretation.听力受损听众的处理机制:来自反应时间和句子理解的证据。
Ear Hear. 2016 Nov/Dec;37(6):e391-e401. doi: 10.1097/AUD.0000000000000339.
6
[Phonemic balance of the Freiburg monosyllabic speech test].[弗莱堡单音节言语测试的音素平衡]
HNO. 2016 Aug;64(8):557-63. doi: 10.1007/s00106-016-0185-z.
7
Talker- and language-specific effects on speech intelligibility in noise assessed with bilingual talkers: Which language is more robust against noise and reverberation?使用双语者评估噪声环境下特定说话者和特定语言对言语可懂度的影响:哪种语言对噪声和混响更具抗性?
Int J Audiol. 2015;54 Suppl 2:23-34. doi: 10.3109/14992027.2015.1088174. Epub 2015 Oct 21.
8
The multilingual matrix test: Principles, applications, and comparison across languages: A review.多语言矩阵测试:原理、应用及跨语言比较:综述
Int J Audiol. 2015;54 Suppl 2:3-16. doi: 10.3109/14992027.2015.1020971. Epub 2015 Sep 18.
9
Development and evaluation of a linguistically and audiologically controlled sentence intelligibility test.言语可懂度测试的语言学和听力学控制的开发和评估。
J Acoust Soc Am. 2013 Oct;134(4):3039-56. doi: 10.1121/1.4818760.
10
How does linguistic complexity influence intelligibility in a German audiometric sentence intelligibility test?语言复杂性如何影响德语测听句可懂度测试中的可懂度?
Int J Audiol. 2011 Sep;50(9):621-31. doi: 10.3109/14992027.2011.582166. Epub 2011 Jun 30.