• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用合成语音的矩阵测试测量语音识别。

Measuring Speech Recognition With a Matrix Test Using Synthetic Speech.

机构信息

1 Institute of Hearing Technology and Audiology, Jade University of Applied Sciences, Oldenburg, Germany.

2 Cluster of Excellence "Hearing4All", Oldenburg, Germany.

出版信息

Trends Hear. 2019 Jan-Dec;23:2331216519862982. doi: 10.1177/2331216519862982.

DOI:10.1177/2331216519862982
PMID:31322032
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6643172/
Abstract

Speech audiometry is an essential part of audiological diagnostics and clinical measurements. Development times of speech recognition tests are rather long, depending on the size of speech corpus and optimization necessity. The aim of this study was to examine whether this development effort could be reduced by using synthetic speech in speech audiometry, especially in a matrix test for speech recognition. For this purpose, the speech material of the German matrix test was replicated using a preselected commercial system to generate the synthetic speech files. In contrast to the conventional matrix test, no level adjustments or optimization tests were performed while producing the synthetic speech material. Evaluation measurements were conducted by presenting both versions of the German matrix test (with natural or synthetic speech), alternately and at three different signal-to-noise ratios, to 48 young, normal-hearing participants. Psychometric functions were fitted to the empirical data. Speech recognition thresholds were 0.5 dB signal-to-noise ratio higher (worse) for the synthetic speech, while slopes were equal for both speech types. Nevertheless, speech recognition scores were comparable with the literature and the threshold difference lay within the same range as recordings of two different natural speakers. Although no optimization was applied, the synthetic-speech signals led to equivalent recognition of the different test lists and word categories. The outcomes of this study indicate that the application of synthetic speech in speech recognition tests could considerably reduce the development costs and evaluation time. This offers the opportunity to increase the speech corpus for speech recognition tests with acceptable effort.

摘要

言语测听是听力学诊断和临床测量的重要组成部分。言语识别测试的开发时间相当长,具体取决于言语语料库的大小和优化的必要性。本研究的目的是检验在言语测听中使用合成语音是否可以减少这种开发工作,特别是在言语识别的矩阵测试中。为此,使用预选的商业系统复制了德语矩阵测试的语音材料,以生成合成语音文件。与传统的矩阵测试不同,在生成合成语音材料时,没有进行电平调整或优化测试。通过以三种不同的信噪比交替呈现自然语音和合成语音的两种版本的德语矩阵测试,对 48 名年轻、正常听力的参与者进行了评估测量。将心理测量函数拟合到经验数据上。对于合成语音,言语识别阈值比自然语音高 0.5dB(更差),而两种语音类型的斜率相等。尽管没有进行优化,但对于不同的测试列表和单词类别,合成语音信号的言语识别得分与文献中的结果相当,且阈值差异在两个不同自然语音录音的范围内。尽管没有应用优化,但在言语识别测试中应用合成语音可以显著降低开发成本和评估时间。这为增加具有可接受工作量的言语识别测试的言语语料库提供了机会。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41ba/6643172/9fadeab617ae/10.1177_2331216519862982-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41ba/6643172/f26b654da0cb/10.1177_2331216519862982-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41ba/6643172/6e89a29d2e83/10.1177_2331216519862982-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41ba/6643172/215586ac57ae/10.1177_2331216519862982-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41ba/6643172/7a8022305174/10.1177_2331216519862982-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41ba/6643172/1753a354fd14/10.1177_2331216519862982-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41ba/6643172/9fadeab617ae/10.1177_2331216519862982-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41ba/6643172/f26b654da0cb/10.1177_2331216519862982-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41ba/6643172/6e89a29d2e83/10.1177_2331216519862982-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41ba/6643172/215586ac57ae/10.1177_2331216519862982-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41ba/6643172/7a8022305174/10.1177_2331216519862982-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41ba/6643172/1753a354fd14/10.1177_2331216519862982-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/41ba/6643172/9fadeab617ae/10.1177_2331216519862982-fig6.jpg

相似文献

1
Measuring Speech Recognition With a Matrix Test Using Synthetic Speech.使用合成语音的矩阵测试测量语音识别。
Trends Hear. 2019 Jan-Dec;23:2331216519862982. doi: 10.1177/2331216519862982.
2
The development and evaluation of the Finnish Matrix Sentence Test for speech intelligibility assessment.用于言语可懂度评估的芬兰语矩阵句子测试的开发与评估。
Acta Otolaryngol. 2014 Jul;134(7):728-37. doi: 10.3109/00016489.2014.898185. Epub 2014 May 7.
3
Development of the Russian matrix sentence test.俄罗斯矩阵句子测试的开发。
Int J Audiol. 2015;54 Suppl 2:35-43. doi: 10.3109/14992027.2015.1020969. Epub 2015 Apr 6.
4
An Italian matrix sentence test for the evaluation of speech intelligibility in noise.一种用于评估噪声环境下言语可懂度的意大利语矩阵句子测试。
Int J Audiol. 2015;54 Suppl 2:44-50. doi: 10.3109/14992027.2015.1061709. Epub 2015 Sep 15.
5
Development and evaluation of the Turkish matrix sentence test.土耳其语矩阵句子测试的开发与评估
Int J Audiol. 2015;54 Suppl 2:51-61. doi: 10.3109/14992027.2015.1074735. Epub 2015 Oct 7.
6
Development of a Phrase-Based Speech-Recognition Test Using Synthetic Speech.基于短语的语音识别测试的合成语音开发。
Trends Hear. 2024 Jan-Dec;28:23312165241261490. doi: 10.1177/23312165241261490.
7
Do you hear the noise? The German matrix sentence test with a fixed noise level in subjects with normal hearing and hearing impairment.你听到那个噪音了吗?针对听力正常和听力受损受试者进行的固定噪音水平下的德语矩阵句子测试。
Int J Audiol. 2015;54 Suppl 2:71-9. doi: 10.3109/14992027.2015.1079929. Epub 2015 Nov 10.
8
Construction and evaluation of the Mandarin Chinese matrix (CMNmatrix) sentence test for the assessment of speech recognition in noise.用于评估噪声环境下语音识别能力的汉语矩阵(CMNmatrix)句子测试的构建与评估。
Int J Audiol. 2018 Nov;57(11):838-850. doi: 10.1080/14992027.2018.1483083. Epub 2018 Sep 4.
9
Influence of noise type on speech reception thresholds across four languages measured with matrix sentence tests.通过矩阵句子测试测量的噪声类型对四种语言语音接受阈值的影响。
Int J Audiol. 2015;54 Suppl 2:62-70. doi: 10.3109/14992027.2015.1046502. Epub 2015 Jun 22.
10
Development and evaluation of the Cantonese matrix sentence test.粤语句式测试的编制与评估。
Int J Audiol. 2024 Jan;63(1):8-20. doi: 10.1080/14992027.2022.2142683. Epub 2022 Nov 28.

引用本文的文献

1
Toward an Extended Classification of Noise-Distortion Preferences by Modeling Longitudinal Dynamics of Listening Choices.通过对听力选择的纵向动态进行建模实现噪声-失真偏好的扩展分类
Trends Hear. 2025 Jan-Dec;29:23312165251362018. doi: 10.1177/23312165251362018. Epub 2025 Aug 7.
2
Automatic development of speech-in-noise hearing tests using machine learning.利用机器学习自动开展噪声环境下言语听力测试
Sci Rep. 2025 Apr 15;15(1):12878. doi: 10.1038/s41598-025-96312-z.
3
Measuring Speech Intelligibility with Romanian Synthetic Unpredictable Sentences in Normal Hearing.

本文引用的文献

1
Development and evaluation of the Turkish matrix sentence test.土耳其语矩阵句子测试的开发与评估
Int J Audiol. 2015;54 Suppl 2:51-61. doi: 10.3109/14992027.2015.1074735. Epub 2015 Oct 7.
2
The multilingual matrix test: Principles, applications, and comparison across languages: A review.多语言矩阵测试:原理、应用及跨语言比较:综述
Int J Audiol. 2015;54 Suppl 2:3-16. doi: 10.3109/14992027.2015.1020971. Epub 2015 Sep 18.
3
Matrix sentence intelligibility prediction using an automatic speech recognition system.使用自动语音识别系统进行矩阵句子可懂度预测。
使用罗马尼亚语合成不可预测句子测量正常听力者的言语可懂度。
Audiol Res. 2024 Dec 1;14(6):1028-1044. doi: 10.3390/audiolres14060085.
4
Development of a Phrase-Based Speech-Recognition Test Using Synthetic Speech.基于短语的语音识别测试的合成语音开发。
Trends Hear. 2024 Jan-Dec;28:23312165241261490. doi: 10.1177/23312165241261490.
5
Investigating the Utility of a Compact Loudspeaker Array for Audiometric Testing.研究紧凑型扬声器阵列在测听中的实用性。
Am J Audiol. 2024 Jun 4;33(2):476-491. doi: 10.1044/2024_AJA-23-00199. Epub 2024 Apr 26.
6
Functional changes in the auditory cortex and associated regions caused by different acoustic stimuli in patients with presbycusis and tinnitus.老年性聋和耳鸣患者中不同声学刺激引起的听觉皮层及相关区域的功能变化。
Front Neurosci. 2022 Oct 19;16:921873. doi: 10.3389/fnins.2022.921873. eCollection 2022.
7
Speech Recognition and Listening Effort of Meaningful Sentences Using Synthetic Speech.使用合成语音识别和理解有意义的句子的听力努力。
Trends Hear. 2022 Jan-Dec;26:23312165221130656. doi: 10.1177/23312165221130656.
8
The Concurrent OLSA Test: A Method for Speech Recognition in Multi-talker Situations at Fixed SNR.同时性 OLSA 测试:在固定 SNR 下多说话人情况下的语音识别方法。
Trends Hear. 2022 Jan-Dec;26:23312165221108257. doi: 10.1177/23312165221108257.
9
Evaluation of the Bonebridge BCI 602 active bone conductive implant in adults: efficacy and stability of audiological, surgical, and functional outcomes.成人骨桥 BCI 602 主动骨导植入物的评估:听力学、手术和功能结果的疗效和稳定性。
Eur Arch Otorhinolaryngol. 2022 Jul;279(7):3525-3534. doi: 10.1007/s00405-022-07265-2. Epub 2022 Feb 19.
10
A novel method for peanut variety identification and classification by Improved VGG16.一种利用改进 VGG16 进行花生品种识别和分类的新方法。
Sci Rep. 2021 Aug 3;11(1):15756. doi: 10.1038/s41598-021-95240-y.
Int J Audiol. 2015;54 Suppl 2:100-7. doi: 10.3109/14992027.2015.1061708. Epub 2015 Sep 18.
4
An Italian matrix sentence test for the evaluation of speech intelligibility in noise.一种用于评估噪声环境下言语可懂度的意大利语矩阵句子测试。
Int J Audiol. 2015;54 Suppl 2:44-50. doi: 10.3109/14992027.2015.1061709. Epub 2015 Sep 15.
5
Development of the Russian matrix sentence test.俄罗斯矩阵句子测试的开发。
Int J Audiol. 2015;54 Suppl 2:35-43. doi: 10.3109/14992027.2015.1020969. Epub 2015 Apr 6.
6
Development and evaluation of a linguistically and audiologically controlled sentence intelligibility test.言语可懂度测试的语言学和听力学控制的开发和评估。
J Acoust Soc Am. 2013 Oct;134(4):3039-56. doi: 10.1121/1.4818760.
7
Perception of synthetic speech produced automatically by rule: Intelligibility of eight text-to-speech systems.对通过规则自动生成的合成语音的感知:八个文本转语音系统的可懂度。
Behav Res Methods Instrum Comput. 1986 Mar;18(2):100-107. doi: 10.3758/BF03201008.
8
A Spanish matrix sentence test for assessing speech reception thresholds in noise.一种用于评估噪声中语音接受阈的西班牙矩阵句子测试。
Int J Audiol. 2012 Jul;51(7):536-44. doi: 10.3109/14992027.2012.670731. Epub 2012 Apr 26.
9
Development and analysis of an International Speech Test Signal (ISTS).国际语音测试信号(ISTS)的开发与分析。
Int J Audiol. 2010 Dec;49(12):891-903. doi: 10.3109/14992027.2010.506889.
10
Efficient adaptive procedures for threshold and concurrent slope estimates for psychophysics and speech intelligibility tests.用于心理物理学和言语可懂度测试的阈值和并发斜率估计的高效自适应程序。
J Acoust Soc Am. 2002 Jun;111(6):2801-10. doi: 10.1121/1.1479152.