Institute of Hearing Technology and Audiology, Jade University of Applied Sciences, Oldenburg, Germany.
Medizinische Physik, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany.
Trends Hear. 2024 Jan-Dec;28:23312165241261490. doi: 10.1177/23312165241261490.
Speech-recognition tests are widely used in both clinical and research audiology. The purpose of this study was the development of a novel speech-recognition test that combines concepts of different speech-recognition tests to reduce training effects and allows for a large set of speech material. The new test consists of four different words per trial in a meaningful construct with a fixed structure, the so-called phrases. Various free databases were used to select the words and to determine their frequency. Highly frequent nouns were grouped into thematic categories and combined with related adjectives and infinitives. After discarding inappropriate and unnatural combinations, and eliminating duplications of (sub-)phrases, a total number of 772 phrases remained. Subsequently, the phrases were synthesized using a text-to-speech system. The synthesis significantly reduces the effort compared to recordings with a real speaker. After excluding outliers, measured speech-recognition scores for the phrases with 31 normal-hearing participants at fixed signal-to-noise ratios (SNR) revealed speech-recognition thresholds (SRT) for each phrase varying up to 4 dB. The median SRT was -9.1 dB SNR and thus comparable to existing sentence tests. The psychometric function's slope of 15 percentage points per dB is also comparable and enables efficient use in audiology. Summarizing, the principle of creating speech material in a modular system has many potential applications.
语音识别测试在临床和研究听力学中都得到了广泛应用。本研究的目的是开发一种新的语音识别测试,该测试结合了不同语音识别测试的概念,以减少训练效应,并允许使用大量的语音材料。新测试由每个试验中的四个不同单词组成,这些单词具有固定结构的有意义的构词,即所谓的短语。各种免费数据库被用于选择单词并确定它们的频率。高频率名词被分为主题类别,并与相关形容词和不定式结合使用。在剔除不适当和不自然的组合,以及消除(子)短语的重复之后,剩下了总共 772 个短语。随后,使用文本到语音系统对这些短语进行合成。与使用真实说话者录制相比,这种合成方法大大减少了工作量。在排除离群值后,在固定信噪比(SNR)下,31 名正常听力参与者对短语进行的语音识别测试,得到了每个短语的语音识别阈值(SRT),范围在 4dB 以内。中位数 SRT 为-9.1dB SNR,与现有的句子测试相当。15 个百分点每分贝的心理物理函数斜率也相当,可在听力学中有效使用。综上所述,在模块化系统中创建语音材料的原理具有许多潜在的应用。