Suppr超能文献

电声联合刺激下的言语感知:模拟与模型比较

Speech Perception With Combined Electric-Acoustic Stimulation: A Simulation and Model Comparison.

作者信息

Rader Tobias, Adel Youssef, Fastl Hugo, Baumann Uwe

机构信息

1Department of Audiological Acoustics, ENT Department, University Hospital Frankfurt, Frankfurt, Germany; and 2Arbeitsgruppe Technische Akustik, Lehrstuhl für Mensch-Maschine-Kommunikation, Technische Universität München, Munich, Germany.

出版信息

Ear Hear. 2015 Nov-Dec;36(6):e314-25. doi: 10.1097/AUD.0000000000000178.

Abstract

OBJECTIVE

The aim of this study is to simulate speech perception with combined electric-acoustic stimulation (EAS), verify the advantage of combined stimulation in normal-hearing (NH) subjects, and then compare it with cochlear implant (CI) and EAS user results from the authors' previous study. Furthermore, an automatic speech recognition (ASR) system was built to examine the impact of low-frequency information and is proposed as an applied model to study different hypotheses of the combined-stimulation advantage. Signal-detection-theory (SDT) models were applied to assess predictions of subject performance without the need to assume any synergistic effects.

DESIGN

Speech perception was tested using a closed-set matrix test (Oldenburg sentence test), and its speech material was processed to simulate CI and EAS hearing. A total of 43 NH subjects and a customized ASR system were tested. CI hearing was simulated by an aurally adequate signal spectrum analysis and representation, the part-tone-time-pattern, which was vocoded at 12 center frequencies according to the MED-EL DUET speech processor. Residual acoustic hearing was simulated by low-pass (LP)-filtered speech with cutoff frequencies 200 and 500 Hz for NH subjects and in the range from 100 to 500 Hz for the ASR system. Speech reception thresholds were determined in amplitude-modulated noise and in pseudocontinuous noise. Previously proposed SDT models were lastly applied to predict NH subject performance with EAS simulations.

RESULTS

NH subjects tested with EAS simulations demonstrated the combined-stimulation advantage. Increasing the LP cutoff frequency from 200 to 500 Hz significantly improved speech reception thresholds in both noise conditions. In continuous noise, CI and EAS users showed generally better performance than NH subjects tested with simulations. In modulated noise, performance was comparable except for the EAS at cutoff frequency 500 Hz where NH subject performance was superior. The ASR system showed similar behavior to NH subjects despite a positive signal-to-noise ratio shift for both noise conditions, while demonstrating the synergistic effect for cutoff frequencies ≥300 Hz. One SDT model largely predicted the combined-stimulation results in continuous noise, while falling short of predicting performance observed in modulated noise.

CONCLUSIONS

The presented simulation was able to demonstrate the combined-stimulation advantage for NH subjects as observed in EAS users. Only NH subjects tested with EAS simulations were able to take advantage of the gap listening effect, while CI and EAS user performance was consistently degraded in modulated noise compared with performance in continuous noise. The application of ASR systems seems feasible to assess the impact of different signal processing strategies on speech perception with CI and EAS simulations. In continuous noise, SDT models were largely able to predict the performance gain without assuming any synergistic effects, but model amendments are required to explain the gap listening effect in modulated noise.

摘要

目的

本研究旨在模拟电声联合刺激(EAS)下的言语感知,验证联合刺激在正常听力(NH)受试者中的优势,并将其与作者之前研究中的人工耳蜗(CI)和EAS使用者的结果进行比较。此外,构建了一个自动语音识别(ASR)系统来研究低频信息的影响,并将其作为一个应用模型来研究联合刺激优势的不同假设。应用信号检测理论(SDT)模型来评估受试者的表现预测,而无需假设任何协同效应。

设计

使用闭集矩阵测试(奥尔登堡句子测试)测试言语感知,并对其言语材料进行处理以模拟CI和EAS听力。共测试了43名NH受试者和一个定制的ASR系统。通过听觉上合适的信号频谱分析和表示(部分音调时间模式)来模拟CI听力,该模式根据MED-EL DUET言语处理器在12个中心频率上进行声码转换。对于NH受试者,通过截止频率为200和500 Hz的低通(LP)滤波语音模拟残余听觉,对于ASR系统,截止频率范围为100至500 Hz。在调幅噪声和伪连续噪声中确定言语接受阈值。最后应用先前提出的SDT模型来预测EAS模拟下NH受试者的表现。

结果

用EAS模拟测试的NH受试者表现出联合刺激优势。将LP截止频率从200 Hz提高到500 Hz可显著改善两种噪声条件下的言语接受阈值。在连续噪声中,CI和EAS使用者的表现总体上优于用模拟测试的NH受试者。在调幅噪声中,除了截止频率为500 Hz的EAS外,表现相当,此时NH受试者的表现更优。尽管两种噪声条件下信噪比均有正向偏移,但ASR系统表现出与NH受试者相似的行为,同时在截止频率≥300 Hz时表现出协同效应。一个SDT模型在很大程度上预测了连续噪声中的联合刺激结果,但未能预测调幅噪声中观察到的表现。

结论

所呈现的模拟能够证明NH受试者具有如EAS使用者中观察到的联合刺激优势。只有用EAS模拟测试的NH受试者能够利用间隙聆听效应,而与连续噪声中的表现相比,CI和EAS使用者在调幅噪声中的表现持续下降。应用ASR系统来评估不同信号处理策略对CI和EAS模拟言语感知的影响似乎是可行的。在连续噪声中,SDT模型在很大程度上能够预测性能增益,而无需假设任何协同效应,但需要修正模型来解释调幅噪声中的间隙聆听效应。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验