Suppr超能文献

方向性、压缩和工作记忆对语音识别的影响。

Effects of Directionality, Compression, and Working Memory on Speech Recognition.

作者信息

Rallapalli Varsha, Ellis Gregory, Souza Pamela

机构信息

Department of Communication Sciences & Disorders, Northwestern University, Evanston, Illinois, USA.

Knowles Hearing Center, Northwestern University, Evanston, Illinois, USA.

出版信息

Ear Hear. 2021 May/Jun;42(3):492-505. doi: 10.1097/AUD.0000000000000970.

Abstract

OBJECTIVES

Previous research has shown that the association between hearing aid-processed speech recognition and individual working memory ability becomes stronger in more challenging conditions (e.g., higher background noise levels) and with stronger hearing aid processing (e.g., fast-acting wide dynamic range compression, WDRC). To date, studies have assumed omnidirectional microphone settings and collocated speech and noise conditions to study such relationships. Such conditions fail to recognize that most hearing aids are fit with directional processing that may improve the signal to noise ratio (SNR) and speech recognition in spatially separated speech and noise conditions. Here, we considered the possibility that directional processing may reduce the signal distortion arising from fast-acting WDRC and in turn influence the relationship between working memory ability and speech recognition with WDRC processing. The combined effects of hearing aid processing (WDRC and directionality) and SNR were quantified using a signal modification metric (cepstral correlation), which measures temporal envelope changes in the processed signal with respect to a linearly amplified reference. It was hypothesized that there will be a weaker association between working memory ability and speech recognition for hearing aid processing conditions that result in overall less signal modification (i.e., fewer changes to the processed envelope).

DESIGN

Twenty-three individuals with bilateral, mild to moderately severe sensorineural hearing loss participated in the study. Participants were fit with a commercially available hearing aid, and signal processing was varied in two dimensions: (1) Directionality (omnidirectional [OMNI] versus fixed-directional [DIR]), and (2) WDRC speed (fast-acting [FAST] versus slow-acting [SLOW]). Sentence recognition in spatially separated multi-talker babble was measured across a range of SNRs: 0 dB, 5 dB, 10 dB, and quiet. Cumulative signal modification was measured with individualized hearing aid settings, for all experimental conditions. A linear mixed-effects model was used to determine the relationship between speech recognition, working memory ability, and cumulative signal modification.

RESULTS

Signal modification results showed a complex relationship between directionality and WDRC speed, which varied by SNR. At 0 and 5 dB SNRs, signal modification was lower for SLOW than FAST regardless of directionality. However, at 10 dB SNR and in the DIR listening condition, there was no signal modification difference between FAST and SLOW. Consistent with previous studies, the association of speech recognition in noise with working memory ability depended on the level of signal modification. Contrary to the hypothesis above, however, there was a significant association of speech recognition with working memory only at lower levels of signal modification, and speech recognition increased at a faster rate for individuals with better working memory as signal modification decreased with DIR and SLOW.

CONCLUSIONS

This research suggests that working memory ability remains a significant predictor of speech recognition when WDRC and directionality are applied. Our findings revealed that directional processing can reduce the detrimental effect of fast-acting WDRC on speech cues at higher SNRs, which affects speech recognition ability. Contrary to some previous research, this study showed that individuals with better working memory ability benefitted more from a decrease in signal modification than individuals with poorer working memory ability.

摘要

目的

先前的研究表明,在更具挑战性的条件下(例如,更高的背景噪声水平)以及更强的助听器处理(例如,快速作用的宽动态范围压缩,WDRC)时,经助听器处理的语音识别与个体工作记忆能力之间的关联会更强。迄今为止,研究在研究此类关系时假定为全向麦克风设置以及语音和噪声并置的条件。此类条件未认识到大多数助听器是采用定向处理进行适配的,这在空间上分离的语音和噪声条件下可能会提高信噪比(SNR)和语音识别能力。在此,我们考虑了定向处理可能会减少快速作用的WDRC引起的信号失真,进而影响工作记忆能力与经WDRC处理的语音识别之间关系的可能性。使用一种信号修改指标(倒谱相关性)对助听器处理(WDRC和方向性)和SNR的综合影响进行了量化,该指标测量经处理信号相对于线性放大参考信号的时间包络变化。据推测,对于导致总体信号修改较少(即,经处理包络的变化较少)的助听器处理条件,工作记忆能力与语音识别之间的关联会较弱。

设计

23名双侧轻度至中度重度感音神经性听力损失的个体参与了该研究。为参与者适配了一款商用助听器,并且在两个维度上改变信号处理方式:(1)方向性(全向[OMNI]与固定方向[DIR]),以及(2)WDRC速度(快速作用[FAST]与慢速作用[SLOW])。在一系列SNR水平:0 dB、5 dB、10 dB和安静环境下,测量了在空间上分离的多说话者嘈杂环境中的句子识别情况。针对所有实验条件,使用个性化的助听器设置测量累积信号修改情况。使用线性混合效应模型来确定语音识别、工作记忆能力和累积信号修改之间的关系。

结果

信号修改结果显示方向性和WDRC速度之间存在复杂关系,该关系随SNR而变化。在0和5 dB SNR时,无论方向性如何,慢速作用的信号修改都低于快速作用的。然而,在10 dB SNR且处于DIR聆听条件下,快速作用和慢速作用之间没有信号修改差异。与先前的研究一致,噪声中的语音识别与工作记忆能力的关联取决于信号修改水平。然而,与上述假设相反,仅在较低的信号修改水平下,语音识别与工作记忆之间存在显著关联,并且随着DIR和慢速作用导致信号修改减少,工作记忆较好的个体的语音识别以更快的速率增加。

结论

本研究表明,当应用WDRC和方向性时,工作记忆能力仍然是语音识别的重要预测指标。我们的研究结果表明,定向处理可以减少快速作用的WDRC在较高SNR时对语音线索的有害影响,这会影响语音识别能力。与一些先前的研究相反,本研究表明,工作记忆能力较好的个体比工作记忆能力较差的个体从信号修改减少中获益更多。

相似文献

8
Preference for Combinations of Hearing Aid Signal Processing.助听器信号处理组合的偏好。
J Speech Lang Hear Res. 2022 Aug 17;65(8):3100-3116. doi: 10.1044/2022_JSLHR-22-00018. Epub 2022 Jul 26.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验