使用自动语音识别优化助听器时间常数。

Using Automatic Speech Recognition to Optimize Hearing-Aid Time Constants.

作者信息

Fontan Lionel, Gonçalves Braz Libio, Pinquier Julien, Stone Michael A, Füllgrabe Christian

机构信息

Archean LABS, Montauban, France.

IRIT, CNRS, Université Paul Sabatier, Toulouse, France.

出版信息

Front Neurosci. 2022 Mar 17;16:779062. doi: 10.3389/fnins.2022.779062. eCollection 2022.

DOI:10.3389/fnins.2022.779062

PMID:35368250

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8969748/

Abstract

Automatic speech recognition (ASR), when combined with hearing-aid (HA) and hearing-loss (HL) simulations, can predict aided speech-identification performances of persons with age-related hearing loss. ASR can thus be used to evaluate different HA configurations, such as combinations of insertion-gain functions and compression thresholds, in order to optimize HA fitting for a given person. The present study investigated whether, after fixing compression thresholds and insertion gains, a random-search algorithm could be used to optimize time constants (i.e., attack and release times) for 12 audiometric profiles. The insertion gains were either those recommended by the CAM2 prescription rule or those optimized using ASR, while compression thresholds were always optimized using ASR. For each audiometric profile, the random-search algorithm was used to vary time constants with the aim to maximize ASR performance. A HA simulator and a HL simulator simulator were used, respectively, to amplify and to degrade speech stimuli according to the input audiogram. The resulting speech signals were fed to an ASR system for recognition. For each audiogram, 1,000 iterations of the random-search algorithm were used to find the time-constant configuration yielding the highest ASR score. To assess the reproducibility of the results, the random search algorithm was run twice. Optimizing the time constants significantly improved the ASR scores when CAM2 insertion gains were used, but not when using ASR-based gains. Repeating the random search yielded similar ASR scores, but different time-constant configurations.

摘要

自动语音识别（ASR）与助听器（HA）及听力损失（HL）模拟相结合时，能够预测年龄相关性听力损失患者的助听语音识别表现。因此，ASR可用于评估不同的HA配置，如插入增益函数与压缩阈值的组合，以便为特定患者优化HA适配。本研究调查了在固定压缩阈值和插入增益后，随机搜索算法是否可用于为12种听力测试配置优化时间常数（即起始时间和释放时间）。插入增益要么是CAM2处方规则推荐的增益，要么是使用ASR优化的增益，而压缩阈值始终使用ASR进行优化。对于每种听力测试配置，使用随机搜索算法改变时间常数，以最大化ASR性能。分别使用HA模拟器和HL模拟器，根据输入的听力图放大和降解语音刺激。将得到的语音信号输入ASR系统进行识别。对于每种听力图，使用随机搜索算法进行1000次迭代，以找到产生最高ASR分数的时间常数配置。为评估结果的可重复性，随机搜索算法运行两次。当使用CAM2插入增益时，优化时间常数显著提高了ASR分数，但使用基于ASR的增益时则不然。重复随机搜索产生了相似的ASR分数，但时间常数配置不同。