Shahidi Lidea K, Collins Leslie M, Mainsah Boyla O
Electrical and Computer Engineering, Duke University, Durham, USA.
Proc Int Conf Mach Learn Appl. 2018 Dec;2018:847-852. doi: 10.1109/ICMLA.2018.00136. Epub 2019 Jan 17.
Individuals with cochlear implants (CIs) experience more difficulty understanding speech in reverberant environ-ments than normal hearing listeners. As a result, recent research has targeted mitigating the effects of late reverberant signal reflections in CIs by using a machine learning approach to detect and delete affected segments in the CI stimulus pattern. Previous work has trained electrode-specific classification models to mitigate late reverberant signal reflections based on features extracted from only the acoustic activity within the electrode of interest. Since adjacent CI electrodes tend to be activated concurrently during speech, we hypothesized that incorporating additional information from the other electrode channels, termed , as features could improve classification performance. Cross-channel information extracted in real-world conditions will likely contain errors that will impact classification performance. To simulate extracting cross-channel information in realistic conditions, we developed a graphical model based on the Ising model to systematically introduce errors to specific types of cross-channel information. The Ising-like model allows us to add errors while maintaining the important geometric information contained in cross-channel information, which is due to the spectro-temporal structure of speech. Results suggest the potential utility of leveraging cross-channel information to improve the performance of the reverberation mitigation algorithm from the baseline channel-based features, even when the cross-channel information contains errors.
与正常听力的听众相比,人工耳蜗(CI)使用者在混响环境中理解语音时会遇到更多困难。因此,最近的研究旨在通过使用机器学习方法来检测和删除CI刺激模式中受影响的部分,以减轻CI中晚期混响信号反射的影响。先前的工作已经训练了特定电极的分类模型,以基于仅从感兴趣电极内的声学活动中提取的特征来减轻晚期混响信号反射。由于在语音过程中相邻的CI电极往往会同时被激活,我们假设将来自其他电极通道的额外信息(称为 )作为特征纳入可以提高分类性能。在现实世界条件下提取的跨通道信息可能会包含影响分类性能的错误。为了模拟在现实条件下提取跨通道信息,我们基于伊辛模型开发了一个图形模型,以系统地将错误引入特定类型的跨通道信息。类似伊辛的模型使我们能够在添加错误的同时保留跨通道信息中包含的重要几何信息,这是由于语音的频谱-时间结构所致。结果表明,即使跨通道信息包含错误,利用跨通道信息从基于基线通道的特征提高混响减轻算法性能的潜在效用。