听力受损者噪声环境下言语感知的深度神经网络模型

Deep Neural Network Model of Hearing-Impaired Speech-in-Noise Perception.

作者信息

Haro Stephanie, Smalt Christopher J, Ciccarelli Gregory A, Quatieri Thomas F

机构信息

Human Health and Performance Systems, Massachusetts Institute of Technology Lincoln Laboratory, Lexington, MA, United States.

Speech and Hearing Biosciences and Technology, Harvard Medical School, Boston, MA, United States.

出版信息

Front Neurosci. 2020 Dec 15;14:588448. doi: 10.3389/fnins.2020.588448. eCollection 2020.

DOI:10.3389/fnins.2020.588448

PMID:33384579

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7770113/

Abstract

Many individuals struggle to understand speech in listening scenarios that include reverberation and background noise. An individual's ability to understand speech arises from a combination of peripheral auditory function, central auditory function, and general cognitive abilities. The interaction of these factors complicates the prescription of treatment or therapy to improve hearing function. Damage to the auditory periphery can be studied in animals; however, this method alone is not enough to understand the impact of hearing loss on speech perception. Computational auditory models bridge the gap between animal studies and human speech perception. Perturbations to the modeled auditory systems can permit mechanism-based investigations into observed human behavior. In this study, we propose a computational model that accounts for the complex interactions between different hearing damage mechanisms and simulates human speech-in-noise perception. The model performs a digit classification task as a human would, with only acoustic sound pressure as input. Thus, we can use the model's performance as a proxy for human performance. This two-stage model consists of a biophysical cochlear-nerve spike generator followed by a deep neural network (DNN) classifier. We hypothesize that sudden damage to the periphery affects speech perception and that central nervous system adaptation over time may compensate for peripheral hearing damage. Our model achieved human-like performance across signal-to-noise ratios (SNRs) under normal-hearing (NH) cochlear settings, achieving 50% digit recognition accuracy at -20.7 dB SNR. Results were comparable to eight NH participants on the same task who achieved 50% behavioral performance at -22 dB SNR. We also simulated medial olivocochlear reflex (MOCR) and auditory nerve fiber (ANF) loss, which worsened digit-recognition accuracy at lower SNRs compared to higher SNRs. Our simulated performance following ANF loss is consistent with the hypothesis that cochlear synaptopathy impacts communication in background noise more so than in quiet. Following the insult of various cochlear degradations, we implemented extreme and conservative adaptation through the DNN. At the lowest SNRs (<0 dB), both adapted models were unable to fully recover NH performance, even with hundreds of thousands of training samples. This implies a limit on performance recovery following peripheral damage in our human-inspired DNN architecture.

摘要

许多人在包含混响和背景噪声的听力场景中难以理解言语。个体理解言语的能力源于外周听觉功能、中枢听觉功能和一般认知能力的综合作用。这些因素的相互作用使得改善听力功能的治疗或疗法的处方变得复杂。听觉外周的损伤可以在动物身上进行研究；然而，仅靠这种方法不足以理解听力损失对言语感知的影响。计算听觉模型弥合了动物研究与人类言语感知之间的差距。对建模的听觉系统的扰动可以允许基于机制的对观察到的人类行为的研究。在本研究中，我们提出了一个计算模型，该模型考虑了不同听力损伤机制之间的复杂相互作用，并模拟了人类在噪声中的言语感知。该模型像人类一样执行数字分类任务，仅将声压作为输入。因此，我们可以将模型的性能用作人类性能的代理。这个两阶段模型由一个生物物理的耳蜗神经尖峰发生器和一个深度神经网络（DNN）分类器组成。我们假设外周的突然损伤会影响言语感知，并且随着时间的推移中枢神经系统的适应可能会补偿外周听力损伤。在正常听力（NH）耳蜗设置下，我们的模型在不同信噪比（SNR）下实现了类似人类的性能，在 -20.7 dB SNR 时实现了 50% 的数字识别准确率。结果与八名在同一任务上的 NH 参与者相当，他们在 -22 dB SNR 时实现了 50% 的行为表现。我们还模拟了内侧橄榄耳蜗反射（MOCR）和听觉神经纤维（ANF）损失，与较高 SNR 相比，这在较低 SNR 时恶化了数字识别准确率。我们模拟的 ANF 损失后的性能与以下假设一致：耳蜗突触病变对背景噪声中的交流影响比对安静环境中的影响更大。在遭受各种耳蜗退化后，我们通过 DNN 实现了极端和保守的适应。在最低 SNR（<0 dB）时，即使有数十万训练样本，两个适应模型都无法完全恢复 NH 性能。这意味着在我们受人类启发的 DNN 架构中外周损伤后性能恢复存在限制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce5f/7770113/52ed26b17464/fnins-14-588448-g0001.jpg

相似文献

Deep Neural Network Model of Hearing-Impaired Speech-in-Noise Perception.

Front Neurosci. 2020 Dec 15;14:588448. doi: 10.3389/fnins.2020.588448. eCollection 2020.

Olivocochlear Efferent Activity Is Associated With the Slope of the Psychometric Function of Speech Recognition in Noise.

Ear Hear. 2018 May/Jun;39(3):583-593. doi: 10.1097/AUD.0000000000000514.

Effects of lifetime noise exposure on the middle-age human auditory brainstem response, tinnitus and speech-in-noise intelligibility.

Hear Res. 2018 Aug;365:36-48. doi: 10.1016/j.heares.2018.06.003. Epub 2018 Jun 12.

Auditory models of suprathreshold distortion and speech intelligibility in persons with impaired hearing.

J Am Acad Audiol. 2013 Apr;24(4):307-28. doi: 10.3766/jaaa.24.4.6.

Olivocochlear efferent contributions to speech-in-noise recognition across signal-to-noise ratios.

J Acoust Soc Am. 2019 Mar;145(3):1529. doi: 10.1121/1.5094766.

Sound localization in noise by normal-hearing listeners and cochlear implant users.

Ear Hear. 2012 Jul-Aug;33(4):445-57. doi: 10.1097/AUD.0b013e318257607b.

A Binaural Cochlear Implant Sound Coding Strategy Inspired by the Contralateral Medial Olivocochlear Reflex.

Ear Hear. 2016 May-Jun;37(3):e138-48. doi: 10.1097/AUD.0000000000000273.

Adaptation to Noise in Human Speech Recognition Unrelated to the Medial Olivocochlear Reflex.

J Neurosci. 2018 Apr 25;38(17):4138-4145. doi: 10.1523/JNEUROSCI.0024-18.2018. Epub 2018 Mar 28.

Using Microphone Technology to Improve Speech Perception in Noise in Children with Cochlear Implants.

J Am Acad Audiol. 2018 Oct;29(9):814-825. doi: 10.3766/jaaa.17035.

The Role of Efferent Reflexes in the Efficient Encoding of Speech by the Auditory Nerve.

J Neurosci. 2022 Sep 7;42(36):6907-6916. doi: 10.1523/JNEUROSCI.2220-21.2022.

引用本文的文献

Piezoelectric nanofiber-based intelligent hearing system.

Sci Adv. 2025 May 9;11(19):eadl2741. doi: 10.1126/sciadv.adl2741. Epub 2025 May 7.

Assessment of Peripheral and Central Auditory Processing after Treatment for Idiopathic Sudden Sensorineural Hearing Loss.

Int Arch Otorhinolaryngol. 2024 Mar 15;28(3):e415-e423. doi: 10.1055/s-0043-1776728. eCollection 2024 Jul.

Predictive coding and stochastic resonance as fundamental principles of auditory phantom perception.

Brain. 2023 Dec 1;146(12):4809-4825. doi: 10.1093/brain/awad255.

Computational modeling of the human compound action potential.

J Acoust Soc Am. 2023 Apr 1;153(4):2376. doi: 10.1121/10.0017863.

Unraveling Spatial-Spectral Dynamics of Speech Categorization Speed Using Convolutional Neural Networks.

Brain Sci. 2022 Dec 30;13(1):75. doi: 10.3390/brainsci13010075.

Predicting the Outcomes of Internet-Based Cognitive Behavioral Therapy for Tinnitus: Applications of Artificial Neural Network and Support Vector Machine.

Am J Audiol. 2022 Dec 5;31(4):1167-1177. doi: 10.1044/2022_AJA-21-00270. Epub 2022 Oct 10.

The hunt for hidden hearing loss in humans: From preclinical studies to effective interventions.

Front Neurosci. 2022 Sep 15;16:1000304. doi: 10.3389/fnins.2022.1000304. eCollection 2022.

An overview of the HASPI and HASQI metrics for predicting speech intelligibility and speech quality for normal hearing, hearing loss, and hearing aids.

Hear Res. 2022 Dec;426:108608. doi: 10.1016/j.heares.2022.108608. Epub 2022 Sep 13.

The impairment of speech perception in noise following pure tone hearing recovery in patients with sudden sensorineural hearing loss.

Sci Rep. 2022 Jan 17;12(1):866. doi: 10.1038/s41598-021-03847-y.

本文引用的文献

Intrinsic Noise Improves Speech Recognition in a Computational Model of the Auditory Pathway.

Front Neurosci. 2022 Jun 8;16:908330. doi: 10.3389/fnins.2022.908330. eCollection 2022.

A convolutional neural-network model of human cochlear mechanics and filter tuning for real-time applications.

Nat Mach Intell. 2021 Feb;3(2):134-143. doi: 10.1038/s42256-020-00286-8. Epub 2021 Feb 8.

Bottom-up and top-down neural signatures of disordered multi-talker speech perception in adults with normal hearing.

Elife. 2020 Jan 21;9:e51419. doi: 10.7554/eLife.51419.

Noise-induced hearing loss: Translating risk from animal models to real-world environments.

J Acoust Soc Am. 2019 Nov;146(5):3646. doi: 10.1121/1.5133385.

A dynamic network model of temporal receptive fields in primary auditory cortex.

PLoS Comput Biol. 2019 May 6;15(5):e1006618. doi: 10.1371/journal.pcbi.1006618. eCollection 2019 May.

The search for noise-induced cochlear synaptopathy in humans: Mission impossible?

Hear Res. 2019 Jun;377:88-103. doi: 10.1016/j.heares.2019.02.016. Epub 2019 Mar 9.

Towards reconstructing intelligible speech from the human auditory cortex.

Sci Rep. 2019 Jan 29;9(1):874. doi: 10.1038/s41598-018-37359-z.

Supra-Threshold Hearing and Fluctuation Profiles: Implications for Sensorineural and Hidden Hearing Loss.

J Assoc Res Otolaryngol. 2018 Aug;19(4):331-352. doi: 10.1007/s10162-018-0669-5. Epub 2018 May 9.

A Task-Optimized Neural Network Replicates Human Auditory Behavior, Predicts Brain Responses, and Reveals a Cortical Processing Hierarchy.

Neuron. 2018 May 2;98(3):630-644.e16. doi: 10.1016/j.neuron.2018.03.044. Epub 2018 Apr 19.

Computational modeling of the human auditory periphery: Auditory-nerve responses, evoked potentials and hearing loss.

Hear Res. 2018 Mar;360:55-75. doi: 10.1016/j.heares.2017.12.018. Epub 2017 Dec 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

听力受损者噪声环境下言语感知的深度神经网络模型

Deep Neural Network Model of Hearing-Impaired Speech-in-Noise Perception.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献