• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于实时应用的人类耳蜗力学和滤波器调谐的卷积神经网络模型。

A convolutional neural-network model of human cochlear mechanics and filter tuning for real-time applications.

作者信息

Baby Deepak, Van Den Broucke Arthur, Verhulst Sarah

机构信息

Hearing Technology @ WAVES, Dept. of Information Technology, Ghent University, 9000 Ghent, Belgium.

出版信息

Nat Mach Intell. 2021 Feb;3(2):134-143. doi: 10.1038/s42256-020-00286-8. Epub 2021 Feb 8.

DOI:10.1038/s42256-020-00286-8
PMID:33629031
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7116797/
Abstract

Auditory models are commonly used as feature extractors for automatic speech-recognition systems or as front-ends for robotics, machine-hearing and hearing-aid applications. Although auditory models can capture the biophysical and nonlinear properties of human hearing in great detail, these biophysical models are computationally expensive and cannot be used in real-time applications. We present a hybrid approach where convolutional neural networks are combined with computational neuroscience to yield a real-time end-to-end model for human cochlear mechanics, including level-dependent filter tuning (CoNNear). The CoNNear model was trained on acoustic speech material and its performance and applicability were evaluated using (unseen) sound stimuli commonly employed in cochlear mechanics research. The CoNNear model accurately simulates human cochlear frequency selectivity and its dependence on sound intensity, an essential quality for robust speech intelligibility at negative speech-to-background-noise ratios. The CoNNear architecture is based on parallel and differentiable computations and has the power to achieve real-time human performance. These unique CoNNear features will enable the next generation of human-like machine-hearing applications.

摘要

听觉模型通常用作自动语音识别系统的特征提取器,或用作机器人技术、机器听觉和助听器应用的前端。尽管听觉模型可以非常详细地捕捉人类听觉的生物物理和非线性特性,但这些生物物理模型计算成本高昂,无法用于实时应用。我们提出了一种混合方法,将卷积神经网络与计算神经科学相结合,以产生一种用于人类耳蜗力学的实时端到端模型,包括与电平相关的滤波器调谐(CoNNear)。CoNNear模型在声学语音材料上进行训练,并使用耳蜗力学研究中常用的(未见)声音刺激来评估其性能和适用性。CoNNear模型准确地模拟了人类耳蜗频率选择性及其对声音强度的依赖性,这是在负语音与背景噪声比下实现稳健语音可懂度的一项基本特性。CoNNear架构基于并行和可微计算,有能力实现实时人类性能。这些独特的CoNNear特性将推动下一代类人机器听觉应用的发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8be/7116797/6f8e4177c25c/EMS114746-f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8be/7116797/3f30ced0cc3f/EMS114746-f007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8be/7116797/fdc859d6a9c9/EMS114746-f008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8be/7116797/1e6eafc39c56/EMS114746-f009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8be/7116797/2bb707513201/EMS114746-f011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8be/7116797/6f8e4177c25c/EMS114746-f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8be/7116797/3f30ced0cc3f/EMS114746-f007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8be/7116797/fdc859d6a9c9/EMS114746-f008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8be/7116797/1e6eafc39c56/EMS114746-f009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8be/7116797/2bb707513201/EMS114746-f011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8be/7116797/6f8e4177c25c/EMS114746-f002.jpg

相似文献

1
A convolutional neural-network model of human cochlear mechanics and filter tuning for real-time applications.一种用于实时应用的人类耳蜗力学和滤波器调谐的卷积神经网络模型。
Nat Mach Intell. 2021 Feb;3(2):134-143. doi: 10.1038/s42256-020-00286-8. Epub 2021 Feb 8.
2
A physiologically-inspired model reproducing the speech intelligibility benefit in cochlear implant listeners with residual acoustic hearing.一种受生理启发的模型,再现了具有残余听觉的人工耳蜗聆听者的言语可懂度优势。
Hear Res. 2017 Feb;344:50-61. doi: 10.1016/j.heares.2016.10.023. Epub 2016 Nov 9.
3
Deep Neural Network Model of Hearing-Impaired Speech-in-Noise Perception.听力受损者噪声环境下言语感知的深度神经网络模型
Front Neurosci. 2020 Dec 15;14:588448. doi: 10.3389/fnins.2020.588448. eCollection 2020.
4
Auditory models of suprathreshold distortion and speech intelligibility in persons with impaired hearing.听力受损者的超阈值失真与言语可懂度的听觉模型。
J Am Acad Audiol. 2013 Apr;24(4):307-28. doi: 10.3766/jaaa.24.4.6.
5
Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users.基于神经网络的语音增强技术可提高人工耳蜗使用者在噪声环境中的语音清晰度。
Hear Res. 2017 Feb;344:183-194. doi: 10.1016/j.heares.2016.11.012. Epub 2016 Nov 30.
6
Effects of lifetime noise exposure on the middle-age human auditory brainstem response, tinnitus and speech-in-noise intelligibility.终生噪声暴露对中年人大脑听觉脑干反应、耳鸣和噪声中言语可懂度的影响。
Hear Res. 2018 Aug;365:36-48. doi: 10.1016/j.heares.2018.06.003. Epub 2018 Jun 12.
7
Electric and acoustic harmonic integration predicts speech-in-noise performance in hybrid cochlear implant users.电声谐波整合可预测混合式人工耳蜗使用者在噪声环境中的言语表现。
Hear Res. 2018 Sep;367:223-230. doi: 10.1016/j.heares.2018.06.016. Epub 2018 Jun 28.
8
Experimental Investigation of Acoustic Features to Optimize Intelligibility in Cochlear Implants.实验研究优化人工耳蜗植入中可懂度的声学特征。
Sensors (Basel). 2023 Aug 31;23(17):7553. doi: 10.3390/s23177553.
9
The effect of different cochlear implant microphones on acoustic hearing individuals' binaural benefits for speech perception in noise.不同人工耳蜗麦克风对听力正常者噪声环境下言语感知的双侧获益的影响。
Ear Hear. 2011 Jul-Aug;32(4):468-84. doi: 10.1097/AUD.0b013e31820dd3f0.
10
Cochlea-inspired speech recognition interface.耳蜗启发式语音识别接口。
Med Biol Eng Comput. 2019 Jun;57(6):1393-1403. doi: 10.1007/s11517-019-01963-6. Epub 2019 Mar 4.

引用本文的文献

1
Piezoelectric nanofiber-based intelligent hearing system.基于压电纳米纤维的智能听力系统。
Sci Adv. 2025 May 9;11(19):eadl2741. doi: 10.1126/sciadv.adl2741. Epub 2025 May 7.
2
Models optimized for real-world tasks reveal the task-dependent necessity of precise temporal coding in hearing.针对现实世界任务进行优化的模型揭示了听觉中精确时间编码的任务依赖性必要性。
Nat Commun. 2024 Dec 4;15(1):10590. doi: 10.1038/s41467-024-54700-5.
3
Models optimized for real-world tasks reveal the task-dependent necessity of precise temporal coding in hearing.

本文引用的文献

1
An efficient analytical reduction of detailed nonlinear neuron models.一种高效的详细非线性神经元模型的分析简化方法。
Nat Commun. 2020 Jan 15;11(1):288. doi: 10.1038/s41467-019-13932-6.
2
A deep learning framework for neuroscience.深度学习在神经科学中的应用框架。
Nat Neurosci. 2019 Nov;22(11):1761-1770. doi: 10.1038/s41593-019-0520-2. Epub 2019 Oct 28.
3
Deep neural network models of sensory systems: windows onto the role of task constraints.感觉系统的深度神经网络模型:揭示任务约束作用的窗口。
针对现实世界任务进行优化的模型揭示了听力中精确时间编码在任务依赖方面的必要性。
bioRxiv. 2024 Sep 16:2024.04.21.590435. doi: 10.1101/2024.04.21.590435.
4
Large-scale electrophysiology and deep learning reveal distorted neural signal dynamics after hearing loss.大规模电生理学和深度学习揭示听力损失后神经信号动力学的扭曲。
Elife. 2023 May 10;12:e85108. doi: 10.7554/eLife.85108.
5
A comparative study of eight human auditory models of monaural processing.八种单耳听觉处理的人体听觉模型的比较研究。
Acta Acust (2020). 2022;6. doi: 10.1051/aacus/2022008. Epub 2022 May 4.
6
Periodicity Pitch Perception Part III: Sensibility and Pachinko Volatility.周期性音高感知第三部分:敏感性与弹珠机波动性
Front Neurosci. 2022 Mar 8;16:736642. doi: 10.3389/fnins.2022.736642. eCollection 2022.
7
Deep neural network models of sound localization reveal how perception is adapted to real-world environments.深度神经网络模型的声音定位揭示了感知是如何适应现实世界环境的。
Nat Hum Behav. 2022 Jan;6(1):111-133. doi: 10.1038/s41562-021-01244-z. Epub 2022 Jan 27.
8
An outer hair cell-powered global hydromechanical mechanism for cochlear amplification.外毛细胞驱动的耳蜗放大整体液力学机制。
Hear Res. 2022 Sep 15;423:108407. doi: 10.1016/j.heares.2021.108407. Epub 2021 Dec 1.
9
A convolutional neural-network framework for modelling auditory sensory cells and synapses.用于模拟听觉感觉细胞和突触的卷积神经网络框架。
Commun Biol. 2021 Jul 1;4(1):827. doi: 10.1038/s42003-021-02341-5.
10
Deep Neural Network Model of Hearing-Impaired Speech-in-Noise Perception.听力受损者噪声环境下言语感知的深度神经网络模型
Front Neurosci. 2020 Dec 15;14:588448. doi: 10.3389/fnins.2020.588448. eCollection 2020.
Curr Opin Neurobiol. 2019 Apr;55:121-132. doi: 10.1016/j.conb.2019.02.003. Epub 2019 Mar 15.
4
Towards reconstructing intelligible speech from the human auditory cortex.从人类听觉皮层重建可理解的语音。
Sci Rep. 2019 Jan 29;9(1):874. doi: 10.1038/s41598-018-37359-z.
5
Quadratic distortion in a nonlinear cascade model of the human cochlea.人耳蜗非线性级联模型中的二次失真。
J Acoust Soc Am. 2018 May;143(5):EL418. doi: 10.1121/1.5038595.
6
A Task-Optimized Neural Network Replicates Human Auditory Behavior, Predicts Brain Responses, and Reveals a Cortical Processing Hierarchy.任务优化神经网络复制人类听觉行为,预测大脑反应,并揭示皮质处理层次结构。
Neuron. 2018 May 2;98(3):630-644.e16. doi: 10.1016/j.neuron.2018.03.044. Epub 2018 Apr 19.
7
Computational modeling of the human auditory periphery: Auditory-nerve responses, evoked potentials and hearing loss.人类听觉外周的计算模型:听神经反应、诱发电位与听力损失
Hear Res. 2018 Mar;360:55-75. doi: 10.1016/j.heares.2017.12.018. Epub 2017 Dec 28.
8
Dynamics of cochlear nonlinearity: Automatic gain control or instantaneous damping?耳蜗非线性的动力学:自动增益控制还是瞬时阻尼?
J Acoust Soc Am. 2017 Dec;142(6):3510. doi: 10.1121/1.5014039.
9
Otoacoustic emission estimates of human basilar membrane impulse response duration and cochlear filter tuning.人类基底膜冲动反应持续时间和耳蜗滤波器调谐的耳声发射估计
Hear Res. 2016 Dec;342:150-160. doi: 10.1016/j.heares.2016.10.016. Epub 2016 Oct 27.
10
A comparative study of seven human cochlear filter models.七种人类耳蜗滤波器模型的比较研究。
J Acoust Soc Am. 2016 Sep;140(3):1618. doi: 10.1121/1.4960486.