• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

针对现实世界任务进行优化的模型揭示了听觉中精确时间编码的任务依赖性必要性。

Models optimized for real-world tasks reveal the task-dependent necessity of precise temporal coding in hearing.

作者信息

Saddler Mark R, McDermott Josh H

机构信息

Department of Brain and Cognitive Sciences, MIT, Cambridge, MA, USA.

McGovern Institute for Brain Research, MIT, Cambridge, MA, USA.

出版信息

Nat Commun. 2024 Dec 4;15(1):10590. doi: 10.1038/s41467-024-54700-5.

DOI:10.1038/s41467-024-54700-5
PMID:39632854
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11618365/
Abstract

Neurons encode information in the timing of their spikes in addition to their firing rates. Spike timing is particularly precise in the auditory nerve, where action potentials phase lock to sound with sub-millisecond precision, but its behavioral relevance remains uncertain. We optimized machine learning models to perform real-world hearing tasks with simulated cochlear input, assessing the precision of auditory nerve spike timing needed to reproduce human behavior. Models with high-fidelity phase locking exhibited more human-like sound localization and speech perception than models without, consistent with an essential role in human hearing. However, the temporal precision needed to reproduce human-like behavior varied across tasks, as did the precision that benefited real-world task performance. These effects suggest that perceptual domains incorporate phase locking to different extents depending on the demands of real-world hearing. The results illustrate how optimizing models for realistic tasks can clarify the role of candidate neural codes in perception.

摘要

神经元除了通过放电率来编码信息外,还通过其尖峰的时间来编码信息。尖峰时间在听神经中特别精确,在听神经中动作电位以亚毫秒级精度与声音相位锁定,但其行为相关性仍不确定。我们优化了机器学习模型,以便利用模拟的耳蜗输入执行现实世界中的听力任务,评估再现人类行为所需的听神经尖峰时间精度。与没有高保真相位锁定的模型相比,具有高保真相位锁定的模型表现出更像人类的声音定位和语音感知,这与在人类听力中起重要作用一致。然而,再现类人行为所需的时间精度因任务而异,对现实世界任务表现有益的精度也是如此。这些效应表明,感知领域根据现实世界听力的需求在不同程度上纳入了相位锁定。结果说明了为现实任务优化模型如何能够阐明候选神经编码在感知中的作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/30b78c882901/41467_2024_54700_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/b221e27c33b6/41467_2024_54700_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/4ab22724c05f/41467_2024_54700_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/0dd45626ed00/41467_2024_54700_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/d9fff9c463ba/41467_2024_54700_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/f0b6be1a65dd/41467_2024_54700_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/ea5eb9dd1f38/41467_2024_54700_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/4349f6b98af4/41467_2024_54700_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/30b78c882901/41467_2024_54700_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/b221e27c33b6/41467_2024_54700_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/4ab22724c05f/41467_2024_54700_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/0dd45626ed00/41467_2024_54700_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/d9fff9c463ba/41467_2024_54700_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/f0b6be1a65dd/41467_2024_54700_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/ea5eb9dd1f38/41467_2024_54700_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/4349f6b98af4/41467_2024_54700_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831c/11618365/30b78c882901/41467_2024_54700_Fig8_HTML.jpg

相似文献

1
Models optimized for real-world tasks reveal the task-dependent necessity of precise temporal coding in hearing.针对现实世界任务进行优化的模型揭示了听觉中精确时间编码的任务依赖性必要性。
Nat Commun. 2024 Dec 4;15(1):10590. doi: 10.1038/s41467-024-54700-5.
2
Models optimized for real-world tasks reveal the task-dependent necessity of precise temporal coding in hearing.针对现实世界任务进行优化的模型揭示了听力中精确时间编码在任务依赖方面的必要性。
bioRxiv. 2024 Sep 16:2024.04.21.590435. doi: 10.1101/2024.04.21.590435.
3
Linear coding of complex sound spectra by discharge rate in neurons of the medial nucleus of the trapezoid body (MNTB) and its inputs.梯形体内侧核(MNTB)神经元及其输入中通过放电率对复杂声谱进行线性编码。
Front Neural Circuits. 2014 Dec 16;8:144. doi: 10.3389/fncir.2014.00144. eCollection 2014.
4
Hearing of modulation in sounds.声音中的调制听觉
Physiol Rev. 1982 Jul;62(3):894-975. doi: 10.1152/physrev.1982.62.3.894.
5
Psychophysiological analyses demonstrate the importance of neural envelope coding for speech perception in noise.心理生理分析表明,神经包络编码对噪声中言语感知很重要。
J Neurosci. 2012 Feb 1;32(5):1747-56. doi: 10.1523/JNEUROSCI.4493-11.2012.
6
Spectrally specific temporal analyses of spike-train responses to complex sounds: A unifying framework.对复杂声音的尖峰序列反应进行频谱特异性时间分析:一个统一框架。
PLoS Comput Biol. 2021 Feb 22;17(2):e1008155. doi: 10.1371/journal.pcbi.1008155. eCollection 2021 Feb.
7
The impact of temporal fine structure and signal envelope on auditory motion perception.时频结构和信号包络对听觉运动感知的影响。
PLoS One. 2020 Aug 21;15(8):e0238125. doi: 10.1371/journal.pone.0238125. eCollection 2020.
8
Efficient auditory coding.高效听觉编码
Nature. 2006 Feb 23;439(7079):978-82. doi: 10.1038/nature04485.
9
Phase Locking of Auditory-Nerve Fibers Reveals Stereotyped Distortions and an Exponential Transfer Function with a Level-Dependent Slope.听觉神经纤维的锁相揭示了刻板的失真和具有水平相关斜率的指数传递函数。
J Neurosci. 2019 May 22;39(21):4077-4099. doi: 10.1523/JNEUROSCI.1801-18.2019. Epub 2019 Mar 13.
10
Auditory information coding by modeled cochlear nucleus neurons.模拟蜗神经核神经元的听觉信息编码
J Comput Neurosci. 2011 Jun;30(3):529-42. doi: 10.1007/s10827-010-0276-x. Epub 2010 Sep 23.

引用本文的文献

1
A Deep Neural Network Trained on Congruent Audiovisual Speech Reports the McGurk Effect.基于一致视听语音训练的深度神经网络呈现麦格克效应。
bioRxiv. 2025 Aug 24:2025.08.20.671347. doi: 10.1101/2025.08.20.671347.
2
A deep learning framework for understanding cochlear implants.一个用于理解人工耳蜗的深度学习框架。
bioRxiv. 2025 Jul 21:2025.07.16.665227. doi: 10.1101/2025.07.16.665227.
3
Optimized feature gains explain and predict successes and failures of human selective listening.优化后的特征增益能够解释并预测人类选择性听力的成败。

本文引用的文献

1
Many but not all deep neural network audio models capture brain responses and exhibit correspondence between model stages and brain regions.许多(但不是全部)深度神经网络音频模型可以捕捉大脑反应,并在模型阶段和大脑区域之间表现出对应关系。
PLoS Biol. 2023 Dec 13;21(12):e3002366. doi: 10.1371/journal.pbio.3002366. eCollection 2023 Dec.
2
Model metamers reveal divergent invariances between biological and artificial neural networks.模型同型揭示了生物神经网络和人工神经网络之间的不同不变性。
Nat Neurosci. 2023 Nov;26(11):2017-2034. doi: 10.1038/s41593-023-01442-0. Epub 2023 Oct 16.
3
Intermediate acoustic-to-semantic representations link behavioral and neural responses to natural sounds.
bioRxiv. 2025 May 28:2025.05.28.656682. doi: 10.1101/2025.05.28.656682.
4
Neuromorphic algorithms for brain implants: a review.用于脑植入物的神经形态算法:综述
Front Neurosci. 2025 Apr 11;19:1570104. doi: 10.3389/fnins.2025.1570104. eCollection 2025.
5
Individual differences elucidate the perceptual benefits associated with robust temporal fine-structure processing.个体差异阐明了与稳健的时间精细结构处理相关的感知益处。
Proc Natl Acad Sci U S A. 2025 Jan 7;122(1):e2317152121. doi: 10.1073/pnas.2317152121. Epub 2025 Jan 3.
中间声觉-语义表示将自然声音的行为和神经反应联系起来。
Nat Neurosci. 2023 Apr;26(4):664-672. doi: 10.1038/s41593-023-01285-9. Epub 2023 Mar 16.
4
Successes and critical failures of neural networks in capturing human-like speech recognition.神经网络在捕捉类似人类的语音识别方面的成功和关键失败。
Neural Netw. 2023 May;162:199-211. doi: 10.1016/j.neunet.2023.02.032. Epub 2023 Feb 24.
5
From Microphone to Phoneme: An End-to-End Computational Neural Model for Predicting Speech Perception With Cochlear Implants.从麦克风到音素:一种用于预测人工耳蜗语音感知的端到端计算神经模型。
IEEE Trans Biomed Eng. 2022 Nov;69(11):3300-3312. doi: 10.1109/TBME.2022.3167113. Epub 2022 Oct 19.
6
Shared computational principles for language processing in humans and deep language models.人类和深度语言模型语言处理的共享计算原则。
Nat Neurosci. 2022 Mar;25(3):369-380. doi: 10.1038/s41593-022-01026-4. Epub 2022 Mar 7.
7
Human discrimination and modeling of high-frequency complex tones shed light on the neural codes for pitch.人类对高频复音的辨别和建模为音高的神经编码提供了线索。
PLoS Comput Biol. 2022 Mar 3;18(3):e1009889. doi: 10.1371/journal.pcbi.1009889. eCollection 2022 Mar.
8
Deep neural network models of sound localization reveal how perception is adapted to real-world environments.深度神经网络模型的声音定位揭示了感知是如何适应现实世界环境的。
Nat Hum Behav. 2022 Jan;6(1):111-133. doi: 10.1038/s41562-021-01244-z. Epub 2022 Jan 27.
9
Deep neural network models reveal interplay of peripheral coding and stimulus statistics in pitch perception.深度神经网络模型揭示了音高感知中外周编码和刺激统计之间的相互作用。
Nat Commun. 2021 Dec 14;12(1):7278. doi: 10.1038/s41467-021-27366-6.
10
Temporal fine structure influences voicing confusions for consonant identification in multi-talker babble.时频结构对多说话人噪声环境下辅音识别的浊音混淆有影响。
J Acoust Soc Am. 2021 Oct;150(4):2664. doi: 10.1121/10.0006527.