• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

声谱时频双耳声音的有效编码导致听觉空间表示的出现。

Efficient coding of spectrotemporal binaural sounds leads to emergence of the auditory space representation.

机构信息

Max-Planck Institute for Mathematics in the Sciences Leipzig, Germany.

出版信息

Front Comput Neurosci. 2014 Mar 7;8:26. doi: 10.3389/fncom.2014.00026. eCollection 2014.

DOI:10.3389/fncom.2014.00026
PMID:24639644
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3945936/
Abstract

To date a number of studies have shown that receptive field shapes of early sensory neurons can be reproduced by optimizing coding efficiency of natural stimulus ensembles. A still unresolved question is whether the efficient coding hypothesis explains formation of neurons which explicitly represent environmental features of different functional importance. This paper proposes that the spatial selectivity of higher auditory neurons emerges as a direct consequence of learning efficient codes for natural binaural sounds. Firstly, it is demonstrated that a linear efficient coding transform-Independent Component Analysis (ICA) trained on spectrograms of naturalistic simulated binaural sounds extracts spatial information present in the signal. A simple hierarchical ICA extension allowing for decoding of sound position is proposed. Furthermore, it is shown that units revealing spatial selectivity can be learned from a binaural recording of a natural auditory scene. In both cases a relatively small subpopulation of learned spectrogram features suffices to perform accurate sound localization. Representation of the auditory space is therefore learned in a purely unsupervised way by maximizing the coding efficiency and without any task-specific constraints. This results imply that efficient coding is a useful strategy for learning structures which allow for making behaviorally vital inferences about the environment.

摘要

迄今为止,许多研究表明,通过优化自然刺激集合的编码效率,可以再现早期感觉神经元的感受野形状。一个尚未解决的问题是,有效编码假说是否解释了明确表示不同功能重要性的环境特征的神经元的形成。本文提出,较高听觉神经元的空间选择性是学习自然双耳声音有效代码的直接结果。首先,证明了基于自然模拟双耳声音的频谱图训练的线性有效编码变换-独立成分分析(ICA)提取了信号中存在的空间信息。提出了一种简单的分层 ICA 扩展,允许对声音位置进行解码。此外,还表明可以从自然听觉场景的双耳记录中学习到具有空间选择性的单元。在这两种情况下,学习的频谱图特征的相对较小子群体足以进行精确的声音定位。因此,通过最大化编码效率并在没有任何特定任务约束的情况下,以纯无监督的方式学习听觉空间的表示。这一结果表明,有效编码是学习允许对环境进行行为上至关重要推断的结构的有用策略。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/0786033bbac4/fncom-08-00026-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/fba8f1611671/fncom-08-00026-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/2501633f4d90/fncom-08-00026-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/a4b73d0bd36a/fncom-08-00026-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/927cbb7fc807/fncom-08-00026-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/54167b516d52/fncom-08-00026-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/cbc363bda53f/fncom-08-00026-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/c53363204ceb/fncom-08-00026-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/abdda51b739d/fncom-08-00026-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/0786033bbac4/fncom-08-00026-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/fba8f1611671/fncom-08-00026-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/2501633f4d90/fncom-08-00026-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/a4b73d0bd36a/fncom-08-00026-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/927cbb7fc807/fncom-08-00026-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/54167b516d52/fncom-08-00026-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/cbc363bda53f/fncom-08-00026-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/c53363204ceb/fncom-08-00026-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/abdda51b739d/fncom-08-00026-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c0d/3945936/0786033bbac4/fncom-08-00026-g0009.jpg

相似文献

1
Efficient coding of spectrotemporal binaural sounds leads to emergence of the auditory space representation.声谱时频双耳声音的有效编码导致听觉空间表示的出现。
Front Comput Neurosci. 2014 Mar 7;8:26. doi: 10.3389/fncom.2014.00026. eCollection 2014.
2
The opponent channel population code of sound location is an efficient representation of natural binaural sounds.声音定位的拮抗通道群体编码是自然双耳声音的一种有效表征。
PLoS Comput Biol. 2015 May 21;11(5):e1004294. doi: 10.1371/journal.pcbi.1004294. eCollection 2015 May.
3
Learning Midlevel Auditory Codes from Natural Sound Statistics.从自然声音统计中学习中级听觉编码。
Neural Comput. 2018 Mar;30(3):631-669. doi: 10.1162/neco_a_01048. Epub 2017 Dec 8.
4
Statistics of natural binaural sounds.自然双耳声音的统计数据。
PLoS One. 2014 Oct 6;9(10):e108968. doi: 10.1371/journal.pone.0108968. eCollection 2014.
5
Emergent categorical representation of natural, complex sounds resulting from the early post-natal sound environment.由出生后早期声音环境产生的自然、复杂声音的紧急分类表征。
Neuroscience. 2013 Sep 17;248:30-42. doi: 10.1016/j.neuroscience.2013.05.056. Epub 2013 Jun 6.
6
Effect of Stimulus-Dependent Spike Timing on Population Coding of Sound Location in the Owl's Auditory Midbrain.刺激依赖的尖峰时间对猫头鹰听觉中脑声音位置的群体编码的影响。
eNeuro. 2020 Apr 23;7(2). doi: 10.1523/ENEURO.0244-19.2020. Print 2020 Mar/Apr.
7
Naturalistic auditory contrast improves spectrotemporal coding in the cat inferior colliculus.自然听觉对比度改善猫下丘的频谱时间编码。
J Neurosci. 2003 Dec 17;23(37):11489-504. doi: 10.1523/JNEUROSCI.23-37-11489.2003.
8
Across Species "Natural Ablation" Reveals the Brainstem Source of a Noninvasive Biomarker of Binaural Hearing.跨物种“自然消融”揭示了双耳听力无创生物标志物的脑干来源。
J Neurosci. 2018 Oct 3;38(40):8563-8573. doi: 10.1523/JNEUROSCI.1211-18.2018. Epub 2018 Aug 20.
9
Monaural and binaural spectrum level cues in the ferret: acoustics and the neural representation of auditory space.雪貂的单耳和双耳频谱水平线索:听觉空间的声学与神经表征
J Neurophysiol. 1994 Feb;71(2):785-801. doi: 10.1152/jn.1994.71.2.785.
10
Binaural SoundNet: Predicting Semantics, Depth and Motion With Binaural Sounds.双耳声音网络:利用双耳声音预测语义、深度和运动
IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):123-136. doi: 10.1109/TPAMI.2022.3155643. Epub 2022 Dec 5.

引用本文的文献

1
A multi-stage auditory model for binaural sound localization using the locally competitive algorithm.一种使用局部竞争算法的用于双耳声音定位的多阶段听觉模型。
Sci Rep. 2025 Jul 25;15(1):27048. doi: 10.1038/s41598-025-11613-7.
2
Multiple timescales of context influence perceptual sensitivity to common pairings of musical pitch and timbre.上下文的多个时间尺度会影响对音高和音色常见组合的感知敏感性。
PLoS One. 2025 Jul 18;20(7):e0328490. doi: 10.1371/journal.pone.0328490. eCollection 2025.
3
Efficient coding theory of dynamic attentional modulation.

本文引用的文献

1
Sparse coding can predict primary visual cortex receptive field changes induced by abnormal visual input.稀疏编码可以预测异常视觉输入引起的初级视觉皮层感受野的变化。
PLoS Comput Biol. 2013;9(5):e1003005. doi: 10.1371/journal.pcbi.1003005. Epub 2013 May 9.
2
Linear processing of interaural level difference underlies spatial tuning in the nucleus of the brachium of the inferior colliculus.下丘臂核中听觉水平差的线性处理是空间调谐的基础。
J Neurosci. 2013 Feb 27;33(9):3891-904. doi: 10.1523/JNEUROSCI.3437-12.2013.
3
Sparse codes for speech predict spectrotemporal receptive fields in the inferior colliculus.
动态注意调制的高效编码理论。
PLoS Biol. 2022 Dec 21;20(12):e3001889. doi: 10.1371/journal.pbio.3001889. eCollection 2022 Dec.
4
Cooperative population coding facilitates efficient sound-source separability by adaptation to input statistics.协同种群编码通过适应输入统计信息来促进有效的声源可分离性。
PLoS Biol. 2019 Jul 29;17(7):e3000150. doi: 10.1371/journal.pbio.3000150. eCollection 2019 Jul.
5
Efficient Neural Coding in Auditory and Speech Perception.听觉和语音感知中的高效神经编码。
Trends Neurosci. 2019 Jan;42(1):56-65. doi: 10.1016/j.tins.2018.09.004. Epub 2018 Oct 5.
6
Optimum neural tuning curves for information efficiency with rate coding and finite-time window.具有速率编码和有限时间窗口的信息效率的最优神经调谐曲线。
Front Comput Neurosci. 2015 Jun 3;9:67. doi: 10.3389/fncom.2015.00067. eCollection 2015.
7
The opponent channel population code of sound location is an efficient representation of natural binaural sounds.声音定位的拮抗通道群体编码是自然双耳声音的一种有效表征。
PLoS Comput Biol. 2015 May 21;11(5):e1004294. doi: 10.1371/journal.pcbi.1004294. eCollection 2015 May.
稀疏编码预测下丘脑中的语音频谱时间感受野。
PLoS Comput Biol. 2012;8(7):e1002594. doi: 10.1371/journal.pcbi.1002594. Epub 2012 Jul 12.
4
Adaptive coding is constrained to midline locations in a spatial listening task.自适应编码受限于空间聆听任务中的中线位置。
J Neurophysiol. 2012 Oct;108(7):1856-68. doi: 10.1152/jn.00652.2011. Epub 2012 Jul 5.
5
Mechanisms of sound localization in mammals.哺乳动物的声音定位机制。
Physiol Rev. 2010 Jul;90(3):983-1012. doi: 10.1152/physrev.00026.2009.
6
Adaptation to stimulus statistics in the perception and neural representation of auditory space.听觉空间感知和神经表示中对刺激统计数据的适应。
Neuron. 2010 Jun 24;66(6):937-48. doi: 10.1016/j.neuron.2010.05.018.
7
On hearing with more than one ear: lessons from evolution.用多只耳朵聆听:来自进化的启示
Nat Neurosci. 2009 Jun;12(6):692-7. doi: 10.1038/nn.2325. Epub 2009 May 26.
8
The consequences of response nonlinearities for interpretation of spectrotemporal receptive fields.响应非线性对光谱时间感受野解释的影响。
J Neurosci. 2008 Jan 9;28(2):446-55. doi: 10.1523/JNEUROSCI.1775-07.2007.
9
A network that uses few active neurones to code visual input predicts the diverse shapes of cortical receptive fields.一个使用少量活跃神经元对视觉输入进行编码的网络能够预测皮质感受野的多种形状。
J Comput Neurosci. 2007 Apr;22(2):135-46. doi: 10.1007/s10827-006-0003-9.
10
Reduction of information redundancy in the ascending auditory pathway.减少听觉传导通路中信息冗余。
Neuron. 2006 Aug 3;51(3):359-68. doi: 10.1016/j.neuron.2006.06.030.