两个带宽扩展阶段促进了自然声音的高效神经编码。

Two stages of bandwidth scaling drives efficient neural coding of natural sounds.

机构信息

Biomedical Engineering, University of Connecticut, Storrs, Connecticut, United States of America.

Psychological Sciences, University of Connecticut, Storrs, Connecticut, United States of America.

出版信息

PLoS Comput Biol. 2023 Feb 14;19(2):e1010862. doi: 10.1371/journal.pcbi.1010862. eCollection 2023 Feb.

DOI:10.1371/journal.pcbi.1010862

PMID:36787338

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9970106/

Abstract

Theories of efficient coding propose that the auditory system is optimized for the statistical structure of natural sounds, yet the transformations underlying optimal acoustic representations are not well understood. Using a database of natural sounds including human speech and a physiologically-inspired auditory model, we explore the consequences of peripheral (cochlear) and mid-level (auditory midbrain) filter tuning transformations on the representation of natural sound spectra and modulation statistics. Whereas Fourier-based sound decompositions have constant time-frequency resolution at all frequencies, cochlear and auditory midbrain filters bandwidths increase proportional to the filter center frequency. This form of bandwidth scaling produces a systematic decrease in spectral resolution and increase in temporal resolution with increasing frequency. Here we demonstrate that cochlear bandwidth scaling produces a frequency-dependent gain that counteracts the tendency of natural sound power to decrease with frequency, resulting in a whitened output representation. Similarly, bandwidth scaling in mid-level auditory filters further enhances the representation of natural sounds by producing a whitened modulation power spectrum (MPS) with higher modulation entropy than both the cochlear outputs and the conventional Fourier MPS. These findings suggest that the tuning characteristics of the peripheral and mid-level auditory system together produce a whitened output representation in three dimensions (frequency, temporal and spectral modulation) that reduces redundancies and allows for a more efficient use of neural resources. This hierarchical multi-stage tuning strategy is thus likely optimized to extract available information and may underlies perceptual sensitivity to natural sounds.

摘要

有效编码理论提出，听觉系统是针对自然声音的统计结构进行优化的，然而，最佳声学表示的基础转换尚不清楚。我们使用包括人类语音在内的自然声音数据库和一种基于生理学的听觉模型，探索了外围（耳蜗）和中层次（听觉中脑）滤波器调谐转换对自然声音频谱和调制统计表示的影响。虽然基于傅里叶的声音分解在所有频率上具有恒定的时频分辨率，但耳蜗和听觉中脑滤波器的带宽与滤波器中心频率成正比增加。这种带宽缩放形式会导致随着频率的增加，频谱分辨率系统性降低，时间分辨率增加。在这里，我们证明了耳蜗带宽缩放产生了一种与自然声音功率随频率降低的趋势相反的频率相关增益，从而产生了白化输出表示。类似地，中层次听觉滤波器中的带宽缩放通过产生具有比耳蜗输出和传统傅里叶 MPS 更高调制熵的白化调制功率谱（MPS），进一步增强了自然声音的表示。这些发现表明，外围和中层次听觉系统的调谐特性共同产生了一个在三个维度（频率、时间和频谱调制）上白化的输出表示，减少了冗余，并允许更有效地利用神经资源。因此，这种分层多阶段调谐策略可能是为了提取可用信息而优化的，并且可能是对自然声音的感知敏感性的基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0291/9970106/b24ecd89d9df/pcbi.1010862.g001.jpg

相似文献

Two stages of bandwidth scaling drives efficient neural coding of natural sounds.两个带宽扩展阶段促进了自然声音的高效神经编码。

PLoS Comput Biol. 2023 Feb 14;19(2):e1010862. doi: 10.1371/journal.pcbi.1010862. eCollection 2023 Feb.

Neural modulation tuning characteristics scale to efficiently encode natural sound statistics.神经调节调谐特性可按比例有效地对自然声音统计进行编码。

J Neurosci. 2010 Nov 24;30(47):15969-80. doi: 10.1523/JNEUROSCI.0966-10.2010.

Efficient auditory coding.高效听觉编码

Nature. 2006 Feb 23;439(7079):978-82. doi: 10.1038/nature04485.

Efficient coding of natural sounds.自然声音的高效编码。

Nat Neurosci. 2002 Apr;5(4):356-63. doi: 10.1038/nn831.

Stimulus-dependent auditory tuning results in synchronous population coding of vocalizations in the songbird midbrain.刺激依赖的听觉调谐导致鸣禽中脑发声的同步群体编码。

J Neurosci. 2006 Mar 1;26(9):2499-512. doi: 10.1523/JNEUROSCI.3731-05.2006.

Human-Like Modulation Sensitivity Emerging through Optimization to Natural Sound Recognition.通过优化自然声音识别实现类人调制敏感性。

J Neurosci. 2023 May 24;43(21):3876-3894. doi: 10.1523/JNEUROSCI.2002-22.2023. Epub 2023 Apr 25.

Modulation spectra of natural sounds and ethological theories of auditory processing.自然声音的调制频谱与听觉处理的行为学理论

J Acoust Soc Am. 2003 Dec;114(6 Pt 1):3394-411. doi: 10.1121/1.1624067.

Encoding of natural sounds at multiple spectral and temporal resolutions in the human auditory cortex.人类听觉皮层中自然声音在多个频谱和时间分辨率下的编码。

PLoS Comput Biol. 2014 Jan;10(1):e1003412. doi: 10.1371/journal.pcbi.1003412. Epub 2014 Jan 2.

Processing of natural sounds in human auditory cortex: tonotopy, spectral tuning, and relation to voice sensitivity.人类听觉皮层对自然声音的处理：音调拓扑、频谱调谐以及与语音敏感性的关系。

J Neurosci. 2012 Oct 10;32(41):14205-16. doi: 10.1523/JNEUROSCI.1388-12.2012.

Representation of temporal sound features in the human auditory cortex.人类听觉皮层中时间声音特征的表示。

Rev Neurosci. 2011;22(2):187-203. doi: 10.1515/RNS.2011.016.

引用本文的文献

Interference of mid-level speech and noise statistics underlies human speech recognition sensitivity in natural environmental noise.中级语音和噪声统计特性的干扰是自然环境噪声中人类语音识别敏感性的基础。

J Neurosci. 2025 Jul 8. doi: 10.1523/JNEUROSCI.1751-24.2025.

Interference of mid-level sound statistics underlie human speech recognition sensitivity in natural noise.中等水平声音统计信息的干扰是自然噪声中人类语音识别敏感性的基础。

bioRxiv. 2024 Oct 4:2024.02.13.579526. doi: 10.1101/2024.02.13.579526.

Human Auditory Ecology: Extending Hearing Research to the Perception of Natural Soundscapes by Humans in Rapidly Changing Environments.人类听觉生态学：将听觉研究扩展到人类在快速变化的环境中对自然声音景观的感知。

Trends Hear. 2023 Jan-Dec;27:23312165231212032. doi: 10.1177/23312165231212032.

本文引用的文献

Distinct neural ensemble response statistics are associated with recognition and discrimination of natural sound textures.不同的神经集合反应统计数据与自然声音纹理的识别和区分有关。

Proc Natl Acad Sci U S A. 2020 Dec 8;117(49):31482-31493. doi: 10.1073/pnas.2005644117. Epub 2020 Nov 20.

Simple transformations capture auditory input to cortex.简单的转换可以捕捉到听觉输入到大脑皮层。

Proc Natl Acad Sci U S A. 2020 Nov 10;117(45):28442-28451. doi: 10.1073/pnas.1922033117. Epub 2020 Oct 23.

Spiking network optimized for word recognition in noise predicts auditory system hierarchy.用于噪声中单词识别的尖峰网络预测听觉系统层级。

PLoS Comput Biol. 2020 Jun 19;16(6):e1007558. doi: 10.1371/journal.pcbi.1007558. eCollection 2020 Jun.

Bottom-up and top-down neural signatures of disordered multi-talker speech perception in adults with normal hearing.正常听力成人中紊乱多说话人语音感知的自下而上和自上而下的神经特征。

Elife. 2020 Jan 21;9:e51419. doi: 10.7554/eLife.51419.

A neural ensemble correlation code for sound category identification.用于声音类别识别的神经集合相关码。

PLoS Biol. 2019 Oct 1;17(10):e3000449. doi: 10.1371/journal.pbio.3000449. eCollection 2019 Oct.

A Task-Optimized Neural Network Replicates Human Auditory Behavior, Predicts Brain Responses, and Reveals a Cortical Processing Hierarchy.任务优化神经网络复制人类听觉行为，预测大脑反应，并揭示皮质处理层次结构。

Neuron. 2018 May 2;98(3):630-644.e16. doi: 10.1016/j.neuron.2018.03.044. Epub 2018 Apr 19.

Origins of scale invariance in vocalization sequences and speech.发声序列和言语中尺度不变性的起源。

PLoS Comput Biol. 2018 Apr 16;14(4):e1005996. doi: 10.1371/journal.pcbi.1005996. eCollection 2018 Apr.

Power spectral entropy as an information-theoretic correlate of manner of articulation in American English.功率谱熵作为美式英语发音方式的信息论关联指标。

J Acoust Soc Am. 2017 Feb;141(2):EL127. doi: 10.1121/1.4976109.

Temporal modulations in speech and music.语音和音乐中的时间调制。

Neurosci Biobehav Rev. 2017 Oct;81(Pt B):181-187. doi: 10.1016/j.neubiorev.2017.02.011. Epub 2017 Feb 14.

Selective and efficient neural coding of communication signals depends on early acoustic and social environment.通讯信号的选择性和高效神经编码取决于早期的声学和社会环境。

PLoS One. 2013 Apr 22;8(4):e61417. doi: 10.1371/journal.pone.0061417. Print 2013.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

两个带宽扩展阶段促进了自然声音的高效神经编码。

Two stages of bandwidth scaling drives efficient neural coding of natural sounds.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献