• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类听觉皮层中自然声音的处理复杂性和时间尺度层次结构。

A hierarchy of processing complexity and timescales for natural sounds in the human auditory cortex.

作者信息

Rupp Kyle M, Hect Jasmine L, Harford Emily E, Holt Lori L, Ghuman Avniel Singh, Abel Taylor J

机构信息

Department of Neurological Surgery, University of Pittsburgh, PA 15213.

Department of Psychology, The University of Texas at Austin, TX 78712.

出版信息

Proc Natl Acad Sci U S A. 2025 May 6;122(18):e2412243122. doi: 10.1073/pnas.2412243122. Epub 2025 Apr 28.

DOI:10.1073/pnas.2412243122
PMID:40294254
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12067213/
Abstract

Efficient behavior is supported by humans' ability to rapidly recognize acoustically distinct sounds as members of a common category. Within the auditory cortex, critical unanswered questions remain regarding the organization and dynamics of sound categorization. We performed intracerebral recordings during epilepsy surgery evaluation as 20 patient-participants listened to natural sounds. We then built encoding models to predict neural responses using sound representations extracted from different layers within a deep neural network (DNN) pretrained to categorize sounds from acoustics. This approach yielded accurate models of neural responses throughout the auditory cortex. The complexity of a cortical site's representation (measured by the depth of the DNN layer that produced the best model) was closely related to its anatomical location, with shallow, middle, and deep layers associated with core (primary auditory cortex), lateral belt, and parabelt regions, respectively. Smoothly varying gradients of representational complexity existed within these regions, with complexity increasing along a posteromedial-to-anterolateral direction in core and lateral belt and along posterior-to-anterior and dorsal-to-ventral dimensions in parabelt. We then characterized the time (relative to sound onset) when feature representations emerged; this measure of temporal dynamics increased across the auditory hierarchy. Finally, we found separable effects of region and temporal dynamics on representational complexity: sites that took longer to begin encoding stimulus features had higher representational complexity independent of region, and downstream regions encoded more complex features independent of temporal dynamics. These findings suggest that hierarchies of timescales and complexity represent a functional organizational principle of the auditory stream underlying our ability to rapidly categorize sounds.

摘要

人类能够迅速将声学上不同的声音识别为同一类别中的成员,这支持了高效行为。在听觉皮层内,关于声音分类的组织和动态仍存在关键的未解决问题。在癫痫手术评估期间,我们对20名患者参与者进行了脑内记录,他们聆听自然声音。然后,我们构建了编码模型,使用从预训练用于根据声学对声音进行分类的深度神经网络(DNN)的不同层中提取的声音表示来预测神经反应。这种方法产生了整个听觉皮层神经反应的准确模型。皮层位点表示的复杂性(通过产生最佳模型的DNN层深度来衡量)与其解剖位置密切相关,浅层、中层和深层分别与核心(初级听觉皮层)、外侧带和旁带区域相关。在这些区域内存在表示复杂性的平滑变化梯度,在核心和外侧带中,复杂性沿后内侧到前外侧方向增加,在旁带中沿后到前和背到腹维度增加。然后,我们确定了特征表示出现的时间(相对于声音开始);这种时间动态测量在整个听觉层次结构中增加。最后,我们发现区域和时间动态对表示复杂性有可分离的影响:开始编码刺激特征所需时间更长的位点具有更高的表示复杂性,与区域无关,并且下游区域编码更复杂的特征,与时间动态无关。这些发现表明,时间尺度和复杂性层次结构代表了听觉流的功能组织原则,是我们快速对声音进行分类能力的基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4d/12067213/ae6c522dc898/pnas.2412243122fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4d/12067213/0375dc04c81c/pnas.2412243122fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4d/12067213/89a20f295a0b/pnas.2412243122fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4d/12067213/011c1a3ab2ad/pnas.2412243122fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4d/12067213/a2a588592a07/pnas.2412243122fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4d/12067213/ae6c522dc898/pnas.2412243122fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4d/12067213/0375dc04c81c/pnas.2412243122fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4d/12067213/89a20f295a0b/pnas.2412243122fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4d/12067213/011c1a3ab2ad/pnas.2412243122fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4d/12067213/a2a588592a07/pnas.2412243122fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cf4d/12067213/ae6c522dc898/pnas.2412243122fig05.jpg

相似文献

1
A hierarchy of processing complexity and timescales for natural sounds in the human auditory cortex.人类听觉皮层中自然声音的处理复杂性和时间尺度层次结构。
Proc Natl Acad Sci U S A. 2025 May 6;122(18):e2412243122. doi: 10.1073/pnas.2412243122. Epub 2025 Apr 28.
2
A hierarchy of processing complexity and timescales for natural sounds in human auditory cortex.人类听觉皮层中自然声音的处理复杂性和时间尺度层次结构。
bioRxiv. 2024 May 26:2024.05.24.595822. doi: 10.1101/2024.05.24.595822.
3
Cortical representation of natural complex sounds: effects of acoustic features and auditory object category.自然复杂声音的皮质代表:声音特征和听觉对象类别的影响。
J Neurosci. 2010 Jun 2;30(22):7604-12. doi: 10.1523/JNEUROSCI.0296-10.2010.
4
Encoding of natural sounds at multiple spectral and temporal resolutions in the human auditory cortex.人类听觉皮层中自然声音在多个频谱和时间分辨率下的编码。
PLoS Comput Biol. 2014 Jan;10(1):e1003412. doi: 10.1371/journal.pcbi.1003412. Epub 2014 Jan 2.
5
Cortical processing of pitch: Model-based encoding and decoding of auditory fMRI responses to real-life sounds.皮层音高处理:基于模型的听觉 fMRI 响应对真实声音的编码和解码。
Neuroimage. 2018 Oct 15;180(Pt A):291-300. doi: 10.1016/j.neuroimage.2017.11.020. Epub 2017 Nov 13.
6
Atypical processing of auditory temporal complexity in autistics.自闭症患者听觉时间复杂度的非典型处理。
Neuropsychologia. 2011 Feb;49(3):546-55. doi: 10.1016/j.neuropsychologia.2010.12.033. Epub 2010 Dec 28.
7
Neural dynamics underlying the acquisition of distinct auditory category structures.不同听觉类别结构习得的神经动力学基础。
Neuroimage. 2021 Dec 1;244:118565. doi: 10.1016/j.neuroimage.2021.118565. Epub 2021 Sep 17.
8
Keeping track of sound objects in space: The contribution of early-stage auditory areas.追踪空间中的声音物体:早期听觉区域的作用。
Hear Res. 2018 Sep;366:17-31. doi: 10.1016/j.heares.2018.03.027. Epub 2018 Apr 1.
9
Hemispheric asymmetries in the auditory cortex reflect discriminative responses to temporal details or summary statistics of stationary sounds.听觉皮层中的半球不对称性反映了对稳态声音的时间细节或统计概要的辨别反应。
Cortex. 2025 Mar;184:79-95. doi: 10.1016/j.cortex.2024.09.020. Epub 2025 Jan 7.
10
Evaluating the Columnar Stability of Acoustic Processing in the Human Auditory Cortex.评估人类听觉皮层中声处理的柱状稳定性。
J Neurosci. 2018 Sep 5;38(36):7822-7832. doi: 10.1523/JNEUROSCI.3576-17.2018. Epub 2018 Aug 1.

本文引用的文献

1
Unveiling the development of human voice perception: Neurobiological mechanisms and pathophysiology.揭示人类语音感知的发展:神经生物学机制与病理生理学
Curr Res Neurobiol. 2024 Mar 8;6:100127. doi: 10.1016/j.crneur.2024.100127. eCollection 2024.
2
A sparse code for natural sound context in auditory cortex.听觉皮层中自然声音环境的稀疏编码。
Curr Res Neurobiol. 2023 Nov 29;6:100118. doi: 10.1016/j.crneur.2023.100118. eCollection 2024.
3
Many but not all deep neural network audio models capture brain responses and exhibit correspondence between model stages and brain regions.
许多(但不是全部)深度神经网络音频模型可以捕捉大脑反应,并在模型阶段和大脑区域之间表现出对应关系。
PLoS Biol. 2023 Dec 13;21(12):e3002366. doi: 10.1371/journal.pbio.3002366. eCollection 2023 Dec.
4
Dissecting neural computations in the human auditory pathway using deep neural networks for speech.利用用于语音的深度神经网络解析人类听觉通路中的神经计算。
Nat Neurosci. 2023 Dec;26(12):2213-2225. doi: 10.1038/s41593-023-01468-4. Epub 2023 Oct 30.
5
Elastic Net Regularization Paths for All Generalized Linear Models.所有广义线性模型的弹性网络正则化路径
J Stat Softw. 2023;106. doi: 10.18637/jss.v106.i01. Epub 2023 Mar 23.
6
Intermediate acoustic-to-semantic representations link behavioral and neural responses to natural sounds.中间声觉-语义表示将自然声音的行为和神经反应联系起来。
Nat Neurosci. 2023 Apr;26(4):664-672. doi: 10.1038/s41593-023-01285-9. Epub 2023 Mar 16.
7
Multiscale temporal integration organizes hierarchical computation in human auditory cortex.多尺度时间整合在人类听觉皮层中组织分层计算。
Nat Hum Behav. 2022 Mar;6(3):455-469. doi: 10.1038/s41562-021-01261-y. Epub 2022 Feb 10.
8
Functionally homologous representation of vocalizations in the auditory cortex of humans and macaques.人类和猕猴听觉皮层中发声功能同源的表达。
Curr Biol. 2021 Nov 8;31(21):4839-4844.e4. doi: 10.1016/j.cub.2021.08.043. Epub 2021 Sep 9.
9
Cortical response to naturalistic stimuli is largely predictable with deep neural networks.皮质对自然主义刺激的反应在很大程度上可以通过深度神经网络进行预测。
Sci Adv. 2021 May 28;7(22). doi: 10.1126/sciadv.abe7547. Print 2021 May.
10
Representational Content of Oscillatory Brain Activity during Object Recognition: Contrasting Cortical and Deep Neural Network Hierarchies.振荡脑活动在物体识别中的表象内容:对比皮层和深度神经网络层级。
eNeuro. 2021 May 25;8(3). doi: 10.1523/ENEURO.0362-20.2021. Print 2021 May-Jun.