人类听觉皮层中自然声音的处理复杂性和时间尺度层次结构。

A hierarchy of processing complexity and timescales for natural sounds in human auditory cortex.

作者信息

Rupp Kyle M, Hect Jasmine L, Harford Emily E, Holt Lori L, Ghuman Avniel Singh, Abel Taylor J

机构信息

Department of Neurological Surgery, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America.

Department of Psychology, The University of Texas at Austin, Austin, Texas, United States of America.

出版信息

bioRxiv. 2024 May 26:2024.05.24.595822. doi: 10.1101/2024.05.24.595822.

DOI:10.1101/2024.05.24.595822

PMID:38826304

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11142240/

Abstract

Efficient behavior is supported by humans' ability to rapidly recognize acoustically distinct sounds as members of a common category. Within auditory cortex, there are critical unanswered questions regarding the organization and dynamics of sound categorization. Here, we performed intracerebral recordings in the context of epilepsy surgery as 20 patient-participants listened to natural sounds. We built encoding models to predict neural responses using features of these sounds extracted from different layers within a sound-categorization deep neural network (DNN). This approach yielded highly accurate models of neural responses throughout auditory cortex. The complexity of a cortical site's representation (measured by the depth of the DNN layer that produced the best model) was closely related to its anatomical location, with shallow, middle, and deep layers of the DNN associated with core (primary auditory cortex), lateral belt, and parabelt regions, respectively. Smoothly varying gradients of representational complexity also existed within these regions, with complexity increasing along a posteromedial-to-anterolateral direction in core and lateral belt, and along posterior-to-anterior and dorsal-to-ventral dimensions in parabelt. When we estimated the time window over which each recording site integrates information, we found shorter integration windows in core relative to lateral belt and parabelt. Lastly, we found a relationship between the length of the integration window and the complexity of information processing within core (but not lateral belt or parabelt). These findings suggest hierarchies of timescales and processing complexity, and their interrelationship, represent a functional organizational principle of the auditory stream that underlies our perception of complex, abstract auditory information.

摘要

人类能够迅速将声学上不同的声音识别为同一类别中的成员，这有助于高效行为的产生。在听觉皮层中，关于声音分类的组织和动态存在一些关键的未解决问题。在这里，我们在癫痫手术的背景下进行了脑内记录，20名患者参与者聆听自然声音。我们构建了编码模型，使用从声音分类深度神经网络（DNN）的不同层中提取的这些声音的特征来预测神经反应。这种方法产生了整个听觉皮层神经反应的高度准确模型。皮层位点表征的复杂性（通过产生最佳模型的DNN层的深度来衡量）与其解剖位置密切相关，DNN的浅层、中层和深层分别与核心（初级听觉皮层）、外侧带和旁带区域相关。在这些区域内也存在表征复杂性的平滑变化梯度，在核心和外侧带中，复杂性沿后内侧到前外侧方向增加，在旁带中沿后到前和背到腹的维度增加。当我们估计每个记录位点整合信息的时间窗口时，我们发现核心区域的整合窗口比外侧带和旁带更短。最后，我们发现整合窗口的长度与核心区域内信息处理的复杂性之间存在关系（但在外侧带或旁带中不存在）。这些发现表明，时间尺度和处理复杂性的层次结构及其相互关系代表了听觉信息流的一种功能组织原则，它是我们对复杂抽象听觉信息感知的基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef37/11142240/887c47ed0ef6/nihpp-2024.05.24.595822v1-f0001.jpg

相似文献

A hierarchy of processing complexity and timescales for natural sounds in human auditory cortex.

bioRxiv. 2024 May 26:2024.05.24.595822. doi: 10.1101/2024.05.24.595822.

A hierarchy of processing complexity and timescales for natural sounds in the human auditory cortex.

Proc Natl Acad Sci U S A. 2025 May 6;122(18):e2412243122. doi: 10.1073/pnas.2412243122. Epub 2025 Apr 28.

Cortical connections of auditory cortex in marmoset monkeys: lateral belt and parabelt regions.

Anat Rec (Hoboken). 2012 May;295(5):800-21. doi: 10.1002/ar.22451. Epub 2012 Mar 28.

Functional organization of human auditory cortex: investigation of response latencies through direct recordings.

Neuroimage. 2014 Nov 1;101:598-609. doi: 10.1016/j.neuroimage.2014.07.004. Epub 2014 Jul 12.

Subdivisions of auditory cortex and ipsilateral cortical connections of the parabelt auditory cortex in macaque monkeys.

J Comp Neurol. 1998 May 18;394(4):475-95. doi: 10.1002/(sici)1096-9861(19980518)394:4<475::aid-cne6>3.0.co;2-z.

Feedforward and feedback projections of caudal belt and parabelt areas of auditory cortex: refining the hierarchical model.

Front Neurosci. 2014 Apr 22;8:72. doi: 10.3389/fnins.2014.00072. eCollection 2014.

Sensitivity to Vocalization Pitch in the Caudal Auditory Cortex of the Marmoset: Comparison of Core and Belt Areas.

Front Syst Neurosci. 2019 Feb 1;13:5. doi: 10.3389/fnsys.2019.00005. eCollection 2019.

Auditory belt and parabelt projections to the prefrontal cortex in the rhesus monkey.

J Comp Neurol. 1999 Jan 11;403(2):141-57. doi: 10.1002/(sici)1096-9861(19990111)403:2<141::aid-cne1>3.0.co;2-v.

Sound Frequency Representation in the Auditory Cortex of the Common Marmoset Visualized Using Optical Intrinsic Signal Imaging.

eNeuro. 2018 May 7;5(2). doi: 10.1523/ENEURO.0078-18.2018. eCollection 2018 Mar-Apr.

Functional correlates of the anterolateral processing hierarchy in human auditory cortex.

J Neurosci. 2011 Jun 22;31(25):9345-52. doi: 10.1523/JNEUROSCI.1448-11.2011.

本文引用的文献

A sparse code for natural sound context in auditory cortex.

Curr Res Neurobiol. 2023 Nov 29;6:100118. doi: 10.1016/j.crneur.2023.100118. eCollection 2024.

Many but not all deep neural network audio models capture brain responses and exhibit correspondence between model stages and brain regions.

PLoS Biol. 2023 Dec 13;21(12):e3002366. doi: 10.1371/journal.pbio.3002366. eCollection 2023 Dec.

Dissecting neural computations in the human auditory pathway using deep neural networks for speech.

Nat Neurosci. 2023 Dec;26(12):2213-2225. doi: 10.1038/s41593-023-01468-4. Epub 2023 Oct 30.

Elastic Net Regularization Paths for All Generalized Linear Models.

J Stat Softw. 2023;106. doi: 10.18637/jss.v106.i01. Epub 2023 Mar 23.

Intermediate acoustic-to-semantic representations link behavioral and neural responses to natural sounds.

Nat Neurosci. 2023 Apr;26(4):664-672. doi: 10.1038/s41593-023-01285-9. Epub 2023 Mar 16.

Neural responses in human superior temporal cortex support coding of voice representations.

PLoS Biol. 2022 Jul 28;20(7):e3001675. doi: 10.1371/journal.pbio.3001675. eCollection 2022 Jul.

Multiscale temporal integration organizes hierarchical computation in human auditory cortex.

Nat Hum Behav. 2022 Mar;6(3):455-469. doi: 10.1038/s41562-021-01261-y. Epub 2022 Feb 10.

Functionally homologous representation of vocalizations in the auditory cortex of humans and macaques.

Curr Biol. 2021 Nov 8;31(21):4839-4844.e4. doi: 10.1016/j.cub.2021.08.043. Epub 2021 Sep 9.

Cortical response to naturalistic stimuli is largely predictable with deep neural networks.

Sci Adv. 2021 May 28;7(22). doi: 10.1126/sciadv.abe7547. Print 2021 May.

Representational Content of Oscillatory Brain Activity during Object Recognition: Contrasting Cortical and Deep Neural Network Hierarchies.

eNeuro. 2021 May 25;8(3). doi: 10.1523/ENEURO.0362-20.2021. Print 2021 May-Jun.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

人类听觉皮层中自然声音的处理复杂性和时间尺度层次结构。

A hierarchy of processing complexity and timescales for natural sounds in human auditory cortex.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献