• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大脑优化提取驱动连续听觉感知的复杂声音特征。

Brain-optimized extraction of complex sound features that drive continuous auditory perception.

机构信息

Department of Neurology and Neurosurgery, Brain Center, University Medical Center Utrecht, Utrecht, The Netherlands.

Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands.

出版信息

PLoS Comput Biol. 2020 Jul 2;16(7):e1007992. doi: 10.1371/journal.pcbi.1007992. eCollection 2020 Jul.

DOI:10.1371/journal.pcbi.1007992
PMID:32614826
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7363106/
Abstract

Understanding how the human brain processes auditory input remains a challenge. Traditionally, a distinction between lower- and higher-level sound features is made, but their definition depends on a specific theoretical framework and might not match the neural representation of sound. Here, we postulate that constructing a data-driven neural model of auditory perception, with a minimum of theoretical assumptions about the relevant sound features, could provide an alternative approach and possibly a better match to the neural responses. We collected electrocorticography recordings from six patients who watched a long-duration feature film. The raw movie soundtrack was used to train an artificial neural network model to predict the associated neural responses. The model achieved high prediction accuracy and generalized well to a second dataset, where new participants watched a different film. The extracted bottom-up features captured acoustic properties that were specific to the type of sound and were associated with various response latency profiles and distinct cortical distributions. Specifically, several features encoded speech-related acoustic properties with some features exhibiting shorter latency profiles (associated with responses in posterior perisylvian cortex) and others exhibiting longer latency profiles (associated with responses in anterior perisylvian cortex). Our results support and extend the current view on speech perception by demonstrating the presence of temporal hierarchies in the perisylvian cortex and involvement of cortical sites outside of this region during audiovisual speech perception.

摘要

理解人类大脑如何处理听觉输入仍然是一个挑战。传统上,人们对低层次和高层次的声音特征进行区分,但它们的定义取决于特定的理论框架,可能与声音的神经表示不匹配。在这里,我们假设构建一个数据驱动的听觉感知神经模型,对相关声音特征的理论假设最少,可能提供一种替代方法,并可能更好地与神经反应相匹配。我们从六名观看长时间特征电影的患者中收集了脑电记录。原始电影配乐被用来训练一个人工神经网络模型,以预测相关的神经反应。该模型实现了很高的预测准确性,并很好地推广到第二个数据集,其中新的参与者观看了不同的电影。提取的自下而上的特征捕获了与声音类型特定的声学特性,并与各种反应潜伏期分布和不同的皮质分布相关。具体来说,一些特征编码与语音相关的声学特性,其中一些特征具有较短的潜伏期分布(与后颞叶皮层的反应相关),而另一些特征具有较长的潜伏期分布(与前颞叶皮层的反应相关)。我们的结果通过证明颞叶皮层中存在时间层次结构以及在视听语音感知期间涉及该区域以外的皮质部位,支持并扩展了当前对语音感知的观点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/da9f95755c0e/pcbi.1007992.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/aa371f83d264/pcbi.1007992.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/b2361cbe3667/pcbi.1007992.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/2aa8f136f646/pcbi.1007992.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/670aab5b5f6f/pcbi.1007992.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/8497f5c975e9/pcbi.1007992.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/20ce84325b9f/pcbi.1007992.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/81f2ed728ac4/pcbi.1007992.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/da9f95755c0e/pcbi.1007992.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/aa371f83d264/pcbi.1007992.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/b2361cbe3667/pcbi.1007992.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/2aa8f136f646/pcbi.1007992.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/670aab5b5f6f/pcbi.1007992.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/8497f5c975e9/pcbi.1007992.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/20ce84325b9f/pcbi.1007992.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/81f2ed728ac4/pcbi.1007992.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/246c/7363106/da9f95755c0e/pcbi.1007992.g008.jpg

相似文献

1
Brain-optimized extraction of complex sound features that drive continuous auditory perception.大脑优化提取驱动连续听觉感知的复杂声音特征。
PLoS Comput Biol. 2020 Jul 2;16(7):e1007992. doi: 10.1371/journal.pcbi.1007992. eCollection 2020 Jul.
2
Neural Tuning to Low-Level Features of Speech throughout the Perisylvian Cortex.整个外侧裂周皮层对语音低层次特征的神经调谐。
J Neurosci. 2017 Aug 16;37(33):7906-7920. doi: 10.1523/JNEUROSCI.0238-17.2017. Epub 2017 Jul 17.
3
Developmental organization of neural dynamics supporting auditory perception.支持听觉感知的神经动力学的发育组织。
Neuroimage. 2022 Sep;258:119342. doi: 10.1016/j.neuroimage.2022.119342. Epub 2022 May 30.
4
Generalizable dimensions of human cortical auditory processing of speech in natural soundscapes: A data-driven ultra high field fMRI approach.人类在自然环境中对言语的皮质听觉处理的可推广维度:一种数据驱动的超高场 fMRI 方法。
Neuroimage. 2021 Aug 15;237:118106. doi: 10.1016/j.neuroimage.2021.118106. Epub 2021 May 12.
5
Mapping phonemic processing zones along human perisylvian cortex: an electro-corticographic investigation.绘制人类外侧裂周围皮质的音素加工区域:一项皮层脑电图研究。
Brain Struct Funct. 2014 Jul;219(4):1369-83. doi: 10.1007/s00429-013-0574-y. Epub 2013 May 26.
6
Cortical representation of natural complex sounds: effects of acoustic features and auditory object category.自然复杂声音的皮质代表:声音特征和听觉对象类别的影响。
J Neurosci. 2010 Jun 2;30(22):7604-12. doi: 10.1523/JNEUROSCI.0296-10.2010.
7
Task-dependent decoding of speaker and vowel identity from auditory cortical response patterns.基于听觉皮层反应模式的说话人及元音身份的任务相关解码
J Neurosci. 2014 Mar 26;34(13):4548-57. doi: 10.1523/JNEUROSCI.4339-13.2014.
8
Speech sound representation, perception, and plasticity: a neurophysiologic perceptive.语音表征、感知与可塑性:一种神经生理学视角
Audiol Neurootol. 1998 Mar-Jun;3(2-3):168-82. doi: 10.1159/000013788.
9
Distributed neural signatures of natural audiovisual speech and music in the human auditory cortex.人类听觉皮层中自然视听语音和音乐的分布式神经特征。
Neuroimage. 2017 Aug 15;157:108-117. doi: 10.1016/j.neuroimage.2016.12.005. Epub 2016 Dec 6.
10
Decoding four different sound-categories in the auditory cortex using functional near-infrared spectroscopy.使用功能近红外光谱技术在听觉皮层中解码四种不同的声音类别。
Hear Res. 2016 Mar;333:157-166. doi: 10.1016/j.heares.2016.01.009. Epub 2016 Jan 29.

引用本文的文献

1
Anatomically distinct cortical tracking of music and speech by slow (1-8Hz) and fast (70-120Hz) oscillatory activity.通过慢(1 - 8Hz)和快(70 - 120Hz)振荡活动对音乐和语音进行解剖学上不同的皮层追踪。
PLoS One. 2025 May 8;20(5):e0320519. doi: 10.1371/journal.pone.0320519. eCollection 2025.
2
A hierarchy of processing complexity and timescales for natural sounds in the human auditory cortex.人类听觉皮层中自然声音的处理复杂性和时间尺度层次结构。
Proc Natl Acad Sci U S A. 2025 May 6;122(18):e2412243122. doi: 10.1073/pnas.2412243122. Epub 2025 Apr 28.
3
A hierarchy of processing complexity and timescales for natural sounds in human auditory cortex.

本文引用的文献

1
Recurrence is required to capture the representational dynamics of the human visual system.为了捕捉人类视觉系统的表示动态,需要进行再现。
Proc Natl Acad Sci U S A. 2019 Oct 22;116(43):21854-21863. doi: 10.1073/pnas.1905544116. Epub 2019 Oct 7.
2
Elucidating relations between fMRI, ECoG, and EEG through a common natural stimulus.通过常见的自然刺激来阐明 fMRI、ECoG 和 EEG 之间的关系。
Neuroimage. 2018 Oct 1;179:79-91. doi: 10.1016/j.neuroimage.2018.06.016. Epub 2018 Jun 15.
3
A Task-Optimized Neural Network Replicates Human Auditory Behavior, Predicts Brain Responses, and Reveals a Cortical Processing Hierarchy.
人类听觉皮层中自然声音的处理复杂性和时间尺度层次结构。
bioRxiv. 2024 May 26:2024.05.24.595822. doi: 10.1101/2024.05.24.595822.
4
Crossmixed convolutional neural network for digital speech recognition.交叉混合卷积神经网络在数字语音识别中的应用。
PLoS One. 2024 Apr 26;19(4):e0302394. doi: 10.1371/journal.pone.0302394. eCollection 2024.
5
Representational maps in the brain: concepts, approaches, and applications.大脑中的表征图谱:概念、方法及应用。
Front Cell Neurosci. 2024 Mar 22;18:1366200. doi: 10.3389/fncel.2024.1366200. eCollection 2024.
6
Dissecting neural computations in the human auditory pathway using deep neural networks for speech.利用用于语音的深度神经网络解析人类听觉通路中的神经计算。
Nat Neurosci. 2023 Dec;26(12):2213-2225. doi: 10.1038/s41593-023-01468-4. Epub 2023 Oct 30.
7
Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models.基于优化深度学习模型的感觉运动脑活动的直接语音重建。
J Neural Eng. 2023 Sep 20;20(5):056010. doi: 10.1088/1741-2552/ace8be.
8
Multifractal test for nonlinearity of interactions across scales in time series.多标度分形检验时间序列中跨尺度相互作用的非线性。
Behav Res Methods. 2023 Aug;55(5):2249-2282. doi: 10.3758/s13428-022-01866-9. Epub 2022 Jul 19.
9
Open multimodal iEEG-fMRI dataset from naturalistic stimulation with a short audiovisual film.开放自然刺激下多模态 iEEG-fMRI 数据集,包含一部短的视听影片。
Sci Data. 2022 Mar 21;9(1):91. doi: 10.1038/s41597-022-01173-0.
10
Perceiving and remembering speech depend on multifractal nonlinearity in movements producing and exploring speech.感知和记忆言语依赖于产生和探索言语的运动中的多重分形非线性。
J R Soc Interface. 2021 Aug;18(181):20210272. doi: 10.1098/rsif.2021.0272. Epub 2021 Aug 4.
任务优化神经网络复制人类听觉行为,预测大脑反应,并揭示皮质处理层次结构。
Neuron. 2018 May 2;98(3):630-644.e16. doi: 10.1016/j.neuron.2018.03.044. Epub 2018 Apr 19.
4
ALICE: A tool for automatic localization of intra-cranial electrodes for clinical and high-density grids.ALICE:一种用于颅内电极自动定位的工具,适用于临床和高密度栅格。
J Neurosci Methods. 2018 May 1;301:43-51. doi: 10.1016/j.jneumeth.2017.10.022. Epub 2017 Nov 1.
5
Neural Encoding of Auditory Features during Music Perception and Imagery.听觉特征在音乐感知和想象过程中的神经编码。
Cereb Cortex. 2018 Dec 1;28(12):4222-4233. doi: 10.1093/cercor/bhx277.
6
Recurrent Convolutional Neural Networks: A Better Model of Biological Object Recognition.循环卷积神经网络:生物物体识别的更佳模型。
Front Psychol. 2017 Sep 12;8:1551. doi: 10.3389/fpsyg.2017.01551. eCollection 2017.
7
Discovering Event Structure in Continuous Narrative Perception and Memory.在连续叙事感知与记忆中发现事件结构
Neuron. 2017 Aug 2;95(3):709-721.e5. doi: 10.1016/j.neuron.2017.06.041.
8
Neural Tuning to Low-Level Features of Speech throughout the Perisylvian Cortex.整个外侧裂周皮层对语音低层次特征的神经调谐。
J Neurosci. 2017 Aug 16;37(33):7906-7920. doi: 10.1523/JNEUROSCI.0238-17.2017. Epub 2017 Jul 17.
9
Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence.将深度神经网络与人类视觉物体识别的时空皮层动力学进行比较,揭示了层级对应关系。
Sci Rep. 2016 Jun 10;6:27755. doi: 10.1038/srep27755.
10
Using goal-driven deep learning models to understand sensory cortex.利用目标驱动的深度学习模型理解感觉皮层。
Nat Neurosci. 2016 Mar;19(3):356-65. doi: 10.1038/nn.4244.