• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

代数听觉结构的检测通过自监督学习得以实现。

The detection of algebraic auditory structures emerges with self-supervised learning.

作者信息

Orhan Pierre, Boubenec Yves, King Jean-Rémi

机构信息

Laboratoire des Systèmes Perceptifs, Département d'études Cognitives, École Normale Supérieure, PSL University, CNRS, Paris, France.

Meta, Paris, France.

出版信息

PLoS Comput Biol. 2025 Sep 5;21(9):e1013271. doi: 10.1371/journal.pcbi.1013271. eCollection 2025 Sep.

DOI:10.1371/journal.pcbi.1013271
PMID:40911653
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12431648/
Abstract

Humans can spontaneously detect complex algebraic structures. Historically, two opposing views explain this ability, at the root of language and music acquisition. Some argue for the existence of an innate and specific mechanism. Others argue that this ability emerges from experience: i.e. when generic learning principles continuously process sensory inputs. These two views, however, remain difficult to test experimentally. Here, we use deep learning models to evaluate the factors that lead to the spontaneous detection of algebraic structures in the auditory modality. Specifically, we use self-supervised learning to train multiple deep-learning models with a variable amount of either natural (environmental sounds) and/or cultural sounds (speech or music) to evaluate the impact of these stimuli. We then expose these models to the experimental paradigms classically used to evaluate the processing of algebraic structures. Like humans, these models spontaneously detect repeated sequences, probabilistic chunks, and complex algebraic structures. Also like humans, this ability diminishes with structure complexity. Importantly, this ability can emerge from experience alone: the more the models are exposed to natural sounds, the more they spontaneously detect increasingly complex structures. Finally, this ability does not emerge in models pretrained only on speech, and emerges more rapidly in models pretrained with music than environmental sounds. Overall, our study provides an operational framework to clarify sufficient built-in and acquired principles that model human's advanced capacity to detect algebraic structures in sounds.

摘要

人类能够自发地检测复杂的代数结构。从历史上看,有两种对立的观点解释了这种能力,它是语言和音乐习得的根源。一些人主张存在一种天生的特定机制。另一些人则认为这种能力源于经验,即当通用学习原则持续处理感官输入时。然而,这两种观点在实验上仍然难以验证。在这里,我们使用深度学习模型来评估导致在听觉模态中自发检测代数结构的因素。具体而言,我们使用自监督学习来训练多个深度学习模型,这些模型使用不同数量的自然声音(环境声音)和/或文化声音(语音或音乐),以评估这些刺激的影响。然后,我们将这些模型暴露于经典用于评估代数结构处理的实验范式中。与人类一样,这些模型会自发地检测重复序列、概率块和复杂的代数结构。同样与人类一样,这种能力会随着结构复杂性的增加而减弱。重要的是,这种能力可以仅从经验中产生:模型接触自然声音的次数越多,它们就越能自发地检测出越来越复杂的结构。最后,这种能力在仅用语音进行预训练的模型中不会出现,并且在使用音乐而不是环境声音进行预训练的模型中出现得更快。总体而言,我们的研究提供了一个操作框架,以阐明足够的内在和习得原则,这些原则可以模拟人类在声音中检测代数结构的高级能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7db/12431648/3402317a033a/pcbi.1013271.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7db/12431648/99d3ba695fc9/pcbi.1013271.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7db/12431648/94a745b0b748/pcbi.1013271.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7db/12431648/5595729ca9eb/pcbi.1013271.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7db/12431648/3402317a033a/pcbi.1013271.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7db/12431648/99d3ba695fc9/pcbi.1013271.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7db/12431648/94a745b0b748/pcbi.1013271.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7db/12431648/5595729ca9eb/pcbi.1013271.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c7db/12431648/3402317a033a/pcbi.1013271.g004.jpg

相似文献

1
The detection of algebraic auditory structures emerges with self-supervised learning.代数听觉结构的检测通过自监督学习得以实现。
PLoS Comput Biol. 2025 Sep 5;21(9):e1013271. doi: 10.1371/journal.pcbi.1013271. eCollection 2025 Sep.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Short-Term Memory Impairment短期记忆障碍
4
Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.医护人员非正规使用手机和其他移动设备来支持工作:定性证据综合评价。
Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.
5
Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.人类常见且大多被称为“非致病性”的单细胞肠道寄生虫的遗传多样性、宿主特异性及公共卫生意义
APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.
6
Sexual Harassment and Prevention Training性骚扰与预防培训
7
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
8
Comparison of speech and music input in North American infants' home environment over the first 2 years of life.北美婴儿在生命的头 2 年家庭环境中言语和音乐输入的比较。
Dev Sci. 2024 Sep;27(5):e13528. doi: 10.1111/desc.13528. Epub 2024 May 21.
9
Music-based therapeutic interventions for people with dementia.针对痴呆症患者的基于音乐的治疗干预措施。
Cochrane Database Syst Rev. 2025 Mar 7;3(3):CD003477. doi: 10.1002/14651858.CD003477.pub5.
10
Factors that impact on the use of mechanical ventilation weaning protocols in critically ill adults and children: a qualitative evidence-synthesis.影响重症成人和儿童机械通气撤机方案使用的因素:一项定性证据综合分析
Cochrane Database Syst Rev. 2016 Oct 4;10(10):CD011812. doi: 10.1002/14651858.CD011812.pub2.

本文引用的文献

1
Parallel mechanisms signal a hierarchy of sequence structure violations in the auditory cortex.并行机制表明听觉皮层中序列结构违反的层次结构。
Elife. 2024 Dec 5;13:RP102702. doi: 10.7554/eLife.102702.
2
How Well Do Unsupervised Learning Algorithms Model Human Real-time and Life-long Learning?无监督学习算法在模拟人类实时学习和终身学习方面的表现如何?
Adv Neural Inf Process Syst. 2022;35:22628-22642.
3
Spontaneous emergence of rudimentary music detectors in deep neural networks.深度神经网络中原始音乐探测器的自发出现。
Nat Commun. 2024 Jan 2;15(1):148. doi: 10.1038/s41467-023-44516-0.
4
Many but not all deep neural network audio models capture brain responses and exhibit correspondence between model stages and brain regions.许多(但不是全部)深度神经网络音频模型可以捕捉大脑反应,并在模型阶段和大脑区域之间表现出对应关系。
PLoS Biol. 2023 Dec 13;21(12):e3002366. doi: 10.1371/journal.pbio.3002366. eCollection 2023 Dec.
5
Brain-imaging evidence for compression of binary sound sequences in human memory.人脑记忆中对二进制声音序列压缩的脑成像证据。
Elife. 2023 Nov 1;12:e84376. doi: 10.7554/eLife.84376.
6
Dissecting neural computations in the human auditory pathway using deep neural networks for speech.利用用于语音的深度神经网络解析人类听觉通路中的神经计算。
Nat Neurosci. 2023 Dec;26(12):2213-2225. doi: 10.1038/s41593-023-01468-4. Epub 2023 Oct 30.
7
Human-like systematic generalization through a meta-learning neural network.通过元学习神经网络实现类人系统泛化。
Nature. 2023 Nov;623(7985):115-121. doi: 10.1038/s41586-023-06668-3. Epub 2023 Oct 25.
8
Humans parsimoniously represent auditory sequences by pruning and completing the underlying network structure.人类通过修剪和完善潜在的网络结构来精简地表示听觉序列。
Elife. 2023 May 2;12:e86430. doi: 10.7554/eLife.86430.
9
Symbols and mental programs: a hypothesis about human singularity.符号与心理程序:关于人类独特性的一种假说。
Trends Cogn Sci. 2022 Sep;26(9):751-766. doi: 10.1016/j.tics.2022.06.010. Epub 2022 Aug 3.
10
Constructing the hierarchy of predictive auditory sequences in the marmoset brain.构建狨猴大脑中预测性听觉序列的层次结构。
Elife. 2022 Feb 17;11:e74653. doi: 10.7554/eLife.74653.