• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

灵活的任务抽象出现在具有快速且有界单元的线性网络中。

Flexible task abstractions emerge in linear networks with fast and bounded units.

作者信息

Sandbrink Kai, Bauer Jan P, Proca Alexandra M, Saxe Andrew M, Summerfield Christopher, Hummos Ali

机构信息

Exp. Psychology, Oxford Brain Mind Institute, EPFL.

ELSC, HebrewU Gatsby Unit, UCL.

出版信息

ArXiv. 2025 Jan 16:arXiv:2411.03840v2.

PMID:39876939
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11774440/
Abstract

Animals survive in dynamic environments changing at arbitrary timescales, but such data distribution shifts are a challenge to neural networks. To adapt to change, neural systems may change a large number of parameters, which is a slow process involving past information. In contrast, animals leverage distribution changes to segment their stream of experience into tasks and associate them with internal task abstractions. Animals can then respond by selecting the appropriate task abstraction. However, how such flexible task abstractions may arise in neural systems remains unknown. Here, we analyze a linear gated network where the weights and gates are jointly optimized via gradient descent, but with neuron-like constraints on the gates including a faster timescale, nonnegativity, and bounded activity. We observe that the weights self-organize into modules specialized for tasks or sub-tasks encountered, while the gates layer forms unique representations that switch the appropriate weight modules (task abstractions). We analytically reduce the learning dynamics to an effective eigenspace, revealing a virtuous cycle: fast adapting gates drive weight specialization by protecting previous knowledge, while weight specialization in turn increases the update rate of the gating layer. Task switching in the gating layer accelerates as a function of curriculum block size and task training, mirroring key findings in cognitive neuroscience. We show that the discovered task abstractions support generalization through both task and subtask composition, and we extend our findings to a non-linear network switching between two tasks. Overall, our work offers a theory of cognitive flexibility in animals as arising from joint gradient descent on synaptic and neural gating in a neural network architecture.

摘要

动物在任意时间尺度变化的动态环境中生存,但这种数据分布的变化对神经网络来说是一个挑战。为了适应变化,神经系统可能会改变大量参数,这是一个涉及过去信息的缓慢过程。相比之下,动物利用分布变化将它们的经验流划分为不同任务,并将这些任务与内部任务抽象联系起来。然后,动物可以通过选择合适的任务抽象来做出反应。然而,这种灵活的任务抽象在神经系统中是如何产生的仍然未知。在这里,我们分析了一个线性门控网络,其中权重和门控通过梯度下降进行联合优化,但对门控有类似神经元的约束,包括更快的时间尺度、非负性和有界活动。我们观察到,权重会自组织成专门用于处理所遇到任务或子任务的模块,而门控层则形成独特的表示,用于切换合适的权重模块(任务抽象)。我们通过分析将学习动态简化为一个有效的特征空间,揭示了一个良性循环:快速适应的门控通过保护先前的知识来驱动权重专业化,而权重专业化反过来又提高了门控层的更新率。门控层中的任务切换会随着课程块大小和任务训练而加速,这与认知神经科学的关键发现相呼应。我们表明,所发现的任务抽象通过任务和子任务组合来支持泛化,并且我们将研究结果扩展到了在两个任务之间切换的非线性网络。总的来说,我们的工作为动物认知灵活性提供了一种理论,这种灵活性源于神经网络架构中突触和神经门控的联合梯度下降。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/5e024d9fa1e4/nihpp-2411.03840v2-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/dea102b057ad/nihpp-2411.03840v2-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/fd1b01bace4b/nihpp-2411.03840v2-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/985c0b3eee86/nihpp-2411.03840v2-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/d1ffe9d697f9/nihpp-2411.03840v2-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/e5edd0485df6/nihpp-2411.03840v2-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/f5fbd5c43b20/nihpp-2411.03840v2-f0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/788f1fd3c8db/nihpp-2411.03840v2-f0015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/8587ab8f197a/nihpp-2411.03840v2-f0016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/535a812c3978/nihpp-2411.03840v2-f0017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/c4a0027f2402/nihpp-2411.03840v2-f0018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/2af16c46b180/nihpp-2411.03840v2-f0019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/9a140c6f58f4/nihpp-2411.03840v2-f0020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/a869bff232a1/nihpp-2411.03840v2-f0021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/b09a500752af/nihpp-2411.03840v2-f0022.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/ec4553688ab2/nihpp-2411.03840v2-f0023.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/1f8b8be3062c/nihpp-2411.03840v2-f0024.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/f51b176d920c/nihpp-2411.03840v2-f0025.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/ef6ee27100d6/nihpp-2411.03840v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/5887b9226d27/nihpp-2411.03840v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/349b9dede7d0/nihpp-2411.03840v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/49489c45ce7a/nihpp-2411.03840v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/e62294312bf1/nihpp-2411.03840v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/3214192b0036/nihpp-2411.03840v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/b27fefecef9e/nihpp-2411.03840v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/5e024d9fa1e4/nihpp-2411.03840v2-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/dea102b057ad/nihpp-2411.03840v2-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/fd1b01bace4b/nihpp-2411.03840v2-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/985c0b3eee86/nihpp-2411.03840v2-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/d1ffe9d697f9/nihpp-2411.03840v2-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/e5edd0485df6/nihpp-2411.03840v2-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/f5fbd5c43b20/nihpp-2411.03840v2-f0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/788f1fd3c8db/nihpp-2411.03840v2-f0015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/8587ab8f197a/nihpp-2411.03840v2-f0016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/535a812c3978/nihpp-2411.03840v2-f0017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/c4a0027f2402/nihpp-2411.03840v2-f0018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/2af16c46b180/nihpp-2411.03840v2-f0019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/9a140c6f58f4/nihpp-2411.03840v2-f0020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/a869bff232a1/nihpp-2411.03840v2-f0021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/b09a500752af/nihpp-2411.03840v2-f0022.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/ec4553688ab2/nihpp-2411.03840v2-f0023.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/1f8b8be3062c/nihpp-2411.03840v2-f0024.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/f51b176d920c/nihpp-2411.03840v2-f0025.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/ef6ee27100d6/nihpp-2411.03840v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/5887b9226d27/nihpp-2411.03840v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/349b9dede7d0/nihpp-2411.03840v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/49489c45ce7a/nihpp-2411.03840v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/e62294312bf1/nihpp-2411.03840v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/3214192b0036/nihpp-2411.03840v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/b27fefecef9e/nihpp-2411.03840v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42e8/11774440/5e024d9fa1e4/nihpp-2411.03840v2-f0008.jpg

相似文献

1
Flexible task abstractions emerge in linear networks with fast and bounded units.灵活的任务抽象出现在具有快速且有界单元的线性网络中。
ArXiv. 2025 Jan 16:arXiv:2411.03840v2.
2
Short-Term Memory Impairment短期记忆障碍
3
Sexual Harassment and Prevention Training性骚扰与预防培训
4
The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂(GLP-1 RAs)减肥效果的网状Meta分析的数量、质量及结果:一项范围综述
Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.
5
The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历:系统检索与综述
Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.
6
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
7
Idiopathic (Genetic) Generalized Epilepsy特发性(遗传性)全身性癫痫
8
Systemic Inflammatory Response Syndrome全身炎症反应综合征
9
"In a State of Flow": A Qualitative Examination of Autistic Adults' Phenomenological Experiences of Task Immersion.“心流状态”:对自闭症成年人任务沉浸现象学体验的质性研究
Autism Adulthood. 2024 Sep 16;6(3):362-373. doi: 10.1089/aut.2023.0032. eCollection 2024 Sep.
10
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.

本文引用的文献

1
Modelling cognitive flexibility with deep neural networks.使用深度神经网络对认知灵活性进行建模。
Curr Opin Behav Sci. 2024 Jun;57:101361. doi: 10.1016/j.cobeha.2024.101361.
2
Principles of cognitive control over task focus and task switching.对任务焦点和任务切换进行认知控制的原则。
Nat Rev Psychol. 2023 Nov;2(11):702-714. doi: 10.1038/s44159-023-00234-4. Epub 2023 Sep 27.
3
Blocked training facilitates learning of multiple schemas.阻断训练有助于多种模式的学习。
Commun Psychol. 2024 Apr 9;2(1):28. doi: 10.1038/s44271-024-00079-4.
4
Loss of plasticity in deep continual learning.深度学习中的可塑性丧失。
Nature. 2024 Aug;632(8026):768-774. doi: 10.1038/s41586-024-07711-7. Epub 2024 Aug 21.
5
Flexible multitask computation in recurrent networks utilizes shared dynamical motifs.递归网络中的灵活多任务计算利用了共享的动态模式。
Nat Neurosci. 2024 Jul;27(7):1349-1363. doi: 10.1038/s41593-024-01668-6. Epub 2024 Jul 9.
6
Fast adaptation to rule switching using neuronal surprise.利用神经元惊讶实现快速规则切换适应。
PLoS Comput Biol. 2024 Feb 20;20(2):e1011839. doi: 10.1371/journal.pcbi.1011839. eCollection 2024 Feb.
7
Learning dynamics of deep linear networks with multiple pathways.具有多条路径的深度线性网络的学习动态
Adv Neural Inf Process Syst. 2022 Dec;35:34064-34076.
8
Rationalizing constraints on the capacity for cognitive control.理性化认知控制能力的约束条件。
Trends Cogn Sci. 2021 Sep;25(9):757-775. doi: 10.1016/j.tics.2021.06.001. Epub 2021 Jul 28.
9
Adaptive learning is structure learning in time.自适应学习是时间上的结构学习。
Neurosci Biobehav Rev. 2021 Sep;128:270-281. doi: 10.1016/j.neubiorev.2021.06.024. Epub 2021 Jun 16.
10
Embracing Change: Continual Learning in Deep Neural Networks.拥抱变化:深度神经网络中的持续学习。
Trends Cogn Sci. 2020 Dec;24(12):1028-1040. doi: 10.1016/j.tics.2020.09.004. Epub 2020 Nov 3.