• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类与神经网络中情境学习和权重学习之间的动态相互作用。

The dynamic interplay between in-context and in-weight learning in humans and neural networks.

作者信息

Russin Jacob, Pavlick Ellie, Frank Michael J

机构信息

Department of Computer Science, Department of Cognitive and Psychological Sciences, Brown University.

Department of Computer Science, Brown University.

出版信息

ArXiv. 2025 Apr 25:arXiv:2402.08674v4.

PMID:38410645
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10896373/
Abstract

Human learning embodies a striking duality: sometimes, we appear capable of following logical, compositional rules and benefit from structured curricula (e.g., in formal education), while other times, we rely on an incremental approach or trial-and-error, learning better from curricula that are randomly interleaved. Influential psychological theories explain this seemingly disparate behavioral evidence by positing two qualitatively different learning systems-one for rapid, rule-based inferences and another for slow, incremental adaptation. It remains unclear how to reconcile such theories with neural networks, which learn via incremental weight updates and are thus a natural model for the latter type of learning, but are not obviously compatible with the former. However, recent evidence suggests that metalearning neural networks and large language models are capable of "in-context learning" (ICL)-the ability to flexibly grasp the structure of a new task from a few examples. Here, we show that the dynamic interplay between ICL and default in-weight learning (IWL) naturally captures a broad range of learning phenomena observed in humans, reproducing curriculum effects on category-learning and compositional tasks, and recapitulating a tradeoff between flexibility and retention. Our work shows how emergent ICL can equip neural networks with fundamentally different learning properties that can coexist with their native IWL, thus offering a novel perspective on dual-process theories and human cognitive flexibility.

摘要

人类学习体现出一种显著的二元性

有时,我们似乎能够遵循逻辑的、组合性的规则,并从结构化课程(如正规教育中)受益;而在其他时候,我们则依赖渐进式方法或试错法,从随机交错的课程中能学得更好。有影响力的心理学理论通过假定两种质的不同的学习系统来解释这种看似不同的行为证据——一种用于快速的、基于规则的推理,另一种用于缓慢的、渐进式的适应。目前尚不清楚如何将这些理论与神经网络协调起来,神经网络通过渐进式权重更新进行学习,因此是后一种学习类型的自然模型,但与前一种模型显然不兼容。然而,最近的证据表明,元学习神经网络和大语言模型能够进行“上下文学习”(ICL)——即从几个例子中灵活掌握新任务结构的能力。在这里,我们表明ICL与默认的权重内学习(IWL)之间的动态相互作用自然地捕捉了在人类中观察到的广泛学习现象,再现了课程对类别学习和组合任务的影响,并概括了灵活性与记忆之间的权衡。我们的工作展示了新兴的ICL如何使神经网络具备与它们原生的IWL共存的根本不同的学习特性,从而为双过程理论和人类认知灵活性提供了一个新视角。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/9ccd2141ce1e/nihpp-2402.08674v4-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/7b6f1ef799fc/nihpp-2402.08674v4-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/35921041213d/nihpp-2402.08674v4-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/eb3679225fdf/nihpp-2402.08674v4-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/6f3fa6b5e9e8/nihpp-2402.08674v4-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/8c2172f08528/nihpp-2402.08674v4-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/3e8191202670/nihpp-2402.08674v4-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/f23d90e201e4/nihpp-2402.08674v4-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/cb1305c6a6bc/nihpp-2402.08674v4-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/d80e471b6729/nihpp-2402.08674v4-f0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/bd0c95982e78/nihpp-2402.08674v4-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/3a4c4ea8d54e/nihpp-2402.08674v4-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/50efd7642311/nihpp-2402.08674v4-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/94b922962263/nihpp-2402.08674v4-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/9ccd2141ce1e/nihpp-2402.08674v4-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/7b6f1ef799fc/nihpp-2402.08674v4-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/35921041213d/nihpp-2402.08674v4-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/eb3679225fdf/nihpp-2402.08674v4-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/6f3fa6b5e9e8/nihpp-2402.08674v4-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/8c2172f08528/nihpp-2402.08674v4-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/3e8191202670/nihpp-2402.08674v4-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/f23d90e201e4/nihpp-2402.08674v4-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/cb1305c6a6bc/nihpp-2402.08674v4-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/d80e471b6729/nihpp-2402.08674v4-f0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/bd0c95982e78/nihpp-2402.08674v4-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/3a4c4ea8d54e/nihpp-2402.08674v4-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/50efd7642311/nihpp-2402.08674v4-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/94b922962263/nihpp-2402.08674v4-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d0a/12281880/9ccd2141ce1e/nihpp-2402.08674v4-f0005.jpg

相似文献

1
The dynamic interplay between in-context and in-weight learning in humans and neural networks.人类与神经网络中情境学习和权重学习之间的动态相互作用。
ArXiv. 2025 Apr 25:arXiv:2402.08674v4.
2
Parallel trade-offs in human cognition and neural networks: The dynamic interplay between in-context and in-weight learning.人类认知与神经网络中的平行权衡:情境学习与权重学习之间的动态相互作用。
Proc Natl Acad Sci U S A. 2025 Sep 2;122(35):e2510270122. doi: 10.1073/pnas.2510270122. Epub 2025 Aug 28.
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Short-Term Memory Impairment短期记忆障碍
5
Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.人类常见且大多被称为“非致病性”的单细胞肠道寄生虫的遗传多样性、宿主特异性及公共卫生意义
APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.
6
Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.医护人员非正规使用手机和其他移动设备来支持工作:定性证据综合评价。
Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.
7
Remote and digital services in UK general practice 2021-2023: the Remote by Default 2 longitudinal qualitative study synopsis.2021 - 2023年英国全科医疗中的远程和数字服务:“默认远程”2纵向定性研究概要
Health Soc Care Deliv Res. 2025 Sep;13(31):1-49. doi: 10.3310/QQTT4411.
8
Sexual Harassment and Prevention Training性骚扰与预防培训
9
Systemic Inflammatory Response Syndrome全身炎症反应综合征
10
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.

本文引用的文献

1
Modelling cognitive flexibility with deep neural networks.使用深度神经网络对认知灵活性进行建模。
Curr Opin Behav Sci. 2024 Jun;57:101361. doi: 10.1016/j.cobeha.2024.101361.
2
Is human compositionality meta-learned?人类的组合性是元学习的吗?
Behav Brain Sci. 2024 Sep 23;47:e162. doi: 10.1017/S0140525X24000189.
3
Blocked training facilitates learning of multiple schemas.阻断训练有助于多种模式的学习。
Commun Psychol. 2024 Apr 9;2(1):28. doi: 10.1038/s44271-024-00079-4.
4
Toward the Emergence of Intelligent Control: Episodic Generalization and Optimization.迈向智能控制的出现:情景泛化与优化。
Open Mind (Camb). 2024 May 10;8:688-722. doi: 10.1162/opmi_a_00143. eCollection 2024.
5
Space as a scaffold for rotational generalisation of abstract concepts.空间作为抽象概念旋转泛化的支架。
Elife. 2024 Apr 3;13:RP93636. doi: 10.7554/eLife.93636.
6
Meta-learned models of cognition.元认知学习模型。
Behav Brain Sci. 2023 Nov 23;47:e147. doi: 10.1017/S0140525X23003266.
7
Human-like systematic generalization through a meta-learning neural network.通过元学习神经网络实现类人系统泛化。
Nature. 2023 Nov;623(7985):115-121. doi: 10.1038/s41586-023-06668-3. Epub 2023 Oct 25.
8
Distribution-dependent representations in auditory category learning and generalization.听觉类别学习与泛化中依赖于分布的表征
Front Psychol. 2023 Sep 27;14:1132570. doi: 10.3389/fpsyg.2023.1132570. eCollection 2023.
9
Generative replay underlies compositional inference in the hippocampal-prefrontal circuit.生成重放是海马-前额叶回路中组合推理的基础。
Cell. 2023 Oct 26;186(22):4885-4897.e14. doi: 10.1016/j.cell.2023.09.004. Epub 2023 Oct 6.
10
Bridging the data gap between children and large language models.弥合儿童与大语言模型之间的数据差距。
Trends Cogn Sci. 2023 Nov;27(11):990-992. doi: 10.1016/j.tics.2023.08.007. Epub 2023 Aug 31.