• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

学习潜在结构:在关节处雕刻自然。

Learning latent structure: carving nature at its joints.

机构信息

Princeton Neuroscience Institute & Psychology Department, Princeton University, USA.

出版信息

Curr Opin Neurobiol. 2010 Apr;20(2):251-6. doi: 10.1016/j.conb.2010.02.008. Epub 2010 Mar 11.

DOI:10.1016/j.conb.2010.02.008
PMID:20227271
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2862793/
Abstract

Reinforcement learning (RL) algorithms provide powerful explanations for simple learning and decision-making behaviors and the functions of their underlying neural substrates. Unfortunately, in real-world situations that involve many stimuli and actions, these algorithms learn pitifully slowly, exposing their inferiority in comparison to animal and human learning. Here we suggest that one reason for this discrepancy is that humans and animals take advantage of structure that is inherent in real-world tasks to simplify the learning problem. We survey an emerging literature on 'structure learning'--using experience to infer the structure of a task--and how this can be of service to RL, with an emphasis on structure in perception and action.

摘要

强化学习 (RL) 算法为简单的学习和决策行为及其潜在神经基质的功能提供了强大的解释。不幸的是,在涉及许多刺激和动作的现实情况下,这些算法的学习速度非常缓慢,这暴露了它们与动物和人类学习相比的劣势。在这里,我们认为造成这种差异的一个原因是人类和动物利用了现实任务中固有的结构来简化学习问题。我们调查了关于“结构学习”的新兴文献——使用经验来推断任务的结构——以及这如何对 RL 有帮助,重点是感知和行动中的结构。

相似文献

1
Learning latent structure: carving nature at its joints.学习潜在结构:在关节处雕刻自然。
Curr Opin Neurobiol. 2010 Apr;20(2):251-6. doi: 10.1016/j.conb.2010.02.008. Epub 2010 Mar 11.
2
Multiple memory systems as substrates for multiple decision systems.多种记忆系统作为多种决策系统的基础。
Neurobiol Learn Mem. 2015 Jan;117:4-13. doi: 10.1016/j.nlm.2014.04.014. Epub 2014 May 15.
3
One-shot learning and behavioral eligibility traces in sequential decision making.序列决策中的单次学习和行为资格痕迹。
Elife. 2019 Nov 11;8:e47463. doi: 10.7554/eLife.47463.
4
Reinforcement learning on slow features of high-dimensional input streams.高维输入流的慢特征的强化学习。
PLoS Comput Biol. 2010 Aug 19;6(8):e1000894. doi: 10.1371/journal.pcbi.1000894.
5
Computational advantages of reverberating loops for sensorimotor learning.回旋环路在感觉运动学习中的计算优势。
Neural Comput. 2012 Mar;24(3):611-34. doi: 10.1162/NECO_a_00237. Epub 2011 Nov 17.
6
Reinforcement Learning and Episodic Memory in Humans and Animals: An Integrative Framework.人类和动物中的强化学习与情景记忆:一个综合框架
Annu Rev Psychol. 2017 Jan 3;68:101-128. doi: 10.1146/annurev-psych-122414-033625. Epub 2016 Sep 2.
7
States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning.状态与奖励:基于模型和无模型强化学习的分离神经预测误差信号。
Neuron. 2010 May 27;66(4):585-95. doi: 10.1016/j.neuron.2010.04.016.
8
How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis.强化学习中有多少是工作记忆,而不是强化学习?一项行为、计算和神经遗传学分析。
Eur J Neurosci. 2012 Apr;35(7):1024-35. doi: 10.1111/j.1460-9568.2011.07980.x.
9
Neural basis of reinforcement learning and decision making.强化学习和决策的神经基础。
Annu Rev Neurosci. 2012;35:287-308. doi: 10.1146/annurev-neuro-062111-150512. Epub 2012 Mar 29.
10
Deep Reinforcement Learning and Its Neuroscientific Implications.深度强化学习及其神经科学意义。
Neuron. 2020 Aug 19;107(4):603-616. doi: 10.1016/j.neuron.2020.06.014. Epub 2020 Jul 13.

引用本文的文献

1
Goals and the Structure of Experience.目标与经验的结构
ArXiv. 2025 Aug 20:arXiv:2508.15013v1.
2
Error-driven changes in hippocampal representations accompany flexible re-learning.海马体表征中由错误驱动的变化伴随着灵活的重新学习。
bioRxiv. 2025 May 21:2025.05.20.655046. doi: 10.1101/2025.05.20.655046.
3
What dopamine teaches depends on what the brain believes.多巴胺所传达的信息取决于大脑所相信的内容。
Nat Neurosci. 2025 May 28. doi: 10.1038/s41593-025-01980-9.
4
Prefrontal encoding of an internal model for emotional inference.用于情绪推理的内部模型的前额叶编码。
Nature. 2025 May 14. doi: 10.1038/s41586-025-09001-2.
5
Theory Change in Cognitive Neurobiology: The Case of the Orbitofrontal Cortex.认知神经生物学中的理论变革:眶额叶皮层的案例
Wiley Interdiscip Rev Cogn Sci. 2025 May-Jun;16(3):e70003. doi: 10.1002/wcs.70003.
6
Hippocampal output suppresses orbitofrontal cortex schema cell formation.海马体输出抑制眶额皮质模式细胞的形成。
Nat Neurosci. 2025 May;28(5):1048-1060. doi: 10.1038/s41593-025-01928-z. Epub 2025 Apr 14.
7
Flexible task abstractions emerge in linear networks with fast and bounded units.灵活的任务抽象出现在具有快速且有界单元的线性网络中。
ArXiv. 2025 Jan 16:arXiv:2411.03840v2.
8
Network structure influences the strength of learned neural representations.网络结构影响学习到的神经表征的强度。
Nat Commun. 2025 Jan 24;16(1):994. doi: 10.1038/s41467-024-55459-5.
9
Schemas, reinforcement learning and the medial prefrontal cortex.图式、强化学习与内侧前额叶皮质
Nat Rev Neurosci. 2025 Mar;26(3):141-157. doi: 10.1038/s41583-024-00893-z. Epub 2025 Jan 7.
10
Stimulus-repetition effects on macaque V1 and V4 microcircuits explain gamma-synchronization increase.对猕猴初级视觉皮层和V4微电路的刺激重复效应解释了伽马同步的增加。
bioRxiv. 2024 Dec 11:2024.12.06.627165. doi: 10.1101/2024.12.06.627165.

本文引用的文献

1
Context, learning, and extinction.情境、学习和遗忘。
Psychol Rev. 2010 Jan;117(1):197-209. doi: 10.1037/a0017808.
2
Human reinforcement learning subdivides structured action spaces by learning effector-specific values.人类强化学习通过学习特定效应器的值来细分结构化动作空间。
J Neurosci. 2009 Oct 28;29(43):13524-31. doi: 10.1523/JNEUROSCI.2469-09.2009.
3
Brain hemispheres selectively track the expected value of contralateral options.大脑半球选择性地追踪对侧选项的预期价值。
J Neurosci. 2009 Oct 28;29(43):13465-72. doi: 10.1523/JNEUROSCI.1500-09.2009.
4
Structure learning in action.行动中的结构学习。
Behav Brain Res. 2010 Jan 20;206(2):157-65. doi: 10.1016/j.bbr.2009.08.031. Epub 2009 Aug 29.
5
Discrimination learning, reversal, and set-shifting in first-episode schizophrenia: stability over six years and specific associations with medication type and disorganization syndrome.首发精神分裂症的辨别学习、逆转和定势转换:六年稳定性及与药物类型和紊乱综合征的特定关联
Biol Psychiatry. 2009 Sep 15;66(6):586-93. doi: 10.1016/j.biopsych.2009.05.016. Epub 2009 Jul 3.
6
Motor task variation induces structural learning.运动任务变化会引发结构学习。
Curr Biol. 2009 Feb 24;19(4):352-7. doi: 10.1016/j.cub.2009.01.036. Epub 2009 Feb 12.
7
Structured statistical models of inductive reasoning.归纳推理的结构化统计模型。
Psychol Rev. 2009 Jan;116(1):20-58. doi: 10.1037/a0014282.
8
Bilinearity, rules, and prefrontal cortex.双线性、规则与前额叶皮质。
Front Comput Neurosci. 2007 Nov 2;1:1. doi: 10.3389/neuro.10.001.2007. eCollection 2007.
9
Set-shifting ability and schizophrenia: a marker of clinical illness or an intermediate phenotype?转换能力与精神分裂症:临床疾病的标志物还是中间表型?
Biol Psychiatry. 2008 Nov 1;64(9):782-8. doi: 10.1016/j.biopsych.2008.05.009. Epub 2008 Jul 2.
10
Modeling the role of working memory and episodic memory in behavioral tasks.模拟工作记忆和情景记忆在行为任务中的作用。
Hippocampus. 2008;18(2):193-209. doi: 10.1002/hipo.20382.