• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

强化学习的理性与机制视角

Rational and mechanistic perspectives on reinforcement learning.

作者信息

Chater Nick

机构信息

Division of Psychology and Language Sciences, Centre for Economic Learning and Social Evolution (ELSE), UCL, London, WC1E 6BT, United Kingdom.

出版信息

Cognition. 2009 Dec;113(3):350-364. doi: 10.1016/j.cognition.2008.06.014. Epub 2008 Aug 22.

DOI:10.1016/j.cognition.2008.06.014
PMID:18722597
Abstract

This special issue describes important recent developments in applying reinforcement learning models to capture neural and cognitive function. But reinforcement learning, as a theoretical framework, can apply at two very different levels of description: mechanistic and rational. Reinforcement learning is often viewed in mechanistic terms--as describing the operation of aspects of an agent's cognitive and neural machinery. Yet it can also be viewed as a rational level of description, specifically, as describing a class of methods for learning from experience, using minimal background knowledge. This paper considers how rational and mechanistic perspectives differ, and what types of evidence distinguish between them. Reinforcement learning research in the cognitive and brain sciences is often implicitly committed to the mechanistic interpretation. Here the opposite view is put forward: that accounts of reinforcement learning should apply at the rational level, unless there is strong evidence for a mechanistic interpretation. Implications of this viewpoint for reinforcement-based theories in the cognitive and brain sciences are discussed.

摘要

本期特刊描述了应用强化学习模型来捕捉神经和认知功能方面的重要近期进展。但是,强化学习作为一个理论框架,可以在两个非常不同的描述层面上应用:机制层面和理性层面。强化学习通常从机制角度来看待——即描述智能体认知和神经机制各方面的运作。然而,它也可以被视为一种理性层面的描述,具体而言,是描述一类利用最少背景知识从经验中学习的方法。本文探讨了理性视角与机制视角有何不同,以及哪些类型的证据可以区分它们。认知和脑科学中的强化学习研究常常隐含地倾向于机制性解释。这里提出相反的观点:除非有强有力的证据支持机制性解释,否则强化学习的解释应适用于理性层面。本文还讨论了这一观点对认知和脑科学中基于强化学习的理论的影响。

相似文献

1
Rational and mechanistic perspectives on reinforcement learning.强化学习的理性与机制视角
Cognition. 2009 Dec;113(3):350-364. doi: 10.1016/j.cognition.2008.06.014. Epub 2008 Aug 22.
2
Learning: from association to cognition.学习:从关联到认知。
Annu Rev Psychol. 2010;61:273-301. doi: 10.1146/annurev.psych.093008.100519.
3
Conditioning and cognition.条件作用与认知
Neurosci Biobehav Rev. 2004 Nov;28(7):651-61. doi: 10.1016/j.neubiorev.2004.09.003.
4
[Learning and memory].[学习与记忆]
Rev Neurol. 2001;32(4):373-81.
5
Neuronal circuits involved in learning: the use of modifiable synapses in the simulation of behavior.参与学习的神经回路:可修饰突触在行为模拟中的应用。
Int J Neurosci. 1976;7(1):9-18. doi: 10.3109/00207457609147194.
6
A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task.一种具有类似多巴胺强化信号的神经网络模型,用于学习空间延迟反应任务。
Neuroscience. 1999;91(3):871-90. doi: 10.1016/s0306-4522(98)00697-6.
7
Behavior systems and reinforcement: an integrative approach.行为系统与强化:一种综合方法。
J Exp Anal Behav. 1993 Jul;60(1):105-28. doi: 10.1901/jeab.1993.60-105.
8
Brain nicotinic receptors: structure and regulation, role in learning and reinforcement.脑烟碱受体:结构与调节,在学习和强化中的作用
Brain Res Brain Res Rev. 1998 May;26(2-3):198-216. doi: 10.1016/s0165-0173(97)00040-4.
9
Action understanding as inverse planning.作为反向规划的动作理解
Cognition. 2009 Dec;113(3):329-349. doi: 10.1016/j.cognition.2009.07.005. Epub 2009 Sep 2.
10
Goal-directed control and its antipodes.目标导向控制及其对立面。
Neural Netw. 2009 Apr;22(3):213-9. doi: 10.1016/j.neunet.2009.03.004. Epub 2009 Mar 24.

引用本文的文献

1
Learning words in space and time: Contrasting models of the suspicious coincidence effect.在时空中学单词:可疑巧合效应的对比模型。
Cognition. 2021 May;210:104576. doi: 10.1016/j.cognition.2020.104576. Epub 2021 Feb 1.
2
Artificial Development by Reinforcement Learning Can Benefit From Multiple Motivations.通过强化学习进行的人工开发可以从多种动机中受益。
Front Robot AI. 2019 Feb 14;6:6. doi: 10.3389/frobt.2019.00006. eCollection 2019.
3
A framework for building cognitive process models.构建认知过程模型的框架。
Psychon Bull Rev. 2020 Dec;27(6):1218-1229. doi: 10.3758/s13423-020-01747-2.
4
Transitive reasoning distorts induction in causal chains.传递性推理会扭曲因果链中的归纳。
Mem Cognit. 2016 Apr;44(3):469-87. doi: 10.3758/s13421-015-0568-5.
5
Learning What to Want: Context-Sensitive Preference Learning.学习想要什么:上下文敏感偏好学习
PLoS One. 2015 Oct 23;10(10):e0141129. doi: 10.1371/journal.pone.0141129. eCollection 2015.
6
Planning for uncertainty.为不确定性做规划。
Front Psychol. 2012 Nov 29;3:541. doi: 10.3389/fpsyg.2012.00541. eCollection 2012.
7
How the number of learning trials affects placebo and nocebo responses.学习试验次数如何影响安慰剂和反安慰剂反应。
Pain. 2010 Nov;151(2):430-439. doi: 10.1016/j.pain.2010.08.007.