Suppr超能文献

考虑强化信号出现概率的强化学习算法对大鼠行为的模拟

Simulation of rat behavior by a reinforcement learning algorithm in consideration of appearance probabilities of reinforcement signals.

作者信息

Murakoshi Kazushi, Noguchi Takuya

机构信息

Department of Knowledge-based Information Engineering, Toyohashi University of Technology, 1-1 Hibarigaoka, Tenpaku-cho, Toyohashi 441-8580, Japan.

出版信息

Biosystems. 2005 Apr;80(1):83-90. doi: 10.1016/j.biosystems.2004.10.005. Epub 2004 Dec 8.

Abstract

Brown and Wanger [Brown, R.T., Wanger, A.R., 1964. Resistance to punishment and extinction following training with shock or nonreinforcement. J. Exp. Psychol. 68, 503-507] investigated rat behaviors with the following features: (1) rats were exposed to reward and punishment at the same time, (2) environment changed and rats relearned, and (3) rats were stochastically exposed to reward and punishment. The results are that exposure to nonreinforcement produces resistance to the decremental effects of behavior after stochastic reward schedule and that exposure to both punishment and reinforcement produces resistance to the decremental effects of behavior after stochastic punishment schedule. This paper aims to simulate the rat behaviors by a reinforcement learning algorithm in consideration of appearance probabilities of reinforcement signals. The former algorithms of reinforcement learning were unable to simulate the behavior of the feature (3). We improve the former reinforcement learning algorithms by controlling learning parameters in consideration of the acquisition probabilities of reinforcement signals. The proposed algorithm qualitatively simulates the result of the animal experiment of Brown and Wanger.

摘要

布朗和万格[布朗,R.T.,万格,A.R.,1964年。电击或无强化训练后对惩罚和消退的抵抗。《实验心理学杂志》68卷,第503 - 507页]研究了具有以下特征的大鼠行为:(1)大鼠同时接受奖励和惩罚;(2)环境改变且大鼠重新学习;(3)大鼠随机接受奖励和惩罚。结果表明,无强化暴露会产生对随机奖励程序后行为递减效应的抵抗,而同时接受惩罚和强化暴露会产生对随机惩罚程序后行为递减效应的抵抗。本文旨在通过强化学习算法考虑强化信号的出现概率来模拟大鼠行为。以前的强化学习算法无法模拟特征(3)的行为。我们通过考虑强化信号的获取概率来控制学习参数,从而改进以前的强化学习算法。所提出的算法定性地模拟了布朗和万格动物实验的结果。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验