将强化神经网络统一理论扩展至稳态操作性行为。

Extending unified-theory-of-reinforcement neural networks to steady-state operant behavior.

作者信息

Calvin Olivia L, McDowell J J

机构信息

Department of Psychology, Emory University, Atlanta, Georgia.

出版信息

Behav Processes. 2016 Jun;127:52-61. doi: 10.1016/j.beproc.2016.03.016. Epub 2016 Mar 24.

DOI:10.1016/j.beproc.2016.03.016

PMID:27018201

Abstract

The unified theory of reinforcement has been used to develop models of behavior over the last 20 years (Donahoe et al., 1993). Previous research has focused on the theory's concordance with the respondent behavior of humans and animals. In this experiment, neural networks were developed from the theory to extend the unified theory of reinforcement to operant behavior on single-alternative variable-interval schedules. This area of operant research was selected because previously developed neural networks could be applied to it without significant alteration. Previous research with humans and animals indicates that the pattern of their steady-state behavior is hyperbolic when plotted against the obtained rate of reinforcement (Herrnstein, 1970). A genetic algorithm was used in the first part of the experiment to determine parameter values for the neural networks, because values that were used in previous research did not result in a hyperbolic pattern of behavior. After finding these parameters, hyperbolic and other similar functions were fitted to the behavior produced by the neural networks. The form of the neural network's behavior was best described by an exponentiated hyperbola (McDowell, 1986; McLean and White, 1983; Wearden, 1981), which was derived from the generalized matching law (Baum, 1974). In post-hoc analyses the addition of a baseline rate of behavior significantly improved the fit of the exponentiated hyperbola and removed systematic residuals. The form of this function was consistent with human and animal behavior, but the estimated parameter values were not.

摘要

在过去20年中，强化统一理论已被用于构建行为模型（多纳霍等人，1993年）。先前的研究主要关注该理论与人类和动物应答性行为的一致性。在本实验中，基于该理论开发了神经网络，以将强化统一理论扩展到单替代可变间隔程序的操作性行为。选择这一操作性研究领域是因为先前开发的神经网络可直接应用于此，无需重大改动。先前对人类和动物的研究表明，当根据获得的强化率绘制时，它们的稳态行为模式呈双曲线（赫尔斯坦，1970年）。在实验的第一部分使用了遗传算法来确定神经网络的参数值，因为先前研究中使用的值并未产生双曲线行为模式。找到这些参数后，将双曲线及其他类似函数拟合到神经网络产生的行为上。神经网络行为的形式最好用指数双曲线来描述（麦克道尔，1986年；麦克林和怀特，1983年；韦尔登，1981年），它源自广义匹配定律（鲍姆，1974年）。在事后分析中，添加行为的基线率显著改善了指数双曲线的拟合效果并消除了系统残差。该函数的形式与人类和动物行为一致，但估计的参数值并非如此。

相似文献

Extending unified-theory-of-reinforcement neural networks to steady-state operant behavior.将强化神经网络统一理论扩展至稳态操作性行为。

Behav Processes. 2016 Jun;127:52-61. doi: 10.1016/j.beproc.2016.03.016. Epub 2016 Mar 24.

Unified-theory-of-reinforcement neural networks do not simulate the blocking effect.强化神经网络统一理论无法模拟阻断效应。

Behav Processes. 2015 Nov;120:54-63. doi: 10.1016/j.beproc.2015.08.008. Epub 2015 Aug 28.

Simple artificial neural networks that match probability and exploit and explore when confronting a multiarmed bandit.简单的人工神经网络，在面对多臂老虎机问题时匹配概率并进行利用和探索。

IEEE Trans Neural Netw. 2009 Aug;20(8):1368-71. doi: 10.1109/TNN.2009.2025588. Epub 2009 Jul 10.

All Behavior is choice: Revisiting an evolutionary theory's account of behavior on single schedules.所有行为都是选择：重新审视一个进化理论对单种安排下行为的解释。

J Exp Anal Behav. 2020 Nov;114(3):430-446. doi: 10.1002/jeab.630. Epub 2020 Oct 6.

Steady-state choice between four alternatives obeys the constant-ratio rule.在四个选项之间的稳态选择遵循恒定比例规则。

J Exp Anal Behav. 2015 Jul;104(1):7-19. doi: 10.1002/jeab.157. Epub 2015 May 18.

Pre-asymptotic response rates as a function of the delay-of-reinforcement gradient summation for Catania's Operant Reserve: A reply to Berg & McDowell (2011).作为卡塔尼亚操作性储备强化延迟梯度总和函数的渐近前反应率：对伯格和麦克道尔（2011年）的回应

Behav Processes. 2017 Mar;136:11-19. doi: 10.1016/j.beproc.2017.01.002. Epub 2017 Jan 4.

Relative and absolute reinforcement frequency as determinants of choice in concurrent variable interval schedules.相对和绝对强化频率作为并发可变间隔时间表中选择的决定因素。

Q J Exp Psychol B. 1991 Feb;43(1):25-38.

Undermatching is an emergent property of selection by consequences.欠匹配是由结果进行选择的一种新兴属性。

Behav Processes. 2007 Jun;75(2):97-106. doi: 10.1016/j.beproc.2007.02.017. Epub 2007 Mar 1.

On the falsifiability of matching theory.论匹配理论的可证伪性。

J Exp Anal Behav. 1986 Jan;45(1):63-74. doi: 10.1901/jeab.1986.45-63.

Background activities, induction, and behavioral allocation in operant performance.操作性行为表现中的背景活动、诱导及行为分配

J Exp Anal Behav. 2014 Sep;102(2):213-30. doi: 10.1002/jeab.100. Epub 2014 Aug 8.

引用本文的文献

Behavioral Research with Planaria.涡虫的行为研究。

Perspect Behav Sci. 2018 Nov 9;41(2):447-464. doi: 10.1007/s40614-018-00176-w. eCollection 2018 Nov.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

将强化神经网络统一理论扩展至稳态操作性行为。

Extending unified-theory-of-reinforcement neural networks to steady-state operant behavior.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献