Suppr超能文献

将强化神经网络统一理论扩展至稳态操作性行为。

Extending unified-theory-of-reinforcement neural networks to steady-state operant behavior.

作者信息

Calvin Olivia L, McDowell J J

机构信息

Department of Psychology, Emory University, Atlanta, Georgia.

Department of Psychology, Emory University, Atlanta, Georgia.

出版信息

Behav Processes. 2016 Jun;127:52-61. doi: 10.1016/j.beproc.2016.03.016. Epub 2016 Mar 24.

Abstract

The unified theory of reinforcement has been used to develop models of behavior over the last 20 years (Donahoe et al., 1993). Previous research has focused on the theory's concordance with the respondent behavior of humans and animals. In this experiment, neural networks were developed from the theory to extend the unified theory of reinforcement to operant behavior on single-alternative variable-interval schedules. This area of operant research was selected because previously developed neural networks could be applied to it without significant alteration. Previous research with humans and animals indicates that the pattern of their steady-state behavior is hyperbolic when plotted against the obtained rate of reinforcement (Herrnstein, 1970). A genetic algorithm was used in the first part of the experiment to determine parameter values for the neural networks, because values that were used in previous research did not result in a hyperbolic pattern of behavior. After finding these parameters, hyperbolic and other similar functions were fitted to the behavior produced by the neural networks. The form of the neural network's behavior was best described by an exponentiated hyperbola (McDowell, 1986; McLean and White, 1983; Wearden, 1981), which was derived from the generalized matching law (Baum, 1974). In post-hoc analyses the addition of a baseline rate of behavior significantly improved the fit of the exponentiated hyperbola and removed systematic residuals. The form of this function was consistent with human and animal behavior, but the estimated parameter values were not.

摘要

在过去20年中,强化统一理论已被用于构建行为模型(多纳霍等人,1993年)。先前的研究主要关注该理论与人类和动物应答性行为的一致性。在本实验中,基于该理论开发了神经网络,以将强化统一理论扩展到单替代可变间隔程序的操作性行为。选择这一操作性研究领域是因为先前开发的神经网络可直接应用于此,无需重大改动。先前对人类和动物的研究表明,当根据获得的强化率绘制时,它们的稳态行为模式呈双曲线(赫尔斯坦,1970年)。在实验的第一部分使用了遗传算法来确定神经网络的参数值,因为先前研究中使用的值并未产生双曲线行为模式。找到这些参数后,将双曲线及其他类似函数拟合到神经网络产生的行为上。神经网络行为的形式最好用指数双曲线来描述(麦克道尔,1986年;麦克林和怀特,1983年;韦尔登,1981年),它源自广义匹配定律(鲍姆,1974年)。在事后分析中,添加行为的基线率显著改善了指数双曲线的拟合效果并消除了系统残差。该函数的形式与人类和动物行为一致,但估计的参数值并非如此。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验