• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

强调正强化中的“积极面”:在认知任务训练中对猴子使用非二元奖励。

Emphasizing the "positive" in positive reinforcement: using nonbinary rewarding for training monkeys on cognitive tasks.

作者信息

Fischer Benjamin, Wegener Detlef

机构信息

Brain Research Institute, Center for Cognitive Sciences, University of Bremen , Bremen , Germany.

出版信息

J Neurophysiol. 2018 Jul 1;120(1):115-128. doi: 10.1152/jn.00572.2017. Epub 2018 Apr 4.

DOI:10.1152/jn.00572.2017
PMID:29617217
Abstract

Nonhuman primates constitute an indispensable model system for studying higher brain functions at the neurophysiological level. Studies involving these animals elucidated the neuronal mechanisms of various cognitive and executive functions, such as visual attention, working memory, and decision-making. Positive reinforcement training (PRT) constitutes the gold standard for training animals on the cognitive tasks employed in these studies. In the laboratory, PRT is usually based on application of a liquid reward as the reinforcer to strengthen the desired behavior and absence of the reward if the animal's response is wrong. By trial and error, the monkey may adapt its behavior and successfully reduce the number of error trials, and eventually learn even very sophisticated tasks. However, progress and success of the training strongly depend on reasonable error rates. If errors get too frequent, they may cause a drop in the animal's motivation to cooperate or its adaptation to high error rates and poor overall performance. We introduce in this report an alternative training regime to minimize errors and base the critical information for learning on graded rewarding. For every new task rule, the feedback to the animal is provided by different amounts of reward to distinguish the desired, optimal behavior from less optimal behavior. We applied this regime in different situations during training of visual attention tasks and analyzed behavioral performance and reaction times to evaluate its effectiveness. For both simple and complex behaviors, graded rewarding was found to constitute a powerful technique allowing for effective training without trade-off in accessible task difficulty or task performance. NEW & NOTEWORTHY Laboratory training of monkeys usually builds on providing a fixed amount of reward for the desired behavior, and no reward otherwise. We present a nonbinary, graded reward schedule to emphasize the positive, desired behavior and to keep errors on a moderate level. Using data from typical training situations, we demonstrate that graded rewards help to effectively guide the animal by success rather than errors and provide a powerful new tool for positive reinforcement training.

摘要

非人灵长类动物是在神经生理学水平上研究高等脑功能不可或缺的模型系统。涉及这些动物的研究阐明了各种认知和执行功能的神经元机制,如视觉注意力、工作记忆和决策。正强化训练(PRT)是在这些研究中用于训练动物完成认知任务的黄金标准。在实验室中,PRT通常基于使用液体奖励作为强化物来强化期望的行为,如果动物的反应错误则不给予奖励。通过反复试验,猴子可能会调整其行为并成功减少错误试验的次数,最终学会甚至非常复杂的任务。然而,训练的进展和成功很大程度上取决于合理的错误率。如果错误过于频繁,可能会导致动物合作动机下降,或者使其适应高错误率并导致整体表现不佳。在本报告中,我们引入了一种替代训练方案,以尽量减少错误,并将学习的关键信息基于分级奖励。对于每一个新的任务规则,通过给予不同数量的奖励来向动物提供反馈,以区分期望的、最佳的行为与次优行为。我们在视觉注意力任务训练的不同情况下应用了这种方案,并分析了行为表现和反应时间以评估其有效性。对于简单和复杂行为,发现分级奖励是一种强大的技术,能够在不影响可及任务难度或任务表现的情况下进行有效训练。新内容及值得注意之处猴子的实验室训练通常基于对期望行为给予固定数量的奖励,否则不给予奖励。我们提出了一种非二元的分级奖励计划,以强调积极的、期望的行为,并将错误保持在适度水平。利用典型训练情况的数据,我们证明分级奖励有助于通过成功而非错误有效地引导动物,并为正强化训练提供了一种强大的新工具。

相似文献

1
Emphasizing the "positive" in positive reinforcement: using nonbinary rewarding for training monkeys on cognitive tasks.强调正强化中的“积极面”:在认知任务训练中对猴子使用非二元奖励。
J Neurophysiol. 2018 Jul 1;120(1):115-128. doi: 10.1152/jn.00572.2017. Epub 2018 Apr 4.
2
A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task.一种具有类似多巴胺强化信号的神经网络模型,用于学习空间延迟反应任务。
Neuroscience. 1999;91(3):871-90. doi: 10.1016/s0306-4522(98)00697-6.
3
Neural signals in the monkey ventral striatum related to motivation for juice and cocaine rewards.猴子腹侧纹状体中与果汁和可卡因奖励动机相关的神经信号。
J Neurophysiol. 1996 Mar;75(3):1061-73. doi: 10.1152/jn.1996.75.3.1061.
4
Behavioral Management Programs to Promote Laboratory Animal Welfare促进实验动物福利的行为管理计划
5
Positive reinforcement training in squirrel monkeys using clicker training.使用响片训练对松鼠猴进行正强化训练。
Am J Primatol. 2012 Aug;74(8):712-20. doi: 10.1002/ajp.22015. Epub 2012 May 2.
6
Response differences in monkey TE and perirhinal cortex: stimulus association related to reward schedules.猴子颞下回和嗅周皮层的反应差异:与奖励计划相关的刺激关联
J Neurophysiol. 2000 Mar;83(3):1677-92. doi: 10.1152/jn.2000.83.3.1677.
7
Reward-based training of recurrent neural networks for cognitive and value-based tasks.用于认知和基于价值任务的循环神经网络的基于奖励的训练。
Elife. 2017 Jan 13;6:e21492. doi: 10.7554/eLife.21492.
8
Post-training self administration of sugar facilitates cognitive performance of male C57BL/6J mice in two spatial learning tasks.训练后自行摄入糖分可促进雄性C57BL/6J小鼠在两项空间学习任务中的认知表现。
Behav Brain Res. 2009 Mar 2;198(1):98-104. doi: 10.1016/j.bbr.2008.10.016. Epub 2008 Oct 18.
9
Effect of reward type on object discrimination learning in socially monogamous coppery titi monkeys (Callicebus cupreus).奖励类型对社会性一夫一妻制铜长尾猴(Callicebus cupreus)物体辨别学习的影响。
Am J Primatol. 2018 Jun;80(6):e22868. doi: 10.1002/ajp.22868. Epub 2018 May 14.
10
No Effect of Commercial Cognitive Training on Brain Activity, Choice Behavior, or Cognitive Performance.商业认知训练对大脑活动、选择行为或认知表现无影响。
J Neurosci. 2017 Aug 2;37(31):7390-7402. doi: 10.1523/JNEUROSCI.2832-16.2017. Epub 2017 Jul 10.

引用本文的文献

1
Eccentricity-dependent saccadic reaction time: The roles of foveal magnification and attentional orienting.视偏心率依赖的扫视反应时间:中央凹放大率和注意力定向的作用。
iScience. 2025 Jul 1;28(8):113042. doi: 10.1016/j.isci.2025.113042. eCollection 2025 Aug 15.
2
Home-Cage Training for Non-Human Primates: An Opportunity to Reduce Stress and Study Natural Behavior in Neurophysiology Experiments.非人灵长类动物的笼内训练:在神经生理学实验中减轻压力并研究自然行为的契机。
Animals (Basel). 2025 May 6;15(9):1340. doi: 10.3390/ani15091340.
3
Potential Food Inclination of Crab-Eating Macaques in Laboratory Environments: Enhancing Positive Reinforcement Training and Health Optimization.
实验室环境中食蟹猕猴的潜在食物偏好:加强正强化训练与健康优化
Animals (Basel). 2024 Apr 7;14(7):1123. doi: 10.3390/ani14071123.
4
Why workshops work: Examining the efficacy of training trainers to train goats.工作坊为何有效:审视培训培训师以培训山羊的成效。
Anim Welf. 2023 Nov 21;32:e76. doi: 10.1017/awf.2023.94. eCollection 2023.
5
Handling and Training of Wild Animals: Evidence and Ethics-Based Approaches and Best Practices in the Modern Zoo.野生动物的管理与训练:现代动物园基于证据和伦理的方法及最佳实践
Animals (Basel). 2023 Jul 9;13(14):2247. doi: 10.3390/ani13142247.
6
Generalised exponential-Gaussian distribution: a method for neural reaction time analysis.广义指数高斯分布:一种用于神经反应时间分析的方法。
Cogn Neurodyn. 2023 Feb;17(1):221-237. doi: 10.1007/s11571-022-09813-2. Epub 2022 May 17.
7
Blood Analysis of Laboratory Used for Neuroscience Research: Investigation of Long-Term and Cumulative Effects of Implants, Fluid Control, and Laboratory Procedures.实验室血液分析在神经科学研究中的应用:植入物、流体控制和实验室程序的长期和累积效应研究。
eNeuro. 2021 Oct 19;8(5). doi: 10.1523/ENEURO.0284-21.2021. Print 2021 Sep-Oct.
8
Macaque monkeys learn and perform a non-match-to-goal task using an automated home cage training procedure.食蟹猴通过自动化的家笼训练程序学习和执行非匹配至目标任务。
Sci Rep. 2021 Jan 29;11(1):2700. doi: 10.1038/s41598-021-82021-w.
9
Animal cognition in the field: performance of wild vervet monkeys (Chlorocebus pygerythrus) on a reversal learning task.野外动物认知:绿长尾猴(Chlorocebus pygerythrus)在反转学习任务中的表现。
Anim Cogn. 2020 May;23(3):523-534. doi: 10.1007/s10071-020-01356-5. Epub 2020 Feb 5.
10
Positive Reinforcement-Based Training for Self-Loading of Meat Horses Reduces Loading Time and Stress-Related Behavior.基于正强化的肉用马自动装货训练可减少装货时间和应激相关行为。
Front Vet Sci. 2019 Oct 10;6:350. doi: 10.3389/fvets.2019.00350. eCollection 2019.