• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

真实动物在联想学习过程中如何感知时间的流逝?

How do real animals account for the passage of time during associative learning?

机构信息

Department of Neurology.

出版信息

Behav Neurosci. 2022 Oct;136(5):383-391. doi: 10.1037/bne0000516. Epub 2022 Apr 28.

DOI:10.1037/bne0000516
PMID:35482634
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9561011/
Abstract

Animals routinely learn to associate environmental stimuli and self-generated actions with their outcomes such as rewards. One of the most popular theoretical models of such learning is the reinforcement learning (RL) framework. The simplest form of RL, model-free RL, is widely applied to explain animal behavior in numerous neuroscientific studies. More complex RL versions assume that animals build and store an explicit model of the world in memory. To apply these approaches to explain animal behavior, typical neuroscientific RL models make implicit assumptions about how real animals represent the passage of time. In this perspective, I explicitly list these assumptions and show that they have several problematic implications. I hope that the explicit discussion of these problems encourages the field to seriously examine the assumptions underlying timing and reinforcement learning. (PsycInfo Database Record (c) 2022 APA, all rights reserved).

摘要

动物通常会学习将环境刺激和自身产生的动作与它们的结果(例如奖励)联系起来。这种学习的最流行的理论模型之一是强化学习(RL)框架。RL 的最简单形式,无模型 RL,被广泛应用于解释众多神经科学研究中的动物行为。更复杂的 RL 版本假设动物在记忆中构建和存储对世界的显式模型。为了将这些方法应用于解释动物行为,典型的神经科学 RL 模型对真实动物如何表示时间的流逝做出了隐含的假设。在这个观点中,我明确列出了这些假设,并表明它们有几个有问题的含义。我希望对这些问题的明确讨论能鼓励该领域认真检查时间和强化学习的基本假设。(PsycInfo 数据库记录(c)2022 APA,保留所有权利)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcaf/9561011/f95cb998be23/nihms-1825994-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcaf/9561011/41d1b70e5a93/nihms-1825994-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcaf/9561011/c9509e2de65d/nihms-1825994-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcaf/9561011/f95cb998be23/nihms-1825994-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcaf/9561011/41d1b70e5a93/nihms-1825994-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcaf/9561011/c9509e2de65d/nihms-1825994-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcaf/9561011/f95cb998be23/nihms-1825994-f0003.jpg

相似文献

1
How do real animals account for the passage of time during associative learning?真实动物在联想学习过程中如何感知时间的流逝?
Behav Neurosci. 2022 Oct;136(5):383-391. doi: 10.1037/bne0000516. Epub 2022 Apr 28.
2
Explicit and implicit reinforcement learning across the psychosis spectrum.精神病谱系中的显性和隐性强化学习。
J Abnorm Psychol. 2017 Jul;126(5):694-711. doi: 10.1037/abn0000259. Epub 2017 Apr 13.
3
Nutrient-Sensitive Reinforcement Learning in Monkeys.猴子的营养敏感强化学习。
J Neurosci. 2023 Mar 8;43(10):1714-1730. doi: 10.1523/JNEUROSCI.0752-22.2022. Epub 2023 Jan 20.
4
A probabilistic successor representation for context-dependent learning.一种用于上下文相关学习的概率后继表示。
Psychol Rev. 2024 Mar;131(2):578-597. doi: 10.1037/rev0000414. Epub 2023 May 11.
5
Reinforcement Learning and Episodic Memory in Humans and Animals: An Integrative Framework.人类和动物中的强化学习与情景记忆:一个综合框架
Annu Rev Psychol. 2017 Jan 3;68:101-128. doi: 10.1146/annurev-psych-122414-033625. Epub 2016 Sep 2.
6
Habits, action sequences and reinforcement learning.习惯、动作序列和强化学习。
Eur J Neurosci. 2012 Apr;35(7):1036-51. doi: 10.1111/j.1460-9568.2012.08050.x.
7
Effects of Ventral Striatum Lesions on Stimulus-Based versus Action-Based Reinforcement Learning.腹侧纹状体损伤对基于刺激与基于动作的强化学习的影响。
J Neurosci. 2017 Jul 19;37(29):6902-6914. doi: 10.1523/JNEUROSCI.0631-17.2017. Epub 2017 Jun 16.
8
1-Back reinforcement matching and mismatching by pigeons: Implicit or explicit learning?鸽子的背侧强化匹配与错配:内隐学习还是外显学习?
Behav Processes. 2022 Feb;195:104562. doi: 10.1016/j.beproc.2021.104562. Epub 2021 Dec 3.
9
Reinforcement learning and human behavior.强化学习与人类行为。
Curr Opin Neurobiol. 2014 Apr;25:93-8. doi: 10.1016/j.conb.2013.12.004. Epub 2014 Jan 1.
10
Multiple memory systems as substrates for multiple decision systems.多种记忆系统作为多种决策系统的基础。
Neurobiol Learn Mem. 2015 Jan;117:4-13. doi: 10.1016/j.nlm.2014.04.014. Epub 2014 May 15.

引用本文的文献

1
Mesolimbic dopamine ramps reflect environmental timescales.中脑边缘多巴胺信号增强反映环境时间尺度。
Elife. 2025 Aug 29;13:RP98666. doi: 10.7554/eLife.98666.
2
Dopaminergic action prediction errors serve as a value-free teaching signal.多巴胺能动作预测误差作为一种无价值的教学信号。
Nature. 2025 May 14. doi: 10.1038/s41586-025-09008-9.
3
Prospective contingency explains behavior and dopamine signals during associative learning.前瞻性偶然性解释了联想学习过程中的行为和多巴胺信号。

本文引用的文献

1
Quantitative properties of the creation and activation of a cell-intrinsic duration-encoding engram.细胞内在持续时间编码记忆痕迹的形成与激活的定量特性。
Front Comput Neurosci. 2022 Nov 3;16:1019812. doi: 10.3389/fncom.2022.1019812. eCollection 2022.
2
Predicting the Future With a Scale-Invariant Temporal Memory for the Past.用具有过去时间不变尺度记忆的方法来预测未来。
Neural Comput. 2022 Feb 17;34(3):642-685. doi: 10.1162/neco_a_01475.
3
An RNA-based theory of natural universal computation.一种基于RNA的自然通用计算理论。
Nat Neurosci. 2025 Mar 18. doi: 10.1038/s41593-025-01915-4.
4
Learning to express reward prediction error-like dopaminergic activity requires plastic representations of time.学习表达类似于奖励预测误差的多巴胺能活动需要时间的可塑性表示。
Nat Commun. 2024 Jul 12;15(1):5856. doi: 10.1038/s41467-024-50205-3.
5
Mesolimbic dopamine ramps reflect environmental timescales.中脑边缘多巴胺信号增强反映环境时间尺度。
bioRxiv. 2024 Apr 23:2024.03.27.587103. doi: 10.1101/2024.03.27.587103.
6
The role of prospective contingency in the control of behavior and dopamine signals during associative learning.前瞻性偶然性在联想学习过程中对行为和多巴胺信号的控制作用。
bioRxiv. 2024 Feb 6:2024.02.05.578961. doi: 10.1101/2024.02.05.578961.
7
Dopamine transients follow a striatal gradient of reward time horizons.多巴胺瞬变遵循纹状体奖赏时程的梯度。
Nat Neurosci. 2024 Apr;27(4):737-746. doi: 10.1038/s41593-023-01566-3. Epub 2024 Feb 6.
8
Emergence of belief-like representations through reinforcement learning.通过强化学习产生类信仰的表示。
PLoS Comput Biol. 2023 Sep 11;19(9):e1011067. doi: 10.1371/journal.pcbi.1011067. eCollection 2023 Sep.
9
Dual credit assignment processes underlie dopamine signals in a complex spatial environment.双信用分配过程是复杂空间环境中多巴胺信号的基础。
Neuron. 2023 Nov 1;111(21):3465-3478.e7. doi: 10.1016/j.neuron.2023.07.017. Epub 2023 Aug 22.
10
Emergence of belief-like representations through reinforcement learning.通过强化学习产生类似信念的表征。
bioRxiv. 2023 Apr 4:2023.04.04.535512. doi: 10.1101/2023.04.04.535512.
J Theor Biol. 2022 Mar 21;537:110984. doi: 10.1016/j.jtbi.2021.110984. Epub 2021 Dec 31.
4
Relative salience signaling within a thalamo-orbitofrontal circuit governs learning rate.丘脑-眶额皮层回路内的相对显著性信号控制学习率。
Curr Biol. 2021 Dec 6;31(23):5176-5191.e5. doi: 10.1016/j.cub.2021.09.037. Epub 2021 Oct 11.
5
A Unified Framework for Dopamine Signals across Timescales.多巴胺信号的跨时间尺度统一框架。
Cell. 2020 Dec 10;183(6):1600-1616.e25. doi: 10.1016/j.cell.2020.11.013. Epub 2020 Nov 27.
6
Serotonergic projections to the orbitofrontal and medial prefrontal cortices differentially modulate waiting for future rewards.5-羟色胺能投射到眶额皮质和内侧前额皮质的不同区域,从而对等待未来奖励产生不同的调节作用。
Sci Adv. 2020 Nov 27;6(48). doi: 10.1126/sciadv.abc7246. Print 2020 Nov.
7
Heliconiini butterflies can learn time-dependent reward associations.旋木雀蛱蝶能够学习与时间相关的奖励关联。
Biol Lett. 2020 Sep;16(9):20200424. doi: 10.1098/rsbl.2020.0424. Epub 2020 Sep 23.
8
Rapid Aversive and Memory Trace Learning during Route Navigation in Desert Ants.荒漠蚁在路线导航过程中的快速厌恶和记忆痕迹学习。
Curr Biol. 2020 May 18;30(10):1927-1933.e2. doi: 10.1016/j.cub.2020.02.082. Epub 2020 Apr 9.
9
Effects of conditioned stimulus (CS) duration, intertrial interval, and I/T ratio on appetitive Pavlovian conditioning.条件刺激(CS)持续时间、试验间间隔和 I/T 比率对食欲性巴甫洛夫条件反射的影响。
J Exp Psychol Anim Learn Cogn. 2020 Jul;46(3):243-255. doi: 10.1037/xan0000241. Epub 2020 Mar 16.
10
CaMKII Measures the Passage of Time to Coordinate Behavior and Motivational State.钙调蛋白激酶 II 测量时间流逝以协调行为和动机状态。
Neuron. 2020 Jan 22;105(2):334-345.e9. doi: 10.1016/j.neuron.2019.10.018. Epub 2019 Nov 27.