• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

序列阻断效应:测试时变差分学习神经机制的试验台。

The serial blocking effect: a testbed for the neural mechanisms of temporal-difference learning.

机构信息

Department of Psychology, Center for Studies in Behavioral Neurobiology/Groupe de recherche en neurobiologie comportementale, Concordia University, Montreal, Quebec, Canada.

Department of Psychology, Brooklyn College of the City University of New York, Brooklyn, NY, USA.

出版信息

Sci Rep. 2019 Apr 12;9(1):5962. doi: 10.1038/s41598-019-42244-4.

DOI:10.1038/s41598-019-42244-4
PMID:30979910
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6461709/
Abstract

Temporal-difference (TD) learning models afford the neuroscientist a theory-driven roadmap in the quest for the neural mechanisms of reinforcement learning. The application of these models to understanding the role of phasic midbrain dopaminergic responses in reward prediction learning constitutes one of the greatest success stories in behavioural and cognitive neuroscience. Critically, the classic learning paradigms associated with TD are poorly suited to cast light on its neural implementation, thus hampering progress. Here, we present a serial blocking paradigm in rodents that overcomes these limitations and allows for the simultaneous investigation of two cardinal TD tenets; namely, that learning depends on the computation of a prediction error, and that reinforcing value, whether intrinsic or acquired, propagates back to the onset of the earliest reliable predictor. The implications of this paradigm for the neural exploration of TD mechanisms are highlighted.

摘要

时频差(TD)学习模型为神经科学家提供了一条理论驱动的路线,以探索强化学习的神经机制。将这些模型应用于理解中脑多巴胺能反应的相位在奖励预测学习中的作用,是行为和认知神经科学中最成功的案例之一。关键的是,与 TD 相关的经典学习范式不太适合揭示其神经实现,从而阻碍了进展。在这里,我们在啮齿动物中提出了一个序列阻断范式,克服了这些限制,并允许同时研究 TD 的两个主要原则;即学习取决于预测误差的计算,以及强化价值,无论是内在的还是获得的,都会传播到最早可靠预测器的开始。该范式对 TD 机制的神经探索的影响被强调了。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/455e/6461709/ec806a37d673/41598_2019_42244_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/455e/6461709/00a184d06e16/41598_2019_42244_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/455e/6461709/ec806a37d673/41598_2019_42244_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/455e/6461709/00a184d06e16/41598_2019_42244_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/455e/6461709/ec806a37d673/41598_2019_42244_Fig2_HTML.jpg

相似文献

1
The serial blocking effect: a testbed for the neural mechanisms of temporal-difference learning.序列阻断效应:测试时变差分学习神经机制的试验台。
Sci Rep. 2019 Apr 12;9(1):5962. doi: 10.1038/s41598-019-42244-4.
2
An imperfect dopaminergic error signal can drive temporal-difference learning.不完美的多巴胺能误差信号可以驱动时间差分学习。
PLoS Comput Biol. 2011 May;7(5):e1001133. doi: 10.1371/journal.pcbi.1001133. Epub 2011 May 12.
3
Abnormal temporal difference reward-learning signals in major depression.重度抑郁症中异常的时间差异奖励学习信号。
Brain. 2008 Aug;131(Pt 8):2084-93. doi: 10.1093/brain/awn136. Epub 2008 Jun 25.
4
How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.我们如何学习做决策:强化学习预测错误在人类中的快速传播。
J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.
5
NMDA receptor antagonism disrupts acquisition and retention of the context preexposure facilitation effect in adolescent rats.N-甲基-D-天冬氨酸(NMDA)受体拮抗作用会破坏青春期大鼠对情境预暴露促进效应的习得和保持。
Behav Brain Res. 2016 Mar 15;301:168-77. doi: 10.1016/j.bbr.2015.12.025. Epub 2015 Dec 19.
6
Anticipatory reward signals in ventral striatal neurons of behaving rats.行为大鼠腹侧纹状体神经元中的预期奖励信号。
Eur J Neurosci. 2008 Nov;28(9):1849-66. doi: 10.1111/j.1460-9568.2008.06480.x.
7
Differential involvement of the medial prefrontal cortex across variants of contextual fear conditioning.内侧前额叶皮层在情境恐惧条件反射不同变体中的差异性参与。
Learn Mem. 2017 Jul 17;24(8):322-330. doi: 10.1101/lm.045286.117. Print 2017 Aug.
8
The effect of chronic corticosterone on fear learning and memory depends on dose and the testing protocol.慢性皮质酮对恐惧学习和记忆的影响取决于剂量和测试方案。
Neuroscience. 2015 Mar 19;289:324-33. doi: 10.1016/j.neuroscience.2015.01.011. Epub 2015 Jan 14.
9
A Dual Role Hypothesis of the Cortico-Basal-Ganglia Pathways: Opponency and Temporal Difference Through Dopamine and Adenosine.皮质-基底神经节通路的双重作用假说:多巴胺和腺苷介导的对立和时间差分。
Front Neural Circuits. 2019 Jan 7;12:111. doi: 10.3389/fncir.2018.00111. eCollection 2018.
10
Modulatory effect of 17-β estradiol on performance of ovariectomized rats on the Shock-Probe test.17-β雌二醇对去卵巢大鼠在电击探针试验中行为表现的调节作用。
Physiol Behav. 2014 May 28;131:129-35. doi: 10.1016/j.physbeh.2014.04.030. Epub 2014 Apr 24.

引用本文的文献

1
Understanding Associative Learning Through Higher-Order Conditioning.通过高阶条件作用理解联想学习。
Front Behav Neurosci. 2022 Apr 18;16:845616. doi: 10.3389/fnbeh.2022.845616. eCollection 2022.
2
Neural substrates of appetitive and aversive prediction error.奖赏性和厌恶性预测误差的神经基础。
Neurosci Biobehav Rev. 2021 Apr;123:337-351. doi: 10.1016/j.neubiorev.2020.10.029. Epub 2021 Jan 13.
3
Reward foraging task and model-based analysis reveal how fruit flies learn value of available options.奖励觅食任务和基于模型的分析揭示了果蝇如何学习可用选项的价值。

本文引用的文献

1
Causal evidence supporting the proposal that dopamine transients function as temporal difference prediction errors.支持多巴胺瞬时作为时间差分预测误差的因果证据。
Nat Neurosci. 2020 Feb;23(2):176-178. doi: 10.1038/s41593-019-0574-1. Epub 2020 Jan 20.
2
Dopamine transients are sufficient and necessary for acquisition of model-based associations.多巴胺瞬变对于基于模型的联想学习而言既是充分的也是必要的。
Nat Neurosci. 2017 May;20(5):735-742. doi: 10.1038/nn.4538. Epub 2017 Apr 3.
3
Orbitofrontal neurons acquire responses to 'valueless' Pavlovian cues during unblocking.
PLoS One. 2020 Oct 2;15(10):e0239616. doi: 10.1371/journal.pone.0239616. eCollection 2020.
4
Different methods of fear reduction are supported by distinct cortical substrates.不同的恐惧缓解方法受到不同皮质基底的支持。
Elife. 2020 Jun 26;9:e55294. doi: 10.7554/eLife.55294.
5
Causal evidence supporting the proposal that dopamine transients function as temporal difference prediction errors.支持多巴胺瞬时作为时间差分预测误差的因果证据。
Nat Neurosci. 2020 Feb;23(2):176-178. doi: 10.1038/s41593-019-0574-1. Epub 2020 Jan 20.
眶额神经元在解除阻断过程中获得对“无价值”巴甫洛夫线索的反应。
Elife. 2014 Jul 18;3:e02653. doi: 10.7554/eLife.02653.
4
Engineering a memory with LTD and LTP.利用长时程抑制和长时程增强构建记忆。
Nature. 2014 Jul 17;511(7509):348-52. doi: 10.1038/nature13294. Epub 2014 Jun 1.
5
A causal link between prediction errors, dopamine neurons and learning.预测误差、多巴胺神经元和学习之间的因果关系。
Nat Neurosci. 2013 Jul;16(7):966-73. doi: 10.1038/nn.3413. Epub 2013 May 26.
6
Expectancy-related changes in firing of dopamine neurons depend on orbitofrontal cortex.与期望相关的多巴胺神经元放电变化取决于眶额皮质。
Nat Neurosci. 2011 Oct 30;14(12):1590-7. doi: 10.1038/nn.2957.
7
Dissociable roles of prelimbic and infralimbic cortices, ventral hippocampus, and basolateral amygdala in the expression and extinction of conditioned fear.前额皮质和下边缘皮质、腹侧海马体以及外侧杏仁核在条件性恐惧的表达和消退中的可分离作用。
Neuropsychopharmacology. 2011 Jan;36(2):529-38. doi: 10.1038/npp.2010.184. Epub 2010 Oct 20.
8
The orbitofrontal cortex and ventral tegmental area are necessary for learning from unexpected outcomes.眶额皮质和腹侧被盖区对于从不预期结果中学习是必要的。
Neuron. 2009 Apr 30;62(2):269-80. doi: 10.1016/j.neuron.2009.03.005.
9
Dopamine D1 versus D4 receptors differentially modulate the encoding of salient versus nonsalient emotional information in the medial prefrontal cortex.多巴胺 D1 受体与 D4 受体对内侧前额叶皮质中显著与非显著情绪信息的编码有不同调节作用。
J Neurosci. 2009 Apr 15;29(15):4836-45. doi: 10.1523/JNEUROSCI.0178-09.2009.
10
CS-US temporal relations in blocking.阻断中的条件刺激-非条件刺激时间关系
Learn Behav. 2008 May;36(2):92-103. doi: 10.3758/lb.36.2.92.