• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种调节纹状体群体不确定性并优化强化学习的胆碱能反馈回路。

A cholinergic feedback circuit to regulate striatal population uncertainty and optimize reinforcement learning.

作者信息

Franklin Nicholas T, Frank Michael J

机构信息

Department of Cognitive, Linguistic and Psychological Sciences, Brown Institute for Brain Science, Brown University, Providence, United States.

出版信息

Elife. 2015 Dec 25;4:e12029. doi: 10.7554/eLife.12029.

DOI:10.7554/eLife.12029
PMID:26705698
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4764588/
Abstract

Convergent evidence suggests that the basal ganglia support reinforcement learning by adjusting action values according to reward prediction errors. However, adaptive behavior in stochastic environments requires the consideration of uncertainty to dynamically adjust the learning rate. We consider how cholinergic tonically active interneurons (TANs) may endow the striatum with such a mechanism in computational models spanning three Marr's levels of analysis. In the neural model, TANs modulate the excitability of spiny neurons, their population response to reinforcement, and hence the effective learning rate. Long TAN pauses facilitated robustness to spurious outcomes by increasing divergence in synaptic weights between neurons coding for alternative action values, whereas short TAN pauses facilitated stochastic behavior but increased responsiveness to change-points in outcome contingencies. A feedback control system allowed TAN pauses to be dynamically modulated by uncertainty across the spiny neuron population, allowing the system to self-tune and optimize performance across stochastic environments.

摘要

越来越多的证据表明,基底神经节通过根据奖励预测误差调整动作值来支持强化学习。然而,在随机环境中的适应性行为需要考虑不确定性以动态调整学习率。我们探讨了胆碱能紧张性活动中间神经元(TANs)如何在跨越马尔三个分析层次的计算模型中赋予纹状体这样一种机制。在神经模型中,TANs调节棘状神经元的兴奋性、它们对强化的群体反应,从而调节有效学习率。TANs的长时间停顿通过增加编码替代动作值的神经元之间突触权重的差异,促进了对虚假结果的鲁棒性,而TANs的短时间停顿促进了随机行为,但增加了对结果偶然性变化点的反应性。一个反馈控制系统允许TANs的停顿根据棘状神经元群体中的不确定性进行动态调节,使系统能够在随机环境中自我调整并优化性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/bb29792310b3/elife-12029-fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/f33d7dd998db/elife-12029-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/3db15a211950/elife-12029-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/8f796ff558b4/elife-12029-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/9357396315c1/elife-12029-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/f64064f3e3b1/elife-12029-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/191d93ef18ee/elife-12029-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/8a7c00fb78e3/elife-12029-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/25a6a923e54c/elife-12029-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/6678154bc869/elife-12029-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/bb29792310b3/elife-12029-fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/f33d7dd998db/elife-12029-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/3db15a211950/elife-12029-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/8f796ff558b4/elife-12029-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/9357396315c1/elife-12029-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/f64064f3e3b1/elife-12029-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/191d93ef18ee/elife-12029-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/8a7c00fb78e3/elife-12029-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/25a6a923e54c/elife-12029-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/6678154bc869/elife-12029-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f099/4764588/bb29792310b3/elife-12029-fig10.jpg

相似文献

1
A cholinergic feedback circuit to regulate striatal population uncertainty and optimize reinforcement learning.一种调节纹状体群体不确定性并优化强化学习的胆碱能反馈回路。
Elife. 2015 Dec 25;4:e12029. doi: 10.7554/eLife.12029.
2
Spontaneous firing and evoked pauses in the tonically active cholinergic interneurons of the striatum.纹状体中持续活动的胆碱能中间神经元的自发放电和诱发的停顿。
Neuroscience. 2011 Dec 15;198:27-43. doi: 10.1016/j.neuroscience.2011.08.067. Epub 2011 Sep 8.
3
The Functional Role of Striatal Cholinergic Interneurons in Reinforcement Learning From Computational Perspective.纹状体胆碱能中间神经元在强化学习中的功能作用:从计算角度的分析。
Front Neural Circuits. 2019 Feb 21;13:10. doi: 10.3389/fncir.2019.00010. eCollection 2019.
4
The role of the intrinsic cholinergic system of the striatum: What have we learned from TAN recordings in behaving animals?纹状体固有胆碱能系统的作用:在行为动物的 TAN 记录中我们了解到了什么?
Neuroscience. 2017 Sep 30;360:81-94. doi: 10.1016/j.neuroscience.2017.07.060. Epub 2017 Jul 30.
5
Pauses in cholinergic interneuron firing exert an inhibitory control on striatal output in vivo.乙酰胆碱能中间神经元的发放暂停对体内纹状体的输出施加抑制性控制。
Elife. 2018 Mar 26;7:e32510. doi: 10.7554/eLife.32510.
6
A dopamine-acetylcholine cascade: simulating learned and lesion-induced behavior of striatal cholinergic interneurons.多巴胺 - 乙酰胆碱级联反应:模拟纹状体胆碱能中间神经元的习得性行为和损伤诱导行为。
J Neurophysiol. 2008 Oct;100(4):2409-21. doi: 10.1152/jn.90486.2008. Epub 2008 Aug 20.
7
Roles of centromedian parafascicular nuclei of thalamus and cholinergic interneurons in the dorsal striatum in associative learning of environmental events.丘脑中央中旁核和背侧纹状体胆碱能中间神经元在环境事件联想学习中的作用。
J Neural Transm (Vienna). 2018 Mar;125(3):501-513. doi: 10.1007/s00702-017-1713-z. Epub 2017 Mar 21.
8
A spiking computational model for striatal cholinergic interneurons.一种用于纹状体胆碱能中间神经元的脉冲计算模型。
Brain Struct Funct. 2023 Mar;228(2):589-611. doi: 10.1007/s00429-022-02604-9. Epub 2023 Jan 18.
9
Quantitative Imaging of Cholinergic Interneurons Reveals a Distinctive Spatial Organization and a Functional Gradient across the Mouse Striatum.胆碱能中间神经元的定量成像揭示了小鼠纹状体内独特的空间组织和功能梯度。
PLoS One. 2016 Jun 17;11(6):e0157682. doi: 10.1371/journal.pone.0157682. eCollection 2016.
10
Role of Striatal Cholinergic Interneurons in Set-Shifting in the Rat.纹状体胆碱能中间神经元在大鼠定势转换中的作用。
J Neurosci. 2015 Jun 24;35(25):9424-31. doi: 10.1523/JNEUROSCI.0490-15.2015.

引用本文的文献

1
Cholinergic modulation of dopamine release drives effortful behavior.多巴胺释放的胆碱能调节驱动费力行为。
bioRxiv. 2025 Jun 21:2025.06.18.660394. doi: 10.1101/2025.06.18.660394.
2
Many roads to minimizing regret: A comparison of Wang et al (2024) and OpAL* models of adaptive striatal dopamine.减少遗憾的多种途径:Wang等人(2024年)与适应性纹状体多巴胺的OpAL*模型的比较
PLoS Comput Biol. 2025 May 8;21(5):e1012920. doi: 10.1371/journal.pcbi.1012920. eCollection 2025 May.
3
A TAN-dopamine interaction mechanism based computational model of basal ganglia in action selection.

本文引用的文献

1
Cholinergic interneurons in the dorsal and ventral striatum: anatomical and functional considerations in normal and diseased conditions.背侧和腹侧纹状体中的胆碱能中间神经元:正常和疾病状态下的解剖学与功能考量
Ann N Y Acad Sci. 2015 Sep;1349(1):1-45. doi: 10.1111/nyas.12762. Epub 2015 Apr 15.
2
A critical time window for dopamine actions on the structural plasticity of dendritic spines.多巴胺作用于树突棘结构可塑性的关键时间窗口。
Science. 2014 Sep 26;345(6204):1616-20. doi: 10.1126/science.1255514.
3
Opponent actor learning (OpAL): modeling interactive effects of striatal dopamine on reinforcement learning and choice incentive.
一种基于棕褐色与多巴胺相互作用机制的基底神经节动作选择计算模型。
Cogn Neurodyn. 2024 Oct;18(5):2127-2144. doi: 10.1007/s11571-023-10046-0. Epub 2023 Dec 23.
4
A mismatch between striatal cholinergic pauses and dopaminergic reward prediction errors.纹状体胆碱能停顿与多巴胺能奖励预测误差不匹配。
Proc Natl Acad Sci U S A. 2024 Oct 8;121(41):e2410828121. doi: 10.1073/pnas.2410828121. Epub 2024 Oct 4.
5
The Impact of Continuous and Partial Reinforcement on the Acquisition and Generalization of Human-Conditioned Fear.连续强化与部分强化对人类条件性恐惧习得及泛化的影响
Behav Sci (Basel). 2024 Jul 24;14(8):630. doi: 10.3390/bs14080630.
6
Computational insights on asymmetrical and receptor-mediated chunking: implications for OCD and Schizophrenia.关于不对称和受体介导组块化的计算见解:对强迫症和精神分裂症的启示。
Cogn Neurodyn. 2024 Feb;18(1):217-232. doi: 10.1007/s11571-022-09865-4. Epub 2023 Jan 12.
7
Acetylcholine modulates the precision of prediction error in the auditory cortex.乙酰胆碱调节听觉皮层中预测误差的精度。
Elife. 2024 Jan 19;12:RP91475. doi: 10.7554/eLife.91475.
8
A computational architecture of punishment insensitivity.一种惩罚不敏感的计算架构。
Proc Natl Acad Sci U S A. 2023 Nov 7;120(45):e2312950120. doi: 10.1073/pnas.2312950120. Epub 2023 Oct 31.
9
Reframing dopamine: A controlled controller at the limbic-motor interface.重新定义多巴胺:边缘运动界面的受控控制器。
PLoS Comput Biol. 2023 Oct 17;19(10):e1011569. doi: 10.1371/journal.pcbi.1011569. eCollection 2023 Oct.
10
Competing neural representations of choice shape evidence accumulation in humans.竞争的神经选择表征塑造了人类的证据积累。
Elife. 2023 Oct 11;12:e85223. doi: 10.7554/eLife.85223.
对抗者学习(OpAL):纹状体多巴胺对强化学习和选择动机的交互作用建模
Psychol Rev. 2014 Jul;121(3):337-66. doi: 10.1037/a0037015.
4
Human cognition. Foundations of human reasoning in the prefrontal cortex.人类认知。前额皮质中的人类推理基础。
Science. 2014 Jun 27;344(6191):1481-6. doi: 10.1126/science.1252254. Epub 2014 May 29.
5
Cortical and thalamic excitation mediate the multiphasic responses of striatal cholinergic interneurons to motivationally salient stimuli.皮质和丘脑兴奋介导纹状体胆碱能中间神经元对动机相关刺激的多相反应。
J Neurosci. 2014 Feb 19;34(8):3101-17. doi: 10.1523/JNEUROSCI.4627-13.2014.
6
δ-opioid and dopaminergic processes in accumbens shell modulate the cholinergic control of predictive learning and choice.伏隔核壳部的 δ-阿片受体和多巴胺能过程调节了预测学习和选择的胆碱能控制。
J Neurosci. 2014 Jan 22;34(4):1358-69. doi: 10.1523/JNEUROSCI.4592-13.2014.
7
Hierarchical prediction errors in midbrain and basal forebrain during sensory learning.中脑和基底前脑在感觉学习过程中的分层预测误差。
Neuron. 2013 Oct 16;80(2):519-30. doi: 10.1016/j.neuron.2013.09.009.
8
Prolonged dopamine signalling in striatum signals proximity and value of distant rewards.纹状体中多巴胺信号的延长标志着远距离奖励的接近度和价值。
Nature. 2013 Aug 29;500(7464):575-9. doi: 10.1038/nature12475. Epub 2013 Aug 4.
9
The thalamostriatal pathway and cholinergic control of goal-directed action: interlacing new with existing learning in the striatum.丘脑底核途径和胆碱能控制的目标导向行动:在纹状体中交织新的与现有的学习。
Neuron. 2013 Jul 10;79(1):153-66. doi: 10.1016/j.neuron.2013.04.039. Epub 2013 Jun 13.
10
Vividness of visual imagery and incidental recall of verbal cues, when phenomenological availability reflects long-term memory accessibility.当现象可用性反映长期记忆可及性时,视觉意象的生动性和偶然回忆的言语线索。
Front Psychol. 2013 Feb 4;4:1. doi: 10.3389/fpsyg.2013.00001. eCollection 2013.