• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

腹侧纹状体和背侧纹状体中基于价值和有限状态策略的并行表征。

Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum.

作者信息

Ito Makoto, Doya Kenji

机构信息

Okinawa Institute of Science and Technology Graduate University, Onna-son Okinawa, Japan.

出版信息

PLoS Comput Biol. 2015 Nov 3;11(11):e1004540. doi: 10.1371/journal.pcbi.1004540. eCollection 2015 Nov.

DOI:10.1371/journal.pcbi.1004540
PMID:26529522
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4631489/
Abstract

Previous theoretical studies of animal and human behavioral learning have focused on the dichotomy of the value-based strategy using action value functions to predict rewards and the model-based strategy using internal models to predict environmental states. However, animals and humans often take simple procedural behaviors, such as the "win-stay, lose-switch" strategy without explicit prediction of rewards or states. Here we consider another strategy, the finite state-based strategy, in which a subject selects an action depending on its discrete internal state and updates the state depending on the action chosen and the reward outcome. By analyzing choice behavior of rats in a free-choice task, we found that the finite state-based strategy fitted their behavioral choices more accurately than value-based and model-based strategies did. When fitted models were run autonomously with the same task, only the finite state-based strategy could reproduce the key feature of choice sequences. Analyses of neural activity recorded from the dorsolateral striatum (DLS), the dorsomedial striatum (DMS), and the ventral striatum (VS) identified significant fractions of neurons in all three subareas for which activities were correlated with individual states of the finite state-based strategy. The signal of internal states at the time of choice was found in DMS, and for clusters of states was found in VS. In addition, action values and state values of the value-based strategy were encoded in DMS and VS, respectively. These results suggest that both the value-based strategy and the finite state-based strategy are implemented in the striatum.

摘要

以往关于动物和人类行为学习的理论研究主要集中在基于价值的策略(使用动作价值函数来预测奖励)和基于模型的策略(使用内部模型来预测环境状态)的二分法上。然而,动物和人类经常采取简单的程序性行为,比如“赢则继续,输则转换”策略,而无需明确预测奖励或状态。在这里,我们考虑另一种策略,即基于有限状态的策略,在这种策略中,主体根据其离散的内部状态选择一个动作,并根据所选择的动作和奖励结果更新状态。通过分析大鼠在自由选择任务中的选择行为,我们发现基于有限状态的策略比基于价值和基于模型的策略更准确地拟合了它们的行为选择。当使用相同任务对拟合模型进行自主运行时,只有基于有限状态的策略能够重现选择序列的关键特征。对从背外侧纹状体(DLS)、背内侧纹状体(DMS)和腹侧纹状体(VS)记录的神经活动分析表明,在所有三个子区域中都有相当一部分神经元的活动与基于有限状态策略的个体状态相关。在DMS中发现了选择时内部状态的信号,在VS中发现了状态簇的信号。此外,基于价值策略的动作值和状态值分别在DMS和VS中进行编码。这些结果表明,基于价值的策略和基于有限状态的策略都在纹状体中得以实施。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/dc91df66fb2d/pcbi.1004540.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/15746828912b/pcbi.1004540.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/c2dad471c2c8/pcbi.1004540.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/bc4ea3decc5c/pcbi.1004540.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/fc9177b5fafa/pcbi.1004540.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/f98834540219/pcbi.1004540.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/b38bf461841b/pcbi.1004540.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/c333090e3e32/pcbi.1004540.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/dc91df66fb2d/pcbi.1004540.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/15746828912b/pcbi.1004540.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/c2dad471c2c8/pcbi.1004540.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/bc4ea3decc5c/pcbi.1004540.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/fc9177b5fafa/pcbi.1004540.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/f98834540219/pcbi.1004540.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/b38bf461841b/pcbi.1004540.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/c333090e3e32/pcbi.1004540.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/48c7/4631489/dc91df66fb2d/pcbi.1004540.g008.jpg

相似文献

1
Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum.腹侧纹状体和背侧纹状体中基于价值和有限状态策略的并行表征。
PLoS Comput Biol. 2015 Nov 3;11(11):e1004540. doi: 10.1371/journal.pcbi.1004540. eCollection 2015 Nov.
2
Lesions of dorsal striatum eliminate lose-switch responding but not mixed-response strategies in rats.背侧纹状体损伤消除了大鼠的“输-换”反应,但未消除其混合反应策略。
Eur J Neurosci. 2014 May;39(10):1655-63. doi: 10.1111/ejn.12518. Epub 2014 Mar 6.
3
Distinct neural representation in the dorsolateral, dorsomedial, and ventral parts of the striatum during fixed- and free-choice tasks.在固定选择和自由选择任务期间,纹状体背外侧、背内侧和腹侧部分存在不同的神经表征。
J Neurosci. 2015 Feb 25;35(8):3499-514. doi: 10.1523/JNEUROSCI.1962-14.2015.
4
Neuronal basis for evaluating selected action in the primate striatum.灵长类纹状体中评估选定动作的神经元基础。
Eur J Neurosci. 2011 Aug;34(3):489-506. doi: 10.1111/j.1460-9568.2011.07771.x. Epub 2011 Jul 22.
5
Neuronal activity in dorsomedial and dorsolateral striatum under the requirement for temporal credit assignment.在时间信用分配需求下背内侧和背外侧纹状体中的神经元活动。
Sci Rep. 2016 Jun 1;6:27056. doi: 10.1038/srep27056.
6
Dorsolateral Striatal Task-initiation Bursts Represent Past Experiences More than Future Action Plans.背外侧纹状体任务启动爆发代表的是过去的经历,而不是未来的行动计划。
J Neurosci. 2021 Sep 22;41(38):8051-8064. doi: 10.1523/JNEUROSCI.3080-20.2021. Epub 2021 Aug 10.
7
Neuronal Representation of a Working Memory-Based Decision Strategy in the Motor and Prefrontal Cortico-Basal Ganglia Loops.运动和前额皮质-基底神经节回路中基于工作记忆的决策策略的神经元表示。
eNeuro. 2023 Jun 20;10(6). doi: 10.1523/ENEURO.0413-22.2023. Print 2023 Jun.
8
Validation of decision-making models and analysis of decision variables in the rat basal ganglia.大鼠基底神经节决策模型的验证及决策变量分析
J Neurosci. 2009 Aug 5;29(31):9861-74. doi: 10.1523/JNEUROSCI.6157-08.2009.
9
Reward prediction error signaling in posterior dorsomedial striatum is action specific.后背侧纹状体中的奖励预测误差信号具有动作特异性。
J Neurosci. 2012 Jul 25;32(30):10296-305. doi: 10.1523/JNEUROSCI.0832-12.2012.
10
Brain mechanism of reward prediction under predictable and unpredictable environmental dynamics.可预测和不可预测环境动态下奖励预测的脑机制。
Neural Netw. 2006 Oct;19(8):1233-41. doi: 10.1016/j.neunet.2006.05.039. Epub 2006 Sep 18.

引用本文的文献

1
Dorsal raphe nucleus controls motivation-state transitions in monkeys.中缝背核控制猴子的动机状态转换。
Sci Adv. 2025 Jun 27;11(26):eads1236. doi: 10.1126/sciadv.ads1236.
2
Distinct basal ganglia contributions to learning from implicit and explicit value signals in perceptual decision-making.在知觉决策中,不同的基底神经节对隐性和显性价值信号的学习有不同的贡献。
Nat Commun. 2024 Jun 22;15(1):5317. doi: 10.1038/s41467-024-49538-w.
3
Neuronal Representation of a Working Memory-Based Decision Strategy in the Motor and Prefrontal Cortico-Basal Ganglia Loops.

本文引用的文献

1
Distinct neural representation in the dorsolateral, dorsomedial, and ventral parts of the striatum during fixed- and free-choice tasks.在固定选择和自由选择任务期间,纹状体背外侧、背内侧和腹侧部分存在不同的神经表征。
J Neurosci. 2015 Feb 25;35(8):3499-514. doi: 10.1523/JNEUROSCI.1962-14.2015.
2
An analysis of rat prefrontal cortex in mediating executive function.分析大鼠前额皮质在执行功能中的作用。
Neurobiol Learn Mem. 2011 Oct;96(3):417-31. doi: 10.1016/j.nlm.2011.07.002. Epub 2011 Aug 9.
3
Neuronal basis for evaluating selected action in the primate striatum.
运动和前额皮质-基底神经节回路中基于工作记忆的决策策略的神经元表示。
eNeuro. 2023 Jun 20;10(6). doi: 10.1523/ENEURO.0413-22.2023. Print 2023 Jun.
4
Choice-selective sequences dominate in cortical relative to thalamic inputs to NAc to support reinforcement learning.皮层对 NAc 的输入比丘脑的输入更具有选择选择性,从而支持强化学习。
Cell Rep. 2022 May 17;39(7):110756. doi: 10.1016/j.celrep.2022.110756.
5
Context coding in the mouse nucleus accumbens modulates motivationally relevant information.伏隔核中的情境编码调节动机相关信息。
PLoS Biol. 2022 Apr 29;20(4):e3001338. doi: 10.1371/journal.pbio.3001338. eCollection 2022 Apr.
6
A Genetically Defined Compartmentalized Striatal Direct Pathway for Negative Reinforcement.一个基因定义的纹状体直接通路的隔室化,用于负强化。
Cell. 2020 Oct 1;183(1):211-227.e20. doi: 10.1016/j.cell.2020.08.032. Epub 2020 Sep 15.
7
Striatal action-value neurons reconsidered.重新思考纹状体动作价值神经元。
Elife. 2018 May 31;7:e34248. doi: 10.7554/eLife.34248.
8
Reward-Predictive Neural Activities in Striatal Striosome Compartments.纹状体神经节细胞隔部的奖赏预测神经活动。
eNeuro. 2018 Feb 5;5(1). doi: 10.1523/ENEURO.0367-17.2018. eCollection 2018 Jan-Feb.
9
Striatal Activity and Reward Relativity: Neural Signals Encoding Dynamic Outcome Valuation.纹状体活动与奖励相关性:编码动态结果估值的神经信号。
eNeuro. 2016 Nov 1;3(5). doi: 10.1523/ENEURO.0022-16.2016. eCollection 2016 Sep-Oct.
灵长类纹状体中评估选定动作的神经元基础。
Eur J Neurosci. 2011 Aug;34(3):489-506. doi: 10.1111/j.1460-9568.2011.07771.x. Epub 2011 Jul 22.
4
Multiple representations and algorithms for reinforcement learning in the cortico-basal ganglia circuit.皮质基底神经节回路中的强化学习的多种表示和算法。
Curr Opin Neurobiol. 2011 Jun;21(3):368-73. doi: 10.1016/j.conb.2011.04.001. Epub 2011 Apr 29.
5
Model-based influences on humans' choices and striatal prediction errors.基于模型的影响对人类选择和纹状体预测误差的影响。
Neuron. 2011 Mar 24;69(6):1204-15. doi: 10.1016/j.neuron.2011.02.027.
6
States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning.状态与奖励:基于模型和无模型强化学习的分离神经预测误差信号。
Neuron. 2010 May 27;66(4):585-95. doi: 10.1016/j.neuron.2010.04.016.
7
Switching from automatic to controlled behavior: cortico-basal ganglia mechanisms.从自动行为到受控行为的转换:皮质-基底节机制。
Trends Cogn Sci. 2010 Apr;14(4):154-61. doi: 10.1016/j.tics.2010.01.006. Epub 2010 Feb 22.
8
The dorsomedial striatum reflects response bias during learning.背内侧纹状体反映了学习过程中的反应偏差。
J Neurosci. 2009 Nov 25;29(47):14891-902. doi: 10.1523/JNEUROSCI.4060-09.2009.
9
Role of striatum in updating values of chosen actions.纹状体在更新所选动作价值中的作用。
J Neurosci. 2009 Nov 25;29(47):14701-12. doi: 10.1523/JNEUROSCI.2728-09.2009.
10
Ventral striatal neurons encode the value of the chosen action in rats deciding between differently delayed or sized rewards.在大鼠决定选择不同延迟或大小的奖励时,腹侧纹状体神经元对所选行动的价值进行编码。
J Neurosci. 2009 Oct 21;29(42):13365-76. doi: 10.1523/JNEUROSCI.2572-09.2009.