• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

奖赏预期指导学习并驱动操作性匹配。

Reward expectations direct learning and drive operant matching in .

机构信息

Janelia Research Campus, HHMI, Ashburn, VA 20147.

Solomon H. Snyder Department of Neuroscience, Johns Hopkins School of Medicine, Baltimore, MD 21205.

出版信息

Proc Natl Acad Sci U S A. 2023 Sep 26;120(39):e2221415120. doi: 10.1073/pnas.2221415120. Epub 2023 Sep 21.

DOI:10.1073/pnas.2221415120
PMID:37733736
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10523640/
Abstract

Foraging animals must use decision-making strategies that dynamically adapt to the changing availability of rewards in the environment. A wide diversity of animals do this by distributing their choices in proportion to the rewards received from each option, Herrnstein's operant matching law. Theoretical work suggests an elegant mechanistic explanation for this ubiquitous behavior, as operant matching follows automatically from simple synaptic plasticity rules acting within behaviorally relevant neural circuits. However, no past work has mapped operant matching onto plasticity mechanisms in the brain, leaving the biological relevance of the theory unclear. Here, we discovered operant matching in and showed that it requires synaptic plasticity that acts in the mushroom body and incorporates the expectation of reward. We began by developing a dynamic foraging paradigm to measure choices from individual flies as they learn to associate odor cues with probabilistic rewards. We then built a model of the fly mushroom body to explain each fly's sequential choice behavior using a family of biologically realistic synaptic plasticity rules. As predicted by past theoretical work, we found that synaptic plasticity rules could explain fly matching behavior by incorporating stimulus expectations, reward expectations, or both. However, by optogenetically bypassing the representation of reward expectation, we abolished matching behavior and showed that the plasticity rule must specifically incorporate reward expectations. Altogether, these results reveal the first synapse-level mechanisms of operant matching and provide compelling evidence for the role of reward expectation signals in the fly brain.

摘要

觅食动物必须使用决策策略,这些策略要能根据环境中奖励的变化情况做出动态调整。许多动物通过赫恩斯坦操作性匹配定律来实现这一点,该定律将动物的选择分配比例与从每种选择中获得的奖励成正比。理论研究为这种普遍存在的行为提供了一种优雅的机械解释,因为操作性匹配是由在行为相关神经回路中起作用的简单突触可塑性规则自动产生的。然而,过去的研究工作并未将操作性匹配映射到大脑中的可塑性机制上,因此该理论的生物学相关性尚不清楚。在这里,我们在果蝇中发现了操作性匹配,并证明它需要在蘑菇体中起作用的突触可塑性,并且要整合对奖励的期望。我们首先开发了一个动态觅食范式,以测量个体果蝇在将气味线索与概率奖励相关联的过程中的选择情况。然后,我们构建了一个果蝇蘑菇体模型,使用一系列具有生物学现实意义的突触可塑性规则来解释每只果蝇的顺序选择行为。正如过去的理论工作所预测的那样,我们发现,通过整合刺激期望、奖励期望或两者,突触可塑性规则可以解释果蝇的匹配行为。然而,通过光遗传学绕过奖励期望的表示,我们消除了匹配行为,并表明可塑性规则必须具体地整合奖励期望。总之,这些结果揭示了操作性匹配的第一个突触水平机制,并为奖励期望信号在果蝇大脑中的作用提供了有力证据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c8/10523640/d6239e58868a/pnas.2221415120fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c8/10523640/67a452cc8cf1/pnas.2221415120fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c8/10523640/8fb15e15500e/pnas.2221415120fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c8/10523640/e3d7ef1278e6/pnas.2221415120fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c8/10523640/f6f38676d3b6/pnas.2221415120fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c8/10523640/d6239e58868a/pnas.2221415120fig05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c8/10523640/67a452cc8cf1/pnas.2221415120fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c8/10523640/8fb15e15500e/pnas.2221415120fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c8/10523640/e3d7ef1278e6/pnas.2221415120fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c8/10523640/f6f38676d3b6/pnas.2221415120fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1c8/10523640/d6239e58868a/pnas.2221415120fig05.jpg

相似文献

1
Reward expectations direct learning and drive operant matching in .奖赏预期指导学习并驱动操作性匹配。
Proc Natl Acad Sci U S A. 2023 Sep 26;120(39):e2221415120. doi: 10.1073/pnas.2221415120. Epub 2023 Sep 21.
2
Operant matching is a generic outcome of synaptic plasticity based on the covariance between reward and neural activity.操作性匹配是基于奖励与神经活动之间的协方差的突触可塑性的一般结果。
Proc Natl Acad Sci U S A. 2006 Oct 10;103(41):15224-9. doi: 10.1073/pnas.0505220103. Epub 2006 Sep 28.
3
Robustness of learning that is based on covariance-driven synaptic plasticity.基于协方差驱动突触可塑性的学习的稳健性。
PLoS Comput Biol. 2008 Mar 7;4(3):e1000007. doi: 10.1371/journal.pcbi.1000007.
4
Operant matching as a Nash equilibrium of an intertemporal game.作为跨期博弈纳什均衡的操作性匹配
Neural Comput. 2009 Oct;21(10):2755-73. doi: 10.1162/neco.2009.09-08-854.
5
Bayesian deterministic decision making: a normative account of the operant matching law and heavy-tailed reward history dependency of choices.贝叶斯确定性决策:对操作性匹配律和选择中重尾奖励历史依赖性的规范解释。
Front Comput Neurosci. 2014 Mar 4;8:18. doi: 10.3389/fncom.2014.00018. eCollection 2014.
6
Normative decision rules in changing environments.规范决策规则在不断变化的环境中。
Elife. 2022 Oct 25;11:e79824. doi: 10.7554/eLife.79824.
7
Switch-like and persistent memory formation in individual .个体. 中类晶体管式和持久记忆的形成
Elife. 2021 Oct 12;10:e70317. doi: 10.7554/eLife.70317.
8
Statistical mechanics of reward-modulated learning in decision-making networks.决策网络中受奖励调节的学习的统计力学。
Neural Comput. 2012 May;24(5):1230-70. doi: 10.1162/NECO_a_00264. Epub 2012 Feb 1.
9
Rest Is Required to Learn an Appetitively-Reinforced Operant Task in .在……中学习一项由食欲强化的操作性任务需要休息。 (你提供的原文似乎不完整,最后的“in.”后面应该还有具体内容)
Front Behav Neurosci. 2021 Jun 18;15:681593. doi: 10.3389/fnbeh.2021.681593. eCollection 2021.
10
Rewarding Capacity of Optogenetically Activating a Giant GABAergic Central-Brain Interneuron in Larval .光遗传学激活幼虫大脑中央 GABA 能中间神经元的奖赏能力。
J Neurosci. 2023 Nov 1;43(44):7393-7428. doi: 10.1523/JNEUROSCI.2310-22.2023. Epub 2023 Sep 21.

引用本文的文献

1
Model-based inference of synaptic plasticity rules.基于模型的突触可塑性规则推理。
Adv Neural Inf Process Syst. 2024;37:48519-48540.
2
The role of dopamine in foraging decisions in social insects.多巴胺在社会性昆虫觅食决策中的作用。
Front Insect Sci. 2025 Apr 17;5:1581307. doi: 10.3389/finsc.2025.1581307. eCollection 2025.
3
Neuronal circuit mechanisms of competitive interaction between action-based and coincidence learning.基于动作学习与同步学习之间竞争性相互作用的神经元回路机制

本文引用的文献

1
Brain mechanism of foraging: Reward-dependent synaptic plasticity versus neural integration of values.觅食的大脑机制:奖赏依赖型突触可塑性与价值的神经整合。
Proc Natl Acad Sci U S A. 2024 Apr 2;121(14):e2318521121. doi: 10.1073/pnas.2318521121. Epub 2024 Mar 29.
2
Enhanced olfactory memory detection in trap-design Y-mazes allows the study of imperceptible memory traces in .在陷阱设计的Y迷宫中增强嗅觉记忆检测,有助于研究难以察觉的记忆痕迹。
Learn Mem. 2022 Sep 30;29(10):355-366. doi: 10.1101/lm.053545.121. Print 2022 Oct.
3
Inferring learning rules from animal decision-making.
Sci Adv. 2024 Dec 6;10(49):eadq3016. doi: 10.1126/sciadv.adq3016.
4
Brain mechanism of foraging: Reward-dependent synaptic plasticity versus neural integration of values.觅食的大脑机制:奖赏依赖型突触可塑性与价值的神经整合。
Proc Natl Acad Sci U S A. 2024 Apr 2;121(14):e2318521121. doi: 10.1073/pnas.2318521121. Epub 2024 Mar 29.
5
Prediction error drives associative learning and conditioned behavior in a spiking model of Drosophila larva.预测误差驱动果蝇幼虫尖峰模型中的联想学习和条件行为。
iScience. 2023 Dec 26;27(1):108640. doi: 10.1016/j.isci.2023.108640. eCollection 2024 Jan 19.
6
Input density tunes Kenyon cell sensory responses in the Drosophila mushroom body.输入密度调节果蝇蘑菇体中的肯扬细胞感觉反应。
Curr Biol. 2023 Jul 10;33(13):2742-2760.e12. doi: 10.1016/j.cub.2023.05.064. Epub 2023 Jun 21.
7
Hacking brain development to test models of sensory coding.通过干预大脑发育来测试感觉编码模型。
bioRxiv. 2023 Jan 26:2023.01.25.525425. doi: 10.1101/2023.01.25.525425.
从动物决策中推断学习规则。
Adv Neural Inf Process Syst. 2020;33:3442-3453.
4
Differential coding of absolute and relative aversive value in the Drosophila brain.果蝇脑中绝对和相对厌恶值的差异编码。
Curr Biol. 2022 Nov 7;32(21):4576-4592.e5. doi: 10.1016/j.cub.2022.08.058. Epub 2022 Sep 13.
5
Mice exhibit stochastic and efficient action switching during probabilistic decision making.在进行概率决策时,老鼠表现出随机且有效的动作转换。
Proc Natl Acad Sci U S A. 2022 Apr 12;119(15):e2113961119. doi: 10.1073/pnas.2113961119. Epub 2022 Apr 6.
6
An incentive circuit for memory dynamics in the mushroom body of .蘑菇体中记忆动态的激励回路。
Elife. 2022 Apr 1;11:e75611. doi: 10.7554/eLife.75611.
7
Idiosyncratic learning performance in flies.果蝇的特殊学习表现。
Biol Lett. 2022 Feb;18(2):20210424. doi: 10.1098/rsbl.2021.0424. Epub 2022 Feb 2.
8
Context-dependent representations of movement in Drosophila dopaminergic reinforcement pathways.果蝇多巴胺强化途径中运动的上下文相关表示。
Nat Neurosci. 2021 Nov;24(11):1555-1566. doi: 10.1038/s41593-021-00929-y. Epub 2021 Oct 25.
9
Switch-like and persistent memory formation in individual .个体. 中类晶体管式和持久记忆的形成
Elife. 2021 Oct 12;10:e70317. doi: 10.7554/eLife.70317.
10
Models of heterogeneous dopamine signaling in an insect learning and memory center.昆虫学习记忆中心的异质多巴胺信号模型。
PLoS Comput Biol. 2021 Aug 10;17(8):e1009205. doi: 10.1371/journal.pcbi.1009205. eCollection 2021 Aug.