在果蝇蘑菇体模型中进行基于强化预测误差的学习。

Learning with reinforcement prediction errors in a model of the Drosophila mushroom body.

机构信息

Department of Informatics, University of Sussex, Brighton, UK.

出版信息

Nat Commun. 2021 May 7;12(1):2569. doi: 10.1038/s41467-021-22592-4.

DOI:10.1038/s41467-021-22592-4

PMID:33963189

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8105414/

Abstract

Effective decision making in a changing environment demands that accurate predictions are learned about decision outcomes. In Drosophila, such learning is orchestrated in part by the mushroom body, where dopamine neurons signal reinforcing stimuli to modulate plasticity presynaptic to mushroom body output neurons. Building on previous mushroom body models, in which dopamine neurons signal absolute reinforcement, we propose instead that dopamine neurons signal reinforcement prediction errors by utilising feedback reinforcement predictions from output neurons. We formulate plasticity rules that minimise prediction errors, verify that output neurons learn accurate reinforcement predictions in simulations, and postulate connectivity that explains more physiological observations than an experimentally constrained model. The constrained and augmented models reproduce a broad range of conditioning and blocking experiments, and we demonstrate that the absence of blocking does not imply the absence of prediction error dependent learning. Our results provide five predictions that can be tested using established experimental methods.

摘要

在不断变化的环境中做出有效决策需要对决策结果进行准确预测。在果蝇中，这种学习部分由蘑菇体协调，多巴胺神经元向蘑菇体输出神经元的突触前传递信号，以增强刺激的可塑性。基于之前的蘑菇体模型，其中多巴胺神经元信号表示绝对增强，我们提出相反的观点，即多巴胺神经元通过利用来自输出神经元的反馈增强预测来表示增强预测误差。我们制定了最小化预测误差的可塑性规则，在模拟中验证了输出神经元学习准确增强预测，并假设了连接方式，该连接方式比实验约束模型解释了更多的生理观察结果。受约束和增强的模型再现了广泛的条件作用和阻断实验，我们证明了缺乏阻断并不意味着不存在依赖于预测误差的学习。我们的结果提供了五个可以使用既定实验方法进行测试的预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2de8/8105414/07e5e008cdee/41467_2021_22592_Fig1_HTML.jpg

相似文献

Learning with reinforcement prediction errors in a model of the Drosophila mushroom body.在果蝇蘑菇体模型中进行基于强化预测误差的学习。

Nat Commun. 2021 May 7;12(1):2569. doi: 10.1038/s41467-021-22592-4.

Models of heterogeneous dopamine signaling in an insect learning and memory center.昆虫学习记忆中心的异质多巴胺信号模型。

PLoS Comput Biol. 2021 Aug 10;17(8):e1009205. doi: 10.1371/journal.pcbi.1009205. eCollection 2021 Aug.

Input Connectivity Reveals Additional Heterogeneity of Dopaminergic Reinforcement in Drosophila.输入连接揭示了果蝇多巴胺强化的额外异质性。

Curr Biol. 2020 Aug 17;30(16):3200-3211.e8. doi: 10.1016/j.cub.2020.05.077. Epub 2020 Jul 2.

Reciprocal synapses between mushroom body and dopamine neurons form a positive feedback loop required for learning.蘑菇体与多巴胺神经元之间的相互突触形成了学习所需的正反馈回路。

Elife. 2017 May 10;6:e23789. doi: 10.7554/eLife.23789.

Re-evaluation of learned information in Drosophila.对果蝇中习得信息的重新评估。

Nature. 2017 Apr 13;544(7649):240-244. doi: 10.1038/nature21716. Epub 2017 Apr 5.

Dopaminergic modulation of cAMP drives nonlinear plasticity across the Drosophila mushroom body lobes.多巴胺对环磷酸腺苷（cAMP）的调节驱动果蝇蘑菇体叶中的非线性可塑性。

Curr Biol. 2014 Apr 14;24(8):822-31. doi: 10.1016/j.cub.2014.03.021. Epub 2014 Mar 27.

Olfactory learning skews mushroom body output pathways to steer behavioral choice in Drosophila.嗅觉学习使果蝇的蘑菇体输出通路发生偏向，以引导行为选择。

Curr Opin Neurobiol. 2015 Dec;35:178-84. doi: 10.1016/j.conb.2015.10.002. Epub 2015 Nov 3.

Four Individually Identified Paired Dopamine Neurons Signal Reward in Larval Drosophila.四个经个体识别的成对多巴胺能神经元在果蝇幼虫中传递奖励信号。

Curr Biol. 2016 Mar 7;26(5):661-9. doi: 10.1016/j.cub.2016.01.012. Epub 2016 Feb 11.

Reinforcement signalling in Drosophila; dopamine does it all after all.果蝇中的强化信号；多巴胺毕竟无所不能。

Curr Opin Neurobiol. 2013 Jun;23(3):324-9. doi: 10.1016/j.conb.2013.01.005. Epub 2013 Feb 5.

Learning to express reward prediction error-like dopaminergic activity requires plastic representations of time.学习表达类似于奖励预测误差的多巴胺能活动需要时间的可塑性表示。

Nat Commun. 2024 Jul 12;15(1):5856. doi: 10.1038/s41467-024-50205-3.

引用本文的文献

Path integration and optic flow in flying insects: a review of current evidence.飞行昆虫的路径整合与光流：当前证据综述

J Comp Physiol A Neuroethol Sens Neural Behav Physiol. 2025 May;211(3):375-401. doi: 10.1007/s00359-025-01734-9. Epub 2025 Mar 7.

Non-Immune Functions of Innate Immunity Acting on Physiological Processes: Insights from .先天免疫对生理过程的非免疫功能：来自……的见解

Int J Mol Sci. 2025 Jan 27;26(3):1087. doi: 10.3390/ijms26031087.

How bumblebees manage conflicting information seen on arrival and departure from flowers.大黄蜂如何处理在抵达和离开花朵时看到的相互矛盾的信息。

Anim Cogn. 2025 Feb 5;28(1):11. doi: 10.1007/s10071-024-01926-x.

A biological model of nonlinear dimensionality reduction.非线性降维的生物学模型。

Sci Adv. 2025 Feb 7;11(6):eadp9048. doi: 10.1126/sciadv.adp9048. Epub 2025 Feb 5.

Reinforcement learning when your life depends on it: A neuro-economic theory of learning.性命攸关时的强化学习：学习的神经经济学理论。

PLoS Comput Biol. 2024 Oct 28;20(10):e1012554. doi: 10.1371/journal.pcbi.1012554. eCollection 2024 Oct.

Reinforcement learning as a robotics-inspired framework for insect navigation: from spatial representations to neural implementation.强化学习作为一种受机器人启发的昆虫导航框架：从空间表征到神经实现。

Front Comput Neurosci. 2024 Sep 9;18:1460006. doi: 10.3389/fncom.2024.1460006. eCollection 2024.

Pavlovian safety learning: An integrative theoretical review.巴甫洛夫式安全学习：一项综合性理论综述。

Psychon Bull Rev. 2025 Feb;32(1):176-202. doi: 10.3758/s13423-024-02559-4. Epub 2024 Aug 21.

Dopamine-mediated interactions between short- and long-term memory dynamics.多巴胺介导的短期记忆和长期记忆动力学之间的相互作用。

Nature. 2024 Oct;634(8036):1141-1149. doi: 10.1038/s41586-024-07819-w. Epub 2024 Jul 22.

Roles of feedback and feed-forward networks of dopamine subsystems: insights from studies.多巴胺亚系统的反馈和前馈网络的作用：研究的启示。

Learn Mem. 2024 Jun 11;31(5). doi: 10.1101/lm.053807.123. Print 2024 May.

Beyond prediction error: 25 years of modeling the associations formed in the insect mushroom body.超越预测误差：昆虫脑的蘑菇体中形成的关联的 25 年建模。

Learn Mem. 2024 Jun 11;31(5). doi: 10.1101/lm.053824.123. Print 2024 May.

本文引用的文献

Recurrent architecture for adaptive regulation of learning in the insect brain.昆虫大脑中用于自适应学习调节的反复结构。

Nat Neurosci. 2020 Apr;23(4):544-555. doi: 10.1038/s41593-020-0607-9. Epub 2020 Mar 23.

Distinct Dopamine Receptor Pathways Underlie the Temporal Sensitivity of Associative Learning.不同的多巴胺受体途径为联想学习的时间敏感性提供了基础。

Cell. 2019 Jun 27;178(1):60-75.e19. doi: 10.1016/j.cell.2019.05.040. Epub 2019 Jun 20.

Learning the payoffs and costs of actions.学习行为的收益和成本。

PLoS Comput Biol. 2019 Feb 28;15(2):e1006285. doi: 10.1371/journal.pcbi.1006285. eCollection 2019 Feb.

Integration of Parallel Opposing Memories Underlies Memory Extinction.并行对立记忆的整合是记忆消除的基础。

Cell. 2018 Oct 18;175(3):709-722.e15. doi: 10.1016/j.cell.2018.08.021. Epub 2018 Sep 20.

Abstract concept learning in a simple neural network inspired by the insect brain.受昆虫大脑启发的简单神经网络中的抽象概念学习。

PLoS Comput Biol. 2018 Sep 17;14(9):e1006435. doi: 10.1371/journal.pcbi.1006435. eCollection 2018 Sep.

Contemporary associative learning theory predicts failures to obtain blocking: Comment on Maes et al. (2016).当代联想学习理论预测阻断失败：评 Maes 等人（2016 年）。

J Exp Psychol Gen. 2018 Apr;147(4):597-602. doi: 10.1037/xge0000341.

Persistent activity in a recurrent circuit underlies courtship memory in .循环回路中的持续活动是[具体物种未给出]求偶记忆的基础。

Elife. 2018 Jan 11;7:e31425. doi: 10.7554/eLife.31425.

Do the right thing: neural network mechanisms of memory formation, expression and update in Drosophila.做正确的事：果蝇中记忆形成、表达和更新的神经网络机制。

Curr Opin Neurobiol. 2018 Apr;49:51-58. doi: 10.1016/j.conb.2017.12.002. Epub 2017 Dec 16.

The complete connectome of a learning and memory centre in an insect brain.昆虫大脑中一个学习与记忆中心的完整连接组。

Nature. 2017 Aug 9;548(7666):175-182. doi: 10.1038/nature23455.

The Biology of Forgetting-A Perspective.遗忘的生物学——一种视角

Neuron. 2017 Aug 2;95(3):490-503. doi: 10.1016/j.neuron.2017.05.039.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

在果蝇蘑菇体模型中进行基于强化预测误差的学习。

Learning with reinforcement prediction errors in a model of the Drosophila mushroom body.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献