• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

经验回放与高效的非局部学习有关。

Experience replay is associated with efficient nonlocal learning.

机构信息

State Key Laboratory of Cognitive Neuroscience and Learning, IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China.

Chinese Institute for Brain Research, Beijing, China.

出版信息

Science. 2021 May 21;372(6544). doi: 10.1126/science.abf1357.

DOI:10.1126/science.abf1357
PMID:34016753
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7610948/
Abstract

To make effective decisions, people need to consider the relationship between actions and outcomes. These are often separated by time and space. The neural mechanisms by which disjoint actions and outcomes are linked remain unknown. One promising hypothesis involves neural replay of nonlocal experience. Using a task that segregates direct from indirect value learning, combined with magnetoencephalography, we examined the role of neural replay in human nonlocal learning. After receipt of a reward, we found significant backward replay of nonlocal experience, with a 160-millisecond state-to-state time lag, which was linked to efficient learning of action values. Backward replay and behavioral evidence of nonlocal learning were more pronounced for experiences of greater benefit for future behavior. These findings support nonlocal replay as a neural mechanism for solving complex credit assignment problems during learning.

摘要

为了做出有效的决策,人们需要考虑行为和结果之间的关系。这些通常是时间和空间上分离的。将不相关的行为和结果联系起来的神经机制尚不清楚。一个有希望的假设涉及非局部经验的神经重放。我们使用一种将直接价值学习和间接价值学习分开的任务,结合脑磁图,研究了神经重放在人类非局部学习中的作用。在收到奖励后,我们发现了非局部经验的显著回溯重放,状态到状态的时间滞后为 160 毫秒,这与行为值的有效学习有关。对于未来行为更有利的经验,回溯重放和非局部学习的行为证据更为明显。这些发现支持非局部重放作为学习过程中解决复杂信用分配问题的神经机制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c1f/7610948/4e4a6855f0b2/EMS126439-f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c1f/7610948/9cc85cb6e4ce/EMS126439-f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c1f/7610948/2aad407aa4dc/EMS126439-f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c1f/7610948/04049a74fc95/EMS126439-f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c1f/7610948/c97f60bd157f/EMS126439-f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c1f/7610948/e0c782de1bc6/EMS126439-f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c1f/7610948/4e4a6855f0b2/EMS126439-f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c1f/7610948/9cc85cb6e4ce/EMS126439-f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c1f/7610948/2aad407aa4dc/EMS126439-f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c1f/7610948/04049a74fc95/EMS126439-f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c1f/7610948/c97f60bd157f/EMS126439-f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c1f/7610948/e0c782de1bc6/EMS126439-f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c1f/7610948/4e4a6855f0b2/EMS126439-f006.jpg

相似文献

1
Experience replay is associated with efficient nonlocal learning.经验回放与高效的非局部学习有关。
Science. 2021 May 21;372(6544). doi: 10.1126/science.abf1357.
2
The roles of online and offline replay in planning.在线和离线回放在规划中的作用。
Elife. 2020 Jun 17;9:e56911. doi: 10.7554/eLife.56911.
3
Causal Inference Gates Corticostriatal Learning.因果推理门控皮质纹状体学习。
J Neurosci. 2021 Aug 11;41(32):6892-6904. doi: 10.1523/JNEUROSCI.2796-20.2021. Epub 2021 Jul 9.
4
Offline replay supports planning in human reinforcement learning.离线重放支持人类强化学习中的规划。
Elife. 2018 Dec 14;7:e32548. doi: 10.7554/eLife.32548.
5
Human Replay Spontaneously Reorganizes Experience.人类的重放会自发地重组经验。
Cell. 2019 Jul 25;178(3):640-652.e14. doi: 10.1016/j.cell.2019.06.012. Epub 2019 Jul 4.
6
Prioritizing replay when future goals are unknown.当未来目标不明时,优先进行重演。
bioRxiv. 2024 Mar 4:2024.02.29.582822. doi: 10.1101/2024.02.29.582822.
7
Post-learning Hippocampal Replay Selectively Reinforces Spatial Memory for Highly Rewarded Locations.学习后海马体重放选择性地加强了对高奖励位置的空间记忆。
Curr Biol. 2019 May 6;29(9):1436-1444.e5. doi: 10.1016/j.cub.2019.03.048. Epub 2019 Apr 25.
8
Credit Assignment in a Motor Decision Making Task Is Influenced by Agency and Not Sensory Prediction Errors.在一项运动决策任务中,信用分配受机构影响,而不受感官预测误差影响。
J Neurosci. 2018 May 9;38(19):4521-4530. doi: 10.1523/JNEUROSCI.3601-17.2018. Epub 2018 Apr 12.
9
Learning offline: memory replay in biological and artificial reinforcement learning.离线学习:生物强化学习和人工强化学习中的记忆重放。
Trends Neurosci. 2021 Oct;44(10):808-821. doi: 10.1016/j.tins.2021.07.007. Epub 2021 Sep 1.
10
A Hybrid Online Off-Policy Reinforcement Learning Agent Framework Supported by Transformers.基于 Transformer 的混合在线非策略强化学习代理框架。
Int J Neural Syst. 2023 Dec;33(12):2350065. doi: 10.1142/S012906572350065X. Epub 2023 Oct 20.

引用本文的文献

1
Trait anxiety is associated with reduced reward-related replay at rest.特质焦虑与静息时与奖励相关的重演减少有关。
Nat Commun. 2025 Aug 26;16(1):7975. doi: 10.1038/s41467-025-63281-w.
2
Replay in the human visual cortex during brief task pauses is linked to implicit learning of successor representations.在短暂的任务停顿期间,人类视觉皮层中的重放与后继表征的内隐学习相关联。
Proc Natl Acad Sci U S A. 2025 Aug 26;122(34):e2507516122. doi: 10.1073/pnas.2507516122. Epub 2025 Aug 22.
3
A Rational Information Gathering Account of Infant Habituation.

本文引用的文献

1
Phase precession in the human hippocampus and entorhinal cortex.人类海马体和内嗅皮层的相位进动。
Cell. 2021 Jun 10;184(12):3242-3255.e10. doi: 10.1016/j.cell.2021.04.017. Epub 2021 May 11.
2
Prioritized experience replays on a hippocampal predictive map for learning.优先体验回放对海马预测图的学习作用。
Proc Natl Acad Sci U S A. 2021 Jan 5;118(1). doi: 10.1073/pnas.2011266118. Epub 2020 Dec 18.
3
Alternating sequences of future and past behavior encoded within hippocampal theta oscillations.海马体θ节律中编码的未来和过去行为的交替序列。
婴儿习惯化的理性信息收集解释
bioRxiv. 2025 Jul 18:2025.01.09.629775. doi: 10.1101/2025.01.09.629775.
4
Goal-directed navigation in humans and deep reinforcement learning agents relies on an adaptive mix of vector-based and transition-based strategies.人类和深度强化学习智能体中的目标导向导航依赖于基于向量和基于转换的策略的自适应混合。
PLoS Biol. 2025 Jul 29;23(7):e3003296. doi: 10.1371/journal.pbio.3003296. eCollection 2025 Jul.
5
Non-feature-specific elevated responses and feature-specific backward replay in human brain induced by visual sequence exposure.视觉序列暴露诱导的人类大脑中无特征特异性的增强反应和特征特异性的反向重演
Elife. 2025 May 8;13:RP101511. doi: 10.7554/eLife.101511.
6
Reinforcement learning increasingly relates to memory specificity from childhood to adulthood.从童年到成年,强化学习与记忆特异性的关联日益紧密。
Nat Commun. 2025 Apr 30;16(1):4074. doi: 10.1038/s41467-025-59379-w.
7
Exploring replay.探索重演。
Nat Commun. 2025 Feb 15;16(1):1657. doi: 10.1038/s41467-025-56731-y.
8
Proactive and reactive construction of memory-based preferences.基于记忆的偏好的主动和被动构建。
Nat Commun. 2025 Feb 13;16(1):1618. doi: 10.1038/s41467-025-56183-4.
9
Schemas, reinforcement learning and the medial prefrontal cortex.图式、强化学习与内侧前额叶皮质
Nat Rev Neurosci. 2025 Mar;26(3):141-157. doi: 10.1038/s41583-024-00893-z. Epub 2025 Jan 7.
10
Sleep microstructure organizes memory replay.睡眠微观结构组织记忆重演。
Nature. 2025 Jan;637(8048):1161-1169. doi: 10.1038/s41586-024-08340-w. Epub 2025 Jan 1.
Science. 2020 Oct 9;370(6513):247-250. doi: 10.1126/science.abb4151.
4
Neuronal Computation Underlying Inferential Reasoning in Humans and Mice.人类和小鼠推理思维的神经计算。
Cell. 2020 Oct 1;183(1):228-243.e21. doi: 10.1016/j.cell.2020.08.035. Epub 2020 Sep 17.
5
Episodic memory retrieval success is associated with rapid replay of episode content.情节记忆提取的成功与情节内容的快速回放有关。
Nat Neurosci. 2020 Aug;23(8):1025-1033. doi: 10.1038/s41593-020-0649-z. Epub 2020 Jun 8.
6
Assembly-Specific Disruption of Hippocampal Replay Leads to Selective Memory Deficit.特定于集合的海马体重放破坏导致选择性记忆缺陷。
Neuron. 2020 Apr 22;106(2):291-300.e6. doi: 10.1016/j.neuron.2020.01.021. Epub 2020 Feb 17.
7
Advantages and detection of phase coding in the absence of rhythmicity.相位编码在无节律性情况下的优势和检测。
Hippocampus. 2020 Jul;30(7):745-762. doi: 10.1002/hipo.23199. Epub 2020 Feb 17.
8
Theta Oscillations in Human Memory.人类记忆中的θ波震荡。
Trends Cogn Sci. 2020 Mar;24(3):208-227. doi: 10.1016/j.tics.2019.12.006. Epub 2020 Feb 3.
9
Constant Sub-second Cycling between Representations of Possible Futures in the Hippocampus.海马体中可能未来的表示之间的恒次秒循环。
Cell. 2020 Feb 6;180(3):552-567.e25. doi: 10.1016/j.cell.2020.01.014. Epub 2020 Jan 30.
10
Human Replay Spontaneously Reorganizes Experience.人类的重放会自发地重组经验。
Cell. 2019 Jul 25;178(3):640-652.e14. doi: 10.1016/j.cell.2019.06.012. Epub 2019 Jul 4.