• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于特征的学习提高了适应性,同时又不影响精度。

Feature-based learning improves adaptability without compromising precision.

机构信息

Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, 03755, USA.

Department of Neuroscience, Yale School of Medicine, New Haven, CT, 06510, USA.

出版信息

Nat Commun. 2017 Nov 24;8(1):1768. doi: 10.1038/s41467-017-01874-w.

DOI:10.1038/s41467-017-01874-w
PMID:29170381
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5700946/
Abstract

Learning from reward feedback is essential for survival but can become extremely challenging with myriad choice options. Here, we propose that learning reward values of individual features can provide a heuristic for estimating reward values of choice options in dynamic, multi-dimensional environments. We hypothesize that this feature-based learning occurs not just because it can reduce dimensionality, but more importantly because it can increase adaptability without compromising precision of learning. We experimentally test this hypothesis and find that in dynamic environments, human subjects adopt feature-based learning even when this approach does not reduce dimensionality. Even in static, low-dimensional environments, subjects initially adopt feature-based learning and gradually switch to learning reward values of individual options, depending on how accurately objects' values can be predicted by combining feature values. Our computational models reproduce these results and highlight the importance of neurons coding feature values for parallel learning of values for features and objects.

摘要

从奖励反馈中学习对于生存至关重要,但在面临无数选择时,这可能变得极具挑战性。在这里,我们提出,学习单个特征的奖励值可以为在动态、多维环境中估计选择选项的奖励值提供一种启发式方法。我们假设这种基于特征的学习不仅因为它可以降低维度,而且更重要的是,因为它可以在不影响学习精度的情况下提高适应性。我们通过实验检验了这一假设,结果发现,在动态环境中,人类受试者采用基于特征的学习,即使这种方法不能降低维度。即使在静态、低维环境中,受试者最初也采用基于特征的学习,然后根据通过组合特征值来准确预测对象值的能力,逐渐切换到学习各个选项的奖励值。我们的计算模型再现了这些结果,并强调了编码特征值的神经元对于特征和对象值的并行学习的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/9e5e88d9a470/41467_2017_1874_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/d9df962e92b9/41467_2017_1874_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/d5f4f86a746d/41467_2017_1874_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/4591ae03c6d7/41467_2017_1874_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/5d0467770cbe/41467_2017_1874_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/23fc8cd2efb8/41467_2017_1874_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/280a2f1b48a0/41467_2017_1874_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/9e5e88d9a470/41467_2017_1874_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/d9df962e92b9/41467_2017_1874_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/d5f4f86a746d/41467_2017_1874_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/4591ae03c6d7/41467_2017_1874_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/5d0467770cbe/41467_2017_1874_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/23fc8cd2efb8/41467_2017_1874_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/280a2f1b48a0/41467_2017_1874_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b74f/5700946/9e5e88d9a470/41467_2017_1874_Fig7_HTML.jpg

相似文献

1
Feature-based learning improves adaptability without compromising precision.基于特征的学习提高了适应性,同时又不影响精度。
Nat Commun. 2017 Nov 24;8(1):1768. doi: 10.1038/s41467-017-01874-w.
2
Influence of learning strategy on response time during complex value-based learning and choice.学习策略对基于复杂价值的学习和选择过程中反应时的影响。
PLoS One. 2018 May 22;13(5):e0197263. doi: 10.1371/journal.pone.0197263. eCollection 2018.
3
Learning arbitrary stimulus-reward associations for naturalistic stimuli involves transition from learning about features to learning about objects.学习自然刺激的任意刺激-奖励关联涉及从学习特征到学习对象的转变。
Cognition. 2020 Dec;205:104425. doi: 10.1016/j.cognition.2020.104425. Epub 2020 Sep 19.
4
Computing reward-prediction error: an integrated account of cortical timing and basal-ganglia pathways for appetitive and aversive learning.计算奖励预测误差:关于皮层时间和基底神经节通路在食欲性和厌恶性学习中的综合阐述
Eur J Neurosci. 2015 Aug;42(4):2003-21. doi: 10.1111/ejn.12994. Epub 2015 Jul 25.
5
Hold it! The influence of lingering rewards on choice diversification and persistence.等等!延迟奖励对选择多样化和坚持性的影响。
J Exp Psychol Learn Mem Cogn. 2017 Nov;43(11):1752-1767. doi: 10.1037/xlm0000407. Epub 2017 Apr 6.
6
Optimal structure of metaplasticity for adaptive learning.适应性学习的元可塑性最佳结构。
PLoS Comput Biol. 2017 Jun 28;13(6):e1005630. doi: 10.1371/journal.pcbi.1005630. eCollection 2017 Jun.
7
Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum.腹侧纹状体和背侧纹状体中基于价值和有限状态策略的并行表征。
PLoS Comput Biol. 2015 Nov 3;11(11):e1004540. doi: 10.1371/journal.pcbi.1004540. eCollection 2015 Nov.
8
How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.我们如何学习做决策:强化学习预测错误在人类中的快速传播。
J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.
9
Separate mesocortical and mesolimbic pathways encode effort and reward learning signals.分离的中皮质和中边缘通路编码努力和奖励学习信号。
Proc Natl Acad Sci U S A. 2017 Aug 29;114(35):E7395-E7404. doi: 10.1073/pnas.1705643114. Epub 2017 Aug 14.
10
Reward-dependent learning in neuronal networks for planning and decision making.用于规划和决策的神经网络中基于奖励的学习。
Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0.

引用本文的文献

1
How working memory and reinforcement learning interact when avoiding punishment and pursuing reward concurrently.当同时避免惩罚和追求奖励时,工作记忆与强化学习是如何相互作用的。
J Exp Psychol Gen. 2025 Sep 1. doi: 10.1037/xge0001817.
2
Harnessing multi-output machine learning approach and dynamical observables from network structure to optimize COVID-19 intervention strategies.利用多输出机器学习方法和网络结构中的动态可观测量来优化新冠肺炎干预策略。
Biol Methods Protoc. 2025 Jun 5;10(1):bpaf039. doi: 10.1093/biomethods/bpaf039. eCollection 2025.
3
Schemas, reinforcement learning and the medial prefrontal cortex.

本文引用的文献

1
Optimal structure of metaplasticity for adaptive learning.适应性学习的元可塑性最佳结构。
PLoS Comput Biol. 2017 Jun 28;13(6):e1005630. doi: 10.1371/journal.pcbi.1005630. eCollection 2017 Jun.
2
Metaplasticity as a Neural Substrate for Adaptive Learning and Choice under Uncertainty.作为不确定性下适应性学习与选择的神经基础的元可塑性
Neuron. 2017 Apr 19;94(2):401-414.e6. doi: 10.1016/j.neuron.2017.03.044.
3
Dynamic Interaction between Reinforcement Learning and Attention in Multidimensional Environments.多维环境中强化学习与注意力之间的动态交互
图式、强化学习与内侧前额叶皮质
Nat Rev Neurosci. 2025 Mar;26(3):141-157. doi: 10.1038/s41583-024-00893-z. Epub 2025 Jan 7.
4
Contributions of Attention to Learning in Multidimensional Reward Environments.在多维奖励环境中注意力对学习的贡献。
J Neurosci. 2025 Feb 12;45(7):e2300232024. doi: 10.1523/JNEUROSCI.2300-23.2024.
5
Noradrenergic alpha-2a receptor stimulation enhances prediction error signaling and updating of attention sets in anterior cingulate cortex and striatum.去甲肾上腺素能α-2a 受体刺激增强了前扣带回皮层和纹状体中预测误差信号和注意力集的更新。
Nat Commun. 2024 Nov 15;15(1):9905. doi: 10.1038/s41467-024-54395-8.
6
Neural Mechanisms Underlying Robust Target Selection in Response to Microstimulation of the Oculomotor System.动眼系统微刺激反应中稳健目标选择的神经机制
J Neurosci. 2025 Jan 15;45(3):e2356232024. doi: 10.1523/JNEUROSCI.2356-23.2024.
7
Integration of Theory and Practice in Medical Morphology Curriculum in Postgraduate Training: A Flipped Classroom and Case-based Learning Exercise.研究生医学形态学课程理论与实践的整合:翻转课堂与基于案例的学习。
Curr Med Sci. 2023 Aug;43(4):741-748. doi: 10.1007/s11596-023-2759-9. Epub 2023 Jul 17.
8
Mechanisms of adjustments to different types of uncertainty in the reward environment across mice and monkeys.在老鼠和猴子中,对奖励环境中不同类型不确定性的调整机制。
Cogn Affect Behav Neurosci. 2023 Jun;23(3):600-619. doi: 10.3758/s13415-022-01059-z. Epub 2023 Feb 23.
9
Humans combine value learning and hypothesis testing strategically in multi-dimensional probabilistic reward learning.人类在多维概率奖励学习中策略性地结合价值学习和假设检验。
PLoS Comput Biol. 2022 Nov 23;18(11):e1010699. doi: 10.1371/journal.pcbi.1010699. eCollection 2022 Nov.
10
Explicit knowledge of task structure is a primary determinant of human model-based action.任务结构的明确知识是人类基于模型行为的主要决定因素。
Nat Hum Behav. 2022 Aug;6(8):1126-1141. doi: 10.1038/s41562-022-01346-2. Epub 2022 May 19.
Neuron. 2017 Jan 18;93(2):451-463. doi: 10.1016/j.neuron.2016.12.040.
4
Neural substrates of cognitive biases during probabilistic inference.概率推理过程中认知偏差的神经基础。
Nat Commun. 2016 Apr 26;7:11393. doi: 10.1038/ncomms11393.
5
Reward-Guided Learning with and without Causal Attribution.有无因果归因的奖励引导学习
Neuron. 2016 Apr 6;90(1):177-90. doi: 10.1016/j.neuron.2016.02.018. Epub 2016 Mar 10.
6
Why neurons mix: high dimensionality for higher cognition.神经元为何混合:高维结构助力更高认知
Curr Opin Neurobiol. 2016 Apr;37:66-74. doi: 10.1016/j.conb.2016.01.010. Epub 2016 Feb 4.
7
Dopamine regulates stimulus generalization in the human hippocampus.多巴胺调节人类海马体中的刺激泛化。
Elife. 2016 Feb 2;5:e12678. doi: 10.7554/eLife.12678.
8
Neural Mechanisms for Undoing the "Curse of Dimensionality".消除“维度灾难”的神经机制
J Neurosci. 2015 Sep 2;35(35):12083-4. doi: 10.1523/JNEUROSCI.2428-15.2015.
9
Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream.深度神经网络揭示了腹侧流中神经表征复杂性的梯度变化。
J Neurosci. 2015 Jul 8;35(27):10005-14. doi: 10.1523/JNEUROSCI.5023-14.2015.
10
Reinforcement learning in multidimensional environments relies on attention mechanisms.多维环境中的强化学习依赖于注意力机制。
J Neurosci. 2015 May 27;35(21):8145-57. doi: 10.1523/JNEUROSCI.2978-14.2015.