• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自适应调谐曲线宽度可提高样本高效学习。

Adaptive Tuning Curve Widths Improve Sample Efficient Learning.

作者信息

Meier Florian, Dang-Nhu Raphaël, Steger Angelika

机构信息

Department of Computer Science, ETH Zürich, Zurich, Switzerland.

出版信息

Front Comput Neurosci. 2020 Feb 18;14:12. doi: 10.3389/fncom.2020.00012. eCollection 2020.

DOI:10.3389/fncom.2020.00012
PMID:32132915
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7041413/
Abstract

Natural brains perform miraculously well in learning new tasks from a small number of samples, whereas sample efficient learning is still a major open problem in the field of machine learning. Here, we raise the question, how the neural coding scheme affects sample efficiency, and make first progress on this question by proposing and analyzing a learning algorithm that uses a simple reinforce-type plasticity mechanism and does not require any gradients to learn low dimensional mappings. It harnesses three bio-plausible mechanisms, namely, population codes with bell shaped tuning curves, continous attractor mechanisms and probabilistic synapses, to achieve sample efficient learning. We show both theoretically and by simulations that population codes with broadly tuned neurons lead to high sample efficiency, whereas codes with sharply tuned neurons account for high final precision. Moreover, a dynamic adaptation of the tuning width during learning gives rise to both, high sample efficiency and high final precision. We prove a sample efficiency guarantee for our algorithm that lies within a logarithmic factor from the information theoretical optimum. Our simulations show that for low dimensional mappings, our learning algorithm achieves comparable sample efficiency to multi-layer perceptrons trained by gradient descent, although it does not use any gradients. Furthermore, it achieves competitive sample efficiency in low dimensional reinforcement learning tasks. From a machine learning perspective, these findings may inspire novel approaches to improve sample efficiency. From a neuroscience perspective, these findings suggest sample efficiency as a yet unstudied functional role of adaptive tuning curve width.

摘要

自然大脑在从少量样本中学习新任务方面表现得极其出色,而样本高效学习仍是机器学习领域的一个主要开放性问题。在此,我们提出一个问题,即神经编码方案如何影响样本效率,并通过提出和分析一种学习算法在这个问题上取得了初步进展,该算法使用一种简单的强化型可塑性机制,且无需任何梯度来学习低维映射。它利用了三种具有生物学合理性的机制,即具有钟形调谐曲线的群体编码、连续吸引子机制和概率性突触,以实现样本高效学习。我们通过理论分析和模拟表明,具有广泛调谐神经元的群体编码会带来高样本效率,而具有尖锐调谐神经元的编码则能实现高最终精度。此外,在学习过程中对调谐宽度进行动态调整会同时产生高样本效率和高最终精度。我们证明了我们算法的样本效率保证,其与信息理论最优值相差一个对数因子。我们的模拟表明,对于低维映射,我们的学习算法虽然不使用任何梯度,但能实现与通过梯度下降训练的多层感知器相当的样本效率。此外,它在低维强化学习任务中实现了具有竞争力的样本效率。从机器学习的角度来看,这些发现可能会启发提高样本效率的新方法。从神经科学的角度来看,这些发现表明样本效率是自适应调谐曲线宽度尚未被研究的功能作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21bb/7041413/b5e4a2c4a6c6/fncom-14-00012-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21bb/7041413/625b87d9a92a/fncom-14-00012-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21bb/7041413/d3fa88256423/fncom-14-00012-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21bb/7041413/6f60c101e7b7/fncom-14-00012-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21bb/7041413/eab9236142b0/fncom-14-00012-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21bb/7041413/b5e4a2c4a6c6/fncom-14-00012-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21bb/7041413/625b87d9a92a/fncom-14-00012-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21bb/7041413/d3fa88256423/fncom-14-00012-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21bb/7041413/6f60c101e7b7/fncom-14-00012-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21bb/7041413/eab9236142b0/fncom-14-00012-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21bb/7041413/b5e4a2c4a6c6/fncom-14-00012-g0005.jpg

相似文献

1
Adaptive Tuning Curve Widths Improve Sample Efficient Learning.自适应调谐曲线宽度可提高样本高效学习。
Front Comput Neurosci. 2020 Feb 18;14:12. doi: 10.3389/fncom.2020.00012. eCollection 2020.
2
A learning rule for very simple universal approximators consisting of a single layer of perceptrons.一种由单层感知器组成的非常简单的通用逼近器的学习规则。
Neural Netw. 2008 Jun;21(5):786-95. doi: 10.1016/j.neunet.2007.12.036. Epub 2007 Dec 31.
3
Evolving interpretable plasticity for spiking networks.用于脉冲神经网络的不断发展的可解释可塑性。
Elife. 2021 Oct 28;10:e66273. doi: 10.7554/eLife.66273.
4
Inferring neural circuit structure from datasets of heterogeneous tuning curves.从异质调谐曲线的数据集推断神经回路结构。
PLoS Comput Biol. 2019 Apr 19;15(4):e1006816. doi: 10.1371/journal.pcbi.1006816. eCollection 2019 Apr.
5
A multivariate adaptive gradient algorithm with reduced tuning efforts.一种具有较少调谐工作的多元自适应梯度算法。
Neural Netw. 2022 Aug;152:499-509. doi: 10.1016/j.neunet.2022.05.016. Epub 2022 May 21.
6
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
7
Variational Information Bottleneck Regularized Deep Reinforcement Learning for Efficient Robotic Skill Adaptation.变分信息瓶颈正则化深度强化学习在机器人高效技能自适应中的应用。
Sensors (Basel). 2023 Jan 9;23(2):762. doi: 10.3390/s23020762.
8
Population codes enable learning from few examples by shaping inductive bias.群体编码通过塑造归纳偏差来实现从少数例子中学习。
Elife. 2022 Dec 16;11:e78606. doi: 10.7554/eLife.78606.
9
Neuronal population codes and the perception of object distance in weakly electric fish.弱电鱼的神经元群体编码与物体距离感知
J Neurosci. 2001 Apr 15;21(8):2842-50. doi: 10.1523/JNEUROSCI.21-08-02842.2001.
10
V1 orientation plasticity is explained by broadly tuned feedforward inputs and intracortical sharpening.V1方向可塑性可通过广泛调谐的前馈输入和皮质内锐化来解释。
Vis Neurosci. 2010 Mar;27(1-2):57-73. doi: 10.1017/S0952523810000039. Epub 2010 Apr 16.

引用本文的文献

1
Population codes enable learning from few examples by shaping inductive bias.群体编码通过塑造归纳偏差来实现从少数例子中学习。
Elife. 2022 Dec 16;11:e78606. doi: 10.7554/eLife.78606.

本文引用的文献

1
Human-level performance in 3D multiplayer games with population-based reinforcement learning.基于群体强化学习的 3D 多人游戏中的人类水平表现。
Science. 2019 May 31;364(6443):859-865. doi: 10.1126/science.aau6249.
2
Reinforcement Learning, Fast and Slow.强化学习:快与慢。
Trends Cogn Sci. 2019 May;23(5):408-422. doi: 10.1016/j.tics.2019.02.006. Epub 2019 Apr 16.
3
Computational roles of plastic probabilistic synapses.计算可塑性概率性突触的作用。
Curr Opin Neurobiol. 2019 Feb;54:90-97. doi: 10.1016/j.conb.2018.09.002. Epub 2018 Oct 8.
4
A Dynamic Connectome Supports the Emergence of Stable Computational Function of Neural Circuits through Reward-Based Learning.动态连接组支持基于奖励的学习使神经网络电路的稳定计算功能得以出现。
eNeuro. 2018 Apr 24;5(2). doi: 10.1523/ENEURO.0301-17.2018. eCollection 2018 Mar-Apr.
5
Ring attractor dynamics in the central brain.中脑的环吸引子动力学。
Science. 2017 May 26;356(6340):849-853. doi: 10.1126/science.aal4835. Epub 2017 May 4.
6
Multiassociative Memory: Recurrent Synapses Increase Storage Capacity.
Neural Comput. 2017 May;29(5):1375-1405. doi: 10.1162/NECO_a_00954. Epub 2017 Mar 23.
7
Reinforcement Learning and Episodic Memory in Humans and Animals: An Integrative Framework.人类和动物中的强化学习与情景记忆:一个综合框架
Annu Rev Psychol. 2017 Jan 3;68:101-128. doi: 10.1146/annurev-psych-122414-033625. Epub 2016 Sep 2.
8
Network Plasticity as Bayesian Inference.作为贝叶斯推理的网络可塑性
PLoS Comput Biol. 2015 Nov 6;11(11):e1004485. doi: 10.1371/journal.pcbi.1004485. eCollection 2015 Nov.
9
Neural dynamics for landmark orientation and angular path integration.地标定向和角度路径整合的神经动力学
Nature. 2015 May 14;521(7551):186-91. doi: 10.1038/nature14446.
10
The influence of population size, noise strength and behavioral task on best-encoded stimulus for neurons with unimodal or monotonic tuning curves.种群大小、噪声强度和行为任务对具有单峰或单调调谐曲线的神经元的最佳编码刺激的影响。
Front Comput Neurosci. 2015 Feb 17;9:18. doi: 10.3389/fncom.2015.00018. eCollection 2015.