• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过具身学习稀疏且有意义的表示。

Learning sparse and meaningful representations through embodiment.

机构信息

Institute of Cognitive Science, University of Osnabrück, Wachsbleiche 27, 49090 Osnabrück, Germany.

Institute of Cognitive Science, University of Osnabrück, Wachsbleiche 27, 49090 Osnabrück, Germany; Institute of Neurophysiology and Pathophysiology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.

出版信息

Neural Netw. 2021 Feb;134:23-41. doi: 10.1016/j.neunet.2020.11.004. Epub 2020 Nov 23.

DOI:10.1016/j.neunet.2020.11.004
PMID:33279863
Abstract

How do humans acquire a meaningful understanding of the world with little to no supervision or semantic labels provided by the environment? Here we investigate embodiment with a closed loop between action and perception as one key component in this process. We take a close look at the representations learned by a deep reinforcement learning agent that is trained with high-dimensional visual observations collected in a 3D environment with very sparse rewards. We show that this agent learns stable representations of meaningful concepts such as doors without receiving any semantic labels. Our results show that the agent learns to represent the action relevant information, extracted from a simulated camera stream, in a wide variety of sparse activation patterns. The quality of the representations learned shows the strength of embodied learning and its advantages over fully supervised approaches.

摘要

人类如何在几乎没有环境提供的监督或语义标签的情况下,获得对世界的有意义的理解?在这里,我们研究了行动和感知之间的闭环作为这个过程中的一个关键组成部分的体现。我们仔细研究了一个深度强化学习代理所学到的表示,该代理是使用在具有非常稀疏奖励的 3D 环境中收集的高维视觉观察进行训练的。我们表明,该代理在没有收到任何语义标签的情况下,学会了对有意义的概念(如门)的稳定表示。我们的结果表明,代理学会了以各种稀疏激活模式来表示从模拟相机流中提取的与动作相关的信息。所学到的表示的质量显示了体现学习的优势,以及它相对于完全监督方法的优势。

相似文献

1
Learning sparse and meaningful representations through embodiment.通过具身学习稀疏且有意义的表示。
Neural Netw. 2021 Feb;134:23-41. doi: 10.1016/j.neunet.2020.11.004. Epub 2020 Nov 23.
2
Development of Few-Shot Learning Capabilities in Artificial Neural Networks When Learning Through Self-Supervised Interaction.
IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):209-219. doi: 10.1109/TPAMI.2023.3323040. Epub 2023 Dec 5.
3
Modality independent adversarial network for generalized zero shot image classification.模态无关对抗网络的广义零样本图像分类。
Neural Netw. 2021 Feb;134:11-22. doi: 10.1016/j.neunet.2020.11.007. Epub 2020 Nov 21.
4
DAPath: Distance-aware knowledge graph reasoning based on deep reinforcement learning.DAPath:基于深度强化学习的距离感知知识图推理。
Neural Netw. 2021 Mar;135:1-12. doi: 10.1016/j.neunet.2020.11.012. Epub 2020 Dec 5.
5
Multiview Semantic Representation for Visual Recognition.多视图语义表示在视觉识别中的应用。
IEEE Trans Cybern. 2020 May;50(5):2038-2049. doi: 10.1109/TCYB.2018.2875728. Epub 2018 Nov 6.
6
Multi-label zero-shot human action recognition via joint latent ranking embedding.基于联合潜在排序嵌入的多标签零镜头人体动作识别。
Neural Netw. 2020 Feb;122:1-23. doi: 10.1016/j.neunet.2019.09.029. Epub 2019 Oct 21.
7
Combining STDP and binary networks for reinforcement learning from images and sparse rewards.结合 STDP 和二进制网络,从图像和稀疏奖励中进行强化学习。
Neural Netw. 2021 Dec;144:496-506. doi: 10.1016/j.neunet.2021.09.010. Epub 2021 Sep 17.
8
Representation in natural and artificial agents: an embodied cognitive science perspective.自然与人工智能体中的表征:一种具身认知科学视角
Z Naturforsch C J Biosci. 1998 Jul-Aug;53(7-8):480-503. doi: 10.1515/znc-1998-7-804.
9
Active Predictive Coding: A Unifying Neural Model for Active Perception, Compositional Learning, and Hierarchical Planning.主动预测编码:主动感知、组合学习和分层规划的统一神经模型。
Neural Comput. 2023 Dec 12;36(1):1-32. doi: 10.1162/neco_a_01627.
10
STACoRe: Spatio-temporal and action-based contrastive representations for reinforcement learning in Atari.STACoRe:用于雅达利强化学习的基于时空和动作对比的表示方法。
Neural Netw. 2023 Mar;160:1-11. doi: 10.1016/j.neunet.2022.12.018. Epub 2022 Dec 29.

引用本文的文献

1
Parallel development of social behavior in biological and artificial fish.生物鱼和人工鱼社会行为的并行发展
Nat Commun. 2024 Dec 5;15(1):10613. doi: 10.1038/s41467-024-52307-4.
2
The neuroconnectionist research programme.神经连接主义研究计划。
Nat Rev Neurosci. 2023 Jul;24(7):431-450. doi: 10.1038/s41583-023-00705-w. Epub 2023 May 30.
3
Mutual influence between language and perception in multi-agent communication games.多主体交流游戏中语言和感知的相互影响。
PLoS Comput Biol. 2022 Oct 31;18(10):e1010658. doi: 10.1371/journal.pcbi.1010658. eCollection 2022 Oct.