• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用主动推理的以对象为中心的场景表示

Object-Centric Scene Representations Using Active Inference.

作者信息

Van de Maele Toon, Verbelen Tim, Mazzaglia Pietro, Ferraro Stefano, Dhoedt Bart

机构信息

Ghent University, 9000 Ghent, Belgium

VERSES AI Research Lab, Los Angeles, CA 90016, U.S.A.

出版信息

Neural Comput. 2024 Mar 21;36(4):677-704. doi: 10.1162/neco_a_01637.

DOI:10.1162/neco_a_01637
PMID:38457764
Abstract

Representing a scene and its constituent objects from raw sensory data is a core ability for enabling robots to interact with their environment. In this letter, we propose a novel approach for scene understanding, leveraging an object-centric generative model that enables an agent to infer object category and pose in an allocentric reference frame using active inference, a neuro-inspired framework for action and perception. For evaluating the behavior of an active vision agent, we also propose a new benchmark where, given a target viewpoint of a particular object, the agent needs to find the best matching viewpoint given a workspace with randomly positioned objects in 3D. We demonstrate that our active inference agent is able to balance epistemic foraging and goal-driven behavior, and quantitatively outperforms both supervised and reinforcement learning baselines by more than a factor of two in terms of success rate.

摘要

从原始感官数据中表征一个场景及其组成对象是使机器人能够与环境交互的核心能力。在这封信中,我们提出了一种用于场景理解的新方法,利用以对象为中心的生成模型,该模型使智能体能够使用主动推理(一种受神经启发的行动和感知框架)在以自我为中心的参考系中推断对象类别和姿态。为了评估主动视觉智能体的行为,我们还提出了一个新的基准测试,在给定特定对象的目标视点的情况下,智能体需要在一个三维空间中随机放置对象的工作空间中找到最佳匹配视点。我们证明,我们的主动推理智能体能够平衡认知觅食和目标驱动行为,并且在成功率方面,在数量上比监督学习和强化学习基线高出两倍多。

相似文献

1
Object-Centric Scene Representations Using Active Inference.使用主动推理的以对象为中心的场景表示
Neural Comput. 2024 Mar 21;36(4):677-704. doi: 10.1162/neco_a_01637.
2
Embodied Object Representation Learning and Recognition.具身物体表征学习与识别
Front Neurorobot. 2022 Apr 14;16:840658. doi: 10.3389/fnbot.2022.840658. eCollection 2022.
3
Symmetry and complexity in object-centric deep active inference models.以对象为中心的深度主动推理模型中的对称性与复杂性。
Interface Focus. 2023 Apr 14;13(3):20220077. doi: 10.1098/rsfs.2022.0077. eCollection 2023 Jun 6.
4
Unsupervised Object-Centric Learning From Multiple Unspecified Viewpoints.从多个未指定视角进行无监督的以对象为中心的学习。
IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):3897-3909. doi: 10.1109/TPAMI.2023.3349174. Epub 2024 Apr 3.
5
Investigating object compositionality in Generative Adversarial Networks.研究生成对抗网络中的对象组合性。
Neural Netw. 2020 Oct;130:309-325. doi: 10.1016/j.neunet.2020.07.007. Epub 2020 Jul 13.
6
Deep Active Inference and Scene Construction.深度主动推理与场景构建
Front Artif Intell. 2020 Oct 28;3:509354. doi: 10.3389/frai.2020.509354. eCollection 2020.
7
Learning efficient haptic shape exploration with a rigid tactile sensor array.使用刚性触觉传感器阵列学习高效的触觉形状探索。
PLoS One. 2020 Jan 2;15(1):e0226880. doi: 10.1371/journal.pone.0226880. eCollection 2020.
8
How does the brain rapidly learn and reorganize view-invariant and position-invariant object representations in the inferotemporal cortex?大脑如何在后颞叶皮层中快速学习和重新组织不变视图和不变位置的物体表示?
Neural Netw. 2011 Dec;24(10):1050-61. doi: 10.1016/j.neunet.2011.04.004. Epub 2011 Apr 22.
9
Active Vision for Robot Manipulators Using the Free Energy Principle.基于自由能原理的机器人操纵器主动视觉
Front Neurorobot. 2021 Mar 5;15:642780. doi: 10.3389/fnbot.2021.642780. eCollection 2021.
10
Integration of egocentric and allocentric information during memory-guided reaching to images of a natural environment.在记忆引导下,对自然环境的图像进行到达操作时,自我中心和他心信息的整合。
Front Hum Neurosci. 2014 Aug 25;8:636. doi: 10.3389/fnhum.2014.00636. eCollection 2014.

引用本文的文献

1
Deep Hybrid Models: Infer and Plan in a Dynamic World.深度混合模型:在动态世界中进行推理与规划。
Entropy (Basel). 2025 May 27;27(6):570. doi: 10.3390/e27060570.
2
Slow but flexible or fast but rigid? Discrete and continuous processes compared.慢而灵活还是快而刻板?离散与连续过程之比较。
Heliyon. 2024 Oct 18;10(20):e39129. doi: 10.1016/j.heliyon.2024.e39129. eCollection 2024 Oct 30.