基于多模态分层狄利克雷过程的机器人主动感知

Multimodal Hierarchical Dirichlet Process-Based Active Perception by a Robot.

作者信息

Taniguchi Tadahiro, Yoshino Ryo, Takano Toshiaki

机构信息

Emergent Systems Laboratory, College of Information Science and Engineering, Ritsumeikan University, Ksatsu Japan.

Adaptive Systems Laboratory, Department of Computer Science, Shizuoka Institute of Science and Technology, Fukuroi, Japan.

出版信息

Front Neurorobot. 2018 May 22;12:22. doi: 10.3389/fnbot.2018.00022. eCollection 2018.

DOI:10.3389/fnbot.2018.00022

PMID:29872389

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5972223/

Abstract

In this paper, we propose an active perception method for recognizing object categories based on the multimodal hierarchical Dirichlet process (MHDP). The MHDP enables a robot to form object categories using multimodal information, e.g., visual, auditory, and haptic information, which can be observed by performing actions on an object. However, performing many actions on a target object requires a long time. In a real-time scenario, i.e., when the time is limited, the robot has to determine the set of actions that is most effective for recognizing a target object. We propose an active perception for MHDP method that uses the information gain (IG) maximization criterion and lazy greedy algorithm. We show that the IG maximization criterion is optimal in the sense that the criterion is equivalent to a minimization of the expected Kullback-Leibler divergence between a final recognition state and the recognition state after the next set of actions. However, a straightforward calculation of IG is practically impossible. Therefore, we derive a Monte Carlo approximation method for IG by making use of a property of the MHDP. We also show that the IG has submodular and non-decreasing properties as a set function because of the structure of the graphical model of the MHDP. Therefore, the IG maximization problem is reduced to a submodular maximization problem. This means that greedy and lazy greedy algorithms are effective and have a theoretical justification for their performance. We conducted an experiment using an upper-torso humanoid robot and a second one using synthetic data. The experimental results show that the method enables the robot to select a set of actions that allow it to recognize target objects quickly and accurately. The numerical experiment using the synthetic data shows that the proposed method can work appropriately even when the number of actions is large and a set of target objects involves objects categorized into multiple classes. The results support our theoretical outcomes.

摘要

在本文中，我们提出了一种基于多模态分层狄利克雷过程（MHDP）的用于识别物体类别的主动感知方法。MHDP使机器人能够利用多模态信息（例如视觉、听觉和触觉信息）来形成物体类别，这些信息可通过对物体执行动作来观察。然而，对目标物体执行许多动作需要很长时间。在实时场景中，即时间有限时，机器人必须确定对于识别目标物体最有效的动作集。我们提出了一种用于MHDP的主动感知方法，该方法使用信息增益（IG）最大化准则和懒惰贪婪算法。我们表明，IG最大化准则在某种意义上是最优的，即该准则等同于最小化最终识别状态与下一组动作后的识别状态之间的期望库尔贝克 - 莱布勒散度。然而，直接计算IG实际上是不可能的。因此，我们利用MHDP的一个性质推导出一种IG的蒙特卡罗近似方法。我们还表明，由于MHDP图形模型的结构，IG作为集合函数具有次模性和非递减性质。因此，IG最大化问题简化为一个次模最大化问题。这意味着贪婪算法和懒惰贪婪算法是有效的，并且其性能具有理论依据。我们使用上半身人形机器人进行了一项实验，并使用合成数据进行了另一项实验。实验结果表明，该方法使机器人能够选择一组动作，从而使其能够快速准确地识别目标物体。使用合成数据的数值实验表明，即使动作数量很大且一组目标物体涉及分类为多个类别的物体，所提出的方法也能适当地工作。这些结果支持了我们的理论成果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1cd/5972223/aaba6dc5398f/fnbot-12-00022-g0001.jpg

相似文献

Multimodal Hierarchical Dirichlet Process-Based Active Perception by a Robot.基于多模态分层狄利克雷过程的机器人主动感知

Front Neurorobot. 2018 May 22;12:22. doi: 10.3389/fnbot.2018.00022. eCollection 2018.

Hierarchical Spatial Concept Formation Based on Multimodal Information for Human Support Robots.基于多模态信息的人类辅助机器人分层空间概念形成

Front Neurorobot. 2018 Mar 13;12:11. doi: 10.3389/fnbot.2018.00011. eCollection 2018.

Community-based rumor blocking maximization in social networks: Algorithms and analysis.社交网络中基于社区的谣言阻断最大化：算法与分析

Theor Comput Sci. 2020 Nov 6;840:257-269. doi: 10.1016/j.tcs.2020.08.030. Epub 2020 Sep 10.

Per-Round Knapsack-Constrained Linear Submodular Bandits.每轮背包约束线性次模博弈

Neural Comput. 2016 Dec;28(12):2757-2789. doi: 10.1162/NECO_a_00887. Epub 2016 Sep 14.

Creating objects and object categories for studying perception and perceptual learning.创建用于研究感知和感知学习的对象及对象类别。

J Vis Exp. 2012 Nov 2(69):e3358. doi: 10.3791/3358.

A low-cost EEG system-based hybrid brain-computer interface for humanoid robot navigation and recognition.基于低成本 EEG 系统的混合脑机接口，用于人形机器人导航和识别。

PLoS One. 2013 Sep 4;8(9):e74583. doi: 10.1371/journal.pone.0074583. eCollection 2013.

Learning efficient haptic shape exploration with a rigid tactile sensor array.使用刚性触觉传感器阵列学习高效的触觉形状探索。

PLoS One. 2020 Jan 2;15(1):e0226880. doi: 10.1371/journal.pone.0226880. eCollection 2020.

Performance of a Computational Model of the Mammalian Olfactory System哺乳动物嗅觉系统计算模型的性能

Ranking with submodular functions on a budget.预算约束下基于次模函数的排序

Data Min Knowl Discov. 2022;36(3):1197-1218. doi: 10.1007/s10618-022-00833-4. Epub 2022 Apr 23.

Intelligent Perception System of Robot Visual Servo for Complex Industrial Environment.机器人视觉伺服智能感知系统在复杂工业环境中的应用。

Sensors (Basel). 2020 Dec 11;20(24):7121. doi: 10.3390/s20247121.

引用本文的文献

Active Inference Through Energy Minimization in Multimodal Affective Human-Robot Interaction.多模态情感人机交互中通过能量最小化实现主动推理

Front Robot AI. 2021 Nov 26;8:684401. doi: 10.3389/frobt.2021.684401. eCollection 2021.

A Framework for Sensorimotor Cross-Perception and Cross-Behavior Knowledge Transfer for Object Categorization.用于对象分类的感觉运动交叉感知和交叉行为知识转移框架。

Front Robot AI. 2020 Oct 9;7:522141. doi: 10.3389/frobt.2020.522141. eCollection 2020.

Open-Environment Robotic Acoustic Perception for Object Recognition.用于目标识别的开放环境机器人声学感知

Front Neurorobot. 2019 Nov 22;13:96. doi: 10.3389/fnbot.2019.00096. eCollection 2019.

本文引用的文献

A Bayesian framework for active artificial perception.主动人工感知的贝叶斯框架。

IEEE Trans Cybern. 2013 Apr;43(2):699-711. doi: 10.1109/TSMCB.2012.2214477. Epub 2013 Mar 7.

Learning tactile skills through curious exploration.通过好奇探索学习触觉技能。

Front Neurorobot. 2012 Jul 23;6:6. doi: 10.3389/fnbot.2012.00006. eCollection 2012.

Bayesian exploration for intelligent identification of textures.贝叶斯探索用于纹理的智能识别。

Front Neurorobot. 2012 Jun 18;6:4. doi: 10.3389/fnbot.2012.00004. eCollection 2012.

Category and feature identification.类别与特征识别。

Acta Psychol (Amst). 2010 Mar;133(3):216-33. doi: 10.1016/j.actpsy.2009.11.012. Epub 2010 Jan 18.

Perceptual symbol systems.感知符号系统

Behav Brain Sci. 1999 Aug;22(4):577-609; discussion 610-60. doi: 10.1017/s0140525x99002149.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于多模态分层狄利克雷过程的机器人主动感知

Multimodal Hierarchical Dirichlet Process-Based Active Perception by a Robot.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献