Suppr超能文献

用于主动目标识别的带过渡管理的视点规划

Viewpoint planning with transition management for active object recognition.

作者信息

Sun Haibo, Zhu Feng, Li Yangyang, Zhao Pengfei, Kong Yanzi, Wang Jianyu, Wan Yingcai, Fu Shuangfei

机构信息

Faculty of Robot Science and Engineering, Northeastern University, Shenyang, China.

Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang, China.

出版信息

Front Neurorobot. 2023 Feb 24;17:1093132. doi: 10.3389/fnbot.2023.1093132. eCollection 2023.

Abstract

Active object recognition (AOR) provides a paradigm where an agent can capture additional evidence by purposefully changing its viewpoint to improve the quality of recognition. One of the most concerned problems in AOR is viewpoint planning (VP) which refers to developing a policy to determine the next viewpoints of the agent. A research trend is to solve the VP problem with reinforcement learning, namely to use the viewpoint transitions explored by the agent to train the VP policy. However, most research discards the trained transitions, which may lead to an inefficient use of the explored transitions. To solve this challenge, we present a novel VP method with transition management based on reinforcement learning, which can reuse the explored viewpoint transitions. To be specific, a learning framework of the VP policy is first established the deterministic policy gradient theory, which provides an opportunity to reuse the explored transitions. Then, we design a scheme of viewpoint transition management that can store the explored transitions and decide which transitions are used for the policy learning. Finally, within the framework, we develop an algorithm based on twin delayed deep deterministic policy gradient and the designed scheme to train the VP policy. Experiments on the public and challenging dataset GERMS show the effectiveness of our method in comparison with several competing approaches.

摘要

主动目标识别(AOR)提供了一种范例,即智能体可以通过有目的地改变其视角来获取额外证据,以提高识别质量。AOR中最受关注的问题之一是视角规划(VP),它指的是制定一种策略来确定智能体的下一个视角。一种研究趋势是用强化学习来解决VP问题,即利用智能体探索的视角转换来训练VP策略。然而,大多数研究丢弃了训练过的转换,这可能导致对探索到的转换利用效率低下。为了解决这一挑战,我们提出了一种基于强化学习的具有转换管理的新型VP方法,该方法可以重用探索到的视角转换。具体来说,首先基于确定性策略梯度理论建立VP策略的学习框架,这为重用探索到的转换提供了机会。然后,我们设计了一种视角转换管理方案,该方案可以存储探索到的转换,并决定哪些转换用于策略学习。最后,在该框架内,我们开发了一种基于双延迟深度确定性策略梯度和所设计方案的算法来训练VP策略。在公开且具有挑战性的GERMS数据集上进行的实验表明,与几种竞争方法相比,我们的方法是有效的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a10/9998679/a6c7cc5649d7/fnbot-17-1093132-g0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验