• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于Transformer的多机器人自主探索强化学习

Transformer-Based Reinforcement Learning for Multi-Robot Autonomous Exploration.

作者信息

Chen Qihong, Wang Rui, Lyu Ming, Zhang Jie

机构信息

School of Automation, Nanjing University of Science and Technology, Nanjing 210094, China.

出版信息

Sensors (Basel). 2024 Aug 6;24(16):5083. doi: 10.3390/s24165083.

DOI:10.3390/s24165083
PMID:39204780
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11360377/
Abstract

A map of the environment is the basis for the robot's navigation. Multi-robot collaborative autonomous exploration allows for rapidly constructing maps of unknown environments, essential for application areas such as search and rescue missions. Traditional autonomous exploration methods are inefficient due to the repetitive exploration problem. For this reason, we propose a multi-robot autonomous exploration method based on the Transformer model. Our multi-agent deep reinforcement learning method includes a multi-agent learning method to effectively improve exploration efficiency. We conducted experiments comparing our proposed method with existing methods in a simulation environment, and the experimental results showed that our proposed method had a good performance and a specific generalization ability.

摘要

环境地图是机器人导航的基础。多机器人协作自主探索能够快速构建未知环境的地图,这对于搜索和救援任务等应用领域至关重要。由于存在重复探索问题,传统的自主探索方法效率低下。因此,我们提出了一种基于Transformer模型的多机器人自主探索方法。我们的多智能体深度强化学习方法包括一种多智能体学习方法,以有效提高探索效率。我们在模拟环境中进行了实验,将我们提出的方法与现有方法进行比较,实验结果表明我们提出的方法具有良好的性能和特定的泛化能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/3a9850b5528f/sensors-24-05083-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/5194fdb38c16/sensors-24-05083-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/40dae69540a5/sensors-24-05083-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/8dddb40aaa22/sensors-24-05083-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/ed9a7688b70c/sensors-24-05083-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/d6bc4f0f6536/sensors-24-05083-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/081e6ee3d370/sensors-24-05083-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/5939267c7074/sensors-24-05083-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/c40b0bc92c1b/sensors-24-05083-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/f12ffc1d5968/sensors-24-05083-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/3a9850b5528f/sensors-24-05083-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/5194fdb38c16/sensors-24-05083-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/40dae69540a5/sensors-24-05083-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/8dddb40aaa22/sensors-24-05083-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/ed9a7688b70c/sensors-24-05083-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/d6bc4f0f6536/sensors-24-05083-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/081e6ee3d370/sensors-24-05083-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/5939267c7074/sensors-24-05083-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/c40b0bc92c1b/sensors-24-05083-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/f12ffc1d5968/sensors-24-05083-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/11360377/3a9850b5528f/sensors-24-05083-g010.jpg

相似文献

1
Transformer-Based Reinforcement Learning for Multi-Robot Autonomous Exploration.基于Transformer的多机器人自主探索强化学习
Sensors (Basel). 2024 Aug 6;24(16):5083. doi: 10.3390/s24165083.
2
Autonomous Navigation by Mobile Robot with Sensor Fusion Based on Deep Reinforcement Learning.基于深度强化学习的传感器融合移动机器人自主导航
Sensors (Basel). 2024 Jun 16;24(12):3895. doi: 10.3390/s24123895.
3
Autonomous robotic exploration with simultaneous environment and traversability models learning.具有同步环境和可通行性模型学习的自主机器人探索。
Front Robot AI. 2022 Oct 5;9:910113. doi: 10.3389/frobt.2022.910113. eCollection 2022.
4
Deep reinforcement learning-aided autonomous navigation with landmark generators.基于地标生成器的深度强化学习辅助自主导航。
Front Neurorobot. 2023 Aug 22;17:1200214. doi: 10.3389/fnbot.2023.1200214. eCollection 2023.
5
Efficient Autonomous Exploration and Mapping in Unknown Environments.高效的自主探索和未知环境下的地图绘制。
Sensors (Basel). 2023 May 15;23(10):4766. doi: 10.3390/s23104766.
6
Distributed Non-Communicating Multi-Robot Collision Avoidance via Map-Based Deep Reinforcement Learning.基于地图的深度强化学习实现分布式非通信多机器人避碰
Sensors (Basel). 2020 Aug 27;20(17):4836. doi: 10.3390/s20174836.
7
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment.基于深度强化学习的未知环境导航自动探索。
IEEE Trans Neural Netw Learn Syst. 2020 Jun;31(6):2064-2076. doi: 10.1109/TNNLS.2019.2927869. Epub 2019 Aug 6.
8
A Comparison Study between Traditional and Deep-Reinforcement-Learning-Based Algorithms for Indoor Autonomous Navigation in Dynamic Scenarios.动态场景下基于传统算法与深度强化学习算法的室内自主导航比较研究
Sensors (Basel). 2023 Dec 7;23(24):9672. doi: 10.3390/s23249672.
9
Robot grasping method optimization using improved deep deterministic policy gradient algorithm of deep reinforcement learning.基于深度强化学习的改进深度确定性策略梯度算法的机器人抓取方法优化
Rev Sci Instrum. 2021 Feb 1;92(2):025114. doi: 10.1063/5.0034101.
10
Rapid, safe, and incremental learning of navigation strategies.快速、安全且逐步学习导航策略。
IEEE Trans Syst Man Cybern B Cybern. 1996;26(3):408-20. doi: 10.1109/3477.499792.

本文引用的文献

1
Scientific exploration of challenging planetary analog environments with a team of legged robots.利用一队腿足机器人探索具有挑战性的行星模拟环境的科学研究。
Sci Robot. 2023 Jul 12;8(80):eade9548. doi: 10.1126/scirobotics.ade9548.
2
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment.基于深度强化学习的未知环境导航自动探索。
IEEE Trans Neural Netw Learn Syst. 2020 Jun;31(6):2064-2076. doi: 10.1109/TNNLS.2019.2927869. Epub 2019 Aug 6.
3
Mastering the game of Go without human knowledge.无需人类知识即可掌握围棋游戏。
Nature. 2017 Oct 18;550(7676):354-359. doi: 10.1038/nature24270.
4
Mastering the game of Go with deep neural networks and tree search.用深度神经网络和树搜索掌握围棋游戏。
Nature. 2016 Jan 28;529(7587):484-9. doi: 10.1038/nature16961.
5
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.