• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单连通工作空间中多完整约束智能体的最优分布式导航

Decentralized Navigation with Optimality for Multiple Holonomic Agents in Simply Connected Workspaces.

作者信息

Kotsinis Dimitrios, Bechlioulis Charalampos P

机构信息

Division of Systems and Automatic Control, Department of Electrical and Computer Engineering, University of Patras, Rio, 26504 Patras, Greece.

Athena Research Center, Robotics Institute, Artemidos 6 & Epidavrou, 15125 Maroussi, Greece.

出版信息

Sensors (Basel). 2024 May 15;24(10):3134. doi: 10.3390/s24103134.

DOI:10.3390/s24103134
PMID:38793989
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11125295/
Abstract

Multi-agent systems are utilized more often in the research community and industry, as they can complete tasks faster and more efficiently than single-agent systems. Therefore, in this paper, we are going to present an optimal approach to the multi-agent navigation problem in simply connected workspaces. The task involves each agent reaching its destination starting from an initial position and following an optimal collision-free trajectory. To achieve this, we design a decentralized control protocol, defined by a navigation function, where each agent is equipped with a navigation controller that resolves imminent safety conflicts with the others, as well as the workspace boundary, without requesting knowledge about the goal position of the other agents. Our approach is rendered sub-optimal, since each agent owns a predetermined optimal policy calculated by a novel off-policy iterative method. We use this method because the computational complexity of learning-based methods needed to calculate the global optimal solution becomes unrealistic as the number of agents increases. To achieve our goal, we examine how much the yielded sub-optimal trajectory deviates from the optimal one and how much time the multi-agent system needs to accomplish its task as we increase the number of agents. Finally, we compare our method results with a discrete centralized policy method, also known as a Multi-Agent Poli-RRT* algorithm, to demonstrate the validity of our method when it is attached to other research algorithms.

摘要

多智能体系统在研究领域和工业界的应用越来越频繁,因为它们比单智能体系统能更快、更高效地完成任务。因此,在本文中,我们将提出一种在简单连通工作空间中解决多智能体导航问题的最优方法。该任务要求每个智能体从初始位置出发,沿着最优的无碰撞轨迹到达其目的地。为实现这一目标,我们设计了一种由导航函数定义的分散控制协议,其中每个智能体都配备了一个导航控制器,该控制器能够在不要求了解其他智能体目标位置的情况下,解决与其他智能体以及工作空间边界的紧迫安全冲突。由于每个智能体都拥有通过一种新颖的离策略迭代方法计算出的预定最优策略,我们的方法是次优的。我们使用这种方法是因为随着智能体数量的增加,计算全局最优解所需的基于学习的方法的计算复杂度变得不切实际。为实现我们的目标,我们研究了随着智能体数量的增加,产生的次优轨迹与最优轨迹的偏差程度以及多智能体系统完成任务所需的时间。最后,我们将我们的方法结果与一种离散集中式策略方法(也称为多智能体Poli-RRT*算法)进行比较,以证明我们的方法与其他研究算法结合时的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/4e040d9d2d1f/sensors-24-03134-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/604220171f1a/sensors-24-03134-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/049149e11b5d/sensors-24-03134-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/c8d768efdc5c/sensors-24-03134-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/5bb165ac385d/sensors-24-03134-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/1c081921e247/sensors-24-03134-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/4dd0f2f33754/sensors-24-03134-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/c904ab45055b/sensors-24-03134-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/4e040d9d2d1f/sensors-24-03134-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/604220171f1a/sensors-24-03134-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/049149e11b5d/sensors-24-03134-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/c8d768efdc5c/sensors-24-03134-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/5bb165ac385d/sensors-24-03134-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/1c081921e247/sensors-24-03134-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/4dd0f2f33754/sensors-24-03134-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/c904ab45055b/sensors-24-03134-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be72/11125295/4e040d9d2d1f/sensors-24-03134-g008.jpg

相似文献

1
Decentralized Navigation with Optimality for Multiple Holonomic Agents in Simply Connected Workspaces.单连通工作空间中多完整约束智能体的最优分布式导航
Sensors (Basel). 2024 May 15;24(10):3134. doi: 10.3390/s24103134.
2
Robot Navigation in Complex Workspaces Employing Harmonic Maps and Adaptive Artificial Potential Fields.机器人在复杂工作空间中的导航:谐波映射与自适应人工势场方法。
Sensors (Basel). 2023 May 3;23(9):4464. doi: 10.3390/s23094464.
3
Reactive optimal motion planning for a class of holonomic planar agents using reinforcement learning with provable guarantees.基于具有可证明保证的强化学习的一类完整平面智能体的反应式最优运动规划
Front Robot AI. 2024 Jan 3;10:1255696. doi: 10.3389/frobt.2023.1255696. eCollection 2023.
4
RatInABox, a toolkit for modelling locomotion and neuronal activity in continuous environments.盒子里的老鼠,一个用于模拟连续环境中运动和神经元活动的工具包。
Elife. 2024 Feb 9;13:e85274. doi: 10.7554/eLife.85274.
5
One-Shot Multi-Path Planning Using Fully Convolutional Networks in a Comparison to Other Algorithms.与其他算法相比,使用全卷积网络的一次性多路径规划
Front Neurorobot. 2021 Jan 8;14:600984. doi: 10.3389/fnbot.2020.600984. eCollection 2020.
6
Decentralized multi-agent reinforcement learning based on best-response policies.基于最佳响应策略的分布式多智能体强化学习
Front Robot AI. 2024 Apr 16;11:1229026. doi: 10.3389/frobt.2024.1229026. eCollection 2024.
7
Consensus-based clustering and data aggregation in decentralized network of multi-agent systems.多智能体系统去中心化网络中基于共识的聚类与数据聚合
PeerJ Comput Sci. 2023 Aug 28;9:e1445. doi: 10.7717/peerj-cs.1445. eCollection 2023.
8
Guidance, Navigation and Control for Autonomous Quadrotor Flight in an Agricultural Field: The Case of Vineyards.自主四旋翼飞行器在农业领域的导航与控制:以葡萄园为例。
Sensors (Basel). 2022 Nov 16;22(22):8865. doi: 10.3390/s22228865.
9
Multi-Object Navigation Using Potential Target Position Policy Function.多目标导航使用潜在目标位置策略函数。
IEEE Trans Image Process. 2023;32:2608-2619. doi: 10.1109/TIP.2023.3263110. Epub 2023 May 5.
10
Coordination as inference in multi-agent reinforcement learning.多智能体强化学习中的协调作为推理。
Neural Netw. 2024 Apr;172:106101. doi: 10.1016/j.neunet.2024.106101. Epub 2024 Jan 11.

引用本文的文献

1
Consensus-Based Formation Control with Time Synchronization for a Decentralized Group of Mobile Robots.用于分散移动机器人组的基于共识的带时间同步的编队控制
Sensors (Basel). 2024 Jun 7;24(12):3717. doi: 10.3390/s24123717.

本文引用的文献

1
Multi-Robot Exploration of Unknown Space Using Combined Meta-Heuristic Salp Swarm Algorithm and Deterministic Coordinated Multi-Robot Exploration.多机器人未知空间探索:基于混合启发式沙蚕群算法和确定性协同多机器人探索
Sensors (Basel). 2023 Feb 14;23(4):2156. doi: 10.3390/s23042156.
2
Navigation of Multiple Disk-Shaped Robots with Independent Goals within Obstacle-Cluttered Environments.在障碍物密集环境中具有独立目标的多个盘形机器人的导航
Sensors (Basel). 2022 Dec 25;23(1):221. doi: 10.3390/s23010221.
3
Deep Learning for Efficient and Optimal Motion Planning for AUVs with Disturbances.
深度学习在有干扰的 AUV 高效和最优运动规划中的应用。
Sensors (Basel). 2021 Jul 23;21(15):5011. doi: 10.3390/s21155011.
4
Off-Policy Actor-Critic Structure for Optimal Control of Unknown Systems With Disturbances.具有干扰的未知系统最优控制的非策略演员-评论家结构。
IEEE Trans Cybern. 2016 May;46(5):1041-50. doi: 10.1109/TCYB.2015.2421338. Epub 2015 Apr 28.