基于深度强化学习和轨迹优化的分布式多机器人导航

Decentralized Multi-Robot Navigation Based on Deep Reinforcement Learning and Trajectory Optimization.

作者信息

Bi Yifei, Luo Jianing, Zhu Jiwei, Liu Junxiu, Li Wei

机构信息

College of Foreign Languages, University of Shanghai for Science and Technology, Shanghai 200093, China.

College of Intelligent Robotics and Advanced Manufacturing, Fudan University, Shanghai 200433, China.

出版信息

Biomimetics (Basel). 2025 Jun 4;10(6):366. doi: 10.3390/biomimetics10060366.

DOI:10.3390/biomimetics10060366

PMID:40558335

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12190238/

Abstract

Multi-robot systems are significant in decision-making capabilities and applications, but avoiding collisions during movement remains a critical challenge. Existing decentralized obstacle avoidance strategies, while low in computational cost, often fail to ensure safety effectively. To address this issue, this paper leverages graph neural networks (GNNs) and deep reinforcement learning (DRL) to aggregate high-dimensional features as inputs for reinforcement learning (RL) to generate paths. Additionally, it introduces safety constraints through an artificial potential field (APF) to optimize these trajectories. Additionally, a constrained nonlinear optimization method further refines the APF-adjusted paths, resulting in the development of the GNN-RL-APF-Lagrangian algorithm. By combining APF and nonlinear optimization techniques, experimental results demonstrate that this method significantly enhances the safety and obstacle avoidance capabilities of multi-robot systems in complex environments. The proposed GNN-RL-APF-Lagrangian algorithm achieves a 96.43% success rate in sparse obstacle environments and 89.77% in dense obstacle scenarios, representing improvements of 59% and 60%, respectively, over baseline GNN-RL approaches. The method maintains scalability up to 30 robots while preserving distributed execution properties.

摘要

多机器人系统在决策能力和应用方面具有重要意义，但在运动过程中避免碰撞仍然是一项关键挑战。现有的分散式避障策略虽然计算成本低，但往往无法有效确保安全。为了解决这个问题，本文利用图神经网络（GNN）和深度强化学习（DRL）来聚合高维特征，作为强化学习（RL）生成路径的输入。此外，它通过人工势场（APF）引入安全约束来优化这些轨迹。此外，一种约束非线性优化方法进一步细化了经APF调整的路径，从而开发出了GNN-RL-APF-拉格朗日算法。通过结合APF和非线性优化技术，实验结果表明，该方法显著提高了多机器人系统在复杂环境中的安全性和避障能力。所提出的GNN-RL-APF-拉格朗日算法在稀疏障碍物环境中的成功率达到96.43%，在密集障碍物场景中的成功率为89.77%，分别比基线GNN-RL方法提高了59%和60%。该方法在保持分布式执行特性的同时，可扩展性高达30个机器人。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/a057b51ec579/biomimetics-10-00366-g001.jpg

相似文献

Decentralized Multi-Robot Navigation Based on Deep Reinforcement Learning and Trajectory Optimization.基于深度强化学习和轨迹优化的分布式多机器人导航

Biomimetics (Basel). 2025 Jun 4;10(6):366. doi: 10.3390/biomimetics10060366.

Accreditation through the eyes of nurse managers: an infinite staircase or a phenomenon that evaporates like water.护士长眼中的认证：是无尽的阶梯还是如流水般消逝的现象。

J Health Organ Manag. 2025 Jun 30. doi: 10.1108/JHOM-01-2025-0029.

Adaptive Model Predictive Control for 4WD-4WS Mobile Robot: A Multivariate Gaussian Mixture Model-Ant Colony Optimization for Robust Trajectory Tracking and Obstacle Avoidance.四轮驱动-四轮转向移动机器人的自适应模型预测控制：用于鲁棒轨迹跟踪和避障的多元高斯混合模型-蚁群优化算法

Sensors (Basel). 2025 Jun 18;25(12):3805. doi: 10.3390/s25123805.

Decentralized nonlinear model predictive control-based flock navigation with real-time obstacle avoidance in unknown obstructed environments.基于分散式非线性模型预测控制的群体导航，在未知障碍物环境中实现实时避障。

Front Robot AI. 2025 Jun 10;12:1540808. doi: 10.3389/frobt.2025.1540808. eCollection 2025.

Adefovir dipivoxil and pegylated interferon alfa-2a for the treatment of chronic hepatitis B: a systematic review and economic evaluation.阿德福韦酯与聚乙二醇化干扰素α-2a治疗慢性乙型肝炎：系统评价与经济学评估

Health Technol Assess. 2006 Aug;10(28):iii-iv, xi-xiv, 1-183. doi: 10.3310/hta10280.

Psychological interventions for adults who have sexually offended or are at risk of offending.针对有性犯罪行为或有性犯罪风险的成年人的心理干预措施。

Cochrane Database Syst Rev. 2012 Dec 12;12(12):CD007507. doi: 10.1002/14651858.CD007507.pub2.

Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果：一种针对特定个体见解的新型验证方法。

Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.

Deep Reinforcement Learning-Based Self-Optimization of Flow Chemistry.基于深度强化学习的流动化学自优化

ACS Eng Au. 2025 May 13;5(3):247-266. doi: 10.1021/acsengineeringau.5c00004. eCollection 2025 Jun 18.

Optimal design of a wheelchair-mounted robotic arm for activities of daily living.用于日常生活活动的轮椅安装式机器人手臂的优化设计。

Disabil Rehabil Assist Technol. 2025 Jul;20(5):1539-1556. doi: 10.1080/17483107.2025.2459890. Epub 2025 Feb 18.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状Meta分析。

Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于深度强化学习和轨迹优化的分布式多机器人导航

Decentralized Multi-Robot Navigation Based on Deep Reinforcement Learning and Trajectory Optimization.

作者信息

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献