• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度强化学习和轨迹优化的分布式多机器人导航

Decentralized Multi-Robot Navigation Based on Deep Reinforcement Learning and Trajectory Optimization.

作者信息

Bi Yifei, Luo Jianing, Zhu Jiwei, Liu Junxiu, Li Wei

机构信息

College of Foreign Languages, University of Shanghai for Science and Technology, Shanghai 200093, China.

College of Intelligent Robotics and Advanced Manufacturing, Fudan University, Shanghai 200433, China.

出版信息

Biomimetics (Basel). 2025 Jun 4;10(6):366. doi: 10.3390/biomimetics10060366.

DOI:10.3390/biomimetics10060366
PMID:40558335
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12190238/
Abstract

Multi-robot systems are significant in decision-making capabilities and applications, but avoiding collisions during movement remains a critical challenge. Existing decentralized obstacle avoidance strategies, while low in computational cost, often fail to ensure safety effectively. To address this issue, this paper leverages graph neural networks (GNNs) and deep reinforcement learning (DRL) to aggregate high-dimensional features as inputs for reinforcement learning (RL) to generate paths. Additionally, it introduces safety constraints through an artificial potential field (APF) to optimize these trajectories. Additionally, a constrained nonlinear optimization method further refines the APF-adjusted paths, resulting in the development of the GNN-RL-APF-Lagrangian algorithm. By combining APF and nonlinear optimization techniques, experimental results demonstrate that this method significantly enhances the safety and obstacle avoidance capabilities of multi-robot systems in complex environments. The proposed GNN-RL-APF-Lagrangian algorithm achieves a 96.43% success rate in sparse obstacle environments and 89.77% in dense obstacle scenarios, representing improvements of 59% and 60%, respectively, over baseline GNN-RL approaches. The method maintains scalability up to 30 robots while preserving distributed execution properties.

摘要

多机器人系统在决策能力和应用方面具有重要意义,但在运动过程中避免碰撞仍然是一项关键挑战。现有的分散式避障策略虽然计算成本低,但往往无法有效确保安全。为了解决这个问题,本文利用图神经网络(GNN)和深度强化学习(DRL)来聚合高维特征,作为强化学习(RL)生成路径的输入。此外,它通过人工势场(APF)引入安全约束来优化这些轨迹。此外,一种约束非线性优化方法进一步细化了经APF调整的路径,从而开发出了GNN-RL-APF-拉格朗日算法。通过结合APF和非线性优化技术,实验结果表明,该方法显著提高了多机器人系统在复杂环境中的安全性和避障能力。所提出的GNN-RL-APF-拉格朗日算法在稀疏障碍物环境中的成功率达到96.43%,在密集障碍物场景中的成功率为89.77%,分别比基线GNN-RL方法提高了59%和60%。该方法在保持分布式执行特性的同时,可扩展性高达30个机器人。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/a35806d4152f/biomimetics-10-00366-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/a057b51ec579/biomimetics-10-00366-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/6c05205bbad8/biomimetics-10-00366-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/95111f443c2f/biomimetics-10-00366-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/572737e7f1c4/biomimetics-10-00366-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/1a570339c312/biomimetics-10-00366-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/56a396935290/biomimetics-10-00366-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/d134772e7f12/biomimetics-10-00366-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/a35806d4152f/biomimetics-10-00366-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/a057b51ec579/biomimetics-10-00366-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/6c05205bbad8/biomimetics-10-00366-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/95111f443c2f/biomimetics-10-00366-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/572737e7f1c4/biomimetics-10-00366-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/1a570339c312/biomimetics-10-00366-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/56a396935290/biomimetics-10-00366-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/d134772e7f12/biomimetics-10-00366-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9c0/12190238/a35806d4152f/biomimetics-10-00366-g008.jpg

相似文献

1
Decentralized Multi-Robot Navigation Based on Deep Reinforcement Learning and Trajectory Optimization.基于深度强化学习和轨迹优化的分布式多机器人导航
Biomimetics (Basel). 2025 Jun 4;10(6):366. doi: 10.3390/biomimetics10060366.
2
Accreditation through the eyes of nurse managers: an infinite staircase or a phenomenon that evaporates like water.护士长眼中的认证:是无尽的阶梯还是如流水般消逝的现象。
J Health Organ Manag. 2025 Jun 30. doi: 10.1108/JHOM-01-2025-0029.
3
Adaptive Model Predictive Control for 4WD-4WS Mobile Robot: A Multivariate Gaussian Mixture Model-Ant Colony Optimization for Robust Trajectory Tracking and Obstacle Avoidance.四轮驱动-四轮转向移动机器人的自适应模型预测控制:用于鲁棒轨迹跟踪和避障的多元高斯混合模型-蚁群优化算法
Sensors (Basel). 2025 Jun 18;25(12):3805. doi: 10.3390/s25123805.
4
Decentralized nonlinear model predictive control-based flock navigation with real-time obstacle avoidance in unknown obstructed environments.基于分散式非线性模型预测控制的群体导航,在未知障碍物环境中实现实时避障。
Front Robot AI. 2025 Jun 10;12:1540808. doi: 10.3389/frobt.2025.1540808. eCollection 2025.
5
Adefovir dipivoxil and pegylated interferon alfa-2a for the treatment of chronic hepatitis B: a systematic review and economic evaluation.阿德福韦酯与聚乙二醇化干扰素α-2a治疗慢性乙型肝炎:系统评价与经济学评估
Health Technol Assess. 2006 Aug;10(28):iii-iv, xi-xiv, 1-183. doi: 10.3310/hta10280.
6
Psychological interventions for adults who have sexually offended or are at risk of offending.针对有性犯罪行为或有性犯罪风险的成年人的心理干预措施。
Cochrane Database Syst Rev. 2012 Dec 12;12(12):CD007507. doi: 10.1002/14651858.CD007507.pub2.
7
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
8
Deep Reinforcement Learning-Based Self-Optimization of Flow Chemistry.基于深度强化学习的流动化学自优化
ACS Eng Au. 2025 May 13;5(3):247-266. doi: 10.1021/acsengineeringau.5c00004. eCollection 2025 Jun 18.
9
Optimal design of a wheelchair-mounted robotic arm for activities of daily living.用于日常生活活动的轮椅安装式机器人手臂的优化设计。
Disabil Rehabil Assist Technol. 2025 Jul;20(5):1539-1556. doi: 10.1080/17483107.2025.2459890. Epub 2025 Feb 18.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.