• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多智能体软演员-评论家的移动机器人混合运动规划器

Multiagent Soft Actor-Critic Based Hybrid Motion Planner for Mobile Robots.

作者信息

He Zichen, Dong Lu, Song Chunwei, Sun Changyin

出版信息

IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):10980-10992. doi: 10.1109/TNNLS.2022.3172168. Epub 2023 Nov 30.

DOI:10.1109/TNNLS.2022.3172168
PMID:35552145
Abstract

In this article, a novel hybrid multirobot motion planner that can be applied under no explicit communication and local observable conditions is presented. The planner is model-free and can realize the end-to-end mapping of multirobot state and observation information to final smooth and continuous trajectories. The planner is a front-end and back-end separated architecture. The design of the front-end collaborative waypoints searching module is based on the multiagent soft actor-critic (MASAC) algorithm under the centralized training with decentralized execution (CTDE) diagram. The design of the back-end trajectory optimization module is based on the minimal snap method with safety zone constraints. This module can output the final dynamic-feasible and executable trajectories. Finally, multigroup experimental results verify the effectiveness of the proposed motion planner.

摘要

本文提出了一种新型混合多机器人运动规划器,该规划器可在无明确通信和局部可观测条件下应用。该规划器无需模型,能够实现多机器人状态和观测信息到最终平滑连续轨迹的端到端映射。该规划器采用前端和后端分离架构。前端协作航路点搜索模块基于集中训练分散执行(CTDE)框架下的多智能体软演员-评论家(MASAC)算法进行设计。后端轨迹优化模块基于带安全区约束的最小急动方法进行设计。该模块能够输出最终的动态可行且可执行的轨迹。最后,多组实验结果验证了所提出的运动规划器的有效性。

相似文献

1
Multiagent Soft Actor-Critic Based Hybrid Motion Planner for Mobile Robots.基于多智能体软演员-评论家的移动机器人混合运动规划器
IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):10980-10992. doi: 10.1109/TNNLS.2022.3172168. Epub 2023 Nov 30.
2
A Local-and-Global Attention Reinforcement Learning Algorithm for Multiagent Cooperative Navigation.一种用于多智能体协作导航的局部与全局注意力强化学习算法
IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):7767-7777. doi: 10.1109/TNNLS.2022.3220798. Epub 2024 Jun 3.
3
Graph Soft Actor-Critic Reinforcement Learning for Large-Scale Distributed Multirobot Coordination.用于大规模分布式多机器人协调的图软演员-评论家强化学习
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):665-676. doi: 10.1109/TNNLS.2023.3329530. Epub 2025 Jan 7.
4
Expert System-Based Multiagent Deep Deterministic Policy Gradient for Swarm Robot Decision Making.基于专家系统的多智能体深度确定性策略梯度用于群体机器人决策
IEEE Trans Cybern. 2024 Mar;54(3):1614-1624. doi: 10.1109/TCYB.2022.3228578. Epub 2024 Feb 9.
5
Trajectory Planner CDT-RRT* for Car-Like Mobile Robots toward Narrow and Cluttered Environments.面向狭窄且杂乱环境的类车移动机器人轨迹规划器 CDT-RRT*
Sensors (Basel). 2021 Jul 15;21(14):4828. doi: 10.3390/s21144828.
6
Adaptive Hybrid Optimization Learning-Based Accurate Motion Planning of Multi-Joint Arm.基于自适应混合优化学习的多关节手臂精确运动规划
IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):5440-5451. doi: 10.1109/TNNLS.2023.3262109. Epub 2023 Sep 1.
7
An Optimized Trajectory Planner and Motion Controller Framework for Autonomous Driving in Unstructured Environments.一种用于非结构化环境中自动驾驶的优化轨迹规划器和运动控制器框架。
Sensors (Basel). 2021 Jun 27;21(13):4409. doi: 10.3390/s21134409.
8
End-to-End AUV Motion Planning Method Based on Soft Actor-Critic.基于软动作 - 批评家的端到端 AUV 运动规划方法。
Sensors (Basel). 2021 Sep 1;21(17):5893. doi: 10.3390/s21175893.
9
An Optimization-Based Motion Planner for Car-like Logistics Robots on Narrow Roads.基于优化的窄路汽车物流机器人运动规划器。
Sensors (Basel). 2022 Nov 18;22(22):8948. doi: 10.3390/s22228948.
10
Robust Reward-Free Actor-Critic for Cooperative Multiagent Reinforcement Learning.用于合作多智能体强化学习的鲁棒无奖励行动者-评论家算法
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17318-17329. doi: 10.1109/TNNLS.2023.3302131. Epub 2024 Dec 2.

引用本文的文献

1
RL-QPSO net: deep reinforcement learning-enhanced QPSO for efficient mobile robot path planning.RL-QPSO网络:用于高效移动机器人路径规划的深度强化学习增强型量子粒子群优化算法
Front Neurorobot. 2025 Jan 8;18:1464572. doi: 10.3389/fnbot.2024.1464572. eCollection 2024.
2
Decentralized multi-agent reinforcement learning based on best-response policies.基于最佳响应策略的分布式多智能体强化学习
Front Robot AI. 2024 Apr 16;11:1229026. doi: 10.3389/frobt.2024.1229026. eCollection 2024.