基于双目标框架的多机器人编队分布式深度强化学习

Distributed deep reinforcement learning based on bi-objective framework for multi-robot formation.

作者信息

Li Jinming, Liu Qingshan, Chi Guoyi

机构信息

School of Mathematics, Southeast University, Nanjing 210096, China.

School of Mathematics, Southeast University, Nanjing 210096, China; Purple Mountain Laboratories, Nanjing 211111, China.

出版信息

Neural Netw. 2024 Mar;171:61-72. doi: 10.1016/j.neunet.2023.11.063. Epub 2023 Dec 1.

DOI:10.1016/j.neunet.2023.11.063

PMID:38091765

Abstract

Improving generalization ability in multi-robot formation can reduce repetitive training and calculation. In this paper, we study the multi-robot formation problem with the ability to generalize the target position. Since the generalization ability of neural network is directly proportional to spatial dimension, we adopt the strategy of using different networks to solve different objectives, so that the network learning can focus on the learning of one objective to obtain better performance. In addition, this paper presents a distributed deep reinforcement learning method based on soft actor-critic algorithm for solving multi-robot formation problem. At the same time, the formation evaluation assignment function is designed to adapt to distributed training. Compared with the original algorithm, the improved algorithm can get higher reward cumulative values. The experimental results show that the proposed algorithm can better maintain the desired formation in the moving process, and the rotation design in the reward function makes the multi-robot system have better flexibility in formation. The comparison of control signal curve shows that the proposed algorithm is more stable. At the end of the experiments, the universality of the proposed algorithm in formation maintenance and formation variations is demonstrated.

摘要

提高多机器人编队中的泛化能力可以减少重复训练和计算。在本文中，我们研究了具有泛化目标位置能力的多机器人编队问题。由于神经网络的泛化能力与空间维度成正比，我们采用使用不同网络解决不同目标的策略，以便网络学习能够专注于一个目标的学习以获得更好的性能。此外，本文提出了一种基于软演员-评论家算法的分布式深度强化学习方法来解决多机器人编队问题。同时，设计了编队评估分配函数以适应分布式训练。与原始算法相比，改进算法能够获得更高的奖励累积值。实验结果表明，所提出的算法能够在移动过程中更好地保持期望的编队，并且奖励函数中的旋转设计使得多机器人系统在编队方面具有更好的灵活性。控制信号曲线的比较表明所提出的算法更稳定。在实验结束时，证明了所提出算法在编队维持和编队变化方面的通用性。

相似文献

Distributed deep reinforcement learning based on bi-objective framework for multi-robot formation.

Neural Netw. 2024 Mar;171:61-72. doi: 10.1016/j.neunet.2023.11.063. Epub 2023 Dec 1.

A deep reinforcement learning algorithm framework for solving multi-objective traveling salesman problem based on feature transformation.

Neural Netw. 2024 Aug;176:106359. doi: 10.1016/j.neunet.2024.106359. Epub 2024 May 3.

A Framework and Algorithm for Human-Robot Collaboration Based on Multimodal Reinforcement Learning.

Comput Intell Neurosci. 2022 Sep 28;2022:2341898. doi: 10.1155/2022/2341898. eCollection 2022.

The Intelligent Path Planning System of Agricultural Robot via Reinforcement Learning.

Sensors (Basel). 2022 Jun 7;22(12):4316. doi: 10.3390/s22124316.

A deep reinforcement learning algorithm for the rectangular strip packing problem.

PLoS One. 2023 Mar 16;18(3):e0282598. doi: 10.1371/journal.pone.0282598. eCollection 2023.

Stable Jumping Control Based on Deep Reinforcement Learning for a Locust-Inspired Robot.

Biomimetics (Basel). 2024 Sep 11;9(9):548. doi: 10.3390/biomimetics9090548.

Modular deep reinforcement learning from reward and punishment for robot navigation.

Neural Netw. 2021 Mar;135:115-126. doi: 10.1016/j.neunet.2020.12.001. Epub 2020 Dec 8.

Target Tracking Control of a Biomimetic Underwater Vehicle Through Deep Reinforcement Learning.

IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3741-3752. doi: 10.1109/TNNLS.2021.3054402. Epub 2022 Aug 3.

Adaptive Quadruped Balance Control for Dynamic Environments Using Maximum-Entropy Reinforcement Learning.

Sensors (Basel). 2021 Sep 2;21(17):5907. doi: 10.3390/s21175907.

An approach to solving optimal control problems of nonlinear systems by introducing detail-reward mechanism in deep reinforcement learning.

Math Biosci Eng. 2022 Jun 23;19(9):9258-9290. doi: 10.3934/mbe.2022430.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于双目标框架的多机器人编队分布式深度强化学习

Distributed deep reinforcement learning based on bi-objective framework for multi-robot formation.

作者信息

Li Jinming, Liu Qingshan, Chi Guoyi

机构信息

School of Mathematics, Southeast University, Nanjing 210096, China.

School of Mathematics, Southeast University, Nanjing 210096, China; Purple Mountain Laboratories, Nanjing 211111, China.

出版信息

Neural Netw. 2024 Mar;171:61-72. doi: 10.1016/j.neunet.2023.11.063. Epub 2023 Dec 1.

DOI:10.1016/j.neunet.2023.11.063

PMID:38091765

Abstract

摘要

基于双目标框架的多机器人编队分布式深度强化学习

Distributed deep reinforcement learning based on bi-objective framework for multi-robot formation.

作者信息

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于双目标框架的多机器人编队分布式深度强化学习

Distributed deep reinforcement learning based on bi-objective framework for multi-robot formation.

作者信息

机构信息

出版信息

相似文献