• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度Q网络协作框架的领导者-跟随者无人机编队控制

Leader-follower UAVs formation control based on a deep Q-network collaborative framework.

作者信息

Liu Zhijun, Li Jie, Shen Jian, Wang Xiaoguang, Chen Pengyun

机构信息

Shenzhen MSU-BIT University, Shenzhen, 518172, China.

School of Mechatronical Engineering, Beijing Institute of Technology, Beijing, 100081, China.

出版信息

Sci Rep. 2024 Feb 26;14(1):4674. doi: 10.1038/s41598-024-54531-w.

DOI:10.1038/s41598-024-54531-w
PMID:38409308
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11316062/
Abstract

This study examines a collaborative framework that utilizes an intelligent deep Q-network to regulate the formation of leader-follower Unmanned Aerial Vehicles (UAVs). The aim is to tackle the challenges posed by the highly dynamic and uncertain flight environment of UAVs. In the context of UAVs, we have developed a dynamic model that captures the collective state of the system. This model encompasses variables like as the relative positions, heading angle, rolling angle, and velocity of different nodes in the formation. In the subsequent section, we elucidate the operational procedure of UAVs in a collaborative manner, employing the conceptual framework of Markov Decision Process (MDP). Furthermore, we employ the Reinforcement Learning (RL) to facilitate this process. In light of this premise, a fundamental framework is presented for addressing the control problem of UAVs utilizing the DQN scheme. This framework encompasses a technique for action selection known as [Formula: see text]-imitation, as well as algorithmic specifics. Finally, the efficacy and portability of the DQN-based approach are substantiated by numerical simulation validation. The average reward curve demonstrates a satisfactory level of convergence, and kinematic link between the nodes inside the formation satisfies the essential requirements for the creation of a controller.

摘要

本研究考察了一种协作框架,该框架利用智能深度Q网络来调控无人机编队中领机-僚机的编队形式。目的是应对无人机高度动态且不确定的飞行环境所带来的挑战。在无人机的背景下,我们开发了一个动态模型来捕捉系统的集体状态。该模型包含编队中不同节点的相对位置、航向角、滚转角和速度等变量。在接下来的部分,我们采用马尔可夫决策过程(MDP)的概念框架,以协作方式阐明无人机的操作流程。此外,我们运用强化学习(RL)来推动这一过程。基于这一前提,提出了一个利用深度Q网络(DQN)方案解决无人机控制问题的基本框架。该框架包含一种称为[公式:见原文]-模仿的动作选择技术以及算法细节。最后,通过数值模拟验证证实了基于深度Q网络方法的有效性和可移植性。平均奖励曲线显示出令人满意的收敛水平,并且编队中节点之间的运动学联系满足创建控制器的基本要求。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/160c21399deb/41598_2024_54531_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/0e802d4672ce/41598_2024_54531_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/d2f98f0f1f87/41598_2024_54531_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/1359490c3142/41598_2024_54531_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/6408179a5cb9/41598_2024_54531_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/b82b73c8ee22/41598_2024_54531_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/b066b220fc39/41598_2024_54531_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/c5ad2a2f580a/41598_2024_54531_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/85cd370ffef7/41598_2024_54531_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/85abdcf432d0/41598_2024_54531_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/f574988710a7/41598_2024_54531_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/706d764cb82b/41598_2024_54531_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/160c21399deb/41598_2024_54531_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/0e802d4672ce/41598_2024_54531_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/d2f98f0f1f87/41598_2024_54531_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/1359490c3142/41598_2024_54531_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/6408179a5cb9/41598_2024_54531_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/b82b73c8ee22/41598_2024_54531_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/b066b220fc39/41598_2024_54531_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/c5ad2a2f580a/41598_2024_54531_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/85cd370ffef7/41598_2024_54531_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/85abdcf432d0/41598_2024_54531_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/f574988710a7/41598_2024_54531_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/706d764cb82b/41598_2024_54531_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c75a/11316062/160c21399deb/41598_2024_54531_Fig10_HTML.jpg

相似文献

1
Leader-follower UAVs formation control based on a deep Q-network collaborative framework.基于深度Q网络协作框架的领导者-跟随者无人机编队控制
Sci Rep. 2024 Feb 26;14(1):4674. doi: 10.1038/s41598-024-54531-w.
2
A Q-Learning Approach to Flocking With UAVs in a Stochastic Environment.基于 Q 学习的无人机在随机环境中的群体行为控制方法。
IEEE Trans Cybern. 2017 Jan;47(1):186-197. doi: 10.1109/TCYB.2015.2509646. Epub 2016 Jan 5.
3
Distributed Finite-Time Fault-Tolerant Containment Control for Multiple Unmanned Aerial Vehicles.多无人机的分布式有限时间容错包容控制
IEEE Trans Neural Netw Learn Syst. 2020 Jun;31(6):2077-2091. doi: 10.1109/TNNLS.2019.2927887. Epub 2019 Aug 9.
4
Formation control of fixed-wing UAVs with communication delay.具有通信延迟的固定翼无人机编队控制
ISA Trans. 2024 Mar;146:154-164. doi: 10.1016/j.isatra.2023.12.036. Epub 2023 Dec 27.
5
Reinforcement Learning-Based Fractional-Order Adaptive Fault-Tolerant Formation Control of Networked Fixed-Wing UAVs With Prescribed Performance.基于强化学习的具有规定性能的网络化固定翼无人机分数阶自适应容错编队控制
IEEE Trans Neural Netw Learn Syst. 2024 Mar;35(3):3365-3379. doi: 10.1109/TNNLS.2023.3281403. Epub 2024 Feb 29.
6
Cooperative Location Method for Leader-Follower UAV Formation Based on Follower UAV's Moving Vector.基于跟随无人机运动矢量的领导者-跟随者无人机编队协同定位方法
Sensors (Basel). 2022 Sep 20;22(19):7125. doi: 10.3390/s22197125.
7
Development of Multiple UAV Collaborative Driving Systems for Improving Field Phenotyping.多架无人机协同驾驶系统的开发,用于改善田间表型分析。
Sensors (Basel). 2022 Feb 12;22(4):1423. doi: 10.3390/s22041423.
8
Virtual Electric Dipole Field Applied to Autonomous Formation Flight Control of Unmanned Aerial Vehicles.虚拟电偶极子场应用于无人机自主编队飞行控制
Sensors (Basel). 2021 Jul 1;21(13):4540. doi: 10.3390/s21134540.
9
Relative Localization and Circumnavigation of a UGV0 Based on Mixed Measurements of Multi-UAVs by Employing Intelligent Sensors.基于多无人机混合测量并采用智能传感器的UGV0相对定位与环绕导航
Sensors (Basel). 2024 Apr 7;24(7):2347. doi: 10.3390/s24072347.
10
Relational Maneuvering of Leader-Follower Unmanned Aerial Vehicles for Flexible Formation.用于灵活编队的领导者-跟随者无人飞行器的关系操纵
IEEE Trans Cybern. 2024 Oct;54(10):5598-5609. doi: 10.1109/TCYB.2024.3435029. Epub 2024 Oct 9.

引用本文的文献

1
Trajectory optimization of UAV-IRS assisted 6G THz network using deep reinforcement learning approach.基于深度强化学习方法的无人机-智能反射面辅助6G太赫兹网络轨迹优化
Sci Rep. 2024 Aug 9;14(1):18501. doi: 10.1038/s41598-024-68459-8.

本文引用的文献

1
Cooperative USV-UAV marine search and rescue with visual navigation and reinforcement learning-based control.基于视觉导航和强化学习控制的协同 USV-UAV 海上搜索与救援。
ISA Trans. 2023 Jun;137:222-235. doi: 10.1016/j.isatra.2023.01.007. Epub 2023 Jan 26.
2
Swarm of micro flying robots in the wild.野外的微型飞行机器人群。
Sci Robot. 2022 May 4;7(66):eabm5954. doi: 10.1126/scirobotics.abm5954.
3
GrpAvoid: Multigroup Collision-Avoidance Control and Optimization for UAV Swarm.GrpAvoid:无人机群的多群组避碰控制与优化
IEEE Trans Cybern. 2023 Mar;53(3):1776-1789. doi: 10.1109/TCYB.2021.3132044. Epub 2023 Feb 15.
4
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.