• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于延迟感知的无人机辅助物联网应用智能编队控制。

Delay-Informed Intelligent Formation Control for UAV-Assisted IoT Application.

机构信息

School of Statistics and Data Science, Beijing Wuzi University, Beijing 101149, China.

Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China.

出版信息

Sensors (Basel). 2023 Jul 6;23(13):6190. doi: 10.3390/s23136190.

DOI:10.3390/s23136190
PMID:37448039
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10347050/
Abstract

Multiple unmanned aerial vehicles (UAVs) have a greater potential to be widely used in UAV-assisted IoT applications. UAV formation, as an effective way to improve surveillance and security, has been extensively of concern. The leader-follower approach is efficient for UAV formation, as the whole formation system needs to find only the leader's trajectory. This paper studies the leader-follower surveillance system. Owing to different scenarios and assignments, the leading velocity is dynamic. The inevitable communication time delays resulting from information sending, communicating and receiving process bring challenges in the design of real-time UAV formation control. In this paper, the design of UAV formation tracking based on deep reinforcement learning (DRL) is investigated for high mobility scenarios in the presence of communication delay. To be more specific, the optimization UAV formation problem is firstly formulated to be a state error minimization problem by using the quadratic cost function when the communication delay is considered. Then, the delay-informed Markov decision process (DIMDP) is developed by including the previous actions in order to compensate the performance degradation induced by the time delay. Subsequently, an extended-delay informed deep deterministic policy gradient (DIDDPG) algorithm is proposed. Finally, some issues, such as computational complexity analysis and the effect of the time delay are discussed, and then the proposed intelligent algorithm is further extended to the arbitrary communication delay case. Numerical experiments demonstrate that the proposed DIDDPG algorithm can significantly alleviate the performance degradation caused by time delays.

摘要

多架无人机(UAV)在无人机辅助物联网应用中具有更大的应用潜力。无人机编队作为提高监控和安全性的有效手段,受到了广泛关注。领导者-跟随者方法是无人机编队的有效方法,因为整个编队系统只需要找到领导者的轨迹。本文研究了领导者-跟随者监控系统。由于不同的场景和任务分配,领导速度是动态的。信息发送、通信和接收过程中不可避免的通信时间延迟给实时无人机编队控制的设计带来了挑战。在本文中,研究了在存在通信延迟的情况下,基于深度强化学习(DRL)的无人机编队跟踪设计,用于高机动性场景。更具体地说,通过使用二次代价函数,将通信延迟考虑在内,首先将优化的无人机编队问题公式化为状态误差最小化问题。然后,通过包括以前的动作来开发包含延迟的马尔可夫决策过程(DIMDP),以补偿由延迟引起的性能下降。随后,提出了一种扩展延迟信息深度确定性策略梯度(DIDDPG)算法。最后,讨论了一些问题,如计算复杂度分析和延迟的影响,然后进一步将所提出的智能算法扩展到任意通信延迟情况。数值实验表明,所提出的 DIDDPG 算法可以显著减轻由延迟引起的性能下降。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/43ff5a35418a/sensors-23-06190-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/670583522a5d/sensors-23-06190-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/ead5449287d6/sensors-23-06190-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/518b9300f5a2/sensors-23-06190-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/a67d10376e26/sensors-23-06190-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/16e74bc38f96/sensors-23-06190-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/b9e5d19efe9d/sensors-23-06190-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/5cc5bfc0c1dc/sensors-23-06190-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/b9fb034058d4/sensors-23-06190-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/a102fd8c66d8/sensors-23-06190-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/3d3d828af99b/sensors-23-06190-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/05a879d19799/sensors-23-06190-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/51a917d33c81/sensors-23-06190-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/36fb8b5cb315/sensors-23-06190-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/43ff5a35418a/sensors-23-06190-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/670583522a5d/sensors-23-06190-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/ead5449287d6/sensors-23-06190-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/518b9300f5a2/sensors-23-06190-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/a67d10376e26/sensors-23-06190-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/16e74bc38f96/sensors-23-06190-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/b9e5d19efe9d/sensors-23-06190-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/5cc5bfc0c1dc/sensors-23-06190-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/b9fb034058d4/sensors-23-06190-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/a102fd8c66d8/sensors-23-06190-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/3d3d828af99b/sensors-23-06190-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/05a879d19799/sensors-23-06190-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/51a917d33c81/sensors-23-06190-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/36fb8b5cb315/sensors-23-06190-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6769/10347050/43ff5a35418a/sensors-23-06190-g014.jpg

相似文献

1
Delay-Informed Intelligent Formation Control for UAV-Assisted IoT Application.基于延迟感知的无人机辅助物联网应用智能编队控制。
Sensors (Basel). 2023 Jul 6;23(13):6190. doi: 10.3390/s23136190.
2
Optimal UAV Formation Tracking Control with Dynamic Leading Velocity and Network-Induced Delays.具有动态引导速度和网络诱导延迟的无人机最优编队跟踪控制
Entropy (Basel). 2022 Feb 21;24(2):305. doi: 10.3390/e24020305.
3
Task Offloading Strategy for Unmanned Aerial Vehicle Power Inspection Based on Deep Reinforcement Learning.基于深度强化学习的无人机电力巡检任务卸载策略
Sensors (Basel). 2024 Mar 24;24(7):2070. doi: 10.3390/s24072070.
4
Trajectory optimization of UAV-IRS assisted 6G THz network using deep reinforcement learning approach.基于深度强化学习方法的无人机-智能反射面辅助6G太赫兹网络轨迹优化
Sci Rep. 2024 Aug 9;14(1):18501. doi: 10.1038/s41598-024-68459-8.
5
Relational Maneuvering of Leader-Follower Unmanned Aerial Vehicles for Flexible Formation.用于灵活编队的领导者-跟随者无人飞行器的关系操纵
IEEE Trans Cybern. 2024 Oct;54(10):5598-5609. doi: 10.1109/TCYB.2024.3435029. Epub 2024 Oct 9.
6
Cooperative Location Method for Leader-Follower UAV Formation Based on Follower UAV's Moving Vector.基于跟随无人机运动矢量的领导者-跟随者无人机编队协同定位方法
Sensors (Basel). 2022 Sep 20;22(19):7125. doi: 10.3390/s22197125.
7
Proactive Handover Decision for UAVs with Deep Reinforcement Learning.基于深度强化学习的无人机主动交接决策
Sensors (Basel). 2022 Feb 5;22(3):1200. doi: 10.3390/s22031200.
8
Biological Intelligence Inspired Trajectory Design for Energy Harvesting UAV Networks.受生物智能启发的能量收集无人机网络轨迹设计。
Sensors (Basel). 2023 Jan 11;23(2):863. doi: 10.3390/s23020863.
9
Multi-UAV simultaneous target assignment and path planning based on deep reinforcement learning in dynamic multiple obstacles environments.动态多障碍物环境下基于深度强化学习的多无人机同步目标分配与路径规划
Front Neurorobot. 2024 Jan 22;17:1302898. doi: 10.3389/fnbot.2023.1302898. eCollection 2023.
10
Multi-UAV Path Planning in GPS and Communication Denial Environment.多无人机在 GPS 和通信干扰环境下的路径规划。
Sensors (Basel). 2023 Mar 10;23(6):2997. doi: 10.3390/s23062997.

引用本文的文献

1
Multi-UAV Collaborative Search and Attack Mission Decision-Making in Unknown Environments.未知环境下多无人机协同搜索与攻击任务决策
Sensors (Basel). 2023 Aug 24;23(17):7398. doi: 10.3390/s23177398.

本文引用的文献

1
USV Formation and Path-Following Control via Deep Reinforcement Learning With Random Braking.基于随机制动的深度强化学习的无人水面航行器编队形成与路径跟踪控制
IEEE Trans Neural Netw Learn Syst. 2021 Dec;32(12):5468-5478. doi: 10.1109/TNNLS.2021.3068762. Epub 2021 Nov 30.
2
Leader-Follower Output Synchronization of Linear Heterogeneous Systems With Active Leader Using Reinforcement Learning.使用强化学习的主动领导者的线性异类系统的领导者-跟随者输出同步。
IEEE Trans Neural Netw Learn Syst. 2018 Jun;29(6):2139-2153. doi: 10.1109/TNNLS.2018.2803059.
3
Human-level control through deep reinforcement learning.
通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.
4
Congested traffic states in empirical observations and microscopic simulations.实证观察和微观模拟中的拥堵交通状态。
Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics. 2000 Aug;62(2 Pt A):1805-24. doi: 10.1103/physreve.62.1805.