• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过分层强化学习增强多无人机空战决策

Enhancing multi-UAV air combat decision making via hierarchical reinforcement learning.

作者信息

Wang Huan, Wang Jintao

机构信息

College of Artificial Intelligence and Automation, Hohai University, Changzhou, 213200, China.

College of information and Network Engineering, Anhui Science and Technology University, Chuzhou, 233030, China.

出版信息

Sci Rep. 2024 Feb 23;14(1):4458. doi: 10.1038/s41598-024-54938-5.

DOI:10.1038/s41598-024-54938-5
PMID:38396185
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10891071/
Abstract

In the realm of air combat, autonomous decision-making in regard to Unmanned Aerial Vehicle (UAV) has emerged as a critical force. However, prevailing autonomous decision-making algorithms in this domain predominantly rely on rule-based methods, proving challenging to design and implement optimal solutions in complex multi-UAV combat environments. This paper proposes a novel approach to multi-UAV air combat decision-making utilizing hierarchical reinforcement learning. First, a hierarchical decision-making network is designed based on tactical action types to streamline the complexity of the maneuver decision-making space. Second, the high-quality combat experience gained from training is decomposed, with the aim of augmenting the quantity of valuable experiences and alleviating the intricacies of strategy learning. Finally, the performance of the algorithm is validated using the advanced UAV simulation platform JSBSim. Through comparisons with various baseline algorithms, our experiments demonstrate the superior performance of the proposed method in both even and disadvantaged air combat environments.

摘要

在空战领域,无人机的自主决策已成为一股关键力量。然而,该领域现有的自主决策算法主要依赖基于规则的方法,在复杂的多无人机作战环境中设计和实施最优解决方案具有挑战性。本文提出了一种利用分层强化学习的多无人机空战决策新方法。首先,基于战术行动类型设计了一个分层决策网络,以简化机动决策空间的复杂性。其次,对训练中获得的高质量作战经验进行分解,目的是增加有价值经验的数量并减轻策略学习的复杂性。最后,使用先进的无人机仿真平台JSBSim验证了算法的性能。通过与各种基线算法的比较,我们的实验证明了所提方法在均势和劣势空战环境中的优越性能。

相似文献

1
Enhancing multi-UAV air combat decision making via hierarchical reinforcement learning.通过分层强化学习增强多无人机空战决策
Sci Rep. 2024 Feb 23;14(1):4458. doi: 10.1038/s41598-024-54938-5.
2
A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning.一种基于深度强化学习的无人机自主空投机动决策算法
Sensors (Basel). 2021 Mar 23;21(6):2233. doi: 10.3390/s21062233.
3
UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy.基于深度强化学习策略的无人机自主跟踪与着陆
Sensors (Basel). 2020 Oct 1;20(19):5630. doi: 10.3390/s20195630.
4
Autonomous maneuver decision-making method based on reinforcement learning and Monte Carlo tree search.基于强化学习和蒙特卡洛树搜索的自主机动决策方法
Front Neurorobot. 2022 Oct 25;16:996412. doi: 10.3389/fnbot.2022.996412. eCollection 2022.
5
A Bio-Inspired Decision-Making Method of UAV Swarm for Attack-Defense Confrontation via Multi-Agent Reinforcement Learning.一种基于多智能体强化学习的无人机集群攻防对抗生物启发式决策方法。
Biomimetics (Basel). 2023 May 25;8(2):222. doi: 10.3390/biomimetics8020222.
6
Hierarchical Reinforcement Learning Framework in Geographic Coordination for Air Combat Tactical Pursuit.用于空战战术追击地理协调的分层强化学习框架
Entropy (Basel). 2023 Oct 1;25(10):1409. doi: 10.3390/e25101409.
7
Research on UCAV Maneuvering Decision Method Based on Heuristic Reinforcement Learning.基于启发式强化学习的无人机机动决策方法研究。
Comput Intell Neurosci. 2022 Mar 3;2022:1477078. doi: 10.1155/2022/1477078. eCollection 2022.
8
A Channel Rendezvous Algorithm for Multi-Unmanned Aerial Vehicle Networks Based on Average Consensus.一种基于平均一致性的多无人机网络信道交会算法
Sensors (Basel). 2023 Sep 25;23(19):8076. doi: 10.3390/s23198076.
9
Research on reinforcement learning-based safe decision-making methodology for multiple unmanned aerial vehicles.基于强化学习的多无人机安全决策方法研究
Front Neurorobot. 2023 Jan 10;16:1105480. doi: 10.3389/fnbot.2022.1105480. eCollection 2022.
10
Multi-UAV Collaborative Search and Attack Mission Decision-Making in Unknown Environments.未知环境下多无人机协同搜索与攻击任务决策
Sensors (Basel). 2023 Aug 24;23(17):7398. doi: 10.3390/s23177398.

引用本文的文献

1
Research on Real-Time Roundup and Dynamic Allocation Methods for Multi-Dynamic Target Unmanned Aerial Vehicles.多动态目标无人机实时综述与动态分配方法研究
Sensors (Basel). 2024 Oct 12;24(20):6565. doi: 10.3390/s24206565.

本文引用的文献

1
Situation assessment in air combat considering incomplete frame of discernment in the generalized evidence theory.在广义证据理论中考虑不完备分辨框架的空战态势评估
Sci Rep. 2022 Dec 31;12(1):22639. doi: 10.1038/s41598-022-27076-z.
2
A hierarchical reinforcement learning method for missile evasion and guidance.一种用于导弹规避与制导的分层强化学习方法。
Sci Rep. 2022 Nov 7;12(1):18888. doi: 10.1038/s41598-022-21756-6.
3
Dynamic-boundary-based lateral motion synergistic control of distributed drive autonomous vehicle.基于动态边界的分布式驱动自动驾驶车辆横向运动协同控制
Sci Rep. 2021 Nov 22;11(1):22644. doi: 10.1038/s41598-021-01947-3.