Chi Pei, Wei Jiahong, Wu Kun, Di Bin, Wang Yingxun
Institute of Unmanned System, Beihang University, Beijing 100191, China.
School of Automation Science and Electrical Engineering, Beihang University, Beijing 100191, China.
Biomimetics (Basel). 2023 May 25;8(2):222. doi: 10.3390/biomimetics8020222.
The unmanned aerial vehicle (UAV) swarm is regarded as having a significant role in modern warfare. The demand for UAV swarms with the capability of attack-defense confrontation is urgent. The existing decision-making methods of UAV swarm confrontation, such as multi-agent reinforcement learning (MARL), suffer from an exponential increase in training time as the size of the swarm increases. Inspired by group hunting behavior in nature, this paper presents a new bio-inspired decision-making method for UAV swarms for attack-defense confrontation via MARL. Firstly, a UAV swarm decision-making framework for confrontation based on grouping mechanisms is established. Secondly, a bio-inspired action space is designed, and a dense reward is added to the reward function to accelerate the convergence speed of training. Finally, numerical experiments are conducted to evaluate the performance of our method. The experiment results show that the proposed method can be applied to a swarm of 12 UAVs, and when the maximum acceleration of the enemy UAV is within 2.5 times ours, the swarm can well intercept the enemy, and the success rate is above 91%.
无人机集群在现代战争中被视为具有重要作用。对具备攻防对抗能力的无人机集群的需求十分迫切。现有的无人机集群对抗决策方法,如多智能体强化学习(MARL),随着集群规模的增大,训练时间会呈指数级增长。受自然界群体狩猎行为的启发,本文提出一种新的受生物启发的无人机集群攻防对抗决策方法,通过多智能体强化学习实现。首先,建立基于分组机制的无人机集群对抗决策框架。其次,设计受生物启发的动作空间,并在奖励函数中添加密集奖励以加快训练收敛速度。最后,进行数值实验以评估我们方法的性能。实验结果表明,所提方法可应用于12架无人机的集群,当敌方无人机的最大加速度在我方的2.5倍以内时,该集群能够很好地拦截敌方,成功率在91%以上。