基于优先经验回放的深度双Q网络优化地理路由协议，用于考虑无人机自组织网络链路稳定性和能量预测的情况。

DDQN with Prioritized Experience Replay-Based Optimized Geographical Routing Protocol of Considering Link Stability and Energy Prediction for UANET.

作者信息

Zhang Yanan, Qiu Hongbing

机构信息

School of Information and Communications, Guilin University of Electronic Technology, Guilin 541004, China.

出版信息

Sensors (Basel). 2022 Jul 3;22(13):5020. doi: 10.3390/s22135020.

DOI:10.3390/s22135020

PMID:35808515

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9269789/

Abstract

Unmanned aerial vehicles (UAVs) are important equipment for efficiently executing search and rescue missions in disaster or air-crash scenarios. Each node can communicate with the others by a routing protocol in UAV ad hoc networks (UANETs). However, UAV routing protocols are faced with the challenges of high mobility and limited node energy, which hugely lead to unstable link and sparse network topology due to premature node death. Eventually, this severely affects network performance. In order to solve these problems, we proposed the deep-reinforcement-learning-based geographical routing protocol of considering link stability and energy prediction (DSEGR) for UANETs. First of all, we came up with the link stability evaluation indicator and utilized the autoregressive integrated moving average (ARIMA) model to predict the residual energy of neighbor nodes. Then, the packet forward process was modeled as a Markov Decision Process, and according to a deep double Q network with prioritized experience replay to learn the routing-decision process. Meanwhile, a reward function was designed to obtain a better convergence rate, and the analytic hierarchy process (AHP) was used to analyze the weights of the considered factors in the reward function. Finally, to verify the effectiveness of DSEGR, we conducted simulation experiments to analyze network performance. The simulation results demonstrate that our proposed routing protocol remarkably outperforms others in packet delivery ratio and has a faster convergence rate.

摘要

无人机是在灾难或空难场景中高效执行搜索和救援任务的重要设备。在无人机自组织网络（UANETs）中，每个节点可以通过路由协议与其他节点进行通信。然而，无人机路由协议面临着高移动性和节点能量有限的挑战，这会因节点过早死亡而极大地导致链路不稳定和网络拓扑稀疏，最终严重影响网络性能。为了解决这些问题，我们提出了一种基于深度强化学习的、考虑链路稳定性和能量预测的UANETs地理路由协议（DSEGR）。首先，我们提出了链路稳定性评估指标，并利用自回归积分移动平均（ARIMA）模型预测邻居节点的剩余能量。然后，将数据包转发过程建模为马尔可夫决策过程，并根据带有优先经验回放的深度双Q网络来学习路由决策过程。同时，设计了一个奖励函数以获得更好的收敛速度，并使用层次分析法（AHP）来分析奖励函数中所考虑因素的权重。最后，为了验证DSEGR的有效性，我们进行了模拟实验来分析网络性能。模拟结果表明，我们提出的路由协议在数据包交付率方面显著优于其他协议，并且具有更快的收敛速度。