基于意图推理和深度强化学习的城市逃避目标长期跟踪

Long-Term Tracking of Evasive Urban Target Based on Intention Inference and Deep Reinforcement Learning.

作者信息

Yan Peng, Guo Jifeng, Su Xiaojie, Bai Chengchao

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):16886-16900. doi: 10.1109/TNNLS.2023.3298944. Epub 2024 Oct 29.

DOI:10.1109/TNNLS.2023.3298944

Abstract

Unmanned aerial vehicles (UAVs) have been widely used in urban target-tracking tasks, where long-term tracking of evasive targets is of great significance for public safety. However, the tracked targets are easily lost due to the evasive behavior of the targets and the unstructured characteristics of the urban environment. To address this issue, this article proposes a hybrid target-tracking approach based on target intention inference and deep reinforcement learning (DRL). First, a target intention inference model based on convolution neural networks (CNNs) is built to infer target intentions by fusing urban environment information and observed target trajectory. Then, the prediction of the target trajectory can be inspired by the inferred target intentions, which can further provide effective guidance to the target search process. In order to fully explore the policy space, the target search policy is developed under a DRL framework, where the search policy is modeled as a deep neural network (DNN) and trained by interacting with the task environment. The simulation results show that the inference of the target intentions can effectively guide the UAV to search for the target and significantly improve the target-tracking performance. Meanwhile, the generalization results indicate that the proposed DRL-based search policy has high robustness to the uncertainty of the target behavior.

摘要

无人机已广泛应用于城市目标跟踪任务，其中对规避目标的长期跟踪对公共安全具有重要意义。然而，由于目标的规避行为和城市环境的非结构化特征，被跟踪目标很容易丢失。为了解决这个问题，本文提出了一种基于目标意图推理和深度强化学习（DRL）的混合目标跟踪方法。首先，构建基于卷积神经网络（CNN）的目标意图推理模型，通过融合城市环境信息和观测到的目标轨迹来推断目标意图。然后，目标轨迹的预测可以受到推断出的目标意图的启发，这可以进一步为目标搜索过程提供有效指导。为了充分探索策略空间，在DRL框架下开发目标搜索策略，其中搜索策略被建模为深度神经网络（DNN）并通过与任务环境交互进行训练。仿真结果表明，目标意图的推理可以有效地引导无人机搜索目标，并显著提高目标跟踪性能。同时，泛化结果表明，所提出的基于DRL的搜索策略对目标行为的不确定性具有很高的鲁棒性。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于意图推理和深度强化学习的城市逃避目标长期跟踪

Long-Term Tracking of Evasive Urban Target Based on Intention Inference and Deep Reinforcement Learning.

作者信息

出版信息

相似文献

基于意图推理和深度强化学习的城市逃避目标长期跟踪

Long-Term Tracking of Evasive Urban Target Based on Intention Inference and Deep Reinforcement Learning.

作者信息

出版信息

相似文献