• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于意图推理和深度强化学习的城市逃避目标长期跟踪

Long-Term Tracking of Evasive Urban Target Based on Intention Inference and Deep Reinforcement Learning.

作者信息

Yan Peng, Guo Jifeng, Su Xiaojie, Bai Chengchao

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):16886-16900. doi: 10.1109/TNNLS.2023.3298944. Epub 2024 Oct 29.

DOI:10.1109/TNNLS.2023.3298944
PMID:37566499
Abstract

Unmanned aerial vehicles (UAVs) have been widely used in urban target-tracking tasks, where long-term tracking of evasive targets is of great significance for public safety. However, the tracked targets are easily lost due to the evasive behavior of the targets and the unstructured characteristics of the urban environment. To address this issue, this article proposes a hybrid target-tracking approach based on target intention inference and deep reinforcement learning (DRL). First, a target intention inference model based on convolution neural networks (CNNs) is built to infer target intentions by fusing urban environment information and observed target trajectory. Then, the prediction of the target trajectory can be inspired by the inferred target intentions, which can further provide effective guidance to the target search process. In order to fully explore the policy space, the target search policy is developed under a DRL framework, where the search policy is modeled as a deep neural network (DNN) and trained by interacting with the task environment. The simulation results show that the inference of the target intentions can effectively guide the UAV to search for the target and significantly improve the target-tracking performance. Meanwhile, the generalization results indicate that the proposed DRL-based search policy has high robustness to the uncertainty of the target behavior.

摘要

无人机已广泛应用于城市目标跟踪任务,其中对规避目标的长期跟踪对公共安全具有重要意义。然而,由于目标的规避行为和城市环境的非结构化特征,被跟踪目标很容易丢失。为了解决这个问题,本文提出了一种基于目标意图推理和深度强化学习(DRL)的混合目标跟踪方法。首先,构建基于卷积神经网络(CNN)的目标意图推理模型,通过融合城市环境信息和观测到的目标轨迹来推断目标意图。然后,目标轨迹的预测可以受到推断出的目标意图的启发,这可以进一步为目标搜索过程提供有效指导。为了充分探索策略空间,在DRL框架下开发目标搜索策略,其中搜索策略被建模为深度神经网络(DNN)并通过与任务环境交互进行训练。仿真结果表明,目标意图的推理可以有效地引导无人机搜索目标,并显著提高目标跟踪性能。同时,泛化结果表明,所提出的基于DRL的搜索策略对目标行为的不确定性具有很高的鲁棒性。

相似文献

1
Long-Term Tracking of Evasive Urban Target Based on Intention Inference and Deep Reinforcement Learning.基于意图推理和深度强化学习的城市逃避目标长期跟踪
IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):16886-16900. doi: 10.1109/TNNLS.2023.3298944. Epub 2024 Oct 29.
2
Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking.基于深度强化学习的无人机动态目标跟踪端到端控制
Biomimetics (Basel). 2022 Nov 11;7(4):197. doi: 10.3390/biomimetics7040197.
3
Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps Merging.搜索和跟踪未知数量的目标:一种基于学习并通过地图合并增强的方法。
Sensors (Basel). 2021 Feb 4;21(4):1076. doi: 10.3390/s21041076.
4
MNNMs Integrated Control for UAV Autonomous Tracking Randomly Moving Target Based on Learning Method.基于学习方法的无人机自主跟踪随机移动目标的多神经网络集成控制
Sensors (Basel). 2021 Nov 2;21(21):7307. doi: 10.3390/s21217307.
5
Deep Reinforcement Learning Approach with Multiple Experience Pools for UAV's Autonomous Motion Planning in Complex Unknown Environments.深度强化学习方法与多经验池在复杂未知环境中无人机自主运动规划。
Sensors (Basel). 2020 Mar 29;20(7):1890. doi: 10.3390/s20071890.
6
Proactive Handover Decision for UAVs with Deep Reinforcement Learning.基于深度强化学习的无人机主动交接决策
Sensors (Basel). 2022 Feb 5;22(3):1200. doi: 10.3390/s22031200.
7
UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy.基于深度强化学习策略的无人机自主跟踪与着陆
Sensors (Basel). 2020 Oct 1;20(19):5630. doi: 10.3390/s20195630.
8
Research on reinforcement learning-based safe decision-making methodology for multiple unmanned aerial vehicles.基于强化学习的多无人机安全决策方法研究
Front Neurorobot. 2023 Jan 10;16:1105480. doi: 10.3389/fnbot.2022.1105480. eCollection 2022.
9
Multi-UAV simultaneous target assignment and path planning based on deep reinforcement learning in dynamic multiple obstacles environments.动态多障碍物环境下基于深度强化学习的多无人机同步目标分配与路径规划
Front Neurorobot. 2024 Jan 22;17:1302898. doi: 10.3389/fnbot.2023.1302898. eCollection 2023.
10
Application of Deep Reinforcement Learning to UAV Swarming for Ground Surveillance.深度强化学习在用于地面监视的无人机群中的应用。
Sensors (Basel). 2023 Oct 27;23(21):8766. doi: 10.3390/s23218766.