• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

搜索和跟踪未知数量的目标:一种基于学习并通过地图合并增强的方法。

Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps Merging.

作者信息

Yan Peng, Jia Tao, Bai Chengchao

机构信息

School of Astronautics, Harbin Institute of Technology, Harbin 150001, China.

Aerospace Technology Research Institute, China Aerodynamics Research and Development Center, Mianyang 621000, China.

出版信息

Sensors (Basel). 2021 Feb 4;21(4):1076. doi: 10.3390/s21041076.

DOI:10.3390/s21041076
PMID:33557359
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7915622/
Abstract

Unmanned aerial vehicles (UAVs) have been widely used in search and rescue (SAR) missions due to their high flexibility. A key problem in SAR missions is to search and track moving targets in an area of interest. In this paper, we focus on the problem of Cooperative Multi-UAV Observation of Multiple Moving Targets (CMUOMMT). In contrast to the existing literature, we not only optimize the average observation rate of the discovered targets, but we also emphasize the fairness of the observation of the discovered targets and the continuous exploration of the undiscovered targets, under the assumption that the total number of targets is unknown. To achieve this objective, a deep reinforcement learning (DRL)-based method is proposed under the Partially Observable Markov Decision Process (POMDP) framework, where each UAV maintains four observation history maps, and maps from different UAVs within a communication range can be merged to enhance UAVs' awareness of the environment. A deep convolutional neural network (CNN) is used to process the merged maps and generate the control commands to UAVs. The simulation results show that our policy can enable UAVs to balance between giving the discovered targets a fair observation and exploring the search region compared with other methods.

摘要

由于具有高度灵活性,无人机已在搜索和救援(SAR)任务中得到广泛应用。SAR任务中的一个关键问题是在感兴趣区域内搜索和跟踪移动目标。在本文中,我们关注多移动目标协同多无人机观测(CMUOMMT)问题。与现有文献不同,在目标总数未知的假设下,我们不仅优化已发现目标的平均观测率,还强调已发现目标观测的公平性以及对未发现目标的持续探索。为实现这一目标,在部分可观测马尔可夫决策过程(POMDP)框架下提出了一种基于深度强化学习(DRL)的方法,其中每架无人机维护四张观测历史地图,通信范围内不同无人机的地图可合并以增强无人机对环境的感知。使用深度卷积神经网络(CNN)处理合并后的地图并生成无人机的控制指令。仿真结果表明,与其他方法相比,我们的策略能使无人机在公平观测已发现目标和探索搜索区域之间取得平衡。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/153f4e148f3c/sensors-21-01076-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/b6d13b0e1177/sensors-21-01076-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/bf4df99a878c/sensors-21-01076-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/9d0e0c52d273/sensors-21-01076-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/60b54fe0d9ae/sensors-21-01076-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/0d6c99418b98/sensors-21-01076-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/90ab353ddc62/sensors-21-01076-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/469ae364c879/sensors-21-01076-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/aa2e83635397/sensors-21-01076-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/6c4241e304fb/sensors-21-01076-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/153f4e148f3c/sensors-21-01076-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/b6d13b0e1177/sensors-21-01076-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/bf4df99a878c/sensors-21-01076-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/9d0e0c52d273/sensors-21-01076-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/60b54fe0d9ae/sensors-21-01076-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/0d6c99418b98/sensors-21-01076-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/90ab353ddc62/sensors-21-01076-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/469ae364c879/sensors-21-01076-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/aa2e83635397/sensors-21-01076-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/6c4241e304fb/sensors-21-01076-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/611f/7915622/153f4e148f3c/sensors-21-01076-g010.jpg

相似文献

1
Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps Merging.搜索和跟踪未知数量的目标:一种基于学习并通过地图合并增强的方法。
Sensors (Basel). 2021 Feb 4;21(4):1076. doi: 10.3390/s21041076.
2
Profit-Driven Adaptive Moving Targets Search with UAV Swarms.基于利润的无人机群自适应动目标搜索。
Sensors (Basel). 2019 Mar 30;19(7):1545. doi: 10.3390/s19071545.
3
UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy.基于深度强化学习策略的无人机自主跟踪与着陆
Sensors (Basel). 2020 Oct 1;20(19):5630. doi: 10.3390/s20195630.
4
Multi-UAV simultaneous target assignment and path planning based on deep reinforcement learning in dynamic multiple obstacles environments.动态多障碍物环境下基于深度强化学习的多无人机同步目标分配与路径规划
Front Neurorobot. 2024 Jan 22;17:1302898. doi: 10.3389/fnbot.2023.1302898. eCollection 2023.
5
Proactive Handover Decision for UAVs with Deep Reinforcement Learning.基于深度强化学习的无人机主动交接决策
Sensors (Basel). 2022 Feb 5;22(3):1200. doi: 10.3390/s22031200.
6
Research on reinforcement learning-based safe decision-making methodology for multiple unmanned aerial vehicles.基于强化学习的多无人机安全决策方法研究
Front Neurorobot. 2023 Jan 10;16:1105480. doi: 10.3389/fnbot.2022.1105480. eCollection 2022.
7
Joint UAVs' Load Balancing and UEs' Data Rate Fairness Optimization by Diffusion UAV Deployment Algorithm in Multi-UAV Networks.多无人机网络中基于扩散无人机部署算法的联合无人机负载均衡与用户设备数据速率公平性优化
Entropy (Basel). 2021 Nov 7;23(11):1470. doi: 10.3390/e23111470.
8
Long-Term Tracking of Evasive Urban Target Based on Intention Inference and Deep Reinforcement Learning.基于意图推理和深度强化学习的城市逃避目标长期跟踪
IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):16886-16900. doi: 10.1109/TNNLS.2023.3298944. Epub 2024 Oct 29.
9
Enabling UAV Navigation with Sensor and Environmental Uncertainty in Cluttered and GPS-Denied Environments.在复杂且无GPS信号的环境中利用传感器和环境不确定性实现无人机导航
Sensors (Basel). 2016 May 10;16(5):666. doi: 10.3390/s16050666.
10
Autonomous Unmanned Aerial Vehicles in Search and Rescue Missions Using Real-Time Cooperative Model Predictive Control.自主式无人机在搜救任务中的应用——基于实时协同模型预测控制
Sensors (Basel). 2019 Sep 20;19(19):4067. doi: 10.3390/s19194067.

引用本文的文献

1
Application of Deep Reinforcement Learning to UAV Swarming for Ground Surveillance.深度强化学习在用于地面监视的无人机群中的应用。
Sensors (Basel). 2023 Oct 27;23(21):8766. doi: 10.3390/s23218766.
2
Balancing Collective Exploration and Exploitation in Multi-Agent and Multi-Robot Systems: A Review.多智能体与多机器人系统中集体探索与利用的平衡:综述
Front Robot AI. 2022 Feb 1;8:771520. doi: 10.3389/frobt.2021.771520. eCollection 2021.

本文引用的文献

1
A Framework for Multi-Agent UAV Exploration and Target-Finding in GPS-Denied and Partially Observable Environments.多智能体无人机在 GPS 拒止和部分可观测环境中的探索和目标发现框架。
Sensors (Basel). 2020 Aug 21;20(17):4739. doi: 10.3390/s20174739.
2
Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications.用于多智能体系统的深度强化学习:挑战、解决方案及应用综述
IEEE Trans Cybern. 2020 Sep;50(9):3826-3839. doi: 10.1109/TCYB.2020.2977374. Epub 2020 Mar 20.
3
Autonomous Unmanned Aerial Vehicles in Search and Rescue Missions Using Real-Time Cooperative Model Predictive Control.
自主式无人机在搜救任务中的应用——基于实时协同模型预测控制
Sensors (Basel). 2019 Sep 20;19(19):4067. doi: 10.3390/s19194067.
4
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment.基于深度强化学习的未知环境导航自动探索。
IEEE Trans Neural Netw Learn Syst. 2020 Jun;31(6):2064-2076. doi: 10.1109/TNNLS.2019.2927869. Epub 2019 Aug 6.
5
Profit-Driven Adaptive Moving Targets Search with UAV Swarms.基于利润的无人机群自适应动目标搜索。
Sensors (Basel). 2019 Mar 30;19(7):1545. doi: 10.3390/s19071545.
6
Multi-UAV Reconnaissance Task Assignment for Heterogeneous Targets Based on Modified Symbiotic Organisms Search Algorithm.基于改进共生生物搜索算法的多无人机异构目标侦察任务分配
Sensors (Basel). 2019 Feb 12;19(3):734. doi: 10.3390/s19030734.
7
Mastering the game of Go without human knowledge.无需人类知识即可掌握围棋游戏。
Nature. 2017 Oct 18;550(7676):354-359. doi: 10.1038/nature24270.
8
Cooperative Robots to Observe Moving Targets: Review.协作机器人对移动目标的观测:综述。
IEEE Trans Cybern. 2018 Jan;48(1):187-198. doi: 10.1109/TCYB.2016.2628161. Epub 2016 Dec 1.
9
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.
10
Multi-agent cooperative target search.多智能体协作目标搜索
Sensors (Basel). 2014 May 26;14(6):9408-28. doi: 10.3390/s140609408.