• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于无人机路径规划问题的节能多智能体深度强化学习算法

Energy-Saving Multi-Agent Deep Reinforcement Learning Algorithm for Drone Routing Problem.

作者信息

Shu Xiulan, Lin Anping, Wen Xupeng

机构信息

School of Intelligent Manufacturing Engineering, Zhanjiang University of Science and Technology, Zhanjiang 524000, China.

School of Physics and Electronic Electrical Engineering, Xiangnan University, Chenzhou 423000, China.

出版信息

Sensors (Basel). 2024 Oct 18;24(20):6698. doi: 10.3390/s24206698.

DOI:10.3390/s24206698
PMID:39460178
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11511266/
Abstract

With the rapid advancement of drone technology, the efficient distribution of drones has garnered significant attention. Central to this discourse is the energy consumption of drones, a critical metric for assessing energy-efficient distribution strategies. Accordingly, this study delves into the energy consumption factors affecting drone distribution. A primary challenge in drone distribution lies in devising optimal, energy-efficient routes for drones. However, traditional routing algorithms, predominantly heuristic-based, exhibit certain limitations. These algorithms often rely on heuristic rules and expert knowledge, which can constrain their ability to escape local optima. Motivated by these shortcomings, we propose a novel multi-agent deep reinforcement learning algorithm that integrates a drone energy consumption model, namely EMADRL. The EMADRL algorithm first formulates the drone routing problem within a multi-agent reinforcement learning framework. It subsequently designs a strategy network model comprising multiple agent networks, tailored to address the node adjacency and masking complexities typical of multi-depot vehicle routing problem. Training utilizes strategy gradient algorithms and attention mechanisms. Furthermore, local and sampling search strategies are introduced to enhance solution quality. Extensive experimentation demonstrates that EMADRL consistently achieves high-quality solutions swiftly. A comparative analysis against contemporary algorithms reveals EMADRL's superior energy efficiency, with average energy savings of 5.96% and maximum savings reaching 12.45%. Thus, this approach offers a promising new avenue for optimizing energy consumption in last-mile distribution scenarios.

摘要

随着无人机技术的迅速发展,无人机的高效配送已引起广泛关注。在这一讨论的核心是无人机的能量消耗,这是评估节能配送策略的关键指标。因此,本研究深入探讨了影响无人机配送的能量消耗因素。无人机配送的一个主要挑战在于为无人机设计最优的节能路线。然而,传统的路由算法主要基于启发式算法,存在一定的局限性。这些算法通常依赖于启发式规则和专家知识,这可能限制它们逃离局部最优的能力。受这些缺点的启发,我们提出了一种新颖的多智能体深度强化学习算法,该算法集成了无人机能量消耗模型,即EMADRL。EMADRL算法首先在多智能体强化学习框架内制定无人机路由问题。随后,它设计了一个由多个智能体网络组成的策略网络模型,以解决多仓库车辆路由问题中典型的节点邻接和掩码复杂性。训练使用策略梯度算法和注意力机制。此外,还引入了局部和采样搜索策略以提高解决方案质量。大量实验表明,EMADRL能够迅速持续地实现高质量的解决方案。与当代算法的对比分析表明,EMADRL具有更高的能源效率,平均节能5.96%,最大节能达到12.45%。因此,这种方法为优化最后一英里配送场景中的能量消耗提供了一条有前景的新途径。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f747/11511266/4e6841c0b591/sensors-24-06698-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f747/11511266/4e6841c0b591/sensors-24-06698-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f747/11511266/4e6841c0b591/sensors-24-06698-g001.jpg

相似文献

1
Energy-Saving Multi-Agent Deep Reinforcement Learning Algorithm for Drone Routing Problem.用于无人机路径规划问题的节能多智能体深度强化学习算法
Sensors (Basel). 2024 Oct 18;24(20):6698. doi: 10.3390/s24206698.
2
Cooperatively Routing a Truck and Multiple Drones for Target Surveillance.协同规划卡车和多架无人机进行目标监测。
Sensors (Basel). 2022 Apr 10;22(8):2909. doi: 10.3390/s22082909.
3
Countering a Drone in a 3D Space: Analyzing Deep Reinforcement Learning Methods.在三维空间中对抗无人机:分析深度强化学习方法。
Sensors (Basel). 2022 Nov 16;22(22):8863. doi: 10.3390/s22228863.
4
A Coordinated Vehicle-Drone Arc Routing Approach Based on Improved Adaptive Large Neighborhood Search.基于改进型自适应大邻域搜索的车-无人机协同弧路由方法
Sensors (Basel). 2022 May 12;22(10):3702. doi: 10.3390/s22103702.
5
Intelligent Drone Swarms to Search for Victims in Post-Disaster Areas.智能无人机群将在灾区搜寻遇难者。
Sensors (Basel). 2023 Nov 30;23(23):9540. doi: 10.3390/s23239540.
6
Toward Energy-Efficient Routing of Multiple AGVs with Multi-Agent Reinforcement Learning.多 AGV 的多智能体强化学习能量高效路由
Sensors (Basel). 2023 Jun 15;23(12):5615. doi: 10.3390/s23125615.
7
Drone routing problem model for last-mile delivery using the public transportation capacity as moving charging stations.以公共交通运力作为移动充电站的最后一英里配送无人机路径规划问题模型
Sci Rep. 2022 Apr 15;12(1):6361. doi: 10.1038/s41598-022-10408-4.
8
Multi-Objective Optimization of Energy Saving and Throughput in Heterogeneous Networks Using Deep Reinforcement Learning.使用深度强化学习进行异构网络中的节能和吞吐量的多目标优化。
Sensors (Basel). 2021 Nov 27;21(23):7925. doi: 10.3390/s21237925.
9
Drone Routing for Drone-Based Delivery Systems: A Review of Trajectory Planning, Charging, and Security.基于无人机的投递系统的无人机路径规划:轨迹规划、充电和安全综述。
Sensors (Basel). 2023 Jan 28;23(3):1463. doi: 10.3390/s23031463.
10
Algorithms for Delivery of Data by Drones in an Isolated Area Divided into Squares.在一个划分为正方形的孤立区域中,使用无人机传输数据的算法。
Sensors (Basel). 2021 Aug 13;21(16):5472. doi: 10.3390/s21165472.

本文引用的文献

1
Intelligent Drone Swarms to Search for Victims in Post-Disaster Areas.智能无人机群将在灾区搜寻遇难者。
Sensors (Basel). 2023 Nov 30;23(23):9540. doi: 10.3390/s23239540.
2
A Novel Vectorized Curved Road Representation Based Aerial Guided Unmanned Vehicle Trajectory Planning.一种基于新型矢量化曲线道路表示的空中引导无人驾驶车辆轨迹规划
Sensors (Basel). 2023 Aug 21;23(16):7305. doi: 10.3390/s23167305.
3
Drone Routing for Drone-Based Delivery Systems: A Review of Trajectory Planning, Charging, and Security.基于无人机的投递系统的无人机路径规划:轨迹规划、充电和安全综述。
Sensors (Basel). 2023 Jan 28;23(3):1463. doi: 10.3390/s23031463.
4
Cooperatively Routing a Truck and Multiple Drones for Target Surveillance.协同规划卡车和多架无人机进行目标监测。
Sensors (Basel). 2022 Apr 10;22(8):2909. doi: 10.3390/s22082909.
5
Flight Planning Optimization of Multiple UAVs for Internet of Things.用于物联网的多架无人机飞行计划优化
Sensors (Basel). 2021 Nov 20;21(22):7735. doi: 10.3390/s21227735.
6
Learning Improvement Heuristics for Solving Routing Problems.用于解决路由问题的学习改进启发式算法
IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):5057-5069. doi: 10.1109/TNNLS.2021.3068828. Epub 2022 Aug 31.
7
Mastering the game of Go without human knowledge.无需人类知识即可掌握围棋游戏。
Nature. 2017 Oct 18;550(7676):354-359. doi: 10.1038/nature24270.
8
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.