• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过基于奖励的多智能体学习实现无人机通信网络中的分布式资源分配

Decentralized resource allocation in UAV communication networks through reward based multi agent learning.

作者信息

Shoaib Muhammad, Husnain Ghassan, Khan Muhsin, Ghadi Yazeed Yasin, Lim Sangsoon

机构信息

Department of Computer Science, CECOS University of IT and Emerging Sciences, Peshawar, 25100, Pakistan.

Department of Computer Science, Iqra National University, Peshawar, 25100, Pakistan.

出版信息

Sci Rep. 2025 Sep 26;15(1):33122. doi: 10.1038/s41598-025-18353-8.

DOI:10.1038/s41598-025-18353-8
PMID:41006638
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12475095/
Abstract

Unmanned aerial vehicles (UAVs) used as aerial base stations (ABS) can provide economical, on-demand wireless access. This research investigates dynamic resource allocation in multi-UAV-enabled communication systems with the aim of maximizing long-term rewards. More specifically, without exchanging information with other UAVs, every UAV chooses its communicating users, power levels, and sub-channels to establish communication with a ground user. In the proposed work, the dynamic scheme-based resource allocation is investigated of communication networks made possible by many UAVs to achieve the highest possible performance level over time. Specifically, each UAV selects its connected users, battery power, and communication channel independently, without exchanging information across multiple UAVs. This allows each UAV to connect with ground users. To model the unpredictability of the environment, we present the problem of long-term allocation of system resources as a stochastic game to maximize the anticipated reward. Each UAV in this game plays the role of a learnable agent, and the system solution for resource allocation matches the actions made by the UAV. Afterward, we built a framework called reward-based multi-agent learning (RMAL), in which each agent uses learning to identify its best strategies based on local observations. RMAL is an acronym for ″reward-based multi-agent learning″. We specifically offer an agent-independent strategy where each agent decides algorithms separately but cooperates on a common Q-learning-based framework. The performance of the suggested RMAL-based resource allocation method may be enhanced by employing the right development and exploration parameters, according to the simulation findings. Secondly, the proposed RMAL algorithm provides acceptable performance over full information exchange between UAVs. Doing so achieves a satisfactory compromise between the increase in performance and the additional burden of information transmission.

摘要

用作空中基站(ABS)的无人机(UAV)可以提供经济、按需的无线接入。本研究调查了多无人机通信系统中的动态资源分配,旨在最大化长期奖励。更具体地说,在不与其他无人机交换信息的情况下,每架无人机选择其通信用户、功率水平和子信道,以与地面用户建立通信。在这项拟议的工作中,研究了由多架无人机实现的通信网络基于动态方案的资源分配,以便随着时间的推移实现尽可能高的性能水平。具体而言,每架无人机独立选择其连接的用户、电池电量和通信信道,而不跨多架无人机交换信息。这使得每架无人机能够与地面用户建立连接。为了模拟环境的不可预测性,我们将系统资源的长期分配问题表示为一个随机博弈,以最大化预期奖励。此博弈中的每架无人机都扮演一个可学习智能体的角色,资源分配的系统解决方案与无人机采取的行动相匹配。之后,我们构建了一个名为基于奖励的多智能体学习(RMAL)的框架,其中每个智能体基于局部观测使用学习来识别其最佳策略。RMAL是“基于奖励的多智能体学习”的首字母缩写。我们特别提供了一种与智能体无关的策略,其中每个智能体分别决定算法,但在一个基于Q学习的通用框架上进行协作。根据仿真结果,通过采用合适的开发和探索参数,可以提高所建议的基于RMAL的资源分配方法的性能。其次,所提出的RMAL算法在无人机之间进行完全信息交换时提供了可接受的性能。这样做在性能提升和信息传输的额外负担之间达成了令人满意的折衷。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/973848ef1ccd/41598_2025_18353_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/89559cd29bf0/41598_2025_18353_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/4c01853c96ab/41598_2025_18353_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/000812efb1e7/41598_2025_18353_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/2af599170d6c/41598_2025_18353_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/ed147d0cf4c8/41598_2025_18353_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/ea534b653ed2/41598_2025_18353_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/4e8b9c8e4fa2/41598_2025_18353_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/973848ef1ccd/41598_2025_18353_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/89559cd29bf0/41598_2025_18353_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/4c01853c96ab/41598_2025_18353_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/000812efb1e7/41598_2025_18353_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/2af599170d6c/41598_2025_18353_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/ed147d0cf4c8/41598_2025_18353_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/ea534b653ed2/41598_2025_18353_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/4e8b9c8e4fa2/41598_2025_18353_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9aae/12475095/973848ef1ccd/41598_2025_18353_Fig8_HTML.jpg

相似文献

1
Decentralized resource allocation in UAV communication networks through reward based multi agent learning.通过基于奖励的多智能体学习实现无人机通信网络中的分布式资源分配
Sci Rep. 2025 Sep 26;15(1):33122. doi: 10.1038/s41598-025-18353-8.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Shoulder Arthrogram肩关节造影
4
Short-Term Memory Impairment短期记忆障碍
5
Vesicoureteral Reflux膀胱输尿管反流
6
Integrated neural network framework for multi-object detection and recognition using UAV imagery.用于使用无人机图像进行多目标检测与识别的集成神经网络框架。
Front Neurorobot. 2025 Jul 30;19:1643011. doi: 10.3389/fnbot.2025.1643011. eCollection 2025.
7
Post-pandemic planning for maternity care for local, regional, and national maternity systems across the four nations: a mixed-methods study.针对四个地区的地方、区域和国家孕产妇保健系统的疫情后规划:一项混合方法研究。
Health Soc Care Deliv Res. 2025 Sep;13(35):1-25. doi: 10.3310/HHTE6611.
8
Mid Forehead Brow Lift额中眉提升术
9
Energy-Efficient Multi-Agent Deep Reinforcement Learning Task Offloading and Resource Allocation for UAV Edge Computing.用于无人机边缘计算的高效节能多智能体深度强化学习任务卸载与资源分配
Sensors (Basel). 2025 May 28;25(11):3403. doi: 10.3390/s25113403.
10
Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.医护人员非正规使用手机和其他移动设备来支持工作:定性证据综合评价。
Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.

本文引用的文献

1
Multi-Agent DRL for Air-to-Ground Communication Planning in UAV-Enabled IoT Networks.用于无人机支持的物联网网络中地空通信规划的多智能体深度强化学习
Sensors (Basel). 2024 Oct 10;24(20):6535. doi: 10.3390/s24206535.