Xu Shu, Liu Qingjie, Gong Chengye, Wen Xupeng
China Nanhu Academy of Electronic and Information Technology, Jiaxing 314001, China.
Sensors (Basel). 2025 May 28;25(11):3403. doi: 10.3390/s25113403.
The integration of Unmanned Aerial Vehicles (UAVs) into Mobile Edge Computing (MEC) systems has emerged as a transformative solution for latency-sensitive applications, leveraging UAVs' unique advantages in mobility, flexible deployment, and on-demand service provisioning. This paper proposes a novel multi-agent reinforcement learning framework, termed Multi-Agent Twin Delayed Deep Deterministic Policy Gradient for Task Offloading and Resource Allocation (MATD3-TORA), to optimize task offloading and resource allocation in UAV-assisted MEC networks. The framework enables collaborative decision making among multiple UAVs to efficiently serve sparsely distributed ground mobile devices (MDs) and establish an integrated mobility, communication, and computational offloading model, which formulates a joint optimization problem aimed at minimizing the weighted sum of task processing latency and UAV energy consumption. Extensive experiments demonstrate that the algorithm achieves improvements in system latency and energy efficiency compared to conventional approaches. The results highlight MATD3-TORA's effectiveness in addressing UAV-MEC challenges, including mobility-energy tradeoffs, distributed decision making, and real-time resource allocation.
将无人机(UAV)集成到移动边缘计算(MEC)系统中,已成为对延迟敏感型应用的一种变革性解决方案,它利用了无人机在移动性、灵活部署和按需服务提供方面的独特优势。本文提出了一种新颖的多智能体强化学习框架,称为用于任务卸载和资源分配的多智能体双延迟深度确定性策略梯度(MATD3-TORA),以优化无人机辅助的MEC网络中的任务卸载和资源分配。该框架能够在多个无人机之间进行协作决策,以高效服务稀疏分布的地面移动设备(MD),并建立一个集成的移动性、通信和计算卸载模型,该模型制定了一个联合优化问题,旨在最小化任务处理延迟和无人机能量消耗的加权总和。大量实验表明,与传统方法相比,该算法在系统延迟和能源效率方面取得了改进。结果突出了MATD3-TORA在应对无人机-MEC挑战方面的有效性,包括移动性-能量权衡、分布式决策和实时资源分配。