• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于联邦多智能体深度强化学习的车载网络通信资源分配方法

Communication resource allocation method in vehicular networks based on federated multi-agent deep reinforcement learning.

作者信息

Liu Qingli, Ma Yongjie

机构信息

Key Laboratory of Communication and Network, Dalian University, Dalian, 116622, China.

School of Information Engineering, Dalian University, Dalian, 116622, China.

出版信息

Sci Rep. 2025 Aug 22;15(1):30866. doi: 10.1038/s41598-025-15982-x.

DOI:10.1038/s41598-025-15982-x
PMID:40846743
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12373776/
Abstract

In highly dynamic vehicular networking scenarios, when Vehicle-to-Infrastructure links and Vehicle-to-Vehicle links share spectrum resources, the traditional distributed resource allocation method lacks global optimization and fails to respond to environmental changes in a timely manner, which leads to low spectral efficiency of the system. A resource allocation method based on federated multi-agent deep reinforcement learning is proposed for Vehicular Networking communication, by fusing Asynchronous Federated Learning (AFL) and Multi-Agent Deep Deterministic Policy Gradient (MADDPG). Synergistic optimization of resource allocation. First, vehicles as agent dynamically optimize spectrum access, power control, and bandwidth allocation based on local channel states through the collaborative policy of MADDPG to reduce cross-link interference. Second, the asynchronous federation architecture is designed, where vehicles independently upload local model parameters to the global server, dynamically adjust the aggregation weights according to the real-time channel quality, and optimize the update of global model parameters. Finally, the global model parameters are fed back to the vehicles to further optimize the local resource allocation strategy, thus improving the system spectrum efficiency. The simulation results show that the system spectrum efficiency is improved by 19.1% on average compared with the centralized DDPG, MADDPG, MAPPO and FL-DuelingDQN algorithms in the Vehicle Networking scenario, while the transmission success rate of the V2V link is improved by 9.3% on average, and the total capacity of the V2I link is increased by 16.1% on average.

摘要

在高度动态的车载网络场景中,当车与基础设施链路和车与车链路共享频谱资源时,传统的分布式资源分配方法缺乏全局优化,无法及时响应环境变化,导致系统频谱效率低下。针对车载网络通信,提出了一种基于联邦多智能体深度强化学习的资源分配方法,该方法融合了异步联邦学习(AFL)和多智能体深度确定性策略梯度(MADDPG),对资源分配进行协同优化。首先,车辆作为智能体基于本地信道状态,通过MADDPG的协同策略动态优化频谱接入、功率控制和带宽分配,以减少交叉链路干扰。其次,设计了异步联邦架构,车辆独立将本地模型参数上传到全局服务器,根据实时信道质量动态调整聚合权重,优化全局模型参数的更新。最后,将全局模型参数反馈给车辆,进一步优化本地资源分配策略,从而提高系统频谱效率。仿真结果表明,在车载网络场景中,与集中式DDPG、MADDPG、MAPPO和FL-DuelingDQN算法相比,该系统频谱效率平均提高了19.1%,同时车对车链路的传输成功率平均提高了9.3%,车对基础设施链路的总容量平均增加了16.1%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/1c8726fc41fa/41598_2025_15982_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/7da50e48c702/41598_2025_15982_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/aa1102f56154/41598_2025_15982_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/04857e49c8b0/41598_2025_15982_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/1a5c9223a94b/41598_2025_15982_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/522a1a87a300/41598_2025_15982_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/33469af15af9/41598_2025_15982_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/9352b42efdf5/41598_2025_15982_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/1c8726fc41fa/41598_2025_15982_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/7da50e48c702/41598_2025_15982_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/aa1102f56154/41598_2025_15982_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/04857e49c8b0/41598_2025_15982_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/1a5c9223a94b/41598_2025_15982_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/522a1a87a300/41598_2025_15982_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/33469af15af9/41598_2025_15982_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/9352b42efdf5/41598_2025_15982_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3393/12373776/1c8726fc41fa/41598_2025_15982_Fig7_HTML.jpg

相似文献

1
Communication resource allocation method in vehicular networks based on federated multi-agent deep reinforcement learning.基于联邦多智能体深度强化学习的车载网络通信资源分配方法
Sci Rep. 2025 Aug 22;15(1):30866. doi: 10.1038/s41598-025-15982-x.
2
Data prioritization aware resource allocation in internet of vehicles using multi-agent deep reinforcement learning.基于多智能体深度强化学习的车联网中数据优先级感知资源分配
Neural Netw. 2025 Oct;190:107671. doi: 10.1016/j.neunet.2025.107671. Epub 2025 Jun 6.
3
Dynamic Allocation of C-V2X Communication Resources Based on Graph Attention Network and Deep Reinforcement Learning.基于图注意力网络和深度强化学习的C-V2X通信资源动态分配
Sensors (Basel). 2025 Aug 21;25(16):5209. doi: 10.3390/s25165209.
4
Research on Computation Offloading and Resource Allocation Strategy Based on MADDPG for Integrated Space-Air-Marine Network.基于多智能体深度确定性策略梯度算法的空天海一体化网络计算卸载与资源分配策略研究
Entropy (Basel). 2025 Jul 28;27(8):803. doi: 10.3390/e27080803.
5
A Lightweight Intrusion Detection System with Dynamic Feature Fusion Federated Learning for Vehicular Network Security.一种用于车载网络安全的具有动态特征融合联邦学习的轻量级入侵检测系统。
Sensors (Basel). 2025 Jul 25;25(15):4622. doi: 10.3390/s25154622.
6
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
7
Research on channel estimation based on joint perception and deep enhancement learning in complex communication scenarios.复杂通信场景下基于联合感知与深度强化学习的信道估计研究
PeerJ Comput Sci. 2025 May 8;11:e2852. doi: 10.7717/peerj-cs.2852. eCollection 2025.
8
Intelligent Priority-Aware Spectrum Access in 5G Vehicular IoT: A Reinforcement Learning Approach.5G车联网物联网中的智能优先级感知频谱接入:一种强化学习方法
Sensors (Basel). 2025 Jul 23;25(15):4554. doi: 10.3390/s25154554.
9
Distributed Collaborative Data Processing Framework for Unmanned Platforms Based on Federated Edge Intelligence.基于联邦边缘智能的无人平台分布式协作数据处理框架
Sensors (Basel). 2025 Aug 1;25(15):4752. doi: 10.3390/s25154752.
10
SFPGCL: Specificity-preserving federated population graph contrastive learning for multi-site ASD identification using rs-fMRI data.SFPGCL:使用静息态功能磁共振成像数据进行多站点自闭症谱系障碍识别的特异性保持联邦群体图对比学习
Comput Med Imaging Graph. 2025 Sep;124:102558. doi: 10.1016/j.compmedimag.2025.102558. Epub 2025 May 16.

引用本文的文献

1
A Concept for Bio-Agentic Visual Communication: Bridging Swarm Intelligence with Biological Analogues.生物智能视觉通信的概念:将群体智能与生物类似物相联系
Biomimetics (Basel). 2025 Sep 9;10(9):605. doi: 10.3390/biomimetics10090605.

本文引用的文献

1
Energy-Efficient Resource Allocation Based on Deep Q-Network in V2V Communications.基于深度 Q 网络的车对车通信中的节能资源分配。
Sensors (Basel). 2023 Jan 23;23(3):1295. doi: 10.3390/s23031295.