• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

移动网络中基于深度确定性策略梯度的考虑网络切片和设备到设备通信的资源分配

Deep Deterministic Policy Gradient-Based Resource Allocation Considering Network Slicing and Device-to-Device Communication in Mobile Networks.

作者信息

de Souza Lopes Hudson Henrique, Ferreira Lima Lucas Jose, de Lima Soares Telma Woerle, Teles Vieira Flávio Henrique

机构信息

Electrical, Mechanical and Computer (EMC) School of Engineering, Federal University of Goias (UFG), Goiânia 74605010, GO, Brazil.

Advanced Knowledge Center for Immersive Technologies (AKCIT), Federal University of Goias (UFG), Goiânia 74605010, GO, Brazil.

出版信息

Sensors (Basel). 2024 Sep 20;24(18):6079. doi: 10.3390/s24186079.

DOI:10.3390/s24186079
PMID:39338825
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11436155/
Abstract

Next-generation mobile networks, such as those beyond the 5th generation (B5G) and 6th generation (6G), have diverse network resource demands. Network slicing (NS) and device-to-device (D2D) communication have emerged as promising solutions for network operators. NS is a candidate technology for this scenario, where a single network infrastructure is divided into multiple (virtual) slices to meet different service requirements. Combining D2D and NS can improve spectrum utilization, providing better performance and scalability. This paper addresses the challenging problem of dynamic resource allocation with wireless network slices and D2D communications using deep reinforcement learning (DRL) techniques. More specifically, we propose an approach named DDPG-KRP based on deep deterministic policy gradient (DDPG) with K-nearest neighbors (KNNs) and reward penalization (RP) for undesirable action elimination to determine the resource allocation policy maximizing long-term rewards. The simulation results show that the DDPG-KRP is an efficient solution for resource allocation in wireless networks with slicing, outperforming other considered DRL algorithms.

摘要

下一代移动网络,如第五代以上(B5G)和第六代(6G)网络,具有多样化的网络资源需求。网络切片(NS)和设备到设备(D2D)通信已成为网络运营商颇具前景的解决方案。NS是这种场景下的一种候选技术,即单个网络基础设施被划分为多个(虚拟)切片以满足不同的服务需求。将D2D和NS相结合可以提高频谱利用率,提供更好的性能和可扩展性。本文使用深度强化学习(DRL)技术解决了无线网络切片和D2D通信中的动态资源分配这一具有挑战性的问题。更具体地说,我们提出了一种名为DDPG-KRP的方法,该方法基于深度确定性策略梯度(DDPG),结合K近邻(KNN)和奖励惩罚(RP)来消除不良行为,以确定使长期奖励最大化的资源分配策略。仿真结果表明,DDPG-KRP是无线网络切片资源分配的一种有效解决方案,优于其他考虑的DRL算法。

相似文献

1
Deep Deterministic Policy Gradient-Based Resource Allocation Considering Network Slicing and Device-to-Device Communication in Mobile Networks.移动网络中基于深度确定性策略梯度的考虑网络切片和设备到设备通信的资源分配
Sensors (Basel). 2024 Sep 20;24(18):6079. doi: 10.3390/s24186079.
2
Two Tier Slicing Resource Allocation Algorithm Based on Deep Reinforcement Learning and Joint Bidding in Wireless Access Networks.基于深度强化学习和联合竞价的无线接入网络双层切片资源分配算法
Sensors (Basel). 2022 May 4;22(9):3495. doi: 10.3390/s22093495.
3
A Power Allocation Scheme for MIMO-NOMA and D2D Vehicular Edge Computing Based on Decentralized DRL.一种基于去中心化 DRL 的 MIMO-NOMA 和 D2D 车联网边缘计算的功率分配方案。
Sensors (Basel). 2023 Mar 25;23(7):3449. doi: 10.3390/s23073449.
4
Deep Reinforcement Learning Based Resource Allocation for D2D Communications Underlay Cellular Networks.基于深度强化学习的蜂窝网络中 D2D 通信的资源分配。
Sensors (Basel). 2022 Dec 3;22(23):9459. doi: 10.3390/s22239459.
5
Deep Reinforcement Learning for Resource Management on Network Slicing: A Survey.深度强化学习在网络切片资源管理中的应用研究综述。
Sensors (Basel). 2022 Apr 15;22(8):3031. doi: 10.3390/s22083031.
6
A Self-Regulating Power-Control Scheme Using Reinforcement Learning for D2D Communication Networks.一种用于D2D通信网络的基于强化学习的自调节功率控制方案。
Sensors (Basel). 2022 Jun 29;22(13):4894. doi: 10.3390/s22134894.
7
Spectrum-efficient user grouping and resource allocation based on deep reinforcement learning for mmWave massive MIMO-NOMA systems.基于深度强化学习的毫米波大规模MIMO-NOMA系统频谱高效用户分组与资源分配
Sci Rep. 2024 Apr 17;14(1):8884. doi: 10.1038/s41598-024-59241-x.
8
Deep-Learning-Based Resource Allocation for Time-Sensitive Device-to-Device Networks.基于深度学习的对时间敏感的设备到设备网络的资源分配
Sensors (Basel). 2022 Feb 17;22(4):1551. doi: 10.3390/s22041551.
9
Deep deterministic policy gradient algorithm: A systematic review.深度确定性策略梯度算法:一项系统综述。
Heliyon. 2024 May 7;10(9):e30697. doi: 10.1016/j.heliyon.2024.e30697. eCollection 2024 May 15.
10
BENS-B5G: Blockchain-Enabled Network Slicing in 5G and Beyond-5G (B5G) Networks.BENS-B5G:5G 及 beyond-5G(B5G)网络中的基于区块链的网络切片。
Sensors (Basel). 2022 Aug 14;22(16):6068. doi: 10.3390/s22166068.

本文引用的文献

1
Scalable Nearest Neighbor Algorithms for High Dimensional Data.高维数据的可扩展最近邻算法。
IEEE Trans Pattern Anal Mach Intell. 2014 Nov;36(11):2227-40. doi: 10.1109/TPAMI.2014.2321376.