• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度强化学习的延迟受限非正交多址接入卫星网络用户配对

User Pairing for Delay-Limited NOMA-Based Satellite Networks with Deep Reinforcement Learning.

作者信息

Zhang Qianfeng, An Kang, Yan Xiaojuan, Xi Hongxia, Wang Yuli

机构信息

Guangxi Key Laboratory of Ocean Engineering Equipment and Technology, Qinzhou 535011, China.

Key Laboratory of Beibu Gulf Offshore Engineering Equipment and Technology (Beibu Gulf University), Education Department of Guangxi Zhuang Autonomous Region, Qinzhou 535011, China.

出版信息

Sensors (Basel). 2023 Aug 9;23(16):7062. doi: 10.3390/s23167062.

DOI:10.3390/s23167062
PMID:37631599
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10459489/
Abstract

In this paper, we investigate a user pairing problem in power domain non-orthogonal multiple access (NOMA) scheme-aided satellite networks. In the considered scenario, different satellite applications are assumed with various delay quality-of-service (QoS) requirements, and the concept of effective capacity is employed to characterize the effect of delay QoS limitations on achieved performance. Based on this, our objective was to select users to form a NOMA user pair and utilize resource efficiently. To this end, a power allocation coefficient was firstly obtained by ensuring that the achieved capacity of users with sensitive delay QoS requirements was not less than that achieved with an orthogonal multiple access (OMA) scheme. Then, considering that user selection in a delay-limited NOMA-based satellite network is intractable and non-convex, a deep reinforcement learning (DRL) algorithm was employed for dynamic user selection. Specifically, channel conditions and delay QoS requirements of users were carefully selected as state, and a DRL algorithm was used to search for the optimal user who could achieve the maximum performance with the power allocation factor, to pair with the delay QoS-sensitive user to form a NOMA user pair for each state. Simulation results are provided to demonstrate that the proposed DRL-based user selection scheme can output the optimal action in each time slot and, thus, provide superior performance than that achieved with a random selection strategy and OMA scheme.

摘要

在本文中,我们研究了功率域非正交多址接入(NOMA)方案辅助卫星网络中的用户配对问题。在考虑的场景中,假设不同的卫星应用具有各种延迟服务质量(QoS)要求,并采用有效容量的概念来表征延迟QoS限制对所实现性能的影响。基于此,我们的目标是选择用户形成NOMA用户对并有效利用资源。为此,首先通过确保具有敏感延迟QoS要求的用户所实现的容量不小于通过正交多址接入(OMA)方案所实现的容量来获得功率分配系数。然后,考虑到基于延迟受限的NOMA卫星网络中的用户选择是棘手且非凸的,采用深度强化学习(DRL)算法进行动态用户选择。具体而言,仔细选择用户的信道条件和延迟QoS要求作为状态,并使用DRL算法搜索能够在功率分配因子下实现最大性能的最优用户,以便与延迟QoS敏感用户配对,为每个状态形成一个NOMA用户对。提供的仿真结果表明,所提出的基于DRL的用户选择方案可以在每个时隙输出最优动作,因此,提供比随机选择策略和OMA方案所实现的性能更优的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8c0/10459489/77d60087b513/sensors-23-07062-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8c0/10459489/966a8de0e263/sensors-23-07062-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8c0/10459489/6b95b0634aa5/sensors-23-07062-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8c0/10459489/42571f9423d3/sensors-23-07062-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8c0/10459489/98b0fb022a0e/sensors-23-07062-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8c0/10459489/4b93504dccad/sensors-23-07062-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8c0/10459489/77d60087b513/sensors-23-07062-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8c0/10459489/966a8de0e263/sensors-23-07062-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8c0/10459489/6b95b0634aa5/sensors-23-07062-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8c0/10459489/42571f9423d3/sensors-23-07062-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8c0/10459489/98b0fb022a0e/sensors-23-07062-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8c0/10459489/4b93504dccad/sensors-23-07062-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e8c0/10459489/77d60087b513/sensors-23-07062-g006.jpg

相似文献

1
User Pairing for Delay-Limited NOMA-Based Satellite Networks with Deep Reinforcement Learning.基于深度强化学习的延迟受限非正交多址接入卫星网络用户配对
Sensors (Basel). 2023 Aug 9;23(16):7062. doi: 10.3390/s23167062.
2
DRL-Assisted Resource Allocation for NOMA-MEC Offloading with Hybrid SIC.基于混合连续干扰消除的非正交多址边缘计算卸载的深度强化学习辅助资源分配
Entropy (Basel). 2021 May 14;23(5):613. doi: 10.3390/e23050613.
3
Dual Dynamic Scheduling for Hierarchical QoS in Uplink-NOMA: A Reinforcement Learning Approach.上行非正交多址接入中用于分层QoS的双动态调度:一种强化学习方法
Sensors (Basel). 2021 Jun 27;21(13):4404. doi: 10.3390/s21134404.
4
Reinforcement Learning-Based Joint User Pairing and Power Allocation in MIMO-NOMA Systems.基于强化学习的MIMO-NOMA系统联合用户配对与功率分配
Sensors (Basel). 2020 Dec 11;20(24):7094. doi: 10.3390/s20247094.
5
Simplified Deep Reinforcement Learning Approach for Channel Prediction in Power Domain NOMA System.功率域非正交多址接入(NOMA)系统中用于信道预测的简化深度强化学习方法
Sensors (Basel). 2023 Nov 6;23(21):9010. doi: 10.3390/s23219010.
6
Dynamic Divide Grouping Non-Orthogonal Multiple Access in Terrestrial-Satellite Integrated Network.动态划分群组非正交多址接入技术在天地一体化网络中的应用。
Sensors (Basel). 2021 Sep 16;21(18):6199. doi: 10.3390/s21186199.
7
Spectrum-efficient user grouping and resource allocation based on deep reinforcement learning for mmWave massive MIMO-NOMA systems.基于深度强化学习的毫米波大规模MIMO-NOMA系统频谱高效用户分组与资源分配
Sci Rep. 2024 Apr 17;14(1):8884. doi: 10.1038/s41598-024-59241-x.
8
On the Achievable Max-Min User Rates in Multi-Carrier Centralized NOMA-VLC Networks.多载波集中式非正交多址可见光通信网络中可实现的最大最小用户速率
Sensors (Basel). 2021 May 26;21(11):3705. doi: 10.3390/s21113705.
9
Joint User-Slice Pairing and Association Framework Based on H-NOMA in RAN Slicing.基于 RAN 切片中的 H-NOMA 的联合用户-切片配对和关联框架。
Sensors (Basel). 2022 Sep 27;22(19):7343. doi: 10.3390/s22197343.
10
Deep Reinforcement Learning-Based Resource Allocation for Satellite Internet of Things with Diverse QoS Guarantee.基于深度强化学习的具有多样化QoS保障的卫星物联网资源分配
Sensors (Basel). 2022 Apr 13;22(8):2979. doi: 10.3390/s22082979.

引用本文的文献

1
Joint Optimization of Radio and Computational Resource Allocation in Uplink NOMA-Based Remote State Estimation.基于上行非正交多址接入的远程状态估计中无线和计算资源分配的联合优化
Sensors (Basel). 2025 Jul 29;25(15):4686. doi: 10.3390/s25154686.