• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度强化学习的低地球轨道卫星智能分层准入控制

Intelligent Hierarchical Admission Control for Low-Earth Orbit Satellites Based on Deep Reinforcement Learning.

作者信息

Wei Debin, Guo Chuanqi, Yang Li

机构信息

Communication and Network Laboratory, Dalian University, Dalian 116622, China.

School of Automation, Nanjing University of Science and Technology, Nanjing 210094, China.

出版信息

Sensors (Basel). 2023 Oct 14;23(20):8470. doi: 10.3390/s23208470.

DOI:10.3390/s23208470
PMID:37896563
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10611023/
Abstract

Low-Earth orbit (LEO) satellites have limited on-board resources, user terminals are unevenly distributed in the constantly changing coverage area, and the service requirements vary significantly. It is urgent to optimize resource allocation under the constraint of limited satellite spectrum resources and ensure the fairness of service admission control. Therefore, we propose an intelligent hierarchical admission control (IHAC) strategy based on deep reinforcement learning (DRL). This strategy combines the deep deterministic policy gradient (DDPG) and the deep Q network (DQN) intelligent algorithm to construct upper and lower hierarchical resource allocation and admission control frameworks. The upper controller considers the state features of each ground zone and satellite resources from a global perspective, and determines the beam resource allocation ratio of each ground zone. The lower controller formulates the admission control policy based on the decision of the upper controller and the detailed information of the users' services. At the same time, a designed reward and punishment mechanism is used to optimize the decisions of the upper and lower controllers. The fairness of users' services admissions in each ground zone is achieved as far as possible while ensuring the reasonable allocation of beam resources among zones. Finally, online decision-making and offline learning were combined, so that the controller could make full use of a large number of historical data to learn and generate intelligent strategies with stronger adaptive ability while interacting with the network environment in real time. A large number of simulation results show that IHAC has better performance in terms of a successful service admission rate, service drop rate, and fair resource allocation. Among them, the number of accepted services increased by 20.36% on average, the packet loss rate decreased by 17.56% on average, and the resource fairness increased by 17.16% on average.

摘要

低地球轨道(LEO)卫星的机载资源有限,用户终端在不断变化的覆盖区域内分布不均,且服务需求差异显著。在卫星频谱资源有限的约束下优化资源分配并确保服务准入控制的公平性迫在眉睫。因此,我们提出了一种基于深度强化学习(DRL)的智能分层准入控制(IHAC)策略。该策略结合深度确定性策略梯度(DDPG)和深度Q网络(DQN)智能算法,构建上下分层的资源分配和准入控制框架。上层控制器从全局角度考虑每个地面区域和卫星资源的状态特征,确定每个地面区域的波束资源分配比例。下层控制器根据上层控制器的决策和用户服务的详细信息制定准入控制策略。同时,采用设计的奖惩机制优化上下层控制器的决策。在确保各区域波束资源合理分配的同时,尽可能实现每个地面区域用户服务准入的公平性。最后,将在线决策与离线学习相结合,使控制器能够在与网络环境实时交互的同时,充分利用大量历史数据进行学习,生成具有更强自适应能力的智能策略。大量仿真结果表明,IHAC在成功服务准入率、服务丢弃率和公平资源分配方面具有更好的性能。其中,接受服务的数量平均增加了20.36%,丢包率平均降低了17.56%,资源公平性平均提高了17.16%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/d853d5a860c3/sensors-23-08470-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/4c4da2e88cf3/sensors-23-08470-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/b247b95af7a6/sensors-23-08470-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/7fbcfa8d6500/sensors-23-08470-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/4bd31452e599/sensors-23-08470-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/246e0b23335e/sensors-23-08470-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/0a1d3369b2e2/sensors-23-08470-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/58cecf5d715f/sensors-23-08470-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/d853d5a860c3/sensors-23-08470-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/4c4da2e88cf3/sensors-23-08470-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/b247b95af7a6/sensors-23-08470-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/7fbcfa8d6500/sensors-23-08470-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/4bd31452e599/sensors-23-08470-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/246e0b23335e/sensors-23-08470-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/0a1d3369b2e2/sensors-23-08470-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/58cecf5d715f/sensors-23-08470-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1b0/10611023/d853d5a860c3/sensors-23-08470-g008.jpg

相似文献

1
Intelligent Hierarchical Admission Control for Low-Earth Orbit Satellites Based on Deep Reinforcement Learning.基于深度强化学习的低地球轨道卫星智能分层准入控制
Sensors (Basel). 2023 Oct 14;23(20):8470. doi: 10.3390/s23208470.
2
An Efficient Multi-Dimensional Resource Allocation Mechanism for Beam-Hopping in LEO Satellite Network.一种用于低地球轨道卫星网络中的跳波束的高效多维资源分配机制。
Sensors (Basel). 2022 Nov 29;22(23):9304. doi: 10.3390/s22239304.
3
Computation Offloading and Resource Allocation Based on P-DQN in LEO Satellite Edge Networks.基于深度Q网络的低轨卫星边缘网络计算卸载与资源分配
Sensors (Basel). 2023 Dec 17;23(24):9885. doi: 10.3390/s23249885.
4
BeiDou Short-Message Satellite Resource Allocation Algorithm Based on Deep Reinforcement Learning.基于深度强化学习的北斗短报文卫星资源分配算法
Entropy (Basel). 2021 Jul 22;23(8):932. doi: 10.3390/e23080932.
5
Two Tier Slicing Resource Allocation Algorithm Based on Deep Reinforcement Learning and Joint Bidding in Wireless Access Networks.基于深度强化学习和联合竞价的无线接入网络双层切片资源分配算法
Sensors (Basel). 2022 May 4;22(9):3495. doi: 10.3390/s22093495.
6
Optimization of news dissemination push mode by intelligent edge computing technology for deep learning.基于深度学习的智能边缘计算技术对新闻传播推送模式的优化
Sci Rep. 2024 Mar 20;14(1):6671. doi: 10.1038/s41598-024-53859-7.
7
Matching-Based Resource Allocation for Satellite-Ground Network.基于匹配的卫星-地面网络资源分配
Sensors (Basel). 2022 Nov 2;22(21):8436. doi: 10.3390/s22218436.
8
Spectrum-efficient user grouping and resource allocation based on deep reinforcement learning for mmWave massive MIMO-NOMA systems.基于深度强化学习的毫米波大规模MIMO-NOMA系统频谱高效用户分组与资源分配
Sci Rep. 2024 Apr 17;14(1):8884. doi: 10.1038/s41598-024-59241-x.
9
An LEO Constellation Early Warning System Decision-Making Method Based on Hierarchical Reinforcement Learning.基于分层强化学习的低地球轨道星座预警系统决策方法。
Sensors (Basel). 2023 Feb 16;23(4):2225. doi: 10.3390/s23042225.
10
Cooperative Downloading for LEO Satellite Networks: A DRL-Based Approach.低轨卫星网络中的协同下载:基于 DRL 的方法。
Sensors (Basel). 2022 Sep 10;22(18):6853. doi: 10.3390/s22186853.

引用本文的文献

1
Policy Compression for Intelligent Continuous Control on Low-Power Edge Devices.低功耗边缘设备上智能连续控制的策略压缩
Sensors (Basel). 2024 Jul 27;24(15):4876. doi: 10.3390/s24154876.

本文引用的文献

1
Resource Allocation for Cognitive LEO Satellite Systems: Facilitating IoT Communications.认知型低地球轨道卫星系统的资源分配:促进物联网通信。
Sensors (Basel). 2023 Apr 11;23(8):3875. doi: 10.3390/s23083875.
2
A Reference Architecture for Cloud-Edge Meta-Operating Systems Enabling Cross-Domain, Data-Intensive, ML-Assisted Applications: Architectural Overview and Key Concepts.面向跨域、数据密集、机器学习辅助应用的云边元操作系统参考架构:体系结构概述和关键概念。
Sensors (Basel). 2022 Nov 21;22(22):9003. doi: 10.3390/s22229003.
3
Cooperative Downloading for LEO Satellite Networks: A DRL-Based Approach.
低轨卫星网络中的协同下载:基于 DRL 的方法。
Sensors (Basel). 2022 Sep 10;22(18):6853. doi: 10.3390/s22186853.