利用跳跃马尔可夫线性系统和深度强化学习增强 5G 毫米波移动网络的切换。

Enhancing Handover for 5G mmWave Mobile Networks Using Jump Markov Linear System and Deep Reinforcement Learning.

机构信息

Department of Electrical and Electronic Engineering, Auckland University of Technology, Auckland 1010, New Zealand.

Department of Computer Science and Information Systems, Bradley University, Peoria, IL 61625, USA.

出版信息

Sensors (Basel). 2022 Jan 19;22(3):746. doi: 10.3390/s22030746.

DOI:10.3390/s22030746

PMID:35161492

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8838663/

Abstract

The Fifth Generation (5G) mobile networks use millimeter waves (mmWaves) to offer gigabit data rates. However, unlike microwaves, mmWave links are prone to user and topographic dynamics. They easily get blocked and end up forming irregular cell patterns for 5G. This in turn causes too early, too late, or wrong handoffs (HOs). To mitigate HO challenges, sustain connectivity, and avert unnecessary HO, we propose an HO scheme based on a jump Markov linear system (JMLS) and deep reinforcement learning (DRL). JMLS is widely known to account for abrupt changes in system dynamics. DRL likewise emerges as an artificial intelligence technique for learning highly dimensional and time-varying behaviors. We combine the two techniques to account for time-varying, abrupt, and irregular changes in mmWave link behavior by predicting likely deterioration patterns of target links. The prediction is optimized by meta training techniques that also reduce training sample size. Thus, the JMLS-DRL platform formulates intelligent and versatile HO policies for 5G. When compared to a signal and interference noise ratio (SINR) and DRL-based HO scheme, our HO scheme becomes more reliable in selecting reliable target links. In particular, our proposed scheme is able to reduce wasteful HO to less than 5% within 200 training episodes compared to the DRL-based HO scheme that needs more than 200 training episodes to get to less than 5%. It supports longer dew time between HOs and high sum rates by ably averting unnecessary HOs with almost half the HOs compared to a DRL-based HO scheme.

摘要

第五代（5G）移动网络使用毫米波（mmWave）提供千兆数据速率。然而，与微波不同，mmWave 链路容易受到用户和地形动态的影响。它们很容易被阻塞，最终为 5G 形成不规则的小区模式。这反过来又导致过早、过晚或错误的切换（HO）。为了减轻 HO 挑战、保持连接性并避免不必要的 HO，我们提出了一种基于跳跃马尔可夫线性系统（JMLS）和深度强化学习（DRL）的 HO 方案。众所周知，JMLS 用于说明系统动态的突然变化。DRL 同样作为一种人工智能技术，用于学习高度维数和时变的行为。我们将这两种技术结合起来，通过预测目标链路可能的劣化模式来说明 mmWave 链路行为的时变、突然和不规则变化。预测通过元训练技术进行优化，这些技术还减少了训练样本的大小。因此，JMLS-DRL 平台为 5G 制定了智能和通用的 HO 策略。与信号与干扰噪声比（SINR）和基于 DRL 的 HO 方案相比，我们的 HO 方案在选择可靠的目标链路方面更加可靠。特别是，与需要 200 多个训练回合才能达到小于 5%的 DRL 基于 HO 方案相比，我们提出的方案能够将浪费性 HO 减少到小于 5%，在 200 个训练回合内。它通过几乎一半的 HO 来避免不必要的 HO，支持更长的 HO 之间的延迟时间和更高的和速率，而与基于 DRL 的 HO 方案相比。