Liu Song, Wei Peng
Naval University of Engineering, Wuhan, Hubei, China.
National Key Laboratory on Ship Vibration & Noise, Wuhan, China.
Sci Rep. 2024 Dec 28;14(1):31406. doi: 10.1038/s41598-024-83025-y.
In-band full-duplex communication has the potential to double the wireless channel capacity. However, how to efficiently transform the full-duplex gain at the physical layer into network throughput improvement is still a challenge, especially in dynamic communication environments. This paper presents a reinforcement learning-based full-duplex (RLFD) medium access control (MAC) protocol for wireless local-area networks (WLANs) with full-duplex access points. To solve the channel contention problem and fully utilize the full-duplex transmission opportunities, we first design a two-way handshake transmission mechanism and make an investigation on the effects of transmission scheduling in full-duplex transmission. Then the transmission scheduling problem is theoretically formulated as a non-stationary multi-armed bandit problem in which our objective is to maximize the network throughput. Thus, we develop a Window-Constraint Bayesian (WCB) algorithm to generate optimized scheduling policies online. And full-duplex opportunities are fully utilized by transmitting packets according to the optimized scheduling policies. Besides, an analytical model is developed to characterize the performance of RLFD. The performance of RLFD is evaluated by simulation. And the results show that RLFD can improve the network throughput by 80% compared with the IEEE 802.11 distributed coordination function with Request-To-Send/Clear-To-Send. Moreover, with the proposed WCB algorithm, the network throughput can remain steady as the communication environment dynamically changes.
带内全双工通信有潜力使无线信道容量翻倍。然而,如何将物理层的全双工增益有效转化为网络吞吐量提升仍是一项挑战,尤其是在动态通信环境中。本文针对具有全双工接入点的无线局域网(WLAN),提出了一种基于强化学习的全双工(RLFD)介质访问控制(MAC)协议。为了解决信道争用问题并充分利用全双工传输机会,我们首先设计了一种双向握手传输机制,并研究了全双工传输中传输调度的影响。然后,将传输调度问题理论上表述为一个非平稳多臂老虎机问题,我们的目标是最大化网络吞吐量。因此,我们开发了一种窗口约束贝叶斯(WCB)算法来在线生成优化的调度策略。并且根据优化的调度策略发送数据包,从而充分利用全双工机会。此外,还开发了一个分析模型来表征RLFD的性能。通过仿真评估了RLFD的性能。结果表明,与具有请求发送/清除发送功能的IEEE 802.11分布式协调功能相比,RLFD可将网络吞吐量提高80%。而且,使用所提出的WCB算法,随着通信环境动态变化,网络吞吐量可以保持稳定。