He Yuanhao, Xiao Geyang, Zhu Jun, Zou Tao, Liang Yuan
Intelligent Manufacturing Computing Research Center, Zhejiang Lab, Hangzhou, China.
Front Comput Neurosci. 2024 Apr 29;18:1393025. doi: 10.3389/fncom.2024.1393025. eCollection 2024.
In recent years, with the rapid development of network applications and the increasing demand for high-quality network service, quality-of-service (QoS) routing has emerged as a critical network technology. The application of machine learning techniques, particularly reinforcement learning and graph neural network, has garnered significant attention in addressing this problem. However, existing reinforcement learning methods lack research on the causal impact of agent actions on the interactive environment, and graph neural network fail to effectively represent link features, which are pivotal for routing optimization. Therefore, this study quantifies the causal influence between the intelligent agent and the interactive environment based on causal inference techniques, aiming to guide the intelligent agent in improving the efficiency of exploring the action space. Simultaneously, graph neural network is employed to embed node and link features, and a reward function is designed that comprehensively considers network performance metrics and causality relevance. A centralized reinforcement learning method is proposed to effectively achieve QoS-aware routing in Software-Defined Networking (SDN). Finally, experiments are conducted in a network simulation environment, and metrics such as packet loss, delay, and throughput all outperform the baseline.
近年来,随着网络应用的快速发展以及对高质量网络服务需求的不断增加,服务质量(QoS)路由已成为一项关键的网络技术。机器学习技术的应用,特别是强化学习和图神经网络,在解决这一问题上受到了广泛关注。然而,现有的强化学习方法缺乏对智能体动作对交互环境因果影响的研究,而图神经网络无法有效表示链路特征,而链路特征对于路由优化至关重要。因此,本研究基于因果推理技术量化智能体与交互环境之间的因果影响,旨在指导智能体提高探索动作空间的效率。同时,采用图神经网络来嵌入节点和链路特征,并设计了一个综合考虑网络性能指标和因果相关性的奖励函数。提出了一种集中式强化学习方法,以在软件定义网络(SDN)中有效地实现QoS感知路由。最后,在网络仿真环境中进行了实验,丢包、延迟和吞吐量等指标均优于基线。