• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于因果关系检测和图神经网络的强化学习驱动的软件定义网络路由方案

Reinforcement learning-based SDN routing scheme empowered by causality detection and GNN.

作者信息

He Yuanhao, Xiao Geyang, Zhu Jun, Zou Tao, Liang Yuan

机构信息

Intelligent Manufacturing Computing Research Center, Zhejiang Lab, Hangzhou, China.

出版信息

Front Comput Neurosci. 2024 Apr 29;18:1393025. doi: 10.3389/fncom.2024.1393025. eCollection 2024.

DOI:10.3389/fncom.2024.1393025
PMID:38741707
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11089111/
Abstract

In recent years, with the rapid development of network applications and the increasing demand for high-quality network service, quality-of-service (QoS) routing has emerged as a critical network technology. The application of machine learning techniques, particularly reinforcement learning and graph neural network, has garnered significant attention in addressing this problem. However, existing reinforcement learning methods lack research on the causal impact of agent actions on the interactive environment, and graph neural network fail to effectively represent link features, which are pivotal for routing optimization. Therefore, this study quantifies the causal influence between the intelligent agent and the interactive environment based on causal inference techniques, aiming to guide the intelligent agent in improving the efficiency of exploring the action space. Simultaneously, graph neural network is employed to embed node and link features, and a reward function is designed that comprehensively considers network performance metrics and causality relevance. A centralized reinforcement learning method is proposed to effectively achieve QoS-aware routing in Software-Defined Networking (SDN). Finally, experiments are conducted in a network simulation environment, and metrics such as packet loss, delay, and throughput all outperform the baseline.

摘要

近年来,随着网络应用的快速发展以及对高质量网络服务需求的不断增加,服务质量(QoS)路由已成为一项关键的网络技术。机器学习技术的应用,特别是强化学习和图神经网络,在解决这一问题上受到了广泛关注。然而,现有的强化学习方法缺乏对智能体动作对交互环境因果影响的研究,而图神经网络无法有效表示链路特征,而链路特征对于路由优化至关重要。因此,本研究基于因果推理技术量化智能体与交互环境之间的因果影响,旨在指导智能体提高探索动作空间的效率。同时,采用图神经网络来嵌入节点和链路特征,并设计了一个综合考虑网络性能指标和因果相关性的奖励函数。提出了一种集中式强化学习方法,以在软件定义网络(SDN)中有效地实现QoS感知路由。最后,在网络仿真环境中进行了实验,丢包、延迟和吞吐量等指标均优于基线。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/671b/11089111/2d74818cc389/fncom-18-1393025-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/671b/11089111/774413271efb/fncom-18-1393025-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/671b/11089111/875d343475bb/fncom-18-1393025-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/671b/11089111/3892956f01da/fncom-18-1393025-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/671b/11089111/2d74818cc389/fncom-18-1393025-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/671b/11089111/774413271efb/fncom-18-1393025-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/671b/11089111/875d343475bb/fncom-18-1393025-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/671b/11089111/3892956f01da/fncom-18-1393025-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/671b/11089111/2d74818cc389/fncom-18-1393025-g0004.jpg

相似文献

1
Reinforcement learning-based SDN routing scheme empowered by causality detection and GNN.基于因果关系检测和图神经网络的强化学习驱动的软件定义网络路由方案
Front Comput Neurosci. 2024 Apr 29;18:1393025. doi: 10.3389/fncom.2024.1393025. eCollection 2024.
2
AQMDRL: Automatic Quality of Service Architecture Based on Multistep Deep Reinforcement Learning in Software-Defined Networking.基于多步深度强化学习的软件定义网络中自动服务质量架构(AQMDRL)。
Sensors (Basel). 2022 Dec 30;23(1):429. doi: 10.3390/s23010429.
3
Intelligent multicast routing method based on multi-agent deep reinforcement learning in SDWN.基于软件定义广域网中多智能体深度强化学习的智能组播路由方法
Math Biosci Eng. 2023 Sep 4;20(9):17158-17196. doi: 10.3934/mbe.2023765.
4
A Software-Defined Directional Q-Learning Grid-Based Routing Platform and Its Two-Hop Trajectory-Based Routing Algorithm for Vehicular Ad Hoc Networks.一种用于车载自组织网络的软件定义定向Q学习网格路由平台及其基于两跳轨迹的路由算法。
Sensors (Basel). 2022 Oct 27;22(21):8222. doi: 10.3390/s22218222.
5
A fuzzy delay-bandwidth guaranteed routing algorithm for video conferencing services over SDN networks.一种用于软件定义网络(SDN)上视频会议服务的模糊延迟 - 带宽保证路由算法。
Multimed Tools Appl. 2023 Jan 23:1-30. doi: 10.1007/s11042-023-14349-6.
6
A Routing Optimization Method for Software-Defined Optical Transport Networks Based on Ensembles and Reinforcement Learning.基于集成学习和强化学习的软件定义光传送网路由优化方法。
Sensors (Basel). 2022 Oct 24;22(21):8139. doi: 10.3390/s22218139.
7
Augmenting Speech Quality Estimation in Software-Defined Networking Using Machine Learning Algorithms.使用机器学习算法增强软件定义网络中的语音质量估计。
Sensors (Basel). 2021 May 17;21(10):3477. doi: 10.3390/s21103477.
8
Delay-Packet-Loss-Optimized Distributed Routing Using Spiking Neural Network in Delay-Tolerant Networking.基于尖峰神经网络的延迟容忍网络中延迟-丢包优化的分布式路由
Sensors (Basel). 2022 Dec 28;23(1):310. doi: 10.3390/s23010310.
9
Traffic Management in IoT Backbone Networks Using GNN and MAB with SDN Orchestration.基于软件定义网络编排的图神经网络和多智能体强化学习在物联网骨干网中的流量管理
Sensors (Basel). 2023 Aug 10;23(16):7091. doi: 10.3390/s23167091.
10
Environment-Aware Adaptive Reinforcement Learning-Based Routing for Vehicular Ad Hoc Networks.用于车载自组织网络的基于环境感知自适应强化学习的路由
Sensors (Basel). 2023 Dec 20;24(1):40. doi: 10.3390/s24010040.

本文引用的文献

1
Convergence Analysis of Value Iteration Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems.连续时间非线性系统的价值迭代自适应动态规划收敛性分析
IEEE Trans Cybern. 2024 Mar;54(3):1639-1649. doi: 10.1109/TCYB.2022.3232599. Epub 2024 Feb 9.
2
Co-Embedding of Nodes and Edges With Graph Neural Networks.节点和边的图神经网络联合嵌入。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7075-7086. doi: 10.1109/TPAMI.2020.3029762. Epub 2023 May 5.