Liao Ling Xia, Zhao Changqing, Lai Roy Xiaorong, Chao Han-Chieh
School of Electronic Information and Automation, Guilin University of Aerospace Technology, Guilin 541004, China.
Confederal Networks Inc., Seattle, WA 98055, USA.
Sensors (Basel). 2024 Feb 1;24(3):963. doi: 10.3390/s24030963.
Accurately and efficiently predicting elephant flows (elephants) is crucial for optimizing network performance and resource utilization. Current prediction approaches for software-defined networks (SDNs) typically rely on complete traffic and statistics moving from switches to controllers. This leads to an extra control channel bandwidth occupation and network delay. To address this issue, this paper proposes a prediction strategy based on incomplete traffic that is sampled by the timeouts for the installation or reactivation of flow entries. The strategy involves assigning a very short hard timeout (Tinitial) to flow entries and then increasing it at a rate of until flows are identified as elephants or out of their lifespans. Predicted elephants are switched to an idle timeout of 5 s. Logistic regression is used to model elephants based on a complete dataset. Bayesian optimization is then used to tune the trained model Tinitial and over the incomplete dataset. The process of feature selection, model learning, and optimization is explained. An extensive evaluation shows that the proposed approach can achieve over 90% generalization accuracy over 7 different datasets, including campus, backbone, and the Internet of Things (IoT). Elephants can be correctly predicted for about half of their lifetime. The proposed approach can significantly reduce the controller-switch interaction in campus and IoT networks, although packet completion approaches may need to be applied in networks with a short mean packet inter-arrival time.
准确且高效地预测大象流(elephants)对于优化网络性能和资源利用至关重要。软件定义网络(SDN)的当前预测方法通常依赖于从交换机到控制器的完整流量和统计信息。这会导致额外的控制信道带宽占用和网络延迟。为了解决这个问题,本文提出了一种基于不完整流量的预测策略,该流量通过流表项安装或重新激活的超时进行采样。该策略包括为流表项分配一个非常短的硬超时(Tinitial),然后以一定速率增加它,直到流被识别为大象流或超出其生命周期。预测出的大象流将被切换到5秒的空闲超时。基于完整数据集,使用逻辑回归对大象流进行建模。然后使用贝叶斯优化在不完整数据集上调整训练模型的Tinitial和。解释了特征选择、模型学习和优化的过程。广泛的评估表明,所提出的方法在包括校园网、骨干网和物联网(IoT)在内的7个不同数据集上可以实现超过90%的泛化准确率。大象流在其生命周期的大约一半时间内可以被正确预测。所提出的方法可以显著减少校园网和物联网网络中控制器与交换机之间的交互,尽管在平均数据包到达间隔时间较短的网络中可能需要应用数据包完成方法。