Zhao Changqing, Liao Ling Xia, Chen Guomin, Chao Han-Chieh
School of Electronic Information and Automation, Guilin University of Aerospace Technology, Guilin 541004, China.
School of Management, Guilin University of Aerospace Technology, Guilin 541004, China.
Sensors (Basel). 2025 Apr 8;25(8):2368. doi: 10.3390/s25082368.
The accurate and efficient classification of network traffic, including malicious traffic, is essential for effective network management, cybersecurity, and resource optimization. However, traffic classification methods in modern, complex, and dynamic networks face significant challenges, particularly at the network edge, where resources are limited and issues such as privacy concerns and concept drift arise. Condensation techniques offer a solution by reducing the data size, simplifying complex models, and transferring knowledge from traffic data. This paper explores data and knowledge condensation methods-such as coreset selection, data compression, knowledge distillation, and dataset distillation-within the context of traffic classification tasks. It clarifies the relationship between these techniques and network traffic classification, introducing each method and its typical applications. This paper also outlines potential scenarios for applying each condensation technique, highlighting the associated challenges and open research issues. To the best of our knowledge, this is the first comprehensive summary of condensation techniques specifically tailored for network traffic classification tasks.
准确且高效地对包括恶意流量在内的网络流量进行分类,对于有效的网络管理、网络安全和资源优化至关重要。然而,现代、复杂且动态的网络中的流量分类方法面临重大挑战,尤其是在网络边缘,那里资源有限,还会出现隐私问题和概念漂移等问题。凝聚技术通过减少数据大小、简化复杂模型以及从流量数据中传递知识来提供一种解决方案。本文在流量分类任务的背景下探索数据和知识凝聚方法,如核心集选择、数据压缩、知识蒸馏和数据集蒸馏。它阐明了这些技术与网络流量分类之间的关系,介绍了每种方法及其典型应用。本文还概述了应用每种凝聚技术的潜在场景,突出了相关挑战和开放研究问题。据我们所知,这是第一篇专门针对网络流量分类任务量身定制的凝聚技术的全面总结。