Lee Ju Hyeon, Ji Il Hwan, Jeon Seung Ho, Seo Jung Taek
Department of Information Security, Gachon University, Seongnam-si 1342, Republic of Korea.
Department of Computer Engineering (Smart Security), Gachon University, Seongnam-si 1342, Republic of Korea.
Sensors (Basel). 2023 Dec 15;23(24):9855. doi: 10.3390/s23249855.
Cyber threats to industrial control systems (ICSs) have increased as information and communications technology (ICT) has been incorporated. In response to these cyber threats, we are implementing a range of security equipment and specialized training programs. Anomaly data stemming from cyber-attacks are crucial for effectively testing security equipment and conducting cyber training exercises. However, securing anomaly data in an ICS environment requires a lot of effort. For this reason, we propose a method for generating anomaly data that reflects cyber-attack characteristics. This method uses systematic sampling and linear regression models in an ICS environment to generate anomaly data reflecting cyber-attack characteristics based on benign data. The method uses statistical analysis to identify features indicative of cyber-attack characteristics and alters their values from benign data through systematic sampling. The transformed data are then used to train a linear regression model. The linear regression model can predict features because it has learned the linear relationships between data features. This experiment used ICS_PCAPS data generated based on Modbus, frequently used in ICS. In this experiment, more than 50,000 new anomaly data pieces were generated. As a result of using some of the new anomaly data generated as training data for the existing model, no significant performance degradation occurred. Additionally, comparing some of the new anomaly data with the original benign and attack data using kernel density estimation confirmed that the new anomaly data pattern was changing from benign data to attack data. In this way, anomaly data that partially reflect the pattern of the attack data were created. The proposed method generates anomaly data like cyber-attack data quickly and logically, free from the constraints of cost, time, and original cyber-attack data required in existing research.
随着信息通信技术(ICT)被融入工业控制系统(ICS),针对ICS的网络威胁不断增加。为应对这些网络威胁,我们正在实施一系列安全设备和专业培训项目。源自网络攻击的异常数据对于有效测试安全设备和开展网络训练演习至关重要。然而,在ICS环境中保护异常数据需要付出很大努力。因此,我们提出一种生成反映网络攻击特征的异常数据的方法。该方法在ICS环境中使用系统抽样和线性回归模型,基于良性数据生成反映网络攻击特征的异常数据。该方法利用统计分析来识别指示网络攻击特征的特征,并通过系统抽样改变其在良性数据中的值。然后将变换后的数据用于训练线性回归模型。线性回归模型能够预测特征,因为它已经学习了数据特征之间的线性关系。本实验使用了基于ICS中常用的Modbus生成的ICS_PCAPS数据。在本实验中,生成了超过50000条新的异常数据。将部分生成的新异常数据用作现有模型的训练数据,结果并未出现显著的性能下降。此外,使用核密度估计将部分新异常数据与原始良性数据和攻击数据进行比较,证实新异常数据模式正在从良性数据向攻击数据转变。通过这种方式,创建了部分反映攻击数据模式的异常数据。所提出的方法能够快速且合乎逻辑地生成类似网络攻击数据的异常数据,不受现有研究中成本、时间和原始网络攻击数据的限制。