一种基于混合k近邻和随机森林算法的分组密码算法识别方案。

A block cipher algorithm identification scheme based on hybrid k-nearest neighbor and random forest algorithm.

作者信息

Yuan Ke, Yu Daoming, Feng Jingkai, Yang Longwei, Jia Chunfu, Huang Yiwang

机构信息

School of Computer and Information Engineering, Henan University, Kaifeng, Henan, China.

Henan Key Laboratory of Big Data Analysis and Processing, Henan University, Kaifeng, Henan, China.

出版信息

PeerJ Comput Sci. 2022 Oct 10;8:e1110. doi: 10.7717/peerj-cs.1110. eCollection 2022.

DOI:10.7717/peerj-cs.1110

PMID:36262148

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9575859/

Abstract

Cryptographic algorithm identification, which refers to analyzing and identifying the encryption algorithm used in cryptographic system, is of great significance to cryptanalysis. In order to improve the accuracy of identification work, this article proposes a new ensemble learning-based model named hybrid k-nearest neighbor and random forest (HKNNRF), and constructs a block cipher algorithm identification scheme. In the ciphertext-only scenario, we use NIST randomness test methods to extract ciphertext features, and carry out binary-classification and five-classification experiments on the block cipher algorithms using proposed scheme. Experiments show that when the ciphertext size and other experimental conditions are the same, compared with the baselines, the HKNNRF model has higher classification accuracy. Specifically, the average binary-classification identification accuracy of HKNNRF is 69.5%, which is 13%, 12.5%, and 10% higher than the single-layer support vector machine (SVM), k-nearest neighbor (KNN), and random forest (RF) respectively. The five-classification identification accuracy can reach 34%, which is higher than the 21% accuracy of KNN, the 22% accuracy of RF and the 23% accuracy of SVM respectively under the same experimental conditions.

摘要

密码算法识别是指对密码系统中使用的加密算法进行分析和识别，对密码分析具有重要意义。为了提高识别工作的准确性，本文提出了一种基于集成学习的新模型——混合k近邻与随机森林（HKNNRF），并构建了一种分组密码算法识别方案。在仅知密文的场景下，我们使用美国国家标准与技术研究院（NIST）的随机性测试方法来提取密文特征，并使用所提方案对分组密码算法进行二分类和五分类实验。实验表明，在密文大小和其他实验条件相同的情况下，与基线模型相比，HKNNRF模型具有更高的分类准确率。具体而言，HKNNRF的平均二分类识别准确率为69.5%，分别比单层支持向量机（SVM）、k近邻（KNN）和随机森林（RF）高13%、12.5%和10%。五分类识别准确率可达34%，在相同实验条件下分别高于KNN的21%、RF的22%和SVM的23%。

相似文献

A block cipher algorithm identification scheme based on hybrid k-nearest neighbor and random forest algorithm.一种基于混合k近邻和随机森林算法的分组密码算法识别方案。

PeerJ Comput Sci. 2022 Oct 10;8:e1110. doi: 10.7717/peerj-cs.1110. eCollection 2022.

Comparison of Random Forest, k-Nearest Neighbor, and Support Vector Machine Classifiers for Land Cover Classification Using Sentinel-2 Imagery.使用哨兵-2影像的随机森林、k近邻和支持向量机分类器用于土地覆盖分类的比较

Sensors (Basel). 2017 Dec 22;18(1):18. doi: 10.3390/s18010018.

Predictive Model for Dyslexia from Fixations and Saccadic Eye Movement Events.基于注视和眼跳事件的阅读障碍预测模型

Comput Methods Programs Biomed. 2020 Oct;195:105538. doi: 10.1016/j.cmpb.2020.105538. Epub 2020 May 30.

Prediction of human breast and colon cancers from imbalanced data using nearest neighbor and support vector machines.基于最近邻算法和支持向量机的不平衡数据在人类乳腺癌和结肠癌预测中的应用。

Comput Methods Programs Biomed. 2014 Mar;113(3):792-808. doi: 10.1016/j.cmpb.2014.01.001. Epub 2014 Jan 10.

GSEA-SDBE: A gene selection method for breast cancer classification based on GSEA and analyzing differences in performance metrics.GSEA-SDBE：一种基于基因集富集分析（GSEA）并分析性能指标差异的乳腺癌分类基因选择方法。

PLoS One. 2022 Apr 26;17(4):e0263171. doi: 10.1371/journal.pone.0263171. eCollection 2022.

A Highly Discriminative Hybrid Feature Selection Algorithm for Cancer Diagnosis.一种用于癌症诊断的高判别混合特征选择算法。

ScientificWorldJournal. 2022 Aug 9;2022:1056490. doi: 10.1155/2022/1056490. eCollection 2022.

AVNM: A Voting based Novel Mathematical Rule for Image Classification.AVNM：一种基于投票的图像分类新数学规则。

Comput Methods Programs Biomed. 2016 Dec;137:195-201. doi: 10.1016/j.cmpb.2016.08.015. Epub 2016 Sep 26.

Improving the Accuracy of Ensemble Machine Learning Classification Models Using a Novel Bit-Fusion Algorithm for Healthcare AI Systems.利用一种新颖的位融合算法提高医疗 AI 系统中集成机器学习分类模型的准确性。

Front Public Health. 2022 May 4;10:858282. doi: 10.3389/fpubh.2022.858282. eCollection 2022.

Classification of Parkinson's disease utilizing multi-edit nearest-neighbor and ensemble learning algorithms with speech samples.利用语音样本的多编辑最近邻和集成学习算法对帕金森病进行分类。

Biomed Eng Online. 2016 Nov 16;15(1):122. doi: 10.1186/s12938-016-0242-6.

An Enhanced Quantum K-Nearest Neighbor Classification Algorithm Based on Polar Distance.一种基于极距的增强型量子K近邻分类算法

Entropy (Basel). 2023 Jan 8;25(1):127. doi: 10.3390/e25010127.

本文引用的文献

Advances in Sparrow Search Algorithm: A Comprehensive Survey.麻雀搜索算法的研究进展：全面综述

Arch Comput Methods Eng. 2023;30(1):427-455. doi: 10.1007/s11831-022-09804-w. Epub 2022 Aug 22.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种基于混合k近邻和随机森林算法的分组密码算法识别方案。

A block cipher algorithm identification scheme based on hybrid k-nearest neighbor and random forest algorithm.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献