Mughaid Ala, AlZu'bi Shadi, Hnaif Adnan, Taamneh Salah, Alnajjar Asma, Elsoud Esraa Abu
Department of Information Technology, Faculty of prince Al-Hussien bin Abdullah for IT, The Hashemite University, P.O. Box 330127, 13133 Zarqa, Jordan.
Faculty of Science and IT, Al-Zaytoonah University of Jordan, Amman, Jordan.
Cluster Comput. 2022;25(6):3819-3828. doi: 10.1007/s10586-022-03604-4. Epub 2022 May 14.
Recently, phishing attacks have become one of the most prominent social engineering attacks faced by public internet users, governments, and businesses. In response to this threat, this paper proposes to give a complete vision to what Machine learning is, what phishers are using to trick gullible users with different types of phishing attacks techniques and based on our survey that phishing emails is the most effective on the targeted sectors and users which we are going to compare as well. Therefore, more effective phishing detection technology is needed to curb the threat of phishing emails that are growing at an alarming rate in recent years, thus will discuss the techniques of mitigation of phishing by Machine learning algorithms and technical solutions that have been proposed to mitigate the problem of phishing and valuable awareness knowledge users should be aware to detect and prevent from being duped by phishing scams. In this work, we proposed a detection model using machine learning techniques by splitting the dataset to train the detection model and validating the results using the test data , to capture inherent characteristics of the email text, and other features to be classified as phishing or non-phishing using three different data sets, After making a comparison between them, we obtained that the most number of features used the most accurate and efficient results achieved. the best ML algorithm accuracy were 0.88, 1.00, and 0.97 consecutively for boosted decision tree on the applied data sets.
最近,网络钓鱼攻击已成为普通互联网用户、政府和企业面临的最突出的社会工程攻击之一。针对这一威胁,本文旨在全面阐述机器学习是什么,网络钓鱼者如何利用不同类型的网络钓鱼攻击技术欺骗易受骗用户,并基于我们的调查,指出网络钓鱼电子邮件对目标部门和用户最为有效,我们也将对此进行比较。因此,需要更有效的网络钓鱼检测技术来遏制近年来以惊人速度增长的网络钓鱼电子邮件威胁,本文将讨论机器学习算法缓解网络钓鱼的技术以及为缓解网络钓鱼问题而提出的技术解决方案,以及用户应了解的有价值的防范知识,以检测并防止被网络钓鱼诈骗所骗。在这项工作中,我们提出了一种使用机器学习技术的检测模型,通过拆分数据集来训练检测模型,并使用测试数据验证结果,以捕捉电子邮件文本的内在特征,以及使用三个不同数据集将其他特征分类为网络钓鱼或非网络钓鱼的特征。在对它们进行比较后,我们发现使用最多特征的结果最为准确和高效。在所应用的数据集上,增强决策树的最佳机器学习算法准确率分别连续为0.88、1.00和0.97。