• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度卷积神经网络和随机森林集成学习的钓鱼网站检测。

Phishing Website Detection Based on Deep Convolutional Neural Network and Random Forest Ensemble Learning.

机构信息

School of Cyberspace Security, Beijing University of Posts and Telecommunications, Beijing 100876, China.

School of Computer Science, Beijing University of Technology, Beijing 100124, China.

出版信息

Sensors (Basel). 2021 Dec 10;21(24):8281. doi: 10.3390/s21248281.

DOI:10.3390/s21248281
PMID:34960375
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8709380/
Abstract

Phishing has become one of the biggest and most effective cyber threats, causing hundreds of millions of dollars in losses and millions of data breaches every year. Currently, anti-phishing techniques require experts to extract phishing sites features and use third-party services to detect phishing sites. These techniques have some limitations, one of which is that extracting phishing features requires expertise and is time-consuming. Second, the use of third-party services delays the detection of phishing sites. Hence, this paper proposes an integrated phishing website detection method based on convolutional neural networks (CNN) and random forest (RF). The method can predict the legitimacy of URLs without accessing the web content or using third-party services. The proposed technique uses character embedding techniques to convert URLs into fixed-size matrices, extract features at different levels using CNN models, classify multi-level features using multiple RF classifiers, and, finally, output prediction results using a winner-take-all approach. On our dataset, a 99.35% accuracy rate was achieved using the proposed model. An accuracy rate of 99.26% was achieved on the benchmark data, much higher than that of the existing extreme model.

摘要

网络钓鱼已成为最大和最有效的网络威胁之一,每年造成数亿美元的损失和数百万起数据泄露事件。目前,反网络钓鱼技术需要专家提取网络钓鱼网站的特征,并使用第三方服务来检测网络钓鱼网站。这些技术有一些局限性,其中之一是提取网络钓鱼特征需要专业知识且耗时。其次,使用第三方服务会延迟对网络钓鱼网站的检测。因此,本文提出了一种基于卷积神经网络 (CNN) 和随机森林 (RF) 的集成网络钓鱼网站检测方法。该方法可以在不访问网页内容或使用第三方服务的情况下预测 URL 的合法性。所提出的技术使用字符嵌入技术将 URL 转换为固定大小的矩阵,使用 CNN 模型在不同级别提取特征,使用多个 RF 分类器对多级特征进行分类,最后使用“胜者为王”的方法输出预测结果。在我们的数据集上,所提出的模型实现了 99.35%的准确率。在基准数据上,准确率达到了 99.26%,远高于现有的极端模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/c2c06206707d/sensors-21-08281-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/428dcf8fff2d/sensors-21-08281-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/27d168bb6ae5/sensors-21-08281-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/fa38c6331848/sensors-21-08281-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/c058edc27c85/sensors-21-08281-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/d40e461711bd/sensors-21-08281-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/4632e9bf82c2/sensors-21-08281-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/78b94bc1195f/sensors-21-08281-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/cc13023bc0b7/sensors-21-08281-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/d4aa1d79325c/sensors-21-08281-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/470aba705c0f/sensors-21-08281-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/47f3b808181c/sensors-21-08281-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/c2c06206707d/sensors-21-08281-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/428dcf8fff2d/sensors-21-08281-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/27d168bb6ae5/sensors-21-08281-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/fa38c6331848/sensors-21-08281-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/c058edc27c85/sensors-21-08281-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/d40e461711bd/sensors-21-08281-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/4632e9bf82c2/sensors-21-08281-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/78b94bc1195f/sensors-21-08281-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/cc13023bc0b7/sensors-21-08281-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/d4aa1d79325c/sensors-21-08281-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/470aba705c0f/sensors-21-08281-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/47f3b808181c/sensors-21-08281-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c2f/8709380/c2c06206707d/sensors-21-08281-g012.jpg

相似文献

1
Phishing Website Detection Based on Deep Convolutional Neural Network and Random Forest Ensemble Learning.基于深度卷积神经网络和随机森林集成学习的钓鱼网站检测。
Sensors (Basel). 2021 Dec 10;21(24):8281. doi: 10.3390/s21248281.
2
A Deep Learning-Based Innovative Technique for Phishing Detection in Modern Security with Uniform Resource Locators.基于深度学习的现代安全中基于统一资源定位器的网络钓鱼检测创新技术。
Sensors (Basel). 2023 Apr 30;23(9):4403. doi: 10.3390/s23094403.
3
Cyber Threat Intelligence-Based Malicious URL Detection Model Using Ensemble Learning.基于网络威胁情报的集成学习恶意 URL 检测模型。
Sensors (Basel). 2022 Apr 28;22(9):3373. doi: 10.3390/s22093373.
4
A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning.一种使用机器学习的诱人广告网络钓鱼攻击检测混合方法。
Sensors (Basel). 2023 Sep 25;23(19):8070. doi: 10.3390/s23198070.
5
Phishing URLs Detection Using Sequential and Parallel ML Techniques: Comparative Analysis.利用序列和并行 ML 技术检测网络钓鱼 URL:比较分析。
Sensors (Basel). 2023 Mar 26;23(7):3467. doi: 10.3390/s23073467.
6
Improving the phishing website detection using empirical analysis of Function Tree and its variants.通过对功能树及其变体进行实证分析来改进网络钓鱼网站检测
Heliyon. 2021 Jun 29;7(7):e07437. doi: 10.1016/j.heliyon.2021.e07437. eCollection 2021 Jul.
7
CNN-MHSA: A Convolutional Neural Network and multi-head self-attention combined approach for detecting phishing websites.CNN-MHSA:一种用于检测钓鱼网站的卷积神经网络和多头自注意力相结合的方法。
Neural Netw. 2020 May;125:303-312. doi: 10.1016/j.neunet.2020.02.013. Epub 2020 Feb 29.
8
Applications of deep learning for phishing detection: a systematic literature review.深度学习在网络钓鱼检测中的应用:一项系统的文献综述。
Knowl Inf Syst. 2022;64(6):1457-1500. doi: 10.1007/s10115-022-01672-x. Epub 2022 May 23.
9
A Deep-Learning-Driven Light-Weight Phishing Detection Sensor.一种基于深度学习的轻量级钓鱼检测传感器。
Sensors (Basel). 2019 Sep 30;19(19):4258. doi: 10.3390/s19194258.
10
Comparative evaluation of machine learning algorithms for phishing site detection.用于网络钓鱼网站检测的机器学习算法的比较评估。
PeerJ Comput Sci. 2024 Jun 24;10:e2131. doi: 10.7717/peerj-cs.2131. eCollection 2024.

引用本文的文献

1
A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning.一种使用机器学习的诱人广告网络钓鱼攻击检测混合方法。
Sensors (Basel). 2023 Sep 25;23(19):8070. doi: 10.3390/s23198070.

本文引用的文献

1
Datasets for phishing websites detection.用于网络钓鱼网站检测的数据集。
Data Brief. 2020 Oct 23;33:106438. doi: 10.1016/j.dib.2020.106438. eCollection 2020 Dec.