• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度卷积森林:一种用于文本中垃圾邮件检测的动态深度集成方法。

Deep convolutional forest: a dynamic deep ensemble approach for spam detection in text.

作者信息

Shaaban Mai A, Hassan Yasser F, Guirguis Shawkat K

机构信息

Department of Mathematics and Computer Science, Faculty of Science, Alexandria University, Alexandria, Egypt.

Faculty of Computers and Data Science, Alexandria University, Alexandria, Egypt.

出版信息

Complex Intell Systems. 2022;8(6):4897-4909. doi: 10.1007/s40747-022-00741-6. Epub 2022 Apr 26.

DOI:10.1007/s40747-022-00741-6
PMID:35496326
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9039275/
Abstract

The increase in people's use of mobile messaging services has led to the spread of social engineering attacks like phishing, considering that spam text is one of the main factors in the dissemination of phishing attacks to steal sensitive data such as credit cards and passwords. In addition, rumors and incorrect medical information regarding the COVID-19 pandemic are widely shared on social media leading to people's fear and confusion. Thus, filtering spam content is vital to reduce risks and threats. Previous studies relied on machine learning and deep learning approaches for spam classification, but these approaches have two limitations. Machine learning models require manual feature engineering, whereas deep neural networks require a high computational cost. This paper introduces a dynamic deep ensemble model for spam detection that adjusts its complexity and extracts features automatically. The proposed model utilizes convolutional and pooling layers for feature extraction along with base classifiers such as random forests and extremely randomized trees for classifying texts into spam or legitimate ones. Moreover, the model employs ensemble learning procedures like boosting and bagging. As a result, the model achieved high precision, recall, f1-score and accuracy of 98.38%.

摘要

人们对移动消息服务使用的增加导致了网络钓鱼等社会工程攻击的传播,因为垃圾短信是传播网络钓鱼攻击以窃取信用卡和密码等敏感数据的主要因素之一。此外,关于新冠疫情的谣言和错误医疗信息在社交媒体上广泛传播,导致人们恐惧和困惑。因此,过滤垃圾内容对于降低风险和威胁至关重要。以往的研究依赖机器学习和深度学习方法进行垃圾邮件分类,但这些方法有两个局限性。机器学习模型需要人工进行特征工程,而深度神经网络需要高昂的计算成本。本文介绍了一种用于垃圾邮件检测的动态深度集成模型,该模型可自动调整其复杂度并提取特征。所提出的模型利用卷积层和池化层进行特征提取,并使用随机森林和极端随机树等基础分类器将文本分类为垃圾邮件或合法邮件。此外,该模型采用了提升和装袋等集成学习过程。结果,该模型实现了98.38%的高精度、召回率、F1分数和准确率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/9039275/5c03cba5ea81/40747_2022_741_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/9039275/32cbafe85f2a/40747_2022_741_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/9039275/d3b767c3d84e/40747_2022_741_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/9039275/a6e79d9bf98f/40747_2022_741_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/9039275/5c03cba5ea81/40747_2022_741_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/9039275/32cbafe85f2a/40747_2022_741_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/9039275/d3b767c3d84e/40747_2022_741_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/9039275/a6e79d9bf98f/40747_2022_741_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c0d/9039275/5c03cba5ea81/40747_2022_741_Fig4_HTML.jpg

相似文献

1
Deep convolutional forest: a dynamic deep ensemble approach for spam detection in text.深度卷积森林:一种用于文本中垃圾邮件检测的动态深度集成方法。
Complex Intell Systems. 2022;8(6):4897-4909. doi: 10.1007/s40747-022-00741-6. Epub 2022 Apr 26.
2
An intelligent identification and classification system for malicious uniform resource locators (URLs).一种针对恶意统一资源定位符(URL)的智能识别与分类系统。
Neural Comput Appl. 2023 Apr 20:1-17. doi: 10.1007/s00521-023-08592-z.
3
A systematic literature review on spam content detection and classification.关于垃圾邮件内容检测与分类的系统文献综述。
PeerJ Comput Sci. 2022 Jan 20;8:e830. doi: 10.7717/peerj-cs.830. eCollection 2022.
4
SMS Scam Detection Application Based on Optical Character Recognition for Image Data Using Unsupervised and Deep Semi-Supervised Learning.基于光学字符识别的短信诈骗检测应用:利用无监督和深度半监督学习处理图像数据
Sensors (Basel). 2024 Sep 20;24(18):6084. doi: 10.3390/s24186084.
5
Phishing Website Detection Based on Deep Convolutional Neural Network and Random Forest Ensemble Learning.基于深度卷积神经网络和随机森林集成学习的钓鱼网站检测。
Sensors (Basel). 2021 Dec 10;21(24):8281. doi: 10.3390/s21248281.
6
A Hybrid Approach for Alluring Ads Phishing Attack Detection Using Machine Learning.一种使用机器学习的诱人广告网络钓鱼攻击检测混合方法。
Sensors (Basel). 2023 Sep 25;23(19):8070. doi: 10.3390/s23198070.
7
Evading obscure communication from spam emails.避免垃圾邮件中隐晦的通讯。
Math Biosci Eng. 2022 Jan;19(2):1926-1943. doi: 10.3934/mbe.2022091. Epub 2021 Dec 22.
8
Application of error level analysis in image spam classification using deep learning model.基于深度学习模型的图像垃圾分类中错误水平分析的应用。
PLoS One. 2023 Dec 14;18(12):e0291037. doi: 10.1371/journal.pone.0291037. eCollection 2023.
9
Efficient information theoretic strategies for classifier combination, feature extraction and performance evaluation in improving false positives and false negatives for spam e-mail filtering.用于垃圾邮件过滤中分类器组合、特征提取和性能评估的有效信息论策略,以改善误报和漏报情况。
Neural Netw. 2005 Jun-Jul;18(5-6):799-807. doi: 10.1016/j.neunet.2005.06.045.
10
A comprehensive survey of AI-enabled phishing attacks detection techniques.对人工智能驱动的网络钓鱼攻击检测技术的全面调查。
Telecommun Syst. 2021;76(1):139-154. doi: 10.1007/s11235-020-00733-2. Epub 2020 Oct 23.

引用本文的文献

1
A Hybrid Model with New Word Weighting for Fast Filtering Spam Short Texts.一种用于快速过滤垃圾短信的具有新词加权的混合模型。
Sensors (Basel). 2023 Nov 4;23(21):8975. doi: 10.3390/s23218975.