• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TAP:一种基于令牌和深度学习技术的 PHP 漏洞静态分析模型。

TAP: A static analysis model for PHP vulnerabilities based on token and deep learning technology.

机构信息

College of Cybersecurity, Sichuan University, Chengdu 610065, China.

China Information Technology Security Evaluation Center, Beijing 100085, China.

出版信息

PLoS One. 2019 Nov 18;14(11):e0225196. doi: 10.1371/journal.pone.0225196. eCollection 2019.

DOI:10.1371/journal.pone.0225196
PMID:31738786
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6860437/
Abstract

With the widespread usage of Web applications, the security issues of source code are increasing. The exposed vulnerabilities seriously endanger the interests of service providers and customers. There are some models for solving this problem. However, most of them rely on complex graphs generated from source code or regex patterns based on expert experience. In this paper, TAP, which is based on token mechanism and deep learning technology, was proposed as an analysis model to discover the vulnerabilities of PHP: Hypertext Preprocessor (PHP) Web programs conveniently and easily. Based on the token mechanism of PHP language, a custom tokenizer was designed, and it unifies tokens, supports some features of PHP and optimizes the parsing. Besides, the tokenizer also implements parameter iteration to achieve data flow analysis. On the Software Assurance Reference Dataset(SARD) and SQLI-LABS dataset, we trained the deep learning model of TAP by combining the word2vec model with Long Short-Term Memory (LSTM) network algorithm. According to the experiment on the dataset of CWE-89, TAP not only achieves the 0.9941 Area Under the Curve(AUC), which is better than other models, but also achieves the highest accuracy: 0.9787. Further, compared with RIPS, TAP shows much better in multiclass classification with 0.8319 Kappa and 0.0840 hamming distance.

摘要

随着 Web 应用程序的广泛使用,源代码的安全问题日益增多。暴露的漏洞严重危及服务提供商和客户的利益。针对这个问题已经提出了一些模型,但是大多数模型都依赖于从源代码生成的复杂图或基于专家经验的正则表达式模式。本文提出了一种基于令牌机制和深度学习技术的 TAP 分析模型,以便方便、轻松地发现 PHP: Hypertext Preprocessor (PHP) Web 程序的漏洞。基于 PHP 语言的令牌机制,设计了一个自定义的标记器,它统一了标记,支持一些 PHP 的特性并优化了解析。此外,标记器还实现了参数迭代,以实现数据流分析。在 Software Assurance Reference Dataset(SARD)和 SQLI-LABS 数据集上,我们通过结合 word2vec 模型和长短时记忆(LSTM)网络算法对 TAP 的深度学习模型进行了训练。根据 CWE-89 数据集的实验,TAP 不仅实现了 0.9941 的 AUC,优于其他模型,而且达到了最高的准确性:0.9787。此外,与 RIPS 相比,TAP 在多类分类中表现更好,kappa 值为 0.8319,汉明距离为 0.0840。

相似文献

1
TAP: A static analysis model for PHP vulnerabilities based on token and deep learning technology.TAP:一种基于令牌和深度学习技术的 PHP 漏洞静态分析模型。
PLoS One. 2019 Nov 18;14(11):e0225196. doi: 10.1371/journal.pone.0225196. eCollection 2019.
2
Securing web applications against XSS and SQLi attacks using a novel deep learning approach.使用一种新颖的深度学习方法保护Web应用程序免受跨站脚本攻击(XSS)和SQL注入攻击(SQLi)。
Sci Rep. 2024 Jan 20;14(1):1803. doi: 10.1038/s41598-023-48845-4.
3
Deep learning based searching approach for RDF graphs.基于深度学习的 RDF 图搜索方法。
PLoS One. 2020 Mar 23;15(3):e0230500. doi: 10.1371/journal.pone.0230500. eCollection 2020.
4
Comparative Analysis of Open-Source Tools for Conducting Static Code Analysis.用于进行静态代码分析的开源工具的比较分析
Sensors (Basel). 2023 Sep 19;23(18):7978. doi: 10.3390/s23187978.
5
The possibility of the combination of OCT and fundus images for improving the diagnostic accuracy of deep learning for age-related macular degeneration: a preliminary experiment.OCT 与眼底图像相结合提高深度学习诊断年龄相关性黄斑变性准确性的可能性:一项初步实验。
Med Biol Eng Comput. 2019 Mar;57(3):677-687. doi: 10.1007/s11517-018-1915-z. Epub 2018 Oct 22.
6
Direct Prediction of the Toxic Gas Diffusion Rule in a Real Environment Based on LSTM.基于 LSTM 的真实环境下有毒气体扩散规律的直接预测。
Int J Environ Res Public Health. 2019 Jun 17;16(12):2133. doi: 10.3390/ijerph16122133.
7
Fully automated detection of retinal disorders by image-based deep learning.基于图像的深度学习技术对视网膜疾病进行全自动检测。
Graefes Arch Clin Exp Ophthalmol. 2019 Mar;257(3):495-505. doi: 10.1007/s00417-018-04224-8. Epub 2019 Jan 4.
8
An Open Medical Platform to Share Source Code and Various Pre-Trained Weights for Models to Use in Deep Learning Research.一个用于分享源代码和各种预训练权重的开放式医学平台,以供深度学习研究中使用的模型使用。
Korean J Radiol. 2021 Dec;22(12):2073-2081. doi: 10.3348/kjr.2021.0170. Epub 2021 Oct 26.
9
Detection and diagnosis of dental caries using a deep learning-based convolutional neural network algorithm.基于深度学习的卷积神经网络算法在龋齿检测和诊断中的应用。
J Dent. 2018 Oct;77:106-111. doi: 10.1016/j.jdent.2018.07.015. Epub 2018 Jul 26.
10
Multimodal deep representation learning for protein interaction identification and protein family classification.基于多模态深度表示学习的蛋白质相互作用识别和蛋白质家族分类。
BMC Bioinformatics. 2019 Dec 2;20(Suppl 16):531. doi: 10.1186/s12859-019-3084-y.

本文引用的文献

1
Face recognition: a convolutional neural-network approach.人脸识别:一种卷积神经网络方法。
IEEE Trans Neural Netw. 1997;8(1):98-113. doi: 10.1109/72.554195.
2
Framewise phoneme classification with bidirectional LSTM and other neural network architectures.使用双向长短期记忆网络和其他神经网络架构进行逐帧音素分类。
Neural Netw. 2005 Jun-Jul;18(5-6):602-10. doi: 10.1016/j.neunet.2005.06.042.
3
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.
4
The measurement of observer agreement for categorical data.分类数据观察者一致性的测量。
Biometrics. 1977 Mar;33(1):159-74.