• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习技术检测恶意网络爬虫,增强患者门户和网站的安全性。

Enhancing the security of patients' portals and websites by detecting malicious web crawlers using machine learning techniques.

机构信息

Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran.

Department of Computer Science, Faculty of Engineering, Payame Noor University (PNU), Iran.

出版信息

Int J Med Inform. 2019 Dec;132:103976. doi: 10.1016/j.ijmedinf.2019.103976. Epub 2019 Sep 25.

DOI:10.1016/j.ijmedinf.2019.103976
PMID:31606554
Abstract

INTRODUCTION

There is increasing demand for access to medical information via patients' portals. However, one of the challenges towards widespread utilisation of such service is maintaining the security of those portals. Recent reports show an alarming increase in cyber-attacks using crawlers. These software programs crawl web pages and are capable of executing various commands such as attacking web servers, cracking passwords, harvesting users' personal information, and testing the vulnerability of servers. The aim of this research is to develop a new effective model for detecting malicious crawlers based on their navigational behavior using machine-learning techniques.

METHOD

In this research, different methods of crawler detection were investigated. Log files of a sample of compromised web sites were analysed and the best features for the detection of crawlers were extracted. Then after testing and comparing several machine learning algorithms including Support Vector Machine (SVM), Bayesian Network and Decision Tree, the best model was developed using the most appropriate features and its accuracy was evaluated.

RESULTS

Our analysis showed the SVM-based models can yield higher accuracy (f-measure = 0.97) comparing to Bayesian Network (f-measure = 0.88) and Decision Tree (f-measure = 0.95) and artificial neural network (ANN) (f-measure = 0.87)for detecting malicious crawlers. However, extracting proper features can increase the performance of the SVM (f-measure = 0.98), the Bayesian network (f-measure = 0.94) and the Decision Tree (f-measure = 0.96) and ANN (f-measure = 0.92).

CONCLUSION

Security concerns are among the potential barriers to widespread utilisation of patient portals. Machine learning algorithms can be accurately used to detect malicious crawlers and enhance the security of sensitive patients' information. Selecting appropriate features for the development of these algorithms can remarkably increase their accuracy.

摘要

简介

人们对通过患者门户获取医学信息的需求日益增加。然而,广泛使用此类服务的挑战之一是维护这些门户的安全性。最近的报告显示,使用爬虫程序的网络攻击数量令人震惊地增加。这些软件程序可以爬取网页,并能够执行各种命令,如攻击 Web 服务器、破解密码、收集用户个人信息以及测试服务器的漏洞。本研究的目的是开发一种新的基于机器学习技术的有效模型,用于检测恶意爬虫程序基于其导航行为。

方法

本研究调查了不同的爬虫检测方法。分析了一组受影响网站的日志文件,并提取了用于检测爬虫程序的最佳特征。然后,在测试和比较了包括支持向量机 (SVM)、贝叶斯网络和决策树在内的几种机器学习算法之后,使用最合适的特征开发了最佳模型,并评估了其准确性。

结果

我们的分析表明,与贝叶斯网络 (f-measure=0.88) 和决策树 (f-measure=0.95) 以及人工神经网络 (ANN) (f-measure=0.87) 相比,基于 SVM 的模型在检测恶意爬虫程序时可以产生更高的准确性 (f-measure=0.97)。然而,提取适当的特征可以提高 SVM 的性能 (f-measure=0.98)、贝叶斯网络 (f-measure=0.94)、决策树 (f-measure=0.96) 和 ANN (f-measure=0.92)。

结论

安全问题是广泛使用患者门户的潜在障碍之一。机器学习算法可用于准确检测恶意爬虫程序,增强敏感患者信息的安全性。为开发这些算法选择适当的特征可以显著提高其准确性。

相似文献

1
Enhancing the security of patients' portals and websites by detecting malicious web crawlers using machine learning techniques.利用机器学习技术检测恶意网络爬虫,增强患者门户和网站的安全性。
Int J Med Inform. 2019 Dec;132:103976. doi: 10.1016/j.ijmedinf.2019.103976. Epub 2019 Sep 25.
2
Cyber Threat Intelligence-Based Malicious URL Detection Model Using Ensemble Learning.基于网络威胁情报的集成学习恶意 URL 检测模型。
Sensors (Basel). 2022 Apr 28;22(9):3373. doi: 10.3390/s22093373.
3
TKRD: Trusted kernel rootkit detection for cybersecurity of VMs based on machine learning and memory forensic analysis.基于机器学习和内存取证分析的虚拟机可信内核 rootkit 检测技术用于网络安全
Math Biosci Eng. 2019 Mar 26;16(4):2650-2667. doi: 10.3934/mbe.2019132.
4
Detecting phishing websites using machine learning technique.利用机器学习技术检测钓鱼网站。
PLoS One. 2021 Oct 11;16(10):e0258361. doi: 10.1371/journal.pone.0258361. eCollection 2021.
5
Detecting Brain Tumor using Machines Learning Techniques Based on Different Features Extracting Strategies.基于不同特征提取策略,使用机器学习技术检测脑肿瘤。
Curr Med Imaging Rev. 2019;15(6):595-606. doi: 10.2174/1573405614666180718123533.
6
Application of supervised machine learning algorithms in the classification of sagittal gait patterns of cerebral palsy children with spastic diplegia.监督机器学习算法在痉挛性双瘫脑瘫儿童矢状面步态模式分类中的应用。
Comput Biol Med. 2019 Mar;106:33-39. doi: 10.1016/j.compbiomed.2019.01.009. Epub 2019 Jan 16.
7
Fetal health status prediction based on maternal clinical history using machine learning techniques.基于机器学习技术的基于产妇临床史的胎儿健康状况预测。
Comput Methods Programs Biomed. 2018 Sep;163:87-100. doi: 10.1016/j.cmpb.2018.06.010. Epub 2018 Jun 14.
8
Prostate cancer detection using machine learning techniques by employing combination of features extracting strategies.基于特征提取策略组合的机器学习技术在前列腺癌检测中的应用
Cancer Biomark. 2018 Feb 6;21(2):393-413. doi: 10.3233/CBM-170643.
9
Seminal quality prediction using data mining methods.使用数据挖掘方法进行精液质量预测。
Technol Health Care. 2014;22(4):531-45. doi: 10.3233/THC-140816.
10
An investigation and comparison of machine learning approaches for intrusion detection in IoMT network.物联网医疗网络中入侵检测的机器学习方法研究与比较
J Supercomput. 2022;78(15):17403-17422. doi: 10.1007/s11227-022-04568-3. Epub 2022 May 18.

引用本文的文献

1
Enhancing health care through medical cognitive virtual agents.通过医学认知虚拟代理提升医疗保健水平。
Digit Health. 2024 Aug 19;10:20552076241256732. doi: 10.1177/20552076241256732. eCollection 2024 Jan-Dec.
2
Security Risk Assessment for Patient Portals of Hospitals: A Case Study of Taiwan.医院患者门户网站的安全风险评估:以台湾地区为例
Risk Manag Healthc Policy. 2024 Jun 18;17:1647-1656. doi: 10.2147/RMHP.S463408. eCollection 2024.
3
Tourists' Health Risk Threats Amid COVID-19 Era: Role of Technology Innovation, Transformation, and Recovery Implications for Sustainable Tourism.
新冠疫情时代游客的健康风险威胁:技术创新、转型及复苏对可持续旅游的影响
Front Psychol. 2022 Apr 7;12:769175. doi: 10.3389/fpsyg.2021.769175. eCollection 2021.
4
Machine and cognitive intelligence for human health: systematic review.用于人类健康的机器与认知智能:系统综述
Brain Inform. 2022 Feb 12;9(1):5. doi: 10.1186/s40708-022-00153-9.
5
Religious Affiliation, Daily Spirituals, and Private Religious Factors Promote Marital Commitment Among Married Couples: Does Religiosity Help People Amid the COVID-19 Crisis?宗教信仰、日常宗教活动及个人宗教因素促进已婚夫妇的婚姻承诺:在新冠疫情危机中宗教虔诚能帮助人们吗?
Front Psychol. 2021 Aug 6;12:657400. doi: 10.3389/fpsyg.2021.657400. eCollection 2021.
6
The Role of Social Media in the Advent of COVID-19 Pandemic: Crisis Management, Mental Health Challenges and Implications.社交媒体在新冠疫情出现中的作用:危机管理、心理健康挑战及影响
Risk Manag Healthc Policy. 2021 May 12;14:1917-1932. doi: 10.2147/RMHP.S284313. eCollection 2021.