• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于异常的智能健康中的威胁检测:机器学习方法

Anomaly-based threat detection in smart health using machine learning.

机构信息

Department of Computer Science, Bahria University, Islamabad, Pakistan.

Department of Information Systems and Technology, Collage of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia.

出版信息

BMC Med Inform Decis Mak. 2024 Nov 19;24(1):347. doi: 10.1186/s12911-024-02760-4.

DOI:10.1186/s12911-024-02760-4
PMID:39563355
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11577804/
Abstract

BACKGROUND

Anomaly detection is crucial in healthcare data due to challenges associated with the integration of smart technologies and healthcare. Anomaly in electronic health record can be associated with an insider trying to access and manipulate the data. This article focuses around the anomalies under different contexts.

METHODOLOGY

This research has proposed methodology to secure Electronic Health Records (EHRs) within a complex environment. We have employed a systematic approach encompassing data preprocessing, labeling, modeling, and evaluation. Anomalies are not labelled thus a mechanism is required that predicts them with greater accuracy and less false positive results. This research utilized unsupervised machine learning algorithms that includes Isolation Forest and Local Outlier Factor clustering algorithms. By calculating anomaly scores and validating clustering through metrics like the Silhouette Score and Dunn Score, we enhanced the capacity to secure sensitive healthcare data evolving digital threats. Three variations of Isolation Forest (IForest)models (SVM, Decision Tree, and Random Forest) and three variations of Local Outlier Factor (LOF) models (SVM, Decision Tree, and Random Forest) are evaluated based on accuracy, sensitivity, specificity, and F1 Score.

RESULTS

Isolation Forest SVM achieves the highest accuracy of 99.21%, high sensitivity (99.75%) and specificity (99.32%), and a commendable F1 Score of 98.72%. The Isolation Forest Decision Tree also performs well with an accuracy of 98.92% and an F1 Score of 99.35%. However, the Isolation Forest Random Forest exhibits lower specificity (72.84%) than the other models.

CONCLUSION

The experimental results reveal that Isolation Forest SVM emerges as the top performer showcasing the effectiveness of these models in anomaly detection tasks. The proposed methodology utilizing isolation forest and SVM produced better results by detecting anomalies with less false positives in this specific EHR of a hospital in North England. Furthermore the proposal is also able to identify new contextual anomalies that were not identified in the baseline methodology.

摘要

背景

由于与智能技术和医疗保健集成相关的挑战,异常检测在医疗保健数据中至关重要。电子健康记录中的异常可能与试图访问和操纵数据的内部人员有关。本文重点介绍了不同上下文中的异常情况。

方法

本研究提出了一种在复杂环境中保护电子健康记录 (EHR) 的方法。我们采用了一种系统的方法,包括数据预处理、标记、建模和评估。异常没有标记,因此需要一种机制来更准确地预测它们,并减少假阳性结果。本研究利用了包括隔离森林和局部离群因子聚类算法在内的无监督机器学习算法。通过计算异常分数并通过轮廓分数和 Dunn 分数等指标验证聚类,我们提高了应对数字威胁下不断发展的敏感医疗保健数据的安全性。基于准确性、敏感性、特异性和 F1 分数,评估了三种隔离森林 (IForest) 模型(SVM、决策树和随机森林)和三种局部离群因子 (LOF) 模型(SVM、决策树和随机森林)的变体。

结果

隔离森林 SVM 实现了最高的准确性 99.21%、高灵敏度(99.75%)和特异性(99.32%)以及令人赞赏的 F1 分数 98.72%。隔离森林决策树的准确性也达到了 98.92%,F1 分数为 99.35%。然而,隔离森林随机森林的特异性(72.84%)低于其他模型。

结论

实验结果表明,隔离森林 SVM 表现最佳,展示了这些模型在异常检测任务中的有效性。本研究提出的方法利用隔离森林和 SVM,在英格兰北部一家医院的特定 EHR 中,通过更少的假阳性来检测异常,从而产生了更好的结果。此外,该方法还能够识别基线方法未识别的新上下文异常。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/6a0bd553f0bd/12911_2024_2760_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/5d5cc72d0e3b/12911_2024_2760_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/cae92cfe021c/12911_2024_2760_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/f24ee8fe6b92/12911_2024_2760_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/8373e465b555/12911_2024_2760_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/917351b6382d/12911_2024_2760_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/0e63388d4aa8/12911_2024_2760_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/4083e4200c60/12911_2024_2760_Figc_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/65b886408e8a/12911_2024_2760_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/52545b3386c3/12911_2024_2760_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/4f4e21ff64bc/12911_2024_2760_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/3298e8ccf413/12911_2024_2760_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/1d4ceb3a233f/12911_2024_2760_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/c2d4c3a78e11/12911_2024_2760_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/6a0bd553f0bd/12911_2024_2760_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/5d5cc72d0e3b/12911_2024_2760_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/cae92cfe021c/12911_2024_2760_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/f24ee8fe6b92/12911_2024_2760_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/8373e465b555/12911_2024_2760_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/917351b6382d/12911_2024_2760_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/0e63388d4aa8/12911_2024_2760_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/4083e4200c60/12911_2024_2760_Figc_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/65b886408e8a/12911_2024_2760_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/52545b3386c3/12911_2024_2760_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/4f4e21ff64bc/12911_2024_2760_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/3298e8ccf413/12911_2024_2760_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/1d4ceb3a233f/12911_2024_2760_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/c2d4c3a78e11/12911_2024_2760_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76b6/11577804/6a0bd553f0bd/12911_2024_2760_Fig11_HTML.jpg

相似文献

1
Anomaly-based threat detection in smart health using machine learning.基于异常的智能健康中的威胁检测:机器学习方法
BMC Med Inform Decis Mak. 2024 Nov 19;24(1):347. doi: 10.1186/s12911-024-02760-4.
2
Anomaly Detection in Biological Early Warning Systems Using Unsupervised Machine Learning.基于无监督机器学习的生物预警系统异常检测
Sensors (Basel). 2023 Mar 1;23(5):2687. doi: 10.3390/s23052687.
3
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.
4
Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。
J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.
5
A clustering approach for detecting implausible observation values in electronic health records data.一种用于检测电子健康记录数据中不合理观测值的聚类方法。
BMC Med Inform Decis Mak. 2019 Jul 23;19(1):142. doi: 10.1186/s12911-019-0852-6.
6
Machine Learning-Based Ensemble Classifiers for Anomaly Handling in Smart Home Energy Consumption Data.基于机器学习的集成分类器在智能家居能耗数据中的异常处理。
Sensors (Basel). 2022 Nov 30;22(23):9323. doi: 10.3390/s22239323.
7
Hybrid computing framework security in dynamic offloading for IoT-enabled smart home system.支持物联网的智能家居系统动态卸载中的混合计算框架安全
PeerJ Comput Sci. 2024 Aug 23;10:e2211. doi: 10.7717/peerj-cs.2211. eCollection 2024.
8
Towards Reliability in Smart Water Sensing Technology: Evaluating Classical Machine Learning Models for Outlier Detection.迈向智能水传感技术的可靠性:评估用于异常检测的经典机器学习模型
Sensors (Basel). 2024 Jun 24;24(13):4084. doi: 10.3390/s24134084.
9
Developing a FHIR-based EHR phenotyping framework: A case study for identification of patients with obesity and multiple comorbidities from discharge summaries.基于 FHIR 的电子健康记录表型框架的开发:以从出院小结中识别肥胖且伴有多种合并症的患者为例。
J Biomed Inform. 2019 Nov;99:103310. doi: 10.1016/j.jbi.2019.103310. Epub 2019 Oct 14.
10
Soft Clustering for Enhancing the Diagnosis of Chronic Diseases over Machine Learning Algorithms.基于机器学习算法的软聚类在慢性病诊断中的应用。
J Healthc Eng. 2020 Mar 9;2020:4984967. doi: 10.1155/2020/4984967. eCollection 2020.

引用本文的文献

1
A Unified Deep Learning Ensemble Framework for Voice-Based Parkinson's Disease Detection and Motor Severity Prediction.一种用于基于语音的帕金森病检测和运动严重程度预测的统一深度学习集成框架。
Bioengineering (Basel). 2025 Jun 27;12(7):699. doi: 10.3390/bioengineering12070699.
2
Aging With Artificial Intelligence: How Technology Enhances Older Adults' Health and Independence.借助人工智能实现老龄化:科技如何提升老年人的健康水平与独立性。
J Gerontol A Biol Sci Med Sci. 2025 Jun 10;80(7). doi: 10.1093/gerona/glaf086.

本文引用的文献

1
Healthcare insurance fraud detection using data mining.利用数据挖掘进行医疗保险欺诈检测。
BMC Med Inform Decis Mak. 2024 Apr 26;24(1):112. doi: 10.1186/s12911-024-02512-4.
2
Benefits and barriers associated with the use of smart home health technologies in the care of older persons: a systematic review.智能家居健康技术在老年人护理中的应用的益处和障碍:系统评价。
BMC Geriatr. 2024 Feb 14;24(1):152. doi: 10.1186/s12877-024-04702-1.
3
EHR-BERT: A BERT-based model for effective anomaly detection in electronic health records.EHR-BERT:一种基于 BERT 的电子健康记录中有效异常检测模型。
J Biomed Inform. 2024 Feb;150:104605. doi: 10.1016/j.jbi.2024.104605. Epub 2024 Feb 6.
4
STIF: Intuitionistic fuzzy Gaussian membership function with statistical transformation weight of evidence and information value for private information preservation.STIF:具有统计变换证据权重和信息价值的直觉模糊高斯隶属函数,用于保护隐私信息。
Distrib Parallel Databases. 2023 Apr 21:1-34. doi: 10.1007/s10619-023-07423-3.
5
Unsupervised anomaly detection of implausible electronic health records: a real-world evaluation in cancer registries.无监督的电子健康记录异常检测:癌症登记处的真实世界评估。
BMC Med Res Methodol. 2023 May 24;23(1):125. doi: 10.1186/s12874-023-01946-0.
6
The anatomy of electronic patient record ethics: a framework to guide design, development, implementation, and use.电子病历伦理剖析:指导设计、开发、实施及使用的框架
BMC Med Ethics. 2021 Feb 4;22(1):9. doi: 10.1186/s12910-021-00574-x.
7
Secure and Robust Machine Learning for Healthcare: A Survey.用于医疗保健的安全可靠机器学习:一项综述。
IEEE Rev Biomed Eng. 2021;14:156-180. doi: 10.1109/RBME.2020.3013489. Epub 2021 Jan 22.
8
The design and development of technology platforms in a developing country healthcare context from an ecosystem perspective.从生态系统的角度来看发展中国家医疗保健背景下的技术平台的设计和开发。
BMC Med Inform Decis Mak. 2020 Mar 12;20(1):55. doi: 10.1186/s12911-020-1028-0.
9
FilterK: A new outlier detection method for k-means clustering of physical activity.FilterK:一种用于身体活动 k 均值聚类的新异常值检测方法。
J Biomed Inform. 2020 Apr;104:103397. doi: 10.1016/j.jbi.2020.103397. Epub 2020 Feb 26.
10
Choosing Wisely clinical decision support adherence and associated inpatient outcomes.明智选择临床决策支持的依从性及其与住院患者结局的关联。
Am J Manag Care. 2018 Aug;24(8):361-366.