• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于可解释人工智能(XAI)增强可用性的新冠病毒疾病(COVID-19)医学报告数据分析与机器学习模型

Data Analytics and Machine Learning Models on COVID-19 Medical Reports Enhanced with XAI for Usability.

作者信息

Lohaj Oliver, Paralič Ján, Paraličová Zuzana, Javorská Daniela, Zagorová Elena

机构信息

Department of Cybernetics and Artificial Intelligence, Faculty of Electrical Engineering and Informatics, Technical University of Kosice, Letná 9, 042 00 Košice, Slovakia.

Department of Infectology and Travel Medicine, Faculty of Medicine, L. Pasteur University Hospital, Pavol Jozef Šafárik University, Rastislavova 43, 041 90 Košice, Slovakia.

出版信息

Diagnostics (Basel). 2025 Aug 7;15(15):1981. doi: 10.3390/diagnostics15151981.

DOI:10.3390/diagnostics15151981
PMID:40804945
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12346136/
Abstract

-To identify effective data analytics and machine learning solutions that can help in the decision-making process in the medical domain and contribute to the understanding of COVID-19 disease. In this study, we analyze data from anonymized electronic medical records of 4711 patients with COVID-19 disease admitted to hospital in Atlanta. -We used random forest, LightGBM, XGBoost, CatBoost, KNN, SVM, logistic regression, and MLP neural network models in this work. The models are evaluated depending on the type of prediction by relevant metrics, especially accuracy, F1-score, and ROC AUC score. Another aim of the work was to find out which factors most affected severity and mortality risk among the patients. To identify the important features, different statistical methods were used, as well as LASSO regression, and explainable artificial intelligence (XAI) method SHAP values for model explainability. The best models were implemented in a web application and tested by medical experts. The model for prediction of mortality risk was tested on a validation cohort of 45 patients from the Department of Infectiology and Travel Medicine, L. Pasteur University Hospital in Košice (UNLP). -Our study shows that the best model for predicting COVID-19 disease severity was the LightGBM model with accuracy of 88.4% using all features and 89.5% using the eight most important features. The best model for predicting mortality risk was also the LightGBM model, which achieved a ROC AUC score of 83.7% and a classification accuracy of 81.2% using all features. Using a simplified model trained on the 15 most important features, the ROC AUC score was 83.6% and the classification accuracy was 80.5%. We deployed the simplified models for predicting COVID-19 disease severity and for predicting the risk of COVID-19-related death in a web-based application and tested them with medical experts. This test resulted in a ROC AUC score of 83.6% and an overall prediction accuracy of 73.3%.

摘要

-识别有效的数据分析和机器学习解决方案,以帮助医疗领域的决策过程,并促进对新冠病毒疾病的理解。在本研究中,我们分析了亚特兰大4711例因新冠病毒疾病住院患者的匿名电子病历数据。

-我们在这项工作中使用了随机森林、LightGBM、XGBoost、CatBoost、KNN、支持向量机(SVM)、逻辑回归和多层感知器(MLP)神经网络模型。这些模型根据预测类型通过相关指标进行评估,特别是准确率、F1分数和ROC曲线下面积(AUC)分数。这项工作的另一个目标是找出哪些因素对患者的病情严重程度和死亡风险影响最大。为了识别重要特征,我们使用了不同的统计方法,以及套索回归和用于模型可解释性的可解释人工智能(XAI)方法SHAP值。最佳模型在一个网络应用程序中实现,并由医学专家进行测试。死亡风险预测模型在来自科希策的L. 巴斯德大学医院传染病与旅行医学科的45名患者的验证队列上进行了测试。

-我们的研究表明,预测新冠病毒疾病严重程度的最佳模型是LightGBM模型,使用所有特征时准确率为88.4%,使用八个最重要特征时准确率为89.5%。预测死亡风险的最佳模型也是LightGBM模型,使用所有特征时ROC AUC分数为83.7%,分类准确率为81.2%。使用在15个最重要特征上训练的简化模型,ROC AUC分数为83.6%,分类准确率为80.5%。我们在一个基于网络的应用程序中部署了用于预测新冠病毒疾病严重程度和预测新冠病毒相关死亡风险的简化模型,并由医学专家进行测试。该测试的ROC AUC分数为83.6%,总体预测准确率为73.3%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/3a708743acbe/diagnostics-15-01981-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/f92a1de8ac82/diagnostics-15-01981-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/52d778ce2189/diagnostics-15-01981-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/2cb50bcce3b2/diagnostics-15-01981-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/7c91c72d2423/diagnostics-15-01981-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/26339af20917/diagnostics-15-01981-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/25fc261030a6/diagnostics-15-01981-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/1f79d8d850da/diagnostics-15-01981-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/ca2f38da046b/diagnostics-15-01981-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/3a708743acbe/diagnostics-15-01981-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/f92a1de8ac82/diagnostics-15-01981-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/52d778ce2189/diagnostics-15-01981-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/2cb50bcce3b2/diagnostics-15-01981-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/7c91c72d2423/diagnostics-15-01981-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/26339af20917/diagnostics-15-01981-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/25fc261030a6/diagnostics-15-01981-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/1f79d8d850da/diagnostics-15-01981-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/ca2f38da046b/diagnostics-15-01981-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fce/12346136/3a708743acbe/diagnostics-15-01981-g009.jpg

相似文献

1
Data Analytics and Machine Learning Models on COVID-19 Medical Reports Enhanced with XAI for Usability.基于可解释人工智能(XAI)增强可用性的新冠病毒疾病(COVID-19)医学报告数据分析与机器学习模型
Diagnostics (Basel). 2025 Aug 7;15(15):1981. doi: 10.3390/diagnostics15151981.
2
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
5
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
6
Artificial Intelligence-Based prediction model for surgical site infection in metastatic spinal disease: a multicenter development and validation study.基于人工智能的转移性脊柱疾病手术部位感染预测模型:一项多中心开发与验证研究。
Int J Surg. 2025 Jun 27. doi: 10.1097/JS9.0000000000002806.
7
An artificial intelligence model to predict mortality among hemodialysis patients: A retrospective validated cohort study.一种预测血液透析患者死亡率的人工智能模型:一项回顾性验证队列研究。
Sci Rep. 2025 Jul 29;15(1):27699. doi: 10.1038/s41598-025-06576-8.
8
Parsimonious and explainable machine learning for predicting mortality in patients post hip fracture surgery.用于预测髋部骨折手术后患者死亡率的简约且可解释的机器学习方法。
Sci Rep. 2025 Jul 2;15(1):22922. doi: 10.1038/s41598-025-98713-6.
9
Proposal for Using AI to Assess Clinical Data Integrity and Generate Metadata: Algorithm Development and Validation.关于使用人工智能评估临床数据完整性并生成元数据的提案:算法开发与验证
JMIR Med Inform. 2025 Jun 30;13:e60204. doi: 10.2196/60204.
10
Construction and validation of HBV-ACLF bacterial infection diagnosis model based on machine learning.基于机器学习的HBV-ACLF细菌感染诊断模型的构建与验证
BMC Infect Dis. 2025 Jul 1;25(1):847. doi: 10.1186/s12879-025-11199-5.

本文引用的文献

1
Deep learning approach for early prediction of COVID-19 mortality using chest X-ray and electronic health records.基于胸部 X 光和电子健康记录的深度学习方法对 COVID-19 死亡率的早期预测。
BMC Bioinformatics. 2023 May 9;24(1):190. doi: 10.1186/s12859-023-05321-0.
2
Attitudes of Covid-19 patients toward sharing their health data: A survey-based study to understand security and privacy concerns.新冠病毒肺炎患者对共享其健康数据的态度:一项基于调查的研究,以了解安全和隐私问题。
Health Sci Rep. 2023 Feb 27;6(3):e1132. doi: 10.1002/hsr2.1132. eCollection 2023 Mar.
3
Association of statin use with outcomes of patients admitted with COVID-19: an analysis of electronic health records using superlearner.
他汀类药物使用与 COVID-19 住院患者结局的关联:使用超级学习者的电子健康记录分析。
BMC Infect Dis. 2023 Feb 24;23(1):115. doi: 10.1186/s12879-023-08026-0.
4
A Precise Method to Detect Post-COVID-19 Pulmonary Fibrosis Through Extreme Gradient Boosting.一种通过极端梯度提升检测新冠后肺纤维化的精确方法。
SN Comput Sci. 2023;4(1):89. doi: 10.1007/s42979-022-01526-x. Epub 2022 Dec 13.
5
Personalized survival probabilities for SARS-CoV-2 positive patients by explainable machine learning.基于可解释机器学习的 SARS-CoV-2 阳性患者个体化生存概率预测。
Sci Rep. 2022 Aug 16;12(1):13879. doi: 10.1038/s41598-022-17953-y.
6
Multiple Imputation of Missing Race and Ethnicity in CDC COVID-19 Case-Level Surveillance Data.美国疾病控制与预防中心新冠病毒病病例级监测数据中缺失种族和族裔信息的多重填补
Int J Stat Med Res. 2022 Jan 28;11:1-11. doi: 10.6000/1929-6029.2022.11.01.
7
A COVID-19 mortality prediction model for Korean patients using nationwide Korean disease control and prevention agency database.利用韩国全国疾病控制和预防机构数据库建立的韩国 COVID-19 死亡预测模型。
Sci Rep. 2022 Feb 28;12(1):3311. doi: 10.1038/s41598-022-07051-4.
8
Respiratory support status from EHR data for adult population: classification, heuristics, and usage in predictive modeling.从电子健康记录数据中获取成年人群的呼吸支持状态:分类、启发式方法以及在预测模型中的应用。
J Am Med Inform Assoc. 2022 Apr 13;29(5):813-821. doi: 10.1093/jamia/ocac005.
9
Image and structured data analysis for prognostication of health outcomes in patients presenting to the ED during the COVID-19 pandemic.COVID-19大流行期间急诊科患者健康结局预后的图像和结构化数据分析
Int J Med Inform. 2021 Dec 9;158:104662. doi: 10.1016/j.ijmedinf.2021.104662.
10
COVID Mortality Prediction with Machine Learning Methods: A Systematic Review and Critical Appraisal.使用机器学习方法预测 COVID 死亡率:系统评价与批判性评估
J Pers Med. 2021 Sep 7;11(9):893. doi: 10.3390/jpm11090893.