• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用RNA测序、临床和合并症数据预测COVID-19严重程度的机器学习模型。

A Machine Learning Model for the Prediction of COVID-19 Severity Using RNA-Seq, Clinical, and Co-Morbidity Data.

作者信息

Sethi Sahil, Shakyawar Sushil, Reddy Athreya S, Patel Jai Chand, Guda Chittibabu

机构信息

Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE 68105, USA.

Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA.

出版信息

Diagnostics (Basel). 2024 Jun 18;14(12):1284. doi: 10.3390/diagnostics14121284.

DOI:10.3390/diagnostics14121284
PMID:38928699
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11202902/
Abstract

The premise for this study emanated from the need to understand SARS-CoV-2 infections at the molecular level and to develop predictive tools for managing COVID-19 severity. With the varied clinical outcomes observed among infected individuals, creating a reliable machine learning (ML) model for predicting the severity of COVID-19 became paramount. Despite the availability of large-scale genomic and clinical data, previous studies have not effectively utilized multi-modality data for disease severity prediction using data-driven approaches. Our primary goal is to predict COVID-19 severity using a machine-learning model trained on a combination of patients' gene expression, clinical features, and co-morbidity data. Employing various ML algorithms, including Logistic Regression (LR), XGBoost (XG), Naïve Bayes (NB), and Support Vector Machine (SVM), alongside feature selection methods, we sought to identify the best-performing model for disease severity prediction. The results highlighted XG as the superior classifier, with 95% accuracy and a 0.99 AUC (Area Under the Curve), for distinguishing severity groups. Additionally, the SHAP analysis revealed vital features contributing to prediction, including several genes such as COX14, LAMB2, DOLK, SDCBP2, RHBDL1, and IER3-AS1. Notably, two clinical features, the absolute neutrophil count and Viremia Categories, emerged as top contributors. Integrating multiple data modalities has significantly improved the accuracy of disease severity prediction compared to using any single modality. The identified features could serve as biomarkers for COVID-19 prognosis and patient care, allowing clinicians to optimize treatment strategies and refine clinical decision-making processes for enhanced patient outcomes.

摘要

本研究的前提源于在分子水平上了解新型冠状病毒2(SARS-CoV-2)感染以及开发用于管理新冠肺炎严重程度的预测工具的需求。鉴于在受感染个体中观察到的临床结果各异,创建一个可靠的用于预测新冠肺炎严重程度的机器学习(ML)模型变得至关重要。尽管有大规模的基因组和临床数据,但先前的研究尚未有效地利用多模态数据,通过数据驱动的方法进行疾病严重程度预测。我们的主要目标是使用一个基于患者基因表达、临床特征和合并症数据组合训练的机器学习模型来预测新冠肺炎的严重程度。我们采用了各种ML算法,包括逻辑回归(LR)、极端梯度提升(XG)、朴素贝叶斯(NB)和支持向量机(SVM),并结合特征选择方法,试图找出用于疾病严重程度预测的性能最佳的模型。结果表明,XG是区分严重程度组的 superior 分类器,准确率为95%,曲线下面积(AUC)为0.99。此外,SHAP分析揭示了对预测有重要贡献的特征,包括COX14、LAMB2、DOLK、SDCBP2、RHBDL1和IER3-AS1等几个基因。值得注意的是,两个临床特征,即绝对中性粒细胞计数和病毒血症类别,成为主要贡献因素。与使用任何单一模态相比,整合多种数据模态显著提高了疾病严重程度预测的准确性。所确定的特征可作为新冠肺炎预后和患者护理的生物标志物,使临床医生能够优化治疗策略并完善临床决策过程,以改善患者预后。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ab/11202902/b050e2efaf09/diagnostics-14-01284-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ab/11202902/ba51e99d9003/diagnostics-14-01284-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ab/11202902/e952561ba181/diagnostics-14-01284-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ab/11202902/38cb69dd63bd/diagnostics-14-01284-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ab/11202902/211a7889775e/diagnostics-14-01284-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ab/11202902/ab9af176a2ef/diagnostics-14-01284-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ab/11202902/b050e2efaf09/diagnostics-14-01284-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ab/11202902/ba51e99d9003/diagnostics-14-01284-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ab/11202902/e952561ba181/diagnostics-14-01284-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ab/11202902/38cb69dd63bd/diagnostics-14-01284-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ab/11202902/211a7889775e/diagnostics-14-01284-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ab/11202902/ab9af176a2ef/diagnostics-14-01284-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/30ab/11202902/b050e2efaf09/diagnostics-14-01284-g006.jpg

相似文献

1
A Machine Learning Model for the Prediction of COVID-19 Severity Using RNA-Seq, Clinical, and Co-Morbidity Data.一种使用RNA测序、临床和合并症数据预测COVID-19严重程度的机器学习模型。
Diagnostics (Basel). 2024 Jun 18;14(12):1284. doi: 10.3390/diagnostics14121284.
2
A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.一种新的混合集成机器学习模型,用于严重程度风险评估和 COVID 后预测系统。
Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.
3
Comparing different machine learning techniques for predicting COVID-19 severity.比较不同机器学习技术预测 COVID-19 严重程度的效果。
Infect Dis Poverty. 2022 Feb 17;11(1):19. doi: 10.1186/s40249-022-00946-4.
4
Interpretable machine-learning model for Predicting the Convalescent COVID-19 patients with pulmonary diffusing capacity impairment.用于预测具有肺弥散能力障碍的康复期 COVID-19 患者的可解释机器学习模型。
BMC Med Inform Decis Mak. 2023 Aug 29;23(1):169. doi: 10.1186/s12911-023-02192-6.
5
Comparing machine learning algorithms to predict COVID‑19 mortality using a dataset including chest computed tomography severity score data.比较机器学习算法,使用包含胸部计算机断层扫描严重程度评分数据的数据集来预测 COVID-19 死亡率。
Sci Rep. 2023 Jul 13;13(1):11343. doi: 10.1038/s41598-023-38133-6.
6
Use of Multiprognostic Index Domain Scores, Clinical Data, and Machine Learning to Improve 12-Month Mortality Risk Prediction in Older Hospitalized Patients: Prospective Cohort Study.使用多预后指标领域评分、临床数据和机器学习提高老年住院患者 12 个月死亡率风险预测:前瞻性队列研究。
J Med Internet Res. 2021 Jun 21;23(6):e26139. doi: 10.2196/26139.
7
COVID-Net Biochem: an explainability-driven framework to building machine learning models for predicting survival and kidney injury of COVID-19 patients from clinical and biochemistry data.COVID-Net 生化:一个基于可解释性的框架,用于构建基于临床和生化数据预测 COVID-19 患者生存和肾脏损伤的机器学习模型。
Sci Rep. 2023 Oct 9;13(1):17001. doi: 10.1038/s41598-023-42203-0.
8
Development and Validation of a Robust and Interpretable Early Triaging Support System for Patients Hospitalized With COVID-19: Predictive Algorithm Modeling and Interpretation Study.开发和验证用于 COVID-19 住院患者的强大且可解释的早期分诊支持系统:预测算法建模和解释研究。
J Med Internet Res. 2024 Jan 11;26:e52134. doi: 10.2196/52134.
9
Non-Contrasted CT Radiomics for SAH Prognosis Prediction.用于蛛网膜下腔出血预后预测的非增强CT影像组学
Bioengineering (Basel). 2023 Aug 16;10(8):967. doi: 10.3390/bioengineering10080967.
10
Artificial intelligence in clinical care amidst COVID-19 pandemic: A systematic review.COVID-19大流行期间临床护理中的人工智能:一项系统综述。
Comput Struct Biotechnol J. 2021;19:2833-2850. doi: 10.1016/j.csbj.2021.05.010. Epub 2021 May 7.

引用本文的文献

1
Predicting COVID-19 severity in pediatric patients using machine learning: a comparative analysis of algorithms and ensemble methods.使用机器学习预测儿科患者的COVID-19严重程度:算法与集成方法的比较分析
Sci Rep. 2025 Aug 8;15(1):29118. doi: 10.1038/s41598-025-15366-1.
2
COVID-19 risk stratification among older adults: a machine learning approach to identify personal and health-related risk factors.老年人中的新冠病毒疾病风险分层:一种用于识别个人及健康相关风险因素的机器学习方法
BMC Public Health. 2025 Jul 29;25(1):2577. doi: 10.1186/s12889-025-23862-2.

本文引用的文献

1
Evaluating methods for risk prediction of Covid-19 mortality in nursing home residents before and after vaccine availability: a retrospective cohort study.评估疫苗供应前后养老院居民 COVID-19 死亡率风险预测方法的回顾性队列研究。
BMC Med Res Methodol. 2024 Mar 27;24(1):77. doi: 10.1186/s12874-024-02189-3.
2
Logistic regression and other statistical tools in diagnostic biomarker studies.逻辑回归和其他统计工具在诊断生物标志物研究中的应用。
Clin Transl Oncol. 2024 Sep;26(9):2172-2180. doi: 10.1007/s12094-024-03413-8. Epub 2024 Mar 26.
3
XGBoost in the Prediction of 28-Day Mortality in Critical Elderly Patients with Hip Fracture: A MIMIC-IV Cohort Study.
基于 MIMIC-IV 队列研究的 XGBoost 对髋部骨折重症老年患者 28 天死亡率的预测
Altern Ther Health Med. 2024 Sep;30(9):432-436.
4
Application of artificial intelligence (AI) to control COVID-19 pandemic: Current status and future prospects.人工智能在控制新冠疫情中的应用:现状与未来前景
Heliyon. 2024 Feb 9;10(4):e25754. doi: 10.1016/j.heliyon.2024.e25754. eCollection 2024 Feb 29.
5
Augmentation strategies for an imbalanced learning problem on a novel COVID-19 severity dataset.针对新型 COVID-19 严重程度数据集的不平衡学习问题的增强策略。
Sci Rep. 2023 Oct 25;13(1):18299. doi: 10.1038/s41598-023-45532-2.
6
[[Fundamentals] 5. Python+scikit-learn for Machine Learning in Medical Imaging].[基础篇] 5. 医学成像中用于机器学习的Python+scikit-learn
Nihon Hoshasen Gijutsu Gakkai Zasshi. 2023;79(10):1189-1193. doi: 10.6009/jjrt.2023-2266.
7
Hematological alterations associated with long COVID-19.与新冠长期症状相关的血液学改变。
Front Physiol. 2023 Jul 24;14:1203472. doi: 10.3389/fphys.2023.1203472. eCollection 2023.
8
Analysis of Differential Gene Expression and Core Canonical Pathways Involved in the Epithelial to Mesenchymal Transition of Triple Negative Breast Cancer Cells by Ingenuity Pathway Analysis.通过 Ingenuity 通路分析对三阴性乳腺癌细胞上皮-间质转化中涉及的差异基因表达和核心经典通路进行分析。
Comput Mol Biosci. 2023 Jun;13(2):21-34. doi: 10.4236/cmb.2023.132002. Epub 2023 May 19.
9
Neutrophils in COVID-19: recent insights and advances.新型冠状病毒肺炎中的中性粒细胞:最新的认识和进展。
Virol J. 2023 Aug 2;20(1):169. doi: 10.1186/s12985-023-02116-w.
10
Computational identification of differentially-expressed genes as suggested novel COVID-19 biomarkers: A bioinformatics analysis of expression profiles.作为新型冠状病毒肺炎潜在生物标志物的差异表达基因的计算鉴定:表达谱的生物信息学分析
Comput Struct Biotechnol J. 2023;21:3339-3354. doi: 10.1016/j.csbj.2023.06.007. Epub 2023 Jun 12.