• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

COVIDOUTCOME——基于严重急性呼吸综合征冠状病毒2(SARS-CoV-2)基因组中的突变特征估计新冠严重程度。

COVIDOUTCOME-estimating COVID severity based on mutation signatures in the SARS-CoV-2 genome.

作者信息

Nagy Ádám, Ligeti Balázs, Szebeni János, Pongor Sándor, Gyrffy Balázs

机构信息

Department of Bioinformatics, Semmelweis University, u 7-9, Tűzoltó, Budapest H-1094, Hungary.

TTK Momentum Cancer Biomarker Research Group, 2, Magyar tudósok körútja, Budapest H-1117, Hungary.

出版信息

Database (Oxford). 2021 May 8;2021. doi: 10.1093/database/baab020.

DOI:10.1093/database/baab020
PMID:33963845
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8106440/
Abstract

Numerous studies demonstrate frequent mutations in the genome of SARS-CoV-2. Our goal was to statistically link mutations to severe disease outcome. We used an automated machine learning approach where 1594 viral genomes with available clinical follow-up data were used as the training set (797 'severe' and 797 'mild'). The best algorithm, based on random forest classification combined with the LASSO feature selection algorithm, was employed to the training set to link mutation signatures and outcome. The performance of the final model was estimated by repeated, stratified, 10-fold cross validation (CV) and then adjusted for multiple testing with Bootstrap Bias Corrected CV. We identified 26 protein and Untranslated Region (UTR) mutations significantly linked to severe outcome. The best classification algorithm uses a mutation signature of 22 mutations as well as the patient's age as the input and shows high classification efficiency with an area under the curve (AUC) of 0.94 [confidence interval (CI): [0.912, 0.962]] and a prediction accuracy of 87% (CI: [0.830, 0.903]). Finally, we established an online platform (https://covidoutcome.com/) that is capable to use a viral sequence and the patient's age as the input and provides a percentage estimation of disease severity. We demonstrate a statistical association between mutation signatures of SARS-CoV-2 and severe outcome of COVID-19. The established analysis platform enables a real-time analysis of new viral genomes.

摘要

大量研究表明,严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的基因组中存在频繁突变。我们的目标是从统计学角度将突变与严重疾病结局联系起来。我们采用了一种自动化机器学习方法,将1594个具有可用临床随访数据的病毒基因组用作训练集(797个“重症”和797个“轻症”)。基于随机森林分类与套索特征选择算法相结合的最佳算法应用于训练集,以关联突变特征与结局。通过重复分层10折交叉验证(CV)评估最终模型的性能,然后使用Bootstrap偏差校正CV对多重检验进行调整。我们鉴定出26个与严重结局显著相关的蛋白质和非翻译区(UTR)突变。最佳分类算法使用22个突变的突变特征以及患者年龄作为输入,曲线下面积(AUC)为0.94[置信区间(CI):[0.912,

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9f9/8106440/59981355aaf1/baab020f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9f9/8106440/a1eb0b3d5217/baab020f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9f9/8106440/59981355aaf1/baab020f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9f9/8106440/a1eb0b3d5217/baab020f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9f9/8106440/59981355aaf1/baab020f2.jpg

相似文献

1
COVIDOUTCOME-estimating COVID severity based on mutation signatures in the SARS-CoV-2 genome.COVIDOUTCOME——基于严重急性呼吸综合征冠状病毒2(SARS-CoV-2)基因组中的突变特征估计新冠严重程度。
Database (Oxford). 2021 May 8;2021. doi: 10.1093/database/baab020.
2
Can machines learn the mutation signatures of SARS-CoV-2 and enable viral-genotype guided predictive prognosis?机器能否学习 SARS-CoV-2 的突变特征,并实现基于病毒基因型的预测预后?
J Mol Biol. 2022 Aug 15;434(15):167684. doi: 10.1016/j.jmb.2022.167684. Epub 2022 Jun 11.
3
Profiling SARS-CoV-2 mutation fingerprints that range from the viral pangenome to individual infection quasispecies.分析 SARS-CoV-2 突变指纹,范围从病毒泛基因组到个体感染准种。
Genome Med. 2021 Apr 19;13(1):62. doi: 10.1186/s13073-021-00882-2.
4
A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.一种新的混合集成机器学习模型,用于严重程度风险评估和 COVID 后预测系统。
Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.
5
Association between prognostic factors and the outcomes of patients infected with SARS-CoV-2 harboring multiple spike protein mutations.与携带多个刺突蛋白突变的 SARS-CoV-2 感染患者预后因素及结局的关系。
Sci Rep. 2021 Nov 1;11(1):21352. doi: 10.1038/s41598-021-00459-4.
6
Automated machine learning optimizes and accelerates predictive modeling from COVID-19 high throughput datasets.自动化机器学习优化并加速了从 COVID-19 高通量数据集中进行预测建模的过程。
Sci Rep. 2021 Jul 23;11(1):15107. doi: 10.1038/s41598-021-94501-0.
7
Genome sequencing of SARS-CoV-2 in a cohort of Egyptian patients revealed mutation hotspots that are related to clinical outcomes.对埃及患者队列中的 SARS-CoV-2 进行基因组测序揭示了与临床结局相关的突变热点。
Biochim Biophys Acta Mol Basis Dis. 2021 Aug 1;1867(8):166154. doi: 10.1016/j.bbadis.2021.166154. Epub 2021 Apr 28.
8
The role of microRNAs in modulating SARS-CoV-2 infection in human cells: a systematic review.microRNAs 在调节人类细胞中 SARS-CoV-2 感染中的作用:系统评价。
Infect Genet Evol. 2021 Jul;91:104832. doi: 10.1016/j.meegid.2021.104832. Epub 2021 Apr 1.
9
Machine learning-based approach KEVOLVE efficiently identifies SARS-CoV-2 variant-specific genomic signatures.基于机器学习的方法 KEVOLVE 能够有效地识别 SARS-CoV-2 变异特异性基因组特征。
PLoS One. 2024 Jan 19;19(1):e0296627. doi: 10.1371/journal.pone.0296627. eCollection 2024.
10
Feature selection for effective prediction of SARS-COV-2 using machine learning.基于机器学习的 SARS-CoV-2 有效预测的特征选择。
Genes Genomics. 2024 Mar;46(3):341-354. doi: 10.1007/s13258-023-01467-6. Epub 2023 Nov 20.

引用本文的文献

1
Identification of patient demographic, clinical, and SARS-CoV-2 genomic factors associated with severe COVID-19 using supervised machine learning: a retrospective multicenter study.使用监督式机器学习识别与重症新型冠状病毒肺炎相关的患者人口统计学、临床和严重急性呼吸综合征冠状病毒2基因组因素:一项回顾性多中心研究
BMC Infect Dis. 2025 Jan 28;25(1):132. doi: 10.1186/s12879-025-10450-3.
2
Enhanced predictability and interpretability of COVID-19 severity based on SARS-CoV-2 genomic diversity: a comprehensive study encompassing four years of data.基于 SARS-CoV-2 基因组多样性的 COVID-19 严重程度的可预测性和可解释性增强:一项涵盖四年数据的综合研究。
Sci Rep. 2024 Nov 6;14(1):26992. doi: 10.1038/s41598-024-78493-1.
3

本文引用的文献

1
Just Add Data: automated predictive modeling for knowledge discovery and feature selection.只需添加数据:用于知识发现和特征选择的自动预测建模
NPJ Precis Oncol. 2022 Jun 16;6(1):38. doi: 10.1038/s41698-022-00274-8.
2
Spread of a SARS-CoV-2 variant through Europe in the summer of 2020.2020 年夏 SARS-CoV-2 变异株在欧洲的传播。
Nature. 2021 Jul;595(7869):707-712. doi: 10.1038/s41586-021-03677-y. Epub 2021 Jun 7.
3
Hospitalization and mortality associated with SARS-CoV-2 viral clades in COVID-19.与 COVID-19 中 SARS-CoV-2 病毒亚群相关的住院和死亡。
Prediction models for COVID-19 disease outcomes.
用于预测 COVID-19 疾病结局的模型。
Emerg Microbes Infect. 2024 Dec;13(1):2361791. doi: 10.1080/22221751.2024.2361791. Epub 2024 Jun 14.
4
Applying the digital data and the bioinformatics tools in SARS-CoV-2 research.将数字数据和生物信息学工具应用于2019冠状病毒病研究。
Comput Struct Biotechnol J. 2023 Oct 1;21:4697-4705. doi: 10.1016/j.csbj.2023.09.044. eCollection 2023.
5
Automated machine learning for genome wide association studies.基于自动化机器学习的全基因组关联研究。
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad545.
6
Interpretable and Predictive Deep Neural Network Modeling of the SARS-CoV-2 Spike Protein Sequence to Predict COVID-19 Disease Severity.用于预测COVID-19疾病严重程度的SARS-CoV-2刺突蛋白序列的可解释性和预测性深度神经网络建模
Biology (Basel). 2022 Dec 8;11(12):1786. doi: 10.3390/biology11121786.
7
Prediction of Recurrent Mutations in SARS-CoV-2 Using Artificial Neural Networks.利用人工神经网络预测 SARS-CoV-2 的复发性突变。
Int J Mol Sci. 2022 Nov 24;23(23):14683. doi: 10.3390/ijms232314683.
8
Identification and classification of coronavirus genomic signals based on linear predictive coding and machine learning methods.基于线性预测编码和机器学习方法的冠状病毒基因组信号识别与分类
Biomed Signal Process Control. 2023 Feb;80:104192. doi: 10.1016/j.bspc.2022.104192. Epub 2022 Sep 23.
9
Identifying COVID-19 Severity-Related SARS-CoV-2 Mutation Using a Machine Learning Method.使用机器学习方法识别与新冠病毒疾病严重程度相关的严重急性呼吸综合征冠状病毒2突变
Life (Basel). 2022 May 28;12(6):806. doi: 10.3390/life12060806.
10
SARS-CoV-2 Genome-Based Severity Predictions Correspond to Lower qPCR Values and Higher Viral Load.基于 SARS-CoV-2 基因组的严重程度预测与较低的 qPCR 值和较高的病毒载量相对应。
Glob Health Epidemiol Genom. 2022 May 31;2022:6499217. doi: 10.1155/2022/6499217. eCollection 2022.
Sci Rep. 2021 Feb 26;11(1):4802. doi: 10.1038/s41598-021-82850-9.
4
Different mutations in SARS-CoV-2 associate with severe and mild outcome.SARS-CoV-2 的不同突变与严重和轻度结果相关。
Int J Antimicrob Agents. 2021 Feb;57(2):106272. doi: 10.1016/j.ijantimicag.2020.106272. Epub 2020 Dec 23.
5
Spike mutation D614G alters SARS-CoV-2 fitness.刺突突变 D614G 改变了 SARS-CoV-2 的适应性。
Nature. 2021 Apr;592(7852):116-121. doi: 10.1038/s41586-020-2895-3. Epub 2020 Oct 26.
6
Global cataloguing of variations in untranslated regions of viral genome and prediction of key host RNA binding protein-microRNA interactions modulating genome stability in SARS-CoV-2.对病毒基因组非翻译区变异的全球编目,以及预测关键宿主 RNA 结合蛋白-微小 RNA 相互作用,调节 SARS-CoV-2 基因组稳定性。
PLoS One. 2020 Aug 11;15(8):e0237559. doi: 10.1371/journal.pone.0237559. eCollection 2020.
7
SARS-CoV-2 genomic variations associated with mortality rate of COVID-19.SARS-CoV-2 基因组变异与 COVID-19 死亡率的关系。
J Hum Genet. 2020 Dec;65(12):1075-1082. doi: 10.1038/s10038-020-0808-9. Epub 2020 Jul 22.
8
SARS-CoV-2: fear versus data.SARS-CoV-2:恐惧与数据。
Int J Antimicrob Agents. 2020 May;55(5):105947. doi: 10.1016/j.ijantimicag.2020.105947. Epub 2020 Mar 19.
9
Multiple alignment of DNA sequences with MAFFT.使用MAFFT对DNA序列进行多重比对。
Methods Mol Biol. 2009;537:39-64. doi: 10.1007/978-1-59745-251-9_3.
10
Phosphorylation of the arginine/serine dipeptide-rich motif of the severe acute respiratory syndrome coronavirus nucleocapsid protein modulates its multimerization, translation inhibitory activity and cellular localization.严重急性呼吸综合征冠状病毒核衣壳蛋白富含精氨酸/丝氨酸二肽基序的磷酸化调节其多聚化、翻译抑制活性和细胞定位。
FEBS J. 2008 Aug;275(16):4152-63. doi: 10.1111/j.1742-4658.2008.06564.x. Epub 2008 Jul 9.