• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用机器学习方法识别与新冠病毒疾病严重程度相关的严重急性呼吸综合征冠状病毒2突变

Identifying COVID-19 Severity-Related SARS-CoV-2 Mutation Using a Machine Learning Method.

作者信息

Huang Feiming, Chen Lei, Guo Wei, Zhou Xianchao, Feng Kaiyan, Huang Tao, Cai Yudong

机构信息

School of Life Sciences, Shanghai University, Shanghai 200444, China.

College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China.

出版信息

Life (Basel). 2022 May 28;12(6):806. doi: 10.3390/life12060806.

DOI:10.3390/life12060806
PMID:35743837
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9225528/
Abstract

SARS-CoV-2 shows great evolutionary capacity through a high frequency of genomic variation during transmission. Evolved SARS-CoV-2 often demonstrates resistance to previous vaccines and can cause poor clinical status in patients. Mutations in the SARS-CoV-2 genome involve mutations in structural and nonstructural proteins, and some of these proteins such as spike proteins have been shown to be directly associated with the clinical status of patients with severe COVID-19 pneumonia. In this study, we collected genome-wide mutation information of virulent strains and the severity of COVID-19 pneumonia in patients varying depending on their clinical status. Important protein mutations and untranslated region mutations were extracted using machine learning methods. First, through Boruta and four ranking algorithms (least absolute shrinkage and selection operator, light gradient boosting machine, max-relevance and min-redundancy, and Monte Carlo feature selection), mutations that were highly correlated with the clinical status of the patients were screened out and sorted in four feature lists. Some mutations such as D614G and V1176F were shown to be associated with viral infectivity. Moreover, previously unreported mutations such as A320V of nsp14 and I164ILV of nsp14 were also identified, which suggests their potential roles. We then applied the incremental feature selection method to each feature list to construct efficient classifiers, which can be directly used to distinguish the clinical status of COVID-19 patients. Meanwhile, four sets of quantitative rules were set up, which can help us to more intuitively understand the role of each mutation in differentiating the clinical status of COVID-19 patients. Identified key mutations linked to virologic properties will help better understand the mechanisms of infection and will aid in the development of antiviral treatments.

摘要

严重急性呼吸综合征冠状病毒2(SARS-CoV-2)在传播过程中通过高频基因组变异展现出强大的进化能力。进化后的SARS-CoV-2常常表现出对先前疫苗的抗性,并可导致患者临床状况不佳。SARS-CoV-2基因组中的突变涉及结构蛋白和非结构蛋白的突变,其中一些蛋白如刺突蛋白已被证明与重症冠状病毒病2019(COVID-19)肺炎患者的临床状况直接相关。在本研究中,我们收集了不同临床状况患者的毒株全基因组突变信息以及COVID-19肺炎的严重程度。使用机器学习方法提取重要的蛋白质突变和非翻译区突变。首先,通过博鲁塔算法和四种排序算法(最小绝对收缩和选择算子、轻梯度提升机、最大相关最小冗余以及蒙特卡罗特征选择),筛选出与患者临床状况高度相关的突变,并将其归入四个特征列表中进行排序。一些突变如D614G和V1176F被证明与病毒传染性有关。此外,还鉴定出了先前未报道的突变,如非结构蛋白14(nsp14)的A320V和nsp14的I164ILV,这表明了它们的潜在作用。然后,我们将增量特征选择方法应用于每个特征列表以构建高效分类器,该分类器可直接用于区分COVID-19患者的临床状况。同时,建立了四套定量规则,这有助于我们更直观地理解每个突变在区分COVID-19患者临床状况中的作用。确定与病毒学特性相关的关键突变将有助于更好地理解感染机制,并有助于开发抗病毒治疗方法。

相似文献

1
Identifying COVID-19 Severity-Related SARS-CoV-2 Mutation Using a Machine Learning Method.使用机器学习方法识别与新冠病毒疾病严重程度相关的严重急性呼吸综合征冠状病毒2突变
Life (Basel). 2022 May 28;12(6):806. doi: 10.3390/life12060806.
2
Machine Learning Classification of Time since BNT162b2 COVID-19 Vaccination Based on Array-Measured Antibody Activity.基于阵列测量抗体活性的BNT162b2 COVID-19疫苗接种后时间的机器学习分类
Life (Basel). 2023 May 31;13(6):1304. doi: 10.3390/life13061304.
3
Identification of Gene Markers Associated with COVID-19 Severity and Recovery in Different Immune Cell Subtypes.不同免疫细胞亚型中与COVID-19严重程度和恢复相关的基因标志物的鉴定
Biology (Basel). 2023 Jul 2;12(7):947. doi: 10.3390/biology12070947.
4
V367F Mutation in SARS-CoV-2 Spike RBD Emerging during the Early Transmission Phase Enhances Viral Infectivity through Increased Human ACE2 Receptor Binding Affinity.SARS-CoV-2 刺突 RBD 中的 V367F 突变增强了与人类 ACE2 受体的结合亲和力,从而提高了病毒的感染性。
J Virol. 2021 Jul 26;95(16):e0061721. doi: 10.1128/JVI.00617-21.
5
Identification of Genes Associated with the Impairment of Olfactory and Gustatory Functions in COVID-19 via Machine-Learning Methods.通过机器学习方法鉴定与COVID-19嗅觉和味觉功能受损相关的基因
Life (Basel). 2023 Mar 15;13(3):798. doi: 10.3390/life13030798.
6
Identification of methylation signatures and rules for predicting the severity of SARS-CoV-2 infection with machine learning methods.利用机器学习方法识别甲基化特征及预测新型冠状病毒2019感染严重程度的规则。
Front Microbiol. 2022 Sep 23;13:1007295. doi: 10.3389/fmicb.2022.1007295. eCollection 2022.
7
Identifying Transcriptomic Signatures and Rules for SARS-CoV-2 Infection.确定新型冠状病毒感染的转录组特征和规律
Front Cell Dev Biol. 2021 Jan 11;8:627302. doi: 10.3389/fcell.2020.627302. eCollection 2020.
8
Identifying COVID-19-Specific Transcriptomic Biomarkers with Machine Learning Methods.运用机器学习方法识别新冠病毒特异性转录组生物标志物
Biomed Res Int. 2021 Jul 6;2021:9939134. doi: 10.1155/2021/9939134. eCollection 2021.
9
Immune responses of different COVID-19 vaccination strategies by analyzing single-cell RNA sequencing data from multiple tissues using machine learning methods.通过使用机器学习方法分析来自多个组织的单细胞RNA测序数据,研究不同新冠疫苗接种策略的免疫反应。
Front Genet. 2023 Mar 17;14:1157305. doi: 10.3389/fgene.2023.1157305. eCollection 2023.
10
Genome-wide association analysis of COVID-19 mortality risk in SARS-CoV-2 genomes identifies mutation in the SARS-CoV-2 spike protein that colocalizes with P.1 of the Brazilian strain.全基因组关联分析 SARS-CoV-2 基因组中 COVID-19 死亡率风险,鉴定出与巴西变异株 P.1 共定位的 SARS-CoV-2 刺突蛋白突变。
Genet Epidemiol. 2021 Oct;45(7):685-693. doi: 10.1002/gepi.22421. Epub 2021 Jun 22.

引用本文的文献

1
Identification of severity related mutation hotspots in SARS-CoV-2 using a density-based clustering approach.使用基于密度的聚类方法识别新冠病毒中与严重程度相关的突变热点。
BioData Min. 2025 Sep 1;18(1):61. doi: 10.1186/s13040-025-00476-3.
2
Association of RBD mutations with COVID-19 disease severity in the Iranian population.伊朗人群中RBD突变与COVID-19疾病严重程度的关联。
Virus Genes. 2025 Jun 26. doi: 10.1007/s11262-025-02168-w.
3
Identification of patient demographic, clinical, and SARS-CoV-2 genomic factors associated with severe COVID-19 using supervised machine learning: a retrospective multicenter study.

本文引用的文献

1
Identification of Cell Markers and Their Expression Patterns in Skin Based on Single-Cell RNA-Sequencing Profiles.基于单细胞RNA测序图谱鉴定皮肤中的细胞标志物及其表达模式。
Life (Basel). 2022 Apr 7;12(4):550. doi: 10.3390/life12040550.
2
Similarity-Based Method with Multiple-Feature Sampling for Predicting Drug Side Effects.基于相似性的多特征采样方法预测药物副作用。
Comput Math Methods Med. 2022 Apr 1;2022:9547317. doi: 10.1155/2022/9547317. eCollection 2022.
3
Identification of protein functions in mouse with a label space partition method.
使用监督式机器学习识别与重症新型冠状病毒肺炎相关的患者人口统计学、临床和严重急性呼吸综合征冠状病毒2基因组因素:一项回顾性多中心研究
BMC Infect Dis. 2025 Jan 28;25(1):132. doi: 10.1186/s12879-025-10450-3.
4
Emerging SARS-CoV-2 Variants in Uganda in the Era of COVID-19 Vaccination.新冠疫苗接种时代乌干达出现的新型严重急性呼吸综合征冠状病毒2变种
Viruses. 2024 Nov 29;16(12):1860. doi: 10.3390/v16121860.
5
Benefits of Repeated SARS-CoV-2 Vaccination and Virus-induced Cross-neutralization Potential in Immunocompromised Transplant Patients and Healthy Individuals.免疫功能低下的移植患者和健康个体中重复接种SARS-CoV-2疫苗的益处及病毒诱导的交叉中和潜力
Open Forum Infect Dis. 2024 Sep 9;11(10):ofae527. doi: 10.1093/ofid/ofae527. eCollection 2024 Oct.
6
Multivariate analyses and machine learning link sex and age with antibody responses to SARS-CoV-2 and vaccination.多变量分析和机器学习将性别和年龄与针对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的抗体反应及疫苗接种联系起来。
iScience. 2024 Jul 10;27(8):110484. doi: 10.1016/j.isci.2024.110484. eCollection 2024 Aug 16.
7
Prediction models for COVID-19 disease outcomes.用于预测 COVID-19 疾病结局的模型。
Emerg Microbes Infect. 2024 Dec;13(1):2361791. doi: 10.1080/22221751.2024.2361791. Epub 2024 Jun 14.
8
An AI-based novel system for predicting respiratory support in COVID-19 patients through CT imaging analysis.基于人工智能的新型系统,通过 CT 成像分析预测 COVID-19 患者的呼吸支持需求。
Sci Rep. 2024 Jan 8;14(1):851. doi: 10.1038/s41598-023-51053-9.
9
Identification of key gene expression associated with quality of life after recovery from COVID-19.鉴定与 COVID-19 康复后生活质量相关的关键基因表达。
Med Biol Eng Comput. 2024 Apr;62(4):1031-1048. doi: 10.1007/s11517-023-02988-8. Epub 2023 Dec 21.
10
Epigenetic age acceleration in surviving versus deceased COVID-19 patients with acute respiratory distress syndrome following hospitalization.住院后急性呼吸窘迫综合征的存活与死亡 COVID-19 患者的表观遗传年龄加速。
Clin Epigenetics. 2023 Nov 28;15(1):186. doi: 10.1186/s13148-023-01597-4.
用标签空间划分方法鉴定小鼠中的蛋白质功能。
Math Biosci Eng. 2022 Feb 10;19(4):3820-3842. doi: 10.3934/mbe.2022176.
4
Predicting Heart Cell Types by Using Transcriptome Profiles and a Machine Learning Method.利用转录组图谱和机器学习方法预测心脏细胞类型
Life (Basel). 2022 Jan 31;12(2):228. doi: 10.3390/life12020228.
5
Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions.基于关键序列特征和分布预测 RNA 5-甲基胞嘧啶位点
Biomed Res Int. 2022 Jan 13;2022:4035462. doi: 10.1155/2022/4035462. eCollection 2022.
6
Identification of Pan-Cancer Biomarkers Based on the Gene Expression Profiles of Cancer Cell Lines.基于癌细胞系基因表达谱的泛癌生物标志物鉴定
Front Cell Dev Biol. 2021 Nov 30;9:781285. doi: 10.3389/fcell.2021.781285. eCollection 2021.
7
Heavily mutated Omicron variant puts scientists on alert.高度变异的奥密克戎毒株使科学家们警觉起来。
Nature. 2021 Dec;600(7887):21. doi: 10.1038/d41586-021-03552-w.
8
iMPT-FDNPL: Identification of Membrane Protein Types with Functional Domains and a Natural Language Processing Approach.iMPT-FDNPL:基于功能域和自然语言处理方法识别膜蛋白类型。
Comput Math Methods Med. 2021 Oct 11;2021:7681497. doi: 10.1155/2021/7681497. eCollection 2021.
9
SARS-CoV-2 B.1.617.2 Delta variant replication and immune evasion.SARS-CoV-2 B.1.617.2 德尔塔变异株复制和免疫逃逸。
Nature. 2021 Nov;599(7883):114-119. doi: 10.1038/s41586-021-03944-y. Epub 2021 Sep 6.
10
In-Silico analysis reveals lower transcription efficiency of C241T variant of SARS-CoV-2 with host replication factors MADP1 and hnRNP-1.计算机模拟分析显示,新冠病毒(SARS-CoV-2)的C241T变体与宿主复制因子MADP1和hnRNP-1的转录效率较低。
Inform Med Unlocked. 2021;25:100670. doi: 10.1016/j.imu.2021.100670. Epub 2021 Jul 21.