• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)中具有鉴别性的基因组特征进行宏病毒组鉴定。

Metaviromic identification of discriminative genomic features in SARS-CoV-2 using machine learning.

作者信息

Park Jonathan J, Chen Sidi

机构信息

Department of Genetics, Yale University School of Medicine, New Haven, CT, USA.

System Biology Institute, Yale University, West Haven, CT, USA.

出版信息

Patterns (N Y). 2022 Feb 11;3(2):100407. doi: 10.1016/j.patter.2021.100407. Epub 2021 Nov 18.

DOI:10.1016/j.patter.2021.100407
PMID:34812427
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8598947/
Abstract

The COVID-19 pandemic caused by SARS-CoV-2 has become a major threat across the globe. Here, we developed machine learning approaches to identify key pathogenic regions in coronavirus genomes. We trained and evaluated 7,562,625 models on 3,665 genomes including SARS-CoV-2, MERS-CoV, SARS-CoV, and other coronaviruses of human and animal origins to return quantitative and biologically interpretable signatures at nucleotide and amino acid resolutions. We identified hotspots across the SARS-CoV-2 genome, including previously unappreciated features in spike, RdRp, and other proteins. Finally, we integrated pathogenicity genomic profiles with B cell and T cell epitope predictions for enrichment of sequence targets to help guide vaccine development. These results provide a systematic map of predicted pathogenicity in SARS-CoV-2 that incorporates sequence, structural, and immunologic features, providing an unbiased collection of genetic elements for functional studies. This metavirome-based framework can also be applied for rapid characterization of new coronavirus strains or emerging pathogenic viruses.

摘要

由严重急性呼吸综合征冠状病毒2(SARS-CoV-2)引起的2019冠状病毒病大流行已成为全球的重大威胁。在此,我们开发了机器学习方法来识别冠状病毒基因组中的关键致病区域。我们在包括SARS-CoV-2、中东呼吸综合征冠状病毒(MERS-CoV)、严重急性呼吸综合征冠状病毒(SARS-CoV)以及其他人和动物源冠状病毒在内的3665个基因组上训练和评估了7562625个模型,以在核苷酸和氨基酸分辨率下返回定量且具有生物学可解释性的特征。我们在SARS-CoV-2基因组中识别出了热点区域,包括刺突蛋白、RNA依赖的RNA聚合酶(RdRp)和其他蛋白质中以前未被重视的特征。最后,我们将致病性基因组图谱与B细胞和T细胞表位预测相结合,以富集序列靶点,帮助指导疫苗开发。这些结果提供了一个包含序列、结构和免疫学特征的SARS-CoV-2预测致病性系统图谱,为功能研究提供了一组无偏差的遗传元件集合。这种基于宏病毒组的框架也可用于快速鉴定新型冠状病毒毒株或新出现的致病病毒。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/95a46cef24a5/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/8f0e3a136365/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/f7e2e627fc6c/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/398a04300f3b/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/50fdf25cb0f5/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/3a6934331759/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/2857d2819b86/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/e10e0db921c2/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/95a46cef24a5/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/8f0e3a136365/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/f7e2e627fc6c/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/398a04300f3b/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/50fdf25cb0f5/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/3a6934331759/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/2857d2819b86/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/e10e0db921c2/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2728/8848032/95a46cef24a5/gr7.jpg

相似文献

1
Metaviromic identification of discriminative genomic features in SARS-CoV-2 using machine learning.利用机器学习对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)中具有鉴别性的基因组特征进行宏病毒组鉴定。
Patterns (N Y). 2022 Feb 11;3(2):100407. doi: 10.1016/j.patter.2021.100407. Epub 2021 Nov 18.
2
Genomic determinants of pathogenicity in SARS-CoV-2 and other human coronaviruses.新型冠状病毒及其他人类冠状病毒致病性的基因组决定因素
bioRxiv. 2020 Apr 9:2020.04.05.026450. doi: 10.1101/2020.04.05.026450.
3
Genomic determinants of pathogenicity in SARS-CoV-2 and other human coronaviruses.SARS-CoV-2 和其他人类冠状病毒的致病性的基因组决定因素。
Proc Natl Acad Sci U S A. 2020 Jun 30;117(26):15193-15199. doi: 10.1073/pnas.2008176117. Epub 2020 Jun 10.
4
Artificial intelligence predicts the immunogenic landscape of SARS-CoV-2 leading to universal blueprints for vaccine designs.人工智能预测 SARS-CoV-2 的免疫原性景观,从而为疫苗设计提供通用蓝图。
Sci Rep. 2020 Dec 23;10(1):22375. doi: 10.1038/s41598-020-78758-5.
5
Deep learning based on biologically interpretable genome representation predicts two types of human adaptation of SARS-CoV-2 variants.基于具有生物学可解释性的基因组表示的深度学习预测了 SARS-CoV-2 变体的两种人类适应类型。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac036.
6
Genome-Wide Asymptomatic B-Cell, CD4 and CD8 T-Cell Epitopes, that are Highly Conserved Between Human and Animal Coronaviruses, Identified from SARS-CoV-2 as Immune Targets for Pre-Emptive Pan-Coronavirus Vaccines.从新冠病毒中鉴定出全基因组无症状B细胞、CD4和CD8 T细胞表位,这些表位在人类和动物冠状病毒之间高度保守,可作为先发制全冠状病毒疫苗的免疫靶点。
bioRxiv. 2020 Sep 28:2020.09.27.316018. doi: 10.1101/2020.09.27.316018.
7
Evaluation on the use of Nanopore sequencing for direct characterization of coronaviruses from respiratory specimens, and a study on emerging missense mutations in partial RdRP gene of SARS-CoV-2.评估 Nanopore 测序在直接检测呼吸道样本中的冠状病毒中的应用,以及对 SARS-CoV-2 部分 RdRP 基因中新兴错义突变的研究。
Virol J. 2020 Nov 23;17(1):183. doi: 10.1186/s12985-020-01454-3.
8
Emergence, evolution, and vaccine production approaches of SARS-CoV-2 virus: Benefits of getting vaccinated and common questions.严重急性呼吸综合征冠状病毒2(SARS-CoV-2)病毒的出现、演变及疫苗生产方法:接种疫苗的益处及常见问题
Saudi J Biol Sci. 2022 Apr;29(4):1981-1997. doi: 10.1016/j.sjbs.2021.12.020. Epub 2021 Dec 13.
9
Conserved Genomic Terminals of SARS-CoV-2 as Co-evolving Functional Elements and Potential Therapeutic Targets.严重急性呼吸综合征冠状病毒2(SARS-CoV-2)保守的基因组末端作为共同进化的功能元件和潜在治疗靶点
bioRxiv. 2020 Jul 6:2020.07.06.190207. doi: 10.1101/2020.07.06.190207.
10
Applying high throughput and comprehensive immunoinformatics approaches to design a trivalent subunit vaccine for induction of immune response against emerging human coronaviruses SARS-CoV, MERS-CoV and SARS-CoV-2.应用高通量和综合免疫信息学方法设计一种三价亚单位疫苗,以诱导针对新型人类冠状病毒 SARS-CoV、MERS-CoV 和 SARS-CoV-2 的免疫反应。
J Biomol Struct Dyn. 2022 Aug;40(13):6097-6113. doi: 10.1080/07391102.2021.1876774. Epub 2021 Jan 29.

引用本文的文献

1
Evaluation of Mutual Information and Feature Selection for SARS-CoV-2 Respiratory Infection.新型冠状病毒2型呼吸道感染的互信息评估与特征选择
Bioengineering (Basel). 2023 Jul 24;10(7):880. doi: 10.3390/bioengineering10070880.
2
Human virome: Implications in cancer.人类病毒组:对癌症的影响
Heliyon. 2023 Mar;9(3):e14086. doi: 10.1016/j.heliyon.2023.e14086. Epub 2023 Feb 25.
3
Sequencing meets machine learning to fight emerging pathogens: A preview.测序与机器学习携手对抗新出现的病原体:预览

本文引用的文献

1
SARS-CoV-2 variants, spike mutations and immune escape.SARS-CoV-2 变体、刺突突变和免疫逃逸。
Nat Rev Microbiol. 2021 Jul;19(7):409-424. doi: 10.1038/s41579-021-00573-0. Epub 2021 Jun 1.
2
Genomics and epidemiology of the P.1 SARS-CoV-2 lineage in Manaus, Brazil.巴西玛瑙斯市 P.1 型 SARS-CoV-2 谱系的基因组学和流行病学研究。
Science. 2021 May 21;372(6544):815-821. doi: 10.1126/science.abh2644. Epub 2021 Apr 14.
3
Nonstructural Protein 1 of SARS-CoV-2 Is a Potent Pathogenicity Factor Redirecting Host Protein Synthesis Machinery toward Viral RNA.
Patterns (N Y). 2022 Feb 11;3(2):100448. doi: 10.1016/j.patter.2022.100448.
SARS-CoV-2 的非结构蛋白 1 是一种强大的致病性因子,它将宿主蛋白合成机制引导到病毒 RNA 上。
Mol Cell. 2020 Dec 17;80(6):1055-1066.e6. doi: 10.1016/j.molcel.2020.10.034. Epub 2020 Oct 29.
4
SARS-CoV-2-reactive T cells in healthy donors and patients with COVID-19.SARS-CoV-2 反应性 T 细胞在健康供体和 COVID-19 患者中的研究。
Nature. 2020 Nov;587(7833):270-274. doi: 10.1038/s41586-020-2598-9. Epub 2020 Jul 29.
5
Discovery and Genomic Characterization of a 382-Nucleotide Deletion in ORF7b and ORF8 during the Early Evolution of SARS-CoV-2.在 SARS-CoV-2 的早期进化过程中发现并鉴定了 ORF7b 和 ORF8 中 382 个核苷酸的缺失。
mBio. 2020 Jul 21;11(4):e01610-20. doi: 10.1128/mBio.01610-20.
6
Rapid implementation of SARS-CoV-2 sequencing to investigate cases of health-care associated COVID-19: a prospective genomic surveillance study.快速实施 SARS-CoV-2 测序以调查与医疗保健相关的 COVID-19 病例:一项前瞻性基因组监测研究。
Lancet Infect Dis. 2020 Nov;20(11):1263-1272. doi: 10.1016/S1473-3099(20)30562-4. Epub 2020 Jul 14.
7
Genomic determinants of pathogenicity in SARS-CoV-2 and other human coronaviruses.SARS-CoV-2 和其他人类冠状病毒的致病性的基因组决定因素。
Proc Natl Acad Sci U S A. 2020 Jun 30;117(26):15193-15199. doi: 10.1073/pnas.2008176117. Epub 2020 Jun 10.
8
Targets of T Cell Responses to SARS-CoV-2 Coronavirus in Humans with COVID-19 Disease and Unexposed Individuals.COVID-19 疾病患者和未接触者体内针对 SARS-CoV-2 冠状病毒的 T 细胞反应的靶标。
Cell. 2020 Jun 25;181(7):1489-1501.e15. doi: 10.1016/j.cell.2020.05.015. Epub 2020 May 20.
9
Remdesivir for the Treatment of Covid-19 - Final Report.瑞德西韦治疗 COVID-19 的疗效 - 最终报告。
N Engl J Med. 2020 Nov 5;383(19):1813-1826. doi: 10.1056/NEJMoa2007764. Epub 2020 Oct 8.
10
The sprint to solve coronavirus protein structures - and disarm them with drugs.争分夺秒解析新冠病毒蛋白质结构——并用药物使其失效。
Nature. 2020 May;581(7808):252-255. doi: 10.1038/d41586-020-01444-z.