• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过比较44种Sarbecovirus基因组分析严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的基因含量及2019冠状病毒病(COVID-19)的突变影响

SARS-CoV-2 gene content and COVID-19 mutation impact by comparing 44 Sarbecovirus genomes.

作者信息

Jungreis Irwin, Sealfon Rachel, Kellis Manolis

机构信息

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA.

Broad Institute of MIT and Harvard, Cambridge, MA.

出版信息

bioRxiv. 2020 Sep 2:2020.06.02.130955. doi: 10.1101/2020.06.02.130955.

DOI:10.1101/2020.06.02.130955
PMID:32577641
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7302193/
Abstract

Despite its overwhelming clinical importance, the SARS-CoV-2 gene set remains unresolved, hindering dissection of COVID-19 biology. Here, we use comparative genomics to provide a high-confidence protein-coding gene set, characterize protein-level and nucleotide-level evolutionary constraint, and prioritize functional mutations from the ongoing COVID-19 pandemic. We select 44 complete Sarbecovirus genomes at evolutionary distances ideally-suited for protein-coding and non-coding element identification, create whole-genome alignments, and quantify protein-coding evolutionary signatures and overlapping constraint. We find strong protein-coding signatures for all named genes and for 3a, 6, 7a, 7b, 8, 9b, and also ORF3c, a novel alternate-frame gene. By contrast, ORF10, and overlapping-ORFs 9c, 3b, and 3d lack protein-coding signatures or convincing experimental evidence and are not protein-coding. Furthermore, we show no other protein-coding genes remain to be discovered. Cross-strain and within-strain evolutionary pressures largely agree at the gene, amino-acid, and nucleotide levels, with some notable exceptions, including fewer-than-expected mutations in nsp3 and Spike subunit S1, and more-than-expected mutations in Nucleocapsid. The latter also shows a cluster of amino-acid-changing variants in otherwise-conserved residues in a predicted B-cell epitope, which may indicate positive selection for immune avoidance. Several Spike-protein mutations, including D614G, which has been associated with increased transmission, disrupt otherwise-perfectly-conserved amino acids, and could be novel adaptations to human hosts. The resulting high-confidence gene set and evolutionary-history annotations provide valuable resources and insights on COVID-19 biology, mutations, and evolution.

摘要

尽管严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的基因组具有极其重要的临床意义,但其基因集仍未明确,这阻碍了对2019冠状病毒病(COVID-19)生物学特性的剖析。在此,我们利用比较基因组学来提供一个高可信度的蛋白质编码基因集,表征蛋白质水平和核苷酸水平的进化限制,并对正在肆虐的COVID-19大流行中的功能性突变进行优先级排序。我们选择了44个进化距离理想的完整Sarbecovirus属基因组,以用于蛋白质编码和非编码元件的识别,创建全基因组比对,并量化蛋白质编码的进化特征和重叠限制。我们发现所有已命名基因以及3a、6、7a、7b、8、9b还有一个新的可变阅读框基因ORF3c都有很强的蛋白质编码特征。相比之下,ORF10以及重叠阅读框9c、3b和3d缺乏蛋白质编码特征或令人信服的实验证据,并非蛋白质编码基因。此外,我们表明没有其他蛋白质编码基因有待发现。跨毒株和毒株内的进化压力在基因、氨基酸和核苷酸水平上基本一致,但也有一些显著例外,包括非结构蛋白3(nsp3)和刺突蛋白亚基S1中的突变少于预期,而核衣壳蛋白中的突变多于预期。后者在预测的B细胞表位中其他保守残基处还显示出一组氨基酸变化的变体,这可能表明存在免疫逃避的正选择。包括与传播增加有关的D614G在内的几个刺突蛋白突变破坏了原本完全保守的氨基酸,可能是对人类宿主的新适应。由此产生的高可信度基因集和进化历史注释为COVID-19的生物学特性(包括突变和进化)提供了宝贵的资源和见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/abb4754d7668/nihpp-2020.06.02.130955-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/93e7d3bc390c/nihpp-2020.06.02.130955-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/425587ae600a/nihpp-2020.06.02.130955-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/7d2f73ac02e9/nihpp-2020.06.02.130955-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/1c3ae7c39614/nihpp-2020.06.02.130955-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/c95fc31fb7e8/nihpp-2020.06.02.130955-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/7bfa29eec976/nihpp-2020.06.02.130955-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/a837dba390e6/nihpp-2020.06.02.130955-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/bc651af5f4b0/nihpp-2020.06.02.130955-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/39778d73ec41/nihpp-2020.06.02.130955-f0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/a7182c3c1b2c/nihpp-2020.06.02.130955-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/62dd8d79395f/nihpp-2020.06.02.130955-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/a76b83d5ae48/nihpp-2020.06.02.130955-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/33f3784a10a9/nihpp-2020.06.02.130955-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/abb4754d7668/nihpp-2020.06.02.130955-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/93e7d3bc390c/nihpp-2020.06.02.130955-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/425587ae600a/nihpp-2020.06.02.130955-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/7d2f73ac02e9/nihpp-2020.06.02.130955-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/1c3ae7c39614/nihpp-2020.06.02.130955-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/c95fc31fb7e8/nihpp-2020.06.02.130955-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/7bfa29eec976/nihpp-2020.06.02.130955-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/a837dba390e6/nihpp-2020.06.02.130955-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/bc651af5f4b0/nihpp-2020.06.02.130955-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/39778d73ec41/nihpp-2020.06.02.130955-f0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/a7182c3c1b2c/nihpp-2020.06.02.130955-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/62dd8d79395f/nihpp-2020.06.02.130955-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/a76b83d5ae48/nihpp-2020.06.02.130955-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/33f3784a10a9/nihpp-2020.06.02.130955-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a031/7478346/abb4754d7668/nihpp-2020.06.02.130955-f0005.jpg

相似文献

1
SARS-CoV-2 gene content and COVID-19 mutation impact by comparing 44 Sarbecovirus genomes.通过比较44种Sarbecovirus基因组分析严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的基因含量及2019冠状病毒病(COVID-19)的突变影响
bioRxiv. 2020 Sep 2:2020.06.02.130955. doi: 10.1101/2020.06.02.130955.
2
SARS-CoV-2 gene content and COVID-19 mutation impact by comparing 44 Sarbecovirus genomes.通过比较44种Sarbecovirus基因组分析严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的基因内容及2019冠状病毒病(COVID-19)突变的影响
Res Sq. 2020 Oct 1:rs.3.rs-80345. doi: 10.21203/rs.3.rs-80345/v1.
3
SARS-CoV-2 gene content and COVID-19 mutation impact by comparing 44 Sarbecovirus genomes.比较 44 种 Sarbecovirus 基因组分析 SARS-CoV-2 的基因组成和 COVID-19 的突变影响。
Nat Commun. 2021 May 11;12(1):2642. doi: 10.1038/s41467-021-22905-7.
4
RNA structure-altering mutations underlying positive selection on Spike protein reveal novel putative signatures to trace crossing host-species barriers in .RNA 结构改变突变是 Spike 蛋白正选择的基础,揭示了追踪跨越宿主物种屏障的新的潜在特征。
RNA Biol. 2022 Jan;19(1):1019-1044. doi: 10.1080/15476286.2022.2115750.
5
Characterization of accessory genes in coronavirus genomes.冠状病毒基因组中辅助基因的特征。
Virol J. 2020 Aug 27;17(1):131. doi: 10.1186/s12985-020-01402-1.
6
Geographical distribution of SARS-CoV-2 amino acids mutations and the concomitant evolution of seven distinct clades in non-human hosts.SARS-CoV-2 氨基酸突变的地理分布及非人类宿主中七个不同进化枝的伴随进化。
Zoonoses Public Health. 2022 Nov;69(7):816-825. doi: 10.1111/zph.12971. Epub 2022 May 25.
7
Computational Structural and Functional Analyses of ORF10 in Novel Coronavirus SARS-CoV-2 Variants to Understand Evolutionary Dynamics.新型冠状病毒SARS-CoV-2变体中ORF10的计算结构和功能分析以了解进化动态
Evol Bioinform Online. 2022 Jul 7;18:11769343221108218. doi: 10.1177/11769343221108218. eCollection 2022.
8
An ncRNA transcriptomics-based approach to design siRNA molecules against SARS-CoV-2 double membrane vesicle formation and accessory genes.基于 ncRNA 转录组学的方法设计针对 SARS-CoV-2 双层囊泡形成和辅助基因的 siRNA 分子。
BMC Infect Dis. 2023 Dec 12;23(1):872. doi: 10.1186/s12879-023-08870-0.
9
Comparative Genomics and Integrated Network Approach Unveiled Undirected Phylogeny Patterns, Co-mutational Hot Spots, Functional Cross Talk, and Regulatory Interactions in SARS-CoV-2.比较基因组学与整合网络方法揭示了新冠病毒的无向系统发育模式、共突变热点、功能串扰及调控相互作用。
mSystems. 2021 Feb 23;6(1):e00030-21. doi: 10.1128/mSystems.00030-21.
10
V367F Mutation in SARS-CoV-2 Spike RBD Emerging during the Early Transmission Phase Enhances Viral Infectivity through Increased Human ACE2 Receptor Binding Affinity.SARS-CoV-2 刺突 RBD 中的 V367F 突变增强了与人类 ACE2 受体的结合亲和力,从而提高了病毒的感染性。
J Virol. 2021 Jul 26;95(16):e0061721. doi: 10.1128/JVI.00617-21.

本文引用的文献

1
Pervasive generation of non-canonical subgenomic RNAs by SARS-CoV-2.SARS-CoV-2 普遍产生非规范亚基因组 RNA。
Genome Med. 2020 Dec 1;12(1):108. doi: 10.1186/s13073-020-00802-w.
2
SARS-CoV-2 ORF3b Is a Potent Interferon Antagonist Whose Activity Is Increased by a Naturally Occurring Elongation Variant.SARS-CoV-2 ORF3b 是一种有效的干扰素拮抗剂,其活性可被一种自然发生的延长变异体增强。
Cell Rep. 2020 Sep 22;32(12):108185. doi: 10.1016/j.celrep.2020.108185. Epub 2020 Sep 4.
3
Emergence of SARS-CoV-2 through recombination and strong purifying selection.
SARS-CoV-2 通过重组和强烈的纯化选择而出现。
Sci Adv. 2020 Jul 1;6(27). doi: 10.1126/sciadv.abb9153. Print 2020 Jul.
4
The UCSC SARS-CoV-2 Genome Browser.UCSC SARS-CoV-2 基因组浏览器。
Nat Genet. 2020 Oct;52(10):991-998. doi: 10.1038/s41588-020-0700-8.
5
The coding capacity of SARS-CoV-2.SARS-CoV-2 的编码能力。
Nature. 2021 Jan;589(7840):125-130. doi: 10.1038/s41586-020-2739-1. Epub 2020 Sep 9.
6
Characterisation of the transcriptome and proteome of SARS-CoV-2 reveals a cell passage induced in-frame deletion of the furin-like cleavage site from the spike glycoprotein.对 SARS-CoV-2 的转录组和蛋白质组进行了特征描述,发现刺突糖蛋白上的弗林样裂解位点发生了细胞传代诱导的框内缺失。
Genome Med. 2020 Jul 28;12(1):68. doi: 10.1186/s13073-020-00763-0.
7
Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus.追踪 SARS-CoV-2 刺突蛋白的变化:D614G 增加 COVID-19 病毒感染力的证据。
Cell. 2020 Aug 20;182(4):812-827.e19. doi: 10.1016/j.cell.2020.06.043. Epub 2020 Jul 3.
8
A putative new SARS-CoV protein, 3c, encoded in an ORF overlapping ORF3a.一种假定的新型 SARS-CoV 蛋白,3c,由 ORF3a 重叠编码的 ORF 编码。
J Gen Virol. 2020 Oct;101(10):1085-1089. doi: 10.1099/jgv.0.001469. Epub 2020 Jul 13.
9
New insights into the evolutionary features of viral overlapping genes by discriminant analysis.通过判别分析深入了解病毒重叠基因的进化特征。
Virology. 2020 Jul;546:51-66. doi: 10.1016/j.virol.2020.03.007. Epub 2020 Apr 2.
10
Proteomics of SARS-CoV-2-infected host cells reveals therapy targets.SARS-CoV-2 感染宿主细胞的蛋白质组学研究揭示了治疗靶点。
Nature. 2020 Jul;583(7816):469-472. doi: 10.1038/s41586-020-2332-7. Epub 2020 May 14.