• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种快速准确的 SARS-CoV-2 基因组溯源方法。

A fast and accurate method for SARS-CoV-2 genomic tracing.

机构信息

Beijing Institute of Genomics, Chinese Academy of Sciences, and China National Center for Bioinformation, Beijing 100101, China.

University of Chinese Academy of Sciences, Beijing 100049, China.

出版信息

Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad339.

DOI:10.1093/bib/bbad339
PMID:37779249
Abstract

To contain infectious diseases, it is crucial to determine the origin and transmission routes of the pathogen, as well as how the virus evolves. With the development of genome sequencing technology, genome epidemiology has emerged as a powerful approach for investigating the source and transmission of pathogens. In this study, we first presented the rationale for genomic tracing of SARS-CoV-2 and the challenges we currently face. Identifying the most genetically similar reference sequence to the query sequence is a critical step in genome tracing, typically achieved using either a phylogenetic tree or a sequence similarity search. However, these methods become inefficient or computationally prohibitive when dealing with tens of millions of sequences in the reference database, as we encountered during the COVID-19 pandemic. To address this challenge, we developed a novel genomic tracing algorithm capable of processing 6 million SARS-CoV-2 sequences in less than a minute. Instead of constructing a giant phylogenetic tree, we devised a weighted scoring system based on mutation characteristics to quantify sequences similarity. The developed method demonstrated superior performance compared to previous methods. Additionally, an online platform was developed to facilitate genomic tracing and visualization of the spatiotemporal distribution of sequences. The method will be a valuable addition to standard epidemiological investigations, enabling more efficient genomic tracing. Furthermore, the computational framework can be easily adapted to other pathogens, paving the way for routine genomic tracing of infectious diseases.

摘要

为了控制传染病,确定病原体的来源和传播途径以及病毒的进化方式至关重要。随着基因组测序技术的发展,基因组流行病学已成为研究病原体来源和传播的有力方法。在本研究中,我们首先介绍了对 SARS-CoV-2 进行基因组溯源的基本原理和我们目前面临的挑战。确定与查询序列最具遗传相似性的参考序列是基因组溯源的关键步骤,通常使用系统发育树或序列相似性搜索来实现。然而,当我们在 COVID-19 大流行期间遇到参考数据库中包含数千万个序列时,这些方法变得效率低下或计算上不可行。为了解决这一挑战,我们开发了一种新的基因组溯源算法,能够在不到一分钟的时间内处理 600 万个 SARS-CoV-2 序列。我们没有构建巨大的系统发育树,而是设计了一种基于突变特征的加权评分系统来量化序列的相似性。与以前的方法相比,所开发的方法表现出了优越的性能。此外,还开发了一个在线平台,以促进基因组溯源和序列时空分布的可视化。该方法将是标准流行病学调查的有力补充,使基因组溯源更加高效。此外,计算框架可以轻松适应其他病原体,为传染病的常规基因组溯源铺平道路。

相似文献

1
A fast and accurate method for SARS-CoV-2 genomic tracing.一种快速准确的 SARS-CoV-2 基因组溯源方法。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad339.
2
Taxonium, a web-based tool for exploring large phylogenetic trees.Taxonium,一个用于探索大型系统发育树的网络工具。
Elife. 2022 Nov 15;11:e82392. doi: 10.7554/eLife.82392.
3
Within-host diversity improves phylogenetic and transmission reconstruction of SARS-CoV-2 outbreaks.宿主内多样性提高了 SARS-CoV-2 爆发的系统发育和传播重建。
Elife. 2023 Sep 21;12:e84384. doi: 10.7554/eLife.84384.
4
Comparative Genomics Reveals Early Emergence and Biased Spatiotemporal Distribution of SARS-CoV-2.比较基因组学揭示了 SARS-CoV-2 的早期出现和偏时空分布。
Mol Biol Evol. 2021 May 19;38(6):2547-2565. doi: 10.1093/molbev/msab049.
5
Cov2clusters: genomic clustering of SARS-CoV-2 sequences.Cov2clusters:SARS-CoV-2 序列的基因组聚类。
BMC Genomics. 2022 Oct 19;23(1):710. doi: 10.1186/s12864-022-08936-4.
6
Ultrafast Sample placement on Existing tRees (UShER) enables real-time phylogenetics for the SARS-CoV-2 pandemic.超快现有树木样本放置 (UShER) 可实现 SARS-CoV-2 大流行的实时系统发生学。
Nat Genet. 2021 Jun;53(6):809-816. doi: 10.1038/s41588-021-00862-7. Epub 2021 May 10.
7
Identification of Epidemiological Traits by Analysis of SARS-CoV-2 Sequences.通过分析 SARS-CoV-2 序列鉴定流行病学特征。
Viruses. 2021 Apr 27;13(5):764. doi: 10.3390/v13050764.
8
Integrated genomic surveillance enables tracing of person-to-person SARS-CoV-2 transmission chains during community transmission and reveals extensive onward transmission of travel-imported infections, Germany, June to July 2021.综合基因组监测可追踪社区传播期间人与人之间的 SARS-CoV-2 传播链,并揭示了旅行输入性感染的广泛传播,德国,2021 年 6 月至 7 月。
Euro Surveill. 2022 Oct;27(43). doi: 10.2807/1560-7917.ES.2022.27.43.2101089.
9
Genomic characterization and phylogenetic analysis of SARS-COV-2 in Italy.意大利的 SARS-COV-2 的基因组特征和系统进化分析。
J Med Virol. 2020 Sep;92(9):1637-1640. doi: 10.1002/jmv.25794. Epub 2020 Apr 10.
10
SARS-CoV-2 Genomic Epidemiology Dashboards: A Review of Functionality and Technological Frameworks for the Public Health Response.SARS-CoV-2 基因组流行病学数据看板:公共卫生应对的功能和技术框架综述。
Genes (Basel). 2024 Jul 3;15(7):876. doi: 10.3390/genes15070876.