• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

针对 SARS-CoV-2 基因组序列在生物信息学工作流程中的统一变异调用的建议。

Recommendations for Uniform Variant Calling of SARS-CoV-2 Genome Sequence across Bioinformatic Workflows.

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.

Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA.

出版信息

Viruses. 2024 Mar 11;16(3):430. doi: 10.3390/v16030430.

DOI:10.3390/v16030430
PMID:38543795
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10975397/
Abstract

Genomic sequencing of clinical samples to identify emerging variants of SARS-CoV-2 has been a key public health tool for curbing the spread of the virus. As a result, an unprecedented number of SARS-CoV-2 genomes were sequenced during the COVID-19 pandemic, which allowed for rapid identification of genetic variants, enabling the timely design and testing of therapies and deployment of new vaccine formulations to combat the new variants. However, despite the technological advances of deep sequencing, the analysis of the raw sequence data generated globally is neither standardized nor consistent, leading to vastly disparate sequences that may impact identification of variants. Here, we show that for both Illumina and Oxford Nanopore sequencing platforms, downstream bioinformatic protocols used by industry, government, and academic groups resulted in different virus sequences from same sample. These bioinformatic workflows produced consensus genomes with differences in single nucleotide polymorphisms, inclusion and exclusion of insertions, and/or deletions, despite using the same raw sequence as input datasets. Here, we compared and characterized such discrepancies and propose a specific suite of parameters and protocols that should be adopted across the field. Consistent results from bioinformatic workflows are fundamental to SARS-CoV-2 and future pathogen surveillance efforts, including pandemic preparation, to allow for a data-driven and timely public health response.

摘要

对临床样本进行基因组测序以鉴定 SARS-CoV-2 的新兴变体一直是遏制病毒传播的主要公共卫生工具。因此,在 COVID-19 大流行期间对 SARS-CoV-2 基因组进行了前所未有的测序,这使得能够快速鉴定遗传变异体,从而及时设计和测试疗法并部署新的疫苗配方来对抗新的变异体。然而,尽管深度测序技术取得了进步,但全球生成的原始序列数据的分析既没有标准化,也不一致,导致差异很大的序列可能会影响变异体的鉴定。在这里,我们表明,对于 Illumina 和 Oxford Nanopore 测序平台,行业、政府和学术团体使用的下游生物信息学协议导致来自同一样本的病毒序列不同。尽管使用相同的原始序列作为输入数据集,但这些生物信息学工作流程生成的共识基因组在单核苷酸多态性、插入的包含和排除以及/或缺失方面存在差异。在这里,我们比较和描述了这些差异,并提出了一套应在整个领域采用的特定参数和协议。生物信息学工作流程的一致结果对于 SARS-CoV-2 和未来的病原体监测工作至关重要,包括大流行准备,以实现数据驱动和及时的公共卫生应对。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67cf/10975397/29fe2184de5c/viruses-16-00430-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67cf/10975397/c0d7616a5401/viruses-16-00430-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67cf/10975397/d1cb842ab851/viruses-16-00430-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67cf/10975397/e8051299a835/viruses-16-00430-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67cf/10975397/491b381d91e4/viruses-16-00430-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67cf/10975397/b657b3aaf1b2/viruses-16-00430-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67cf/10975397/29fe2184de5c/viruses-16-00430-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67cf/10975397/c0d7616a5401/viruses-16-00430-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67cf/10975397/d1cb842ab851/viruses-16-00430-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67cf/10975397/e8051299a835/viruses-16-00430-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67cf/10975397/491b381d91e4/viruses-16-00430-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67cf/10975397/b657b3aaf1b2/viruses-16-00430-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67cf/10975397/29fe2184de5c/viruses-16-00430-g006.jpg

相似文献

1
Recommendations for Uniform Variant Calling of SARS-CoV-2 Genome Sequence across Bioinformatic Workflows.针对 SARS-CoV-2 基因组序列在生物信息学工作流程中的统一变异调用的建议。
Viruses. 2024 Mar 11;16(3):430. doi: 10.3390/v16030430.
2
Towards increased accuracy and reproducibility in SARS-CoV-2 next generation sequence analysis for public health surveillance.提高用于公共卫生监测的SARS-CoV-2下一代序列分析的准确性和可重复性。
bioRxiv. 2022 Nov 3:2022.11.03.515010. doi: 10.1101/2022.11.03.515010.
3
Benchmark datasets for SARS-CoV-2 surveillance bioinformatics.用于 SARS-CoV-2 监测生物信息学的基准数据集。
PeerJ. 2022 Sep 5;10:e13821. doi: 10.7717/peerj.13821. eCollection 2022.
4
EDGE COVID-19: a web platform to generate submission-ready genomes from SARS-CoV-2 sequencing efforts.EDGE COVID-19:一个从 SARS-CoV-2 测序工作中生成可提交基因组的网络平台。
Bioinformatics. 2022 May 13;38(10):2700-2704. doi: 10.1093/bioinformatics/btac176.
5
From SARS-CoV-2 to Global Preparedness: A Graphical Interface for Standardised High-Throughput Bioinformatics Analysis in Pandemic Scenarios and Surveillance of Drug Resistance.从 SARS-CoV-2 到全球准备:一种图形界面,用于在大流行情况下进行标准化高通量生物信息学分析和耐药性监测。
Int J Mol Sci. 2024 Jun 17;25(12):6645. doi: 10.3390/ijms25126645.
6
Bioinformatic investigation of discordant sequence data for SARS-CoV-2: insights for robust genomic analysis during pandemic surveillance.新冠病毒 S 基因序列数据的生物信息学研究:大流行监测期间稳健基因组分析的启示。
Microb Genom. 2023 Nov;9(11). doi: 10.1099/mgen.0.001146.
7
Taxonium, a web-based tool for exploring large phylogenetic trees.Taxonium,一个用于探索大型系统发育树的网络工具。
Elife. 2022 Nov 15;11:e82392. doi: 10.7554/eLife.82392.
8
Robust Mutation Profiling of SARS-CoV-2 Variants from Multiple Raw Illumina Sequencing Data with Cloud Workflow.利用云工作流程对来自多个原始 Illumina 测序数据的 SARS-CoV-2 变体进行稳健的突变分析。
Genes (Basel). 2022 Apr 13;13(4):686. doi: 10.3390/genes13040686.
9
Short-Read and Long-Read Whole Genome Sequencing for SARS-CoV-2 Variants Identification.用于鉴定SARS-CoV-2变体的短读长和长读长全基因组测序
Viruses. 2025 Apr 18;17(4):584. doi: 10.3390/v17040584.
10
Mini-XT, a miniaturized tagmentation-based protocol for efficient sequencing of SARS-CoV-2.Mini-XT,一种基于小型化标签化的协议,用于提高 SARS-CoV-2 测序效率。
J Transl Med. 2022 Mar 3;20(1):105. doi: 10.1186/s12967-022-03307-9.

引用本文的文献

1
Short-Read and Long-Read Whole Genome Sequencing for SARS-CoV-2 Variants Identification.用于鉴定SARS-CoV-2变体的短读长和长读长全基因组测序
Viruses. 2025 Apr 18;17(4):584. doi: 10.3390/v17040584.
2
PathoSeq-QC: a decision support bioinformatics workflow for robust genomic surveillance.PathoSeq-QC:一种用于可靠基因组监测的决策支持生物信息学工作流程。
Bioinformatics. 2025 Mar 29;41(4). doi: 10.1093/bioinformatics/btaf102.

本文引用的文献

1
Challenges and Opportunities for Global Genomic Surveillance Strategies in the COVID-19 Era.新冠疫情时代全球基因组监测策略的挑战与机遇。
Viruses. 2022 Nov 16;14(11):2532. doi: 10.3390/v14112532.
2
A general approach to identify low-frequency variants within influenza samples collected during routine surveillance.一种在常规监测中鉴定流感样本中低频变异的通用方法。
Microb Genom. 2022 Sep;8(9). doi: 10.1099/mgen.0.000867.
3
Pandemic-scale phylogenomics reveals the SARS-CoV-2 recombination landscape.大流行规模的系统发生基因组学揭示了 SARS-CoV-2 的重组景观。
Nature. 2022 Sep;609(7929):994-997. doi: 10.1038/s41586-022-05189-9. Epub 2022 Aug 11.
4
Emergence and widespread circulation of a recombinant SARS-CoV-2 lineage in North America.在北美出现并广泛传播的一种 SARS-CoV-2 重组谱系。
Cell Host Microbe. 2022 Aug 10;30(8):1112-1123.e3. doi: 10.1016/j.chom.2022.06.010. Epub 2022 Jun 20.
5
Co-infection with SARS-CoV-2 Omicron and Delta variants revealed by genomic surveillance.通过基因组监测发现的 SARS-CoV-2 奥密克戎和德尔塔变异株的共同感染。
Nat Commun. 2022 May 18;13(1):2745. doi: 10.1038/s41467-022-30518-x.
6
EDGE COVID-19: a web platform to generate submission-ready genomes from SARS-CoV-2 sequencing efforts.EDGE COVID-19:一个从 SARS-CoV-2 测序工作中生成可提交基因组的网络平台。
Bioinformatics. 2022 May 13;38(10):2700-2704. doi: 10.1093/bioinformatics/btac176.
7
The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2022 update.Galaxy 平台:用于可访问、可重复和协作的生物医学分析:2022 更新。
Nucleic Acids Res. 2022 Jul 5;50(W1):W345-W351. doi: 10.1093/nar/gkac247.
8
Comparison of GATK and DeepVariant by trio sequencing.基于 trio 测序的 GATK 和 DeepVariant 比较。
Sci Rep. 2022 Feb 2;12(1):1809. doi: 10.1038/s41598-022-05833-4.
9
Assessing reproducibility of inherited variants detected with short-read whole genome sequencing.评估使用短读长全基因组测序检测到的遗传变异的可重复性。
Genome Biol. 2022 Jan 3;23(1):2. doi: 10.1186/s13059-021-02569-8.
10
Variant calling: Considerations, practices, and developments.变异调用:考虑因素、实践和发展。
Hum Mutat. 2022 Aug;43(8):976-985. doi: 10.1002/humu.24311. Epub 2021 Dec 16.