• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

计算机分析确定了用于产生 SARS-CoV-2 序列的 Nanopore 序列污染的阈值。

In silico analyses identify sequence contamination thresholds for Nanopore-generated SARS-CoV-2 sequences.

机构信息

Public Health Agency of Canada, National Microbiology Laboratory, Winnipeg, Canada.

Cadham Provincial Laboratory, Winnipeg, Canada.

出版信息

PLoS Comput Biol. 2024 Aug 19;20(8):e1011539. doi: 10.1371/journal.pcbi.1011539. eCollection 2024 Aug.

DOI:10.1371/journal.pcbi.1011539
PMID:39159257
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11398645/
Abstract

The SARS-CoV-2 pandemic has brought molecular biology and genomic sequencing into the public consciousness and lexicon. With an emphasis on rapid turnaround, genomic data informed both diagnostic and surveillance decisions for the current pandemic at a previously unheard-of scale. The surge in the submission of genomic data to publicly available databases proved essential as comparing different genome sequences offers a wealth of knowledge, including phylogenetic links, modes of transmission, rates of evolution, and the impact of mutations on infection and disease severity. However, the scale of the pandemic has meant that sequencing runs are rarely repeated due to limited sample material and/or the availability of sequencing resources, resulting in the upload of some imperfect runs to public repositories. As a result, it is crucial to investigate the data obtained from these imperfect runs to determine whether the results are reliable prior to depositing them in a public database. Numerous studies have identified a variety of sources of contamination in public next-generation sequencing (NGS) data as the number of NGS studies increases along with the diversity of sequencing technologies and procedures. For this study, we conducted an in silico experiment with known SARS-CoV-2 sequences produced from Oxford Nanopore Technologies sequencing to investigate the effect of contamination on lineage calls and single nucleotide variants (SNVs). A contamination threshold below which runs are expected to generate accurate lineage calls and maintain genome-relatedness and integrity was identified. Together, these findings provide a benchmark below which imperfect runs may be considered robust for reporting results to both stakeholders and public repositories and reduce the need for repeat or wasted runs.

摘要

SARS-CoV-2 大流行使分子生物学和基因组测序进入了公众的意识和词汇中。由于强调快速周转,基因组数据以前所未有的规模为当前大流行的诊断和监测决策提供了信息。向公共可用数据库提交基因组数据的激增被证明是至关重要的,因为比较不同的基因组序列提供了丰富的知识,包括系统发育联系、传播模式、进化速度以及突变对感染和疾病严重程度的影响。然而,大流行的规模意味着由于样本材料有限和/或测序资源的可用性,很少重复测序运行,导致一些不完美的运行被上传到公共存储库。因此,在将这些不完美的运行结果存入公共数据库之前,必须调查从这些运行中获得的数据,以确定结果是否可靠。随着高通量测序(NGS)研究数量的增加以及测序技术和程序的多样性,许多研究已经确定了公共 NGS 数据中存在多种污染来源。在这项研究中,我们使用来自牛津纳米孔技术测序的已知 SARS-CoV-2 序列进行了计算机模拟实验,以研究污染对谱系调用和单核苷酸变异(SNV)的影响。确定了一个低于该污染阈值的运行,预计该运行将产生准确的谱系调用,并保持基因组相关性和完整性。这些发现共同提供了一个基准,低于该基准,不完美的运行可能被认为是可靠的,可以向利益相关者和公共存储库报告结果,并减少重复或浪费运行的需要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb87/11398645/c427fcb92449/pcbi.1011539.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb87/11398645/e0e4eda46a3b/pcbi.1011539.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb87/11398645/39da6e7c1a73/pcbi.1011539.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb87/11398645/98e710d08925/pcbi.1011539.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb87/11398645/c427fcb92449/pcbi.1011539.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb87/11398645/e0e4eda46a3b/pcbi.1011539.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb87/11398645/39da6e7c1a73/pcbi.1011539.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb87/11398645/98e710d08925/pcbi.1011539.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb87/11398645/c427fcb92449/pcbi.1011539.g004.jpg

相似文献

1
In silico analyses identify sequence contamination thresholds for Nanopore-generated SARS-CoV-2 sequences.计算机分析确定了用于产生 SARS-CoV-2 序列的 Nanopore 序列污染的阈值。
PLoS Comput Biol. 2024 Aug 19;20(8):e1011539. doi: 10.1371/journal.pcbi.1011539. eCollection 2024 Aug.
2
Assessment of two-pool multiplex long-amplicon nanopore sequencing of SARS-CoV-2.评估 SARS-CoV-2 两池复用长扩增子纳米孔测序。
J Med Virol. 2022 Jan;94(1):327-334. doi: 10.1002/jmv.27336. Epub 2021 Sep 23.
3
From SARS-CoV-2 to Global Preparedness: A Graphical Interface for Standardised High-Throughput Bioinformatics Analysis in Pandemic Scenarios and Surveillance of Drug Resistance.从 SARS-CoV-2 到全球准备:一种图形界面,用于在大流行情况下进行标准化高通量生物信息学分析和耐药性监测。
Int J Mol Sci. 2024 Jun 17;25(12):6645. doi: 10.3390/ijms25126645.
4
A Novel High-Throughput Nanopore-Sequencing-Based Strategy for Rapid and Automated S-Protein Typing of SARS-CoV-2 Variants.一种基于高通量纳米孔测序的新型快速自动化 SARS-CoV-2 变异株 S 蛋白分型策略。
Viruses. 2021 Dec 19;13(12):2548. doi: 10.3390/v13122548.
5
Tiled-ClickSeq for targeted sequencing of complete coronavirus genomes with simultaneous capture of RNA recombination and minority variants.平铺点击测序技术用于靶向测序完整的冠状病毒基因组,同时捕获 RNA 重组和少数变体。
Elife. 2021 Sep 28;10:e68479. doi: 10.7554/eLife.68479.
6
Combining Short- and Long-Read Sequencing Technologies to Identify SARS-CoV-2 Variants in Wastewater.结合短读和长读测序技术鉴定污水中的 SARS-CoV-2 变体。
Viruses. 2024 Sep 21;16(9):1495. doi: 10.3390/v16091495.
7
Benchmark datasets for SARS-CoV-2 surveillance bioinformatics.用于 SARS-CoV-2 监测生物信息学的基准数据集。
PeerJ. 2022 Sep 5;10:e13821. doi: 10.7717/peerj.13821. eCollection 2022.
8
Next generation sequencing of SARS-CoV-2 genomes: challenges, applications and opportunities.下一代 SARS-CoV-2 基因组测序:挑战、应用和机遇。
Brief Bioinform. 2021 Mar 22;22(2):616-630. doi: 10.1093/bib/bbaa297.
9
Evaluation on the use of Nanopore sequencing for direct characterization of coronaviruses from respiratory specimens, and a study on emerging missense mutations in partial RdRP gene of SARS-CoV-2.评估 Nanopore 测序在直接检测呼吸道样本中的冠状病毒中的应用,以及对 SARS-CoV-2 部分 RdRP 基因中新兴错义突变的研究。
Virol J. 2020 Nov 23;17(1):183. doi: 10.1186/s12985-020-01454-3.
10
Analytical validity of nanopore sequencing for rapid SARS-CoV-2 genome analysis.纳米孔测序快速分析 SARS-CoV-2 基因组的分析有效性。
Nat Commun. 2020 Dec 9;11(1):6272. doi: 10.1038/s41467-020-20075-6.

本文引用的文献

1
The evolution of SARS-CoV-2.严重急性呼吸综合征冠状病毒2的进化
Nat Rev Microbiol. 2023 Jun;21(6):361-379. doi: 10.1038/s41579-023-00878-2. Epub 2023 Apr 5.
2
Emergence of the Omicron SARS-CoV-2 subvariants during the COVID-19 pandemic.新冠疫情期间奥密克戎SARS-CoV-2亚变体的出现。
Int J Surg. 2022 Dec;108:106994. doi: 10.1016/j.ijsu.2022.106994. Epub 2022 Nov 11.
3
Genomic epidemiology of the first two waves of SARS-CoV-2 in Canada.加拿大 SARS-CoV-2 前两波的基因组流行病学。
Elife. 2022 Aug 2;11:e73896. doi: 10.7554/eLife.73896.
4
Human-host transcriptomic analysis reveals unique early innate immune responses in different sub-phenotypes of COVID-19.人宿主转录组分析揭示了新冠肺炎不同亚表型中独特的早期固有免疫反应。
Clin Transl Med. 2022 Jun;12(6):e856. doi: 10.1002/ctm2.856.
5
Contamination detection in genomic data: more is not enough.基因组数据中的污染检测:更多并不一定更好。
Genome Biol. 2022 Feb 21;23(1):60. doi: 10.1186/s13059-022-02619-9.
6
Contamination in Reference Sequence Databases: Time for Divide-and-Rule Tactics.参考序列数据库中的污染:是时候采取分而治之的策略了。
Front Microbiol. 2021 Oct 22;12:755101. doi: 10.3389/fmicb.2021.755101. eCollection 2021.
7
Emerging SARS-CoV-2 Variants: A Review of Its Mutations, Its Implications and Vaccine Efficacy.新型严重急性呼吸综合征冠状病毒2(SARS-CoV-2)变体:其突变、影响及疫苗效力综述
Vaccines (Basel). 2021 Oct 18;9(10):1195. doi: 10.3390/vaccines9101195.
8
Sequencing DNA with nanopores: Troubles and biases.用纳米孔测序 DNA:问题和偏差。
PLoS One. 2021 Oct 1;16(10):e0257521. doi: 10.1371/journal.pone.0257521. eCollection 2021.
9
Post-acute COVID-19 syndrome (PCS) and health-related quality of life (HRQoL)-A systematic review and meta-analysis.新冠后综合征(PCS)和健康相关生活质量(HRQoL)的系统评价和荟萃分析。
J Med Virol. 2022 Jan;94(1):253-262. doi: 10.1002/jmv.27309. Epub 2021 Sep 7.
10
High-precision and cost-efficient sequencing for real-time COVID-19 surveillance.实时 COVID-19 监测的高精度、低成本测序。
Sci Rep. 2021 Jul 1;11(1):13669. doi: 10.1038/s41598-021-93145-4.