利用多重PCR测序技术检测COVID-19患者SARS-CoV-2基因组中的缺失：消除假阳性

Deletion detection in SARS-CoV-2 genomes using multiplex-PCR sequencing from COVID-19 patients: elimination of false positives.

作者信息

Jiang Nan, Dewey Colin N, Yin John

机构信息

Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI, USA.

Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA.

出版信息

medRxiv. 2025 Apr 28:2025.04.15.25325794. doi: 10.1101/2025.04.15.25325794.

DOI:10.1101/2025.04.15.25325794

PMID:40343046

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12060941/

Abstract

Deletions are prevalent in the genomes of SARS-CoV-2 isolates from COVID-19 patients, but their roles in the severity, transmission, and persistence of disease are poorly understood. Millions of COVID-19 swab samples from patients have been sequenced and made available online, offering an unprecedented opportunity to study such deletions. Multiplex PCR-based amplicon sequencing (amplicon-seq) has been the most widely used method for sequencing clinical COVID-19 samples. However, existing bioinformatics methods applied to negative control samples sequenced by multiplex-PCR sequencing often yield large numbers of false-positive deletions. We found that these false positives commonly occur in short alignments, at low frequency and depth, and near primer-binding sites used for whole-genome amplification. To address this issue, we developed a filtering strategy, validated with positive control samples containing a known deletion. Our strategy accurately detected the known deletion and removed more than 99% of false positives. This method, applied to public COVID-19 swab data, revealed that deletions occurring independently of transcription regulatory sequences were about 20-fold less common than previously reported; however, they remain more frequent in symptomatic patients. Our optimized approach should enhance the reliability of SARS-CoV-2 deletion characterization from surveillance studies. Finally, our approach may guide the development of more reliable bioinformatics pipelines for genome sequence analyses of other viruses.

摘要

新冠病毒（SARS-CoV-2）感染者分离株的基因组中普遍存在缺失现象，但其在疾病严重程度、传播和持续性方面所起的作用却鲜为人知。来自患者的数百万份新冠病毒拭子样本已进行测序并在网上公开，这为研究此类缺失提供了前所未有的机会。基于多重PCR的扩增子测序（amplicon-seq）是对临床新冠病毒样本进行测序最广泛使用的方法。然而，应用于通过多重PCR测序的阴性对照样本的现有生物信息学方法，常常会产生大量假阳性缺失。我们发现，这些假阳性通常出现在短比对中，频率和深度较低，且靠近用于全基因组扩增的引物结合位点。为解决这一问题，我们开发了一种过滤策略，并使用含有已知缺失的阳性对照样本进行了验证。我们的策略准确检测到了已知缺失，并去除了超过99%的假阳性。将该方法应用于公开的新冠病毒拭子数据，结果显示，独立于转录调控序列发生的缺失比之前报道的情况少约20倍；然而，它们在有症状患者中仍然更为常见。我们优化后的方法应能提高监测研究中新冠病毒缺失特征描述的可靠性。最后，我们的方法可能会为开发用于其他病毒基因组序列分析的更可靠生物信息学流程提供指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d24/12060941/e645d94df0a8/nihpp-2025.04.15.25325794v2-f0001.jpg

相似文献

Deletion detection in SARS-CoV-2 genomes using multiplex-PCR sequencing from COVID-19 patients: elimination of false positives.利用多重PCR测序技术检测COVID-19患者SARS-CoV-2基因组中的缺失：消除假阳性

medRxiv. 2025 Apr 28:2025.04.15.25325794. doi: 10.1101/2025.04.15.25325794.

Assessment of two-pool multiplex long-amplicon nanopore sequencing of SARS-CoV-2.评估 SARS-CoV-2 两池复用长扩增子纳米孔测序。

J Med Virol. 2022 Jan;94(1):327-334. doi: 10.1002/jmv.27336. Epub 2021 Sep 23.

Non-SARS-CoV-2 respiratory viral detection and whole genome sequencing from COVID-19 rapid antigen test devices: a laboratory evaluation study.从 COVID-19 快速抗原检测设备中检测非 SARS-CoV-2 呼吸道病毒及全基因组测序：一项实验室评估研究。

Lancet Microbe. 2024 Apr;5(4):e317-e325. doi: 10.1016/S2666-5247(23)00375-0. Epub 2024 Feb 12.

Amplicon-Based Detection and Sequencing of SARS-CoV-2 in Nasopharyngeal Swabs from Patients With COVID-19 and Identification of Deletions in the Viral Genome That Encode Proteins Involved in Interferon Antagonism.基于扩增子的 SARS-CoV-2 在 COVID-19 患者鼻咽拭子中的检测和测序，以及鉴定编码干扰素拮抗相关蛋白的病毒基因组中的缺失。

Viruses. 2020 Oct 14;12(10):1164. doi: 10.3390/v12101164.

A short plus long-amplicon based sequencing approach improves genomic coverage and variant detection in the SARS-CoV-2 genome.一种基于短片段和长片段扩增子的测序方法提高了 SARS-CoV-2 基因组的覆盖度和变异检测能力。

PLoS One. 2022 Jan 13;17(1):e0261014. doi: 10.1371/journal.pone.0261014. eCollection 2022.

Multiple Occurrences of a 168-Nucleotide Deletion in SARS-CoV-2 ORF8, Unnoticed by Standard Amplicon Sequencing and Variant Calling Pipelines.SARS-CoV-2 ORF8 中 168 核苷酸缺失的多次出现，未被标准扩增子测序和变异 calling 分析流程注意到。

Viruses. 2021 Sep 18;13(9):1870. doi: 10.3390/v13091870.

Multiple approaches for massively parallel sequencing of SARS-CoV-2 genomes directly from clinical samples.多种方法可直接从临床样本中大规模平行测序 SARS-CoV-2 基因组。

Genome Med. 2020 Jun 30;12(1):57. doi: 10.1186/s13073-020-00751-4.

Lessons learned: overcoming common challenges in reconstructing the SARS-CoV-2 genome from short-read sequencing data via CoVpipe2.经验教训：通过CoVpipe2从短读长测序数据重建严重急性呼吸综合征冠状病毒2（SARS-CoV-2）基因组时克服常见挑战。

F1000Res. 2024 Apr 16;12:1091. doi: 10.12688/f1000research.136683.1. eCollection 2023.

Emerging Variants of SARS-CoV-2 and Novel Therapeutics Against Coronavirus (COVID-19)严重急性呼吸综合征冠状病毒2（SARS-CoV-2）的新变种及针对冠状病毒（COVID-19）的新型疗法

Optimization of primer sets and detection protocols for SARS-CoV-2 of coronavirus disease 2019 (COVID-19) using PCR and real-time PCR.优化用于 2019 年冠状病毒病（COVID-19）的 SARS-CoV-2 冠状病毒的聚合酶链反应（PCR）和实时 PCR 引物和检测方案。

Exp Mol Med. 2020 Jun;52(6):963-977. doi: 10.1038/s12276-020-0452-7. Epub 2020 Jun 16.

本文引用的文献

Influenza A genomic diversity during human infections underscores the strength of genetic drift and the existence of tight transmission bottlenecks.人类感染期间甲型流感的基因组多样性凸显了基因漂变的强度以及紧密传播瓶颈的存在。

Virus Evol. 2024 Jun 1;10(1):veae042. doi: 10.1093/ve/veae042. eCollection 2024.

Evolutionary deletions within the SARS-CoV-2 genome as signature trends for virus fitness and adaptation.SARS-CoV-2 基因组内的进化缺失是病毒适应和适应能力的特征趋势。

J Virol. 2024 Jan 23;98(1):e0140423. doi: 10.1128/jvi.01404-23. Epub 2023 Dec 13.

High-resolution mapping reveals the mechanism and contribution of genome insertions and deletions to RNA virus evolution.高分辨率图谱揭示了基因组插入和缺失对 RNA 病毒进化的机制和贡献。

Proc Natl Acad Sci U S A. 2023 Aug;120(31):e2304667120. doi: 10.1073/pnas.2304667120. Epub 2023 Jul 24.

Generation and Functional Analysis of Defective Viral Genomes during SARS-CoV-2 Infection.在 SARS-CoV-2 感染期间，缺陷型病毒基因组的产生和功能分析。

mBio. 2023 Jun 27;14(3):e0025023. doi: 10.1128/mbio.00250-23. Epub 2023 Apr 19.

DVGfinder: A Metasearch Tool for Identifying Defective Viral Genomes in RNA-Seq Data.DVGfinder：一种用于鉴定 RNA-Seq 数据中缺陷病毒基因组的元搜索工具。

Viruses. 2022 May 23;14(5):1114. doi: 10.3390/v14051114.

Reduced subgenomic RNA expression is a molecular indicator of asymptomatic SARS-CoV-2 infection.亚基因组RNA表达降低是无症状SARS-CoV-2感染的分子指标。

Commun Med (Lond). 2021 Sep 22;1:33. doi: 10.1038/s43856-021-00034-y. eCollection 2021.

Host-Virus Chimeric Events in SARS-CoV-2-Infected Cells Are Infrequent and Artifactual.宿主-病毒嵌合事件在感染 SARS-CoV-2 的细胞中罕见且为人工假象。

J Virol. 2021 Jul 12;95(15):e0029421. doi: 10.1128/JVI.00294-21.

Recommendations for accurate genotyping of SARS-CoV-2 using amplicon-based sequencing of clinical samples.使用基于扩增子的临床样本测序对 SARS-CoV-2 进行准确基因分型的建议。

Clin Microbiol Infect. 2021 Jul;27(7):1036.e1-1036.e8. doi: 10.1016/j.cmi.2021.03.029. Epub 2021 Apr 2.

The coronavirus proofreading exoribonuclease mediates extensive viral recombination.冠状病毒校对外切核糖核酸酶介导广泛的病毒重组。

PLoS Pathog. 2021 Jan 19;17(1):e1009226. doi: 10.1371/journal.ppat.1009226. eCollection 2021 Jan.

Next generation sequencing of SARS-CoV-2 genomes: challenges, applications and opportunities.下一代 SARS-CoV-2 基因组测序：挑战、应用和机遇。

Brief Bioinform. 2021 Mar 22;22(2):616-630. doi: 10.1093/bib/bbaa297.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用多重PCR测序技术检测COVID-19患者SARS-CoV-2基因组中的缺失：消除假阳性

Deletion detection in SARS-CoV-2 genomes using multiplex-PCR sequencing from COVID-19 patients: elimination of false positives.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献