• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高通量测序数据中人为病毒少数变异体检测的改进

Improved detection of artifactual viral minority variants in high-throughput sequencing data.

作者信息

Welkers Matthijs R A, Jonges Marcel, Jeeninga Rienk E, Koopmans Marion P G, de Jong Menno D

机构信息

Department of Medical Microbiology, Academic Medical Centre Amsterdam, Netherlands.

Centre for Infectious Disease Control, National Institute for Public Health and the Environment Bilthoven, Netherlands ; Department of Viroscience, Erasmus Medical Center Rotterdam, Netherlands.

出版信息

Front Microbiol. 2015 Jan 22;5:804. doi: 10.3389/fmicb.2014.00804. eCollection 2014.

DOI:10.3389/fmicb.2014.00804
PMID:25657642
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4302989/
Abstract

High-throughput sequencing (HTS) of viral samples provides important information on the presence of viral minority variants. However, detection and accurate quantification is limited by the capacity to distinguish biological from artificial variation. In this study, errors related to the Illumina HiSeq2000 library generation and HTS process were investigated by determining minority variant frequencies in an influenza A/WSN/1933(H1N1) virus reverse-genetics plasmid pool. Errors related to amplification and sequencing were determined using the same plasmid pool, by generation of infectious virus using reverse genetics followed by in duplo reverse-transcriptase PCR (RT-PCR) amplification and HTS in the same sequence run. Results showed that after "best practice" quality control (QC), within the plasmid pool, one minority variant with a frequency >0.5% was identified, while 84 and 139 were identified in the RT-PCR amplified samples, indicating RT-PCR amplification artificially increased variation. Detailed analysis showed that artifactual minority variants could be identified by two major technical characteristics: their predominant presence in a single read orientation and uneven distribution of mismatches over the length of the reads. We demonstrate that by addition of two QC steps 95% of the artifactual minority variants could be identified. When our analysis approach was applied to three clinical samples 68% of the initially identified minority variants were identified as artifacts. Our study clearly demonstrated that, without additional QC steps, overestimation of viral minority variants is very likely to occur, mainly as a consequence of the required RT-PCR amplification step. The improved ability to detect and correct for artifactual minority variants, increases data resolution and could aid both past and future studies incorporating HTS. The source code has been made available through Sourceforge (https://sourceforge.net/projects/mva-ngs).

摘要

病毒样本的高通量测序(HTS)可提供有关病毒少数变异体存在情况的重要信息。然而,检测和准确量化受到区分生物学变异与人为变异能力的限制。在本研究中,通过测定甲型流感病毒A/WSN/1933(H1N1)病毒反向遗传学质粒库中的少数变异体频率,研究了与Illumina HiSeq2000文库构建和HTS过程相关的误差。使用相同的质粒库,通过反向遗传学产生感染性病毒,随后进行双份逆转录酶PCR(RT-PCR)扩增并在同一次测序运行中进行HTS,确定与扩增和测序相关的误差。结果显示,经过“最佳实践”质量控制(QC)后,在质粒库中鉴定出一个频率>0.5%的少数变异体,而在RT-PCR扩增样本中鉴定出84个和139个,表明RT-PCR扩增人为增加了变异。详细分析表明,人为少数变异体可通过两个主要技术特征来识别:它们主要存在于单一读取方向,以及错配在读取长度上分布不均。我们证明,通过添加两个QC步骤,可以识别95%的人为少数变异体。当我们的分析方法应用于三个临床样本时,最初鉴定出的少数变异体中有68%被鉴定为假象。我们的研究清楚地表明,如果没有额外的QC步骤,很可能会高估病毒少数变异体,这主要是所需RT-PCR扩增步骤的结果。检测和校正人为少数变异体的能力提高,增加了数据分辨率,并有助于过去和未来纳入HTS的研究。源代码已通过Sourceforge(https://sourceforge.net/projects/mva-ngs)提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0784/4302989/439719b4d2a3/fmicb-05-00804-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0784/4302989/e338a7ca28cf/fmicb-05-00804-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0784/4302989/65d3053192a1/fmicb-05-00804-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0784/4302989/0b20e4b5ead9/fmicb-05-00804-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0784/4302989/fc46a9b239b7/fmicb-05-00804-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0784/4302989/9b1f76c7d10d/fmicb-05-00804-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0784/4302989/439719b4d2a3/fmicb-05-00804-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0784/4302989/e338a7ca28cf/fmicb-05-00804-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0784/4302989/65d3053192a1/fmicb-05-00804-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0784/4302989/0b20e4b5ead9/fmicb-05-00804-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0784/4302989/fc46a9b239b7/fmicb-05-00804-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0784/4302989/9b1f76c7d10d/fmicb-05-00804-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0784/4302989/439719b4d2a3/fmicb-05-00804-g0006.jpg

相似文献

1
Improved detection of artifactual viral minority variants in high-throughput sequencing data.高通量测序数据中人为病毒少数变异体检测的改进
Front Microbiol. 2015 Jan 22;5:804. doi: 10.3389/fmicb.2014.00804. eCollection 2014.
2
Viral population analysis and minority-variant detection using short read next-generation sequencing.使用短读长下一代测序进行病毒群体分析和少数变体检测。
Philos Trans R Soc Lond B Biol Sci. 2013 Feb 4;368(1614):20120205. doi: 10.1098/rstb.2012.0205. Print 2013 Mar 19.
3
Distinguishing low frequency mutations from RT-PCR and sequence errors in viral deep sequencing data.从病毒深度测序数据中的逆转录聚合酶链式反应(RT-PCR)和序列错误中区分低频突变。
BMC Genomics. 2015 Mar 24;16(1):229. doi: 10.1186/s12864-015-1456-x.
4
Optimized quantification of intra-host viral diversity in SARS-CoV-2 and influenza virus sequence data.优化 SARS-CoV-2 和流感病毒序列数据中宿主内病毒多样性的定量分析。
mBio. 2023 Aug 31;14(4):e0104623. doi: 10.1128/mbio.01046-23. Epub 2023 Jun 30.
5
Next-generation sequencing of HIV-1 RNA genomes: determination of error rates and minimizing artificial recombination.HIV-1 RNA 基因组的下一代测序:错误率的确定和最小化人为重组。
PLoS One. 2013 Sep 18;8(9):e74249. doi: 10.1371/journal.pone.0074249. eCollection 2013.
6
Optimized Quantification of Intrahost Viral Diversity in SARS-CoV-2 and Influenza Virus Sequence Data.SARS-CoV-2和流感病毒序列数据中宿主内病毒多样性的优化定量分析
bioRxiv. 2022 Aug 16:2021.05.05.442873. doi: 10.1101/2021.05.05.442873.
7
Fluorescent amplification for next generation sequencing (FA-NGS) library preparation.荧光扩增下一代测序(FA-NGS)文库制备。
BMC Genomics. 2020 Jan 28;21(1):85. doi: 10.1186/s12864-020-6481-8.
8
Measurements of Intrahost Viral Diversity Are Extremely Sensitive to Systematic Errors in Variant Calling.宿主内病毒多样性的测量对变异位点检测中的系统误差极为敏感。
J Virol. 2016 Jul 11;90(15):6884-95. doi: 10.1128/JVI.00667-16. Print 2016 Aug 1.
9
Analysis of the genetic diversity of influenza A viruses using next-generation DNA sequencing.使用下一代DNA测序技术分析甲型流感病毒的遗传多样性。
BMC Genomics. 2015 Feb 14;16(1):79. doi: 10.1186/s12864-015-1284-z.
10
Detection of low-level HCV variants in DAA treated patients: comparison amongst three different NGS data analysis protocols.在 DAA 治疗的患者中检测低水平 HCV 变异体:三种不同 NGS 数据分析方案的比较。
Virol J. 2020 Jul 13;17(1):103. doi: 10.1186/s12985-020-01381-3.

引用本文的文献

1
Amino Acid Prevalence of HIV-1 Mutations by Direct Polymerase Chain Reaction and Single Genome Sequencing.通过直接聚合酶链反应和单基因组测序对HIV-1突变的氨基酸流行率
AIDS Res Hum Retroviruses. 2019 Oct;35(10):924-929. doi: 10.1089/AID.2018.0289. Epub 2019 Aug 26.
2
Genetic diversity and host adaptation of avian H5N1 influenza viruses during human infection.人类感染期间禽流感 H5N1 病毒的遗传多样性和宿主适应性。
Emerg Microbes Infect. 2019;8(1):262-271. doi: 10.1080/22221751.2019.1575700.
3
Seasonal Genetic Drift of Human Influenza A Virus Quasispecies Revealed by Deep Sequencing.

本文引用的文献

1
Emergence of the virulence-associated PB2 E627K substitution in a fatal human case of highly pathogenic avian influenza virus A(H7N7) infection as determined by Illumina ultra-deep sequencing.Illumina 超深度测序确定人感染高致病性禽流感病毒 A(H7N7)致死病例中与毒力相关的 PB2 E627K 取代的出现。
J Virol. 2014 Feb;88(3):1694-702. doi: 10.1128/JVI.02044-13. Epub 2013 Nov 20.
2
Viral population analysis and minority-variant detection using short read next-generation sequencing.使用短读长下一代测序进行病毒群体分析和少数变体检测。
Philos Trans R Soc Lond B Biol Sci. 2013 Feb 4;368(1614):20120205. doi: 10.1098/rstb.2012.0205. Print 2013 Mar 19.
3
深度测序揭示人类甲型流感病毒准种的季节性基因漂移
Front Microbiol. 2018 Oct 31;9:2596. doi: 10.3389/fmicb.2018.02596. eCollection 2018.
4
Comparison of an Diagnostic Next-Generation Sequencing Assay with Sanger Sequencing for HIV-1 Genotypic Resistance Testing.比较基于新一代测序技术的诊断检测与 Sanger 测序技术在 HIV-1 基因型耐药检测中的应用。
J Clin Microbiol. 2018 May 25;56(6). doi: 10.1128/JCM.00105-18. Print 2018 Jun.
5
Deep Sequencing of Influenza A Virus from a Human Challenge Study Reveals a Selective Bottleneck and Only Limited Intrahost Genetic Diversification.一项人体挑战研究中甲型流感病毒的深度测序揭示了一个选择性瓶颈以及仅有限的宿主内基因多样化。
J Virol. 2016 Nov 28;90(24):11247-11258. doi: 10.1128/JVI.01657-16. Print 2016 Dec 15.
Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data.
从下一代测序数据估计病毒遗传多样性面临的挑战与机遇
Front Microbiol. 2012 Sep 11;3:329. doi: 10.3389/fmicb.2012.00329. eCollection 2012.
4
Viral quasispecies evolution.病毒准种进化。
Microbiol Mol Biol Rev. 2012 Jun;76(2):159-216. doi: 10.1128/MMBR.05023-11.
5
A survey of error-correction methods for next-generation sequencing.下一代测序错误纠正方法综述。
Brief Bioinform. 2013 Jan;14(1):56-66. doi: 10.1093/bib/bbs015. Epub 2012 Apr 6.
6
Accurate sampling and deep sequencing of the HIV-1 protease gene using a Primer ID.使用引物 ID 对 HIV-1 蛋白酶基因进行准确的取样和深度测序。
Proc Natl Acad Sci U S A. 2011 Dec 13;108(50):20166-71. doi: 10.1073/pnas.1110064108. Epub 2011 Nov 30.
7
Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and genome analyzer systems.Illumina HiSeq 和基因组分析仪系统生成的基因组高通量测序数据评估。
Genome Biol. 2011 Nov 8;12(11):R112. doi: 10.1186/gb-2011-12-11-r112.
8
ShoRAH: estimating the genetic diversity of a mixed sample from next-generation sequencing data.ShōRAH:基于下一代测序数据估计混合样本的遗传多样性。
BMC Bioinformatics. 2011 Apr 26;12:119. doi: 10.1186/1471-2105-12-119.
9
Beyond the consensus: dissecting within-host viral population diversity of foot-and-mouth disease virus by using next-generation genome sequencing.超越共识:利用下一代基因组测序技术剖析口蹄疫病毒的宿主内病毒种群多样性。
J Virol. 2011 Mar;85(5):2266-75. doi: 10.1128/JVI.01396-10. Epub 2010 Dec 15.
10
Error correction of next-generation sequencing data and reliable estimation of HIV quasispecies.下一代测序数据纠错和 HIV 准种可靠估计。
Nucleic Acids Res. 2010 Nov;38(21):7400-9. doi: 10.1093/nar/gkq655. Epub 2010 Jul 29.