Gallet Romain, Fabre Frédéric, Michalakis Yannis, Blanc Stéphane
INRA, UMR BGPI, INRA-CIRAD-SupAgro, Cirad TA-A54/K, Campus International de Baillarguet, Montpellier, France
UMR 1065 Santé et Agroécologie du Vignoble, INRA, Villenave d'Ornon, France.
J Virol. 2017 Jul 27;91(16). doi: 10.1128/JVI.00561-17. Print 2017 Aug 15.
The invention of next-generation sequencing (NGS) techniques marked the coming of a new era in the detection of the genetic diversity of intrahost viral populations. A good understanding of the genetic structure of these populations requires, first, the ability to identify the different isolates or variants and, second, the ability to accurately quantify them. However, the initial amplification step of NGS studies can impose potential quantitative biases, modifying the variant relative frequencies. In particular, the number of target molecules (NTM) used during the amplification step is vastly overlooked although of primary importance, as it sets the limit of the accuracy and sensitivity of the sequencing procedure. In the present article, we investigated quantitative biases in an NGS study of populations of a multipartite single-stranded DNA (ssDNA) virus at different steps of the procedure. We studied 20 independent populations of the ssDNA virus faba bean necrotic stunt virus (FBNSV) in two host plants, and FBNSV is a multipartite virus composed of eight genomic segments, whose specific and host-dependent relative frequencies are defined as the "genome formula." Our results show a significant distortion of the FBNSV genome formula after the amplification and sequencing steps. We also quantified the genetic bottleneck occurring at the amplification step by documenting the NTM of two genomic segments of FBNSV. We argue that the NTM must be documented and carefully considered when determining the sensitivity and accuracy of data from NGS studies. The advent of next-generation sequencing (NGS) techniques now enables study of the genetic diversity of viral populations. A good understanding of the genetic structure of these populations first requires the ability to identify the different isolates or variants and second requires the ability to accurately quantify them. Prior to sequencing, viral genomes need to be amplified, a step that potentially imposes quantitative biases and modifies the viral population structure. In particular, the number of target molecules (NTM) used during the amplification step is of primary importance, as it sets the limit of the accuracy and sensitivity of the sequencing procedure. In this work, we used 20 replicated populations of the multipartite faba bean necrotic stunt virus (FBNSV) to estimate the various limitations of ultradeep-sequencing studies performed on intrahost viral populations. We report quantitative biases during rolling-circle amplification and the NTM of two genomic segments of FBNSV.
下一代测序(NGS)技术的发明标志着宿主内病毒群体遗传多样性检测新时代的到来。要深入了解这些群体的遗传结构,首先需要具备识别不同分离株或变体的能力,其次需要具备准确量化它们的能力。然而,NGS研究的初始扩增步骤可能会产生潜在的定量偏差,改变变体的相对频率。特别是,扩增步骤中使用的靶分子数量(NTM)虽然至关重要,但却被大大忽视了,因为它设定了测序程序准确性和灵敏度的极限。在本文中,我们在一个多组分单链DNA(ssDNA)病毒群体的NGS研究中,调查了该程序不同步骤中的定量偏差。我们研究了两种宿主植物中20个独立的ssDNA病毒蚕豆坏死矮化病毒(FBNSV)群体,FBNSV是一种由八个基因组片段组成的多组分病毒,其特定的和宿主依赖的相对频率被定义为“基因组公式”。我们的结果显示,在扩增和测序步骤之后,FBNSV基因组公式出现了显著扭曲。我们还通过记录FBNSV两个基因组片段的NTM,量化了扩增步骤中出现的遗传瓶颈。我们认为,在确定NGS研究数据的灵敏度和准确性时,必须记录并仔细考虑NTM。下一代测序(NGS)技术的出现现在使得研究病毒群体的遗传多样性成为可能。要深入了解这些群体的遗传结构,首先需要具备识别不同分离株或变体的能力,其次需要具备准确量化它们的能力。在测序之前,病毒基因组需要进行扩增,这一步骤可能会产生定量偏差并改变病毒群体结构。特别是,扩增步骤中使用的靶分子数量(NTM)至关重要,因为它设定了测序程序准确性和灵敏度的极限。在这项工作中,我们使用了20个重复的多组分蚕豆坏死矮化病毒(FBNSV)群体,来估计对宿主内病毒群体进行的超深度测序研究的各种局限性。我们报告了滚环扩增过程中的定量偏差以及FBNSV两个基因组片段的NTM。