Operational Directorate of Virology, Veterinary and Agrochemical Research Centre, Ukkel, Brussels, Belgium.
PLoS One. 2013 Sep 26;8(9):e76144. doi: 10.1371/journal.pone.0076144. eCollection 2013.
Sequence Independent Single Primer Amplification is one of the most widely used random amplification approaches in virology for sequencing template preparation. This technique relies on oligonucleotides consisting of a 3' random part used to prime complementary DNA synthesis and a 5' defined tag sequence for subsequent amplification. Recently, this amplification method was combined with next generation sequencing to obtain viral sequences. However, these studies showed a biased distribution of the resulting sequence reads over the analyzed genomes. The aim of this study was to elucidate the mechanisms that lead to biased sequence depth when using random amplification. Avian paramyxovirus type 8 was used as a model RNA virus to investigate these mechanisms. We showed, based on in silico analysis of the sequence depth in relation to GC-content, predicted RNA secondary structure and sequence complementarity to the 3' part of the tag sequence, that the tag sequence has the main contribution to the observed bias in sequence depth. We confirmed this finding experimentally using both fragmented and non-fragmented viral RNAs as well as primers differing in random oligomer length (6 or 12 nucleotides) and in the sequence of the amplification tag. The observed oligonucleotide annealing bias can be reduced by extending the random oligomer sequence and by in silico combining sequence data from SISPA experiments using different 5' defined tag sequences. These findings contribute to the optimization of random nucleic acid amplification protocols that are currently required for downstream applications such as viral metagenomics and microarray analysis.
序列独立单引物扩增是病毒学中最广泛应用的随机扩增方法之一,用于测序模板制备。该技术依赖于由 3' 随机部分组成的寡核苷酸,用于启动互补 DNA 合成,以及 5' 定义的标签序列,用于随后的扩增。最近,这种扩增方法与下一代测序相结合,以获得病毒序列。然而,这些研究表明,在分析基因组时,所得序列读数的分布存在偏倚。本研究旨在阐明使用随机扩增导致序列深度偏倚的机制。禽副黏病毒 8 型被用作模型 RNA 病毒来研究这些机制。我们基于序列深度与 GC 含量、预测 RNA 二级结构和与标签序列 3' 部分的序列互补性的关系的计算机分析表明,标签序列是导致观察到的序列深度偏倚的主要因素。我们通过使用碎片化和非碎片化病毒 RNA 以及随机寡核苷酸长度(6 或 12 个核苷酸)和扩增标签序列不同的引物,从实验上证实了这一发现。通过延长随机寡核苷酸序列和通过计算机将使用不同 5' 定义标签序列的 SISPA 实验的序列数据组合,可以减少观察到的寡核苷酸退火偏倚。这些发现有助于优化目前用于下游应用(如病毒宏基因组学和微阵列分析)的随机核酸扩增方案。