Molecular Ecology and Fisheries Genetics Laboratory, School of Biological Sciences, Environment Centre Wales, Bangor University, Deiniol Road, Gwynedd LL57 2UW, UK.
Nucleic Acids Res. 2012 May;40(9):e66. doi: 10.1093/nar/gks002. Epub 2012 Jan 25.
Eukaryotic diversity in environmental samples is often assessed via PCR-based amplification of nSSU genes. However, estimates of diversity derived from pyrosequencing environmental data sets are often inflated, mainly because of the formation of chimeric sequences during PCR amplification. Chimeras are hybrid products composed of distinct parental sequences that can lead to the misinterpretation of diversity estimates. We have analyzed the effect of sample richness, evenness and phylogenetic diversity on the formation of chimeras using a nSSU data set derived from 454 Roche pyrosequencing of replicated, large control pools of closely and distantly related nematode mock communities, of known intragenomic identity and richness. To further investigate how chimeric molecules are formed, the nSSU gene secondary structure was analyzed in several individuals. For the first time in eukaryotes, chimera formation proved to be higher in both richer and more genetically diverse samples, thus providing a novel perspective of chimera formation in pyrosequenced environmental data sets. Findings contribute to a better understanding of the nature and mechanisms involved in chimera formation during PCR amplification of environmentally derived DNA. Moreover, given the similarities between biodiversity analyses using amplicon sequencing and those used to assess genomic variation, our findings have potential broad application for identifying genetic variation in homologous loci or multigene families in general.
真核生物多样性通常通过基于 PCR 的扩增 nSSU 基因进行评估。然而,来自焦磷酸测序环境数据集的多样性估计值通常会偏高,这主要是由于 PCR 扩增过程中形成嵌合体序列。嵌合体是由不同的亲本序列组成的杂交产物,可能导致多样性估计值的错误解释。我们使用源自 454 Roche 焦磷酸测序的、经过复制的、密切和远距离相关的线虫模拟群落的大控制池的 nSSU 数据集,分析了样本丰富度、均匀度和系统发育多样性对嵌合体形成的影响,这些模拟群落具有已知的基因组内身份和丰富度。为了进一步研究嵌合分子是如何形成的,我们在几个个体中分析了 nSSU 基因的二级结构。在真核生物中首次证明,嵌合体形成在丰富度更高和遗传多样性更高的样本中更高,从而为焦磷酸测序环境数据集的嵌合体形成提供了新的视角。研究结果有助于更好地理解在环境衍生 DNA 的 PCR 扩增过程中嵌合体形成的性质和机制。此外,鉴于使用扩增子测序进行生物多样性分析与用于评估基因组变异的分析之间的相似性,我们的发现有可能广泛应用于鉴定同源基因座或一般多基因家族中的遗传变异。