Department of Botany, University of British Columbia, Vancouver BC, Canada V6T 1Z4.
Am J Bot. 2012 Feb;99(2):320-9. doi: 10.3732/ajb.1100570. Epub 2012 Feb 1.
To reliably identify lineages below the species level such as subspecies or varieties, we propose an extension to DNA-barcoding using next-generation sequencing to produce whole organellar genomes and substantial nuclear ribosomal sequence. Because this method uses much longer versions of the traditional DNA-barcoding loci in the plastid and ribosomal DNA, we call our approach ultra-barcoding (UBC).
We used high-throughput next-generation sequencing to scan the genome and generate reliable sequence of high copy number regions. Using this method, we examined whole plastid genomes as well as nearly 6000 bases of nuclear ribosomal DNA sequences for nine genotypes of Theobroma cacao and an individual of the related species T. grandiflorum, as well as an additional publicly available whole plastid genome of T. cacao.
All individuals of T. cacao examined were uniquely distinguished, and evidence of reticulation and gene flow was observed. Sequence variation was observed in some of the canonical barcoding regions between species, but other regions of the chloroplast were more variable both within species and between species, as were ribosomal spacers. Furthermore, no single region provides the level of data available using the complete plastid genome and rDNA.
Our data demonstrate that UBC is a viable, increasingly cost-effective approach for reliably distinguishing varieties and even individual genotypes of T. cacao. This approach shows great promise for applications where very closely related or interbreeding taxa must be distinguished.
为了可靠地鉴定物种以下的谱系,如亚种或变种,我们提出了一种扩展 DNA 条码技术的方法,使用下一代测序来产生完整的细胞器基因组和大量核核糖体序列。由于这种方法使用了传统 DNA 条码基因座在质体和核糖体 DNA 中更长的版本,我们称之为超条码(UBC)。
我们使用高通量下一代测序来扫描基因组并生成高拷贝数区域的可靠序列。使用这种方法,我们检查了九个可可基因型的整个质体基因组以及核核糖体 DNA 序列的近 6000 个碱基,以及可可相关物种 T. grandiflorum 的一个个体,以及可可的另一个公开可用的完整质体基因组。
所有检查的可可个体都被独特地区分,并且观察到了网状和基因流的证据。在一些经典的条码区域中观察到了种间的序列变异,但质体的其他区域在种内和种间都更加多变,核糖体间隔区也是如此。此外,没有一个单一的区域能够提供使用完整质体基因组和 rDNA 获得的数据水平。
我们的数据表明,UBC 是一种可行的、成本效益不断提高的方法,能够可靠地区分可可的品种,甚至是个别基因型。这种方法在必须区分非常近缘或杂交的分类群的应用中具有很大的潜力。