Beijing Advanced Innovation Center for Food Nutrition and Human Health, College of Plant Protection, China Agricultural University, Beijing 100193, People's Republic of China.
BGI-Shenzhen, Shenzhen, 518083, China.
Gigascience. 2017 Dec 1;6(12):1-8. doi: 10.1093/gigascience/gix104.
Over the past decade, biodiversity researchers have dedicated tremendous efforts to constructing DNA reference barcodes for rapid species registration and identification. Although analytical cost for standard DNA barcoding has been significantly reduced since early 2000, further dramatic reduction in barcoding costs is unlikely because Sanger sequencing is approaching its limits in throughput and chemistry cost. Constraints in barcoding cost not only led to unbalanced barcoding efforts around the globe, but also prevented high-throughput sequencing (HTS)-based taxonomic identification from applying binomial species names, which provide crucial linkages to biological knowledge. We developed an Illumina-based pipeline, HIFI-Barcode, to produce full-length Cytochrome c oxidase subunit I (COI) barcodes from pooled polymerase chain reaction amplicons generated by individual specimens. The new pipeline generated accurate barcode sequences that were comparable to Sanger standards, even for different haplotypes of the same species that were only a few nucleotides different from each other. Additionally, the new pipeline was much more sensitive in recovering amplicons at low quantity. The HIFI-Barcode pipeline successfully recovered barcodes from more than 78% of the polymerase chain reactions that didn't show clear bands on the electrophoresis gel. Moreover, sequencing results based on the single molecular sequencing platform Pacbio confirmed the accuracy of the HIFI-Barcode results. Altogether, the new pipeline can provide an improved solution to produce full-length reference barcodes at about one-tenth of the current cost, enabling construction of comprehensive barcode libraries for local fauna, leading to a feasible direction for DNA barcoding global biomes.
在过去的十年中,生物多样性研究人员投入了大量精力来构建 DNA 参考条形码,以实现快速的物种注册和鉴定。尽管自 2000 年初以来标准 DNA 条形码的分析成本已经大大降低,但条形码成本的进一步大幅降低不太可能,因为 Sanger 测序在通量和化学成本方面已经接近其极限。条形码成本的限制不仅导致了全球范围内不平衡的条形码工作,而且还阻止了基于高通量测序(HTS)的分类鉴定采用二项式物种名称,这些名称为生物知识提供了关键的联系。我们开发了一种基于 Illumina 的管道 HIFI-Barcode,用于从单个标本生成的聚合酶链反应扩增子的混合池中产生全长细胞色素 c 氧化酶亚基 I(COI)条形码。新的管道生成了准确的条形码序列,与 Sanger 标准相当,即使是彼此之间只有几个核苷酸差异的同一物种的不同单倍型也是如此。此外,新的管道在回收低数量的扩增子时更加敏感。HIFI-Barcode 管道成功地从超过 78%的聚合酶链反应中回收了条形码,这些反应在电泳凝胶上没有显示出清晰的条带。此外,基于单分子测序平台 Pacbio 的测序结果证实了 HIFI-Barcode 结果的准确性。总的来说,新的管道可以以当前成本的十分之一左右提供一种改进的解决方案来产生全长参考条形码,从而为当地动物群构建全面的条形码库,为 DNA 条形码全球生物群落提供可行的方向。