Department of Biology, University of Florida, Gainesville, Florida 32611-8525 USA ; Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611-7800 USA.
Department of Biology, Oberlin College, Oberlin, Ohio 44074-1097 USA.
Appl Plant Sci. 2013 Jan 31;1(2). doi: 10.3732/apps.1200497. eCollection 2013 Feb.
We explored a targeted enrichment strategy to facilitate rapid and low-cost next-generation sequencing (NGS) of numerous complete plastid genomes from across the phylogenetic breadth of angiosperms. •
A custom RNA probe set including the complete sequences of 22 previously sequenced eudicot plastomes was designed to facilitate hybridization-based targeted enrichment of eudicot plastid genomes. Using this probe set and an Agilent SureSelect targeted enrichment kit, we conducted an enrichment experiment including 24 angiosperms (22 eudicots, two monocots), which were subsequently sequenced on a single lane of the Illumina GAIIx with single-end, 100-bp reads. This approach yielded nearly complete to complete plastid genomes with exceptionally high coverage (mean coverage: 717×), even for the two monocots. •
Our enrichment experiment was highly successful even though many aspects of the capture process employed were suboptimal. Hence, significant improvements to this methodology are feasible. With this general approach and probe set, it should be possible to sequence more than 300 essentially complete plastid genomes in a single Illumina GAIIx lane (achieving ∼50× mean coverage). However, given the complications of pooling numerous samples for multiplex sequencing and the limited number of barcodes (e.g., 96) available in commercial kits, we recommend 96 samples as a current practical maximum for multiplex plastome sequencing. This high-throughput approach should facilitate large-scale plastid genome sequencing at any level of phylogenetic diversity in angiosperms.
我们探索了一种靶向富集策略,以促进快速且低成本地对来自开花植物系统发育广泛范围的大量完整质体基因组进行下一代测序(NGS)。
设计了一个定制的 RNA 探针组,其中包括 22 个先前测序的真双子叶植物质体基因组的完整序列,以促进基于杂交的真双子叶植物质体基因组的靶向富集。使用该探针组和 Agilent SureSelect 靶向富集试剂盒,我们进行了一项富集实验,其中包括 24 种被子植物(22 种真双子叶植物,两种单子叶植物),随后在 Illumina GAIIx 上的单个泳道上进行单端、100-bp 读取的测序。该方法产生了近乎完整到完整的质体基因组,具有极高的覆盖率(平均覆盖率:717×),即使对于两种单子叶植物也是如此。
即使我们采用的捕获过程的许多方面都不是最佳的,但我们的富集实验还是非常成功的。因此,这种方法具有很大的改进空间。通过这种通用方法和探针组,应该有可能在单个 Illumina GAIIx 泳道上测序 300 多个基本上完整的质体基因组(实现平均覆盖率约为 50×)。然而,鉴于为了实现多重测序而对大量样本进行混合以及商业试剂盒中可用的有限条码数量(例如 96)的复杂性,我们建议将 96 个样本作为当前的实际多重质体测序的最大数量。这种高通量方法应该有助于在开花植物的任何系统发育多样性水平上进行大规模质体基因组测序。