Blischak Paul D, Wenzel Aaron J, Wolfe Andrea D
Department of Evolution, Ecology, and Organismal Biology, Ohio State University, 318 W. 12th Avenue, Columbus, Ohio 43210 USA.
Appl Plant Sci. 2014 Dec 4;2(12). doi: 10.3732/apps.1400044. eCollection 2014 Dec.
Penstemon (Plantaginaceae) is a large and diverse genus endemic to North America. However, determining the phylogenetic relationships among its 280 species has been difficult due to its recent evolutionary radiation. The development of a large, multilocus data set can help to resolve this challenge. •
Using both previously sequenced genomic libraries and our own low-coverage whole-genome shotgun sequencing libraries, we used the MAKER2 Annotation Pipeline to identify gene regions for the development of sequencing loci from six extremely low-coverage Penstemon genomes (∼0.005×-0.007×). We also compared this approach to BLAST searches, and conducted analyses to characterize sequence divergence across the species sequenced. •
Annotations and gene predictions were successfully added to more than 10,000 contigs for potential use in downstream primer design. Primers were then designed for chloroplast, mitochondrial, and nuclear loci from these annotated sequences. MAKER2 identified longer gene regions in all six Penstemon genomes when compared with BLASTN and BLASTX searches. The average level of sequence divergence among the six species was 7.14%. •
Combining bioinformatics tools into a workflow that produces annotations can be useful for creating potential phylogenetic markers from thousands of sequences even when genome coverage is extremely low and reference data are only available from distant relatives. Furthermore, the output from MAKER2 contains information about important gene features, such as exon boundaries, and can be easily integrated with visualization tools to facilitate the process of marker development.
钓钟柳属(车前科)是北美特有的一个庞大且多样的属。然而,由于其近期的进化辐射,确定其280个物种之间的系统发育关系一直很困难。开发一个大型的多位点数据集有助于应对这一挑战。
我们利用先前测序的基因组文库和我们自己的低覆盖度全基因组鸟枪法测序文库,使用MAKER2注释管道从六个极低覆盖度的钓钟柳属基因组(约0.005× - 0.007×)中识别基因区域,以开发测序位点。我们还将这种方法与BLAST搜索进行了比较,并进行了分析以表征所测序物种间的序列差异。
成功地为超过10,000个重叠群添加了注释和基因预测,可用于下游引物设计。然后从这些注释序列中为叶绿体、线粒体和核基因座设计引物。与BLASTN和BLASTX搜索相比,MAKER2在所有六个钓钟柳属基因组中识别出了更长的基因区域。六个物种之间的平均序列差异水平为7.14%。
将生物信息学工具整合到一个产生注释的工作流程中,即使在基因组覆盖度极低且仅能从远缘亲属获得参考数据的情况下,也有助于从数千个序列中创建潜在的系统发育标记。此外,MAKER2的输出包含有关重要基因特征(如外显子边界)的信息,并且可以很容易地与可视化工具集成,以促进标记开发过程。