Suppr超能文献

胡麻草属(车前科)的基因预测与注释:一种基于极低覆盖度基因组测序进行标记开发的工作流程

Gene prediction and annotation in Penstemon (Plantaginaceae): A workflow for marker development from extremely low-coverage genome sequencing.

作者信息

Blischak Paul D, Wenzel Aaron J, Wolfe Andrea D

机构信息

Department of Evolution, Ecology, and Organismal Biology, Ohio State University, 318 W. 12th Avenue, Columbus, Ohio 43210 USA.

出版信息

Appl Plant Sci. 2014 Dec 4;2(12). doi: 10.3732/apps.1400044. eCollection 2014 Dec.

Abstract

PREMISE OF THE STUDY

Penstemon (Plantaginaceae) is a large and diverse genus endemic to North America. However, determining the phylogenetic relationships among its 280 species has been difficult due to its recent evolutionary radiation. The development of a large, multilocus data set can help to resolve this challenge. •

METHODS

Using both previously sequenced genomic libraries and our own low-coverage whole-genome shotgun sequencing libraries, we used the MAKER2 Annotation Pipeline to identify gene regions for the development of sequencing loci from six extremely low-coverage Penstemon genomes (∼0.005×-0.007×). We also compared this approach to BLAST searches, and conducted analyses to characterize sequence divergence across the species sequenced. •

RESULTS

Annotations and gene predictions were successfully added to more than 10,000 contigs for potential use in downstream primer design. Primers were then designed for chloroplast, mitochondrial, and nuclear loci from these annotated sequences. MAKER2 identified longer gene regions in all six Penstemon genomes when compared with BLASTN and BLASTX searches. The average level of sequence divergence among the six species was 7.14%. •

DISCUSSION

Combining bioinformatics tools into a workflow that produces annotations can be useful for creating potential phylogenetic markers from thousands of sequences even when genome coverage is extremely low and reference data are only available from distant relatives. Furthermore, the output from MAKER2 contains information about important gene features, such as exon boundaries, and can be easily integrated with visualization tools to facilitate the process of marker development.

摘要

研究前提

钓钟柳属(车前科)是北美特有的一个庞大且多样的属。然而,由于其近期的进化辐射,确定其280个物种之间的系统发育关系一直很困难。开发一个大型的多位点数据集有助于应对这一挑战。

方法

我们利用先前测序的基因组文库和我们自己的低覆盖度全基因组鸟枪法测序文库,使用MAKER2注释管道从六个极低覆盖度的钓钟柳属基因组(约0.005× - 0.007×)中识别基因区域,以开发测序位点。我们还将这种方法与BLAST搜索进行了比较,并进行了分析以表征所测序物种间的序列差异。

结果

成功地为超过10,000个重叠群添加了注释和基因预测,可用于下游引物设计。然后从这些注释序列中为叶绿体、线粒体和核基因座设计引物。与BLASTN和BLASTX搜索相比,MAKER2在所有六个钓钟柳属基因组中识别出了更长的基因区域。六个物种之间的平均序列差异水平为7.14%。

讨论

将生物信息学工具整合到一个产生注释的工作流程中,即使在基因组覆盖度极低且仅能从远缘亲属获得参考数据的情况下,也有助于从数千个序列中创建潜在的系统发育标记。此外,MAKER2的输出包含有关重要基因特征(如外显子边界)的信息,并且可以很容易地与可视化工具集成,以促进标记开发过程。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23d5/4259454/e9a7127cb817/apps.1400044fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验