Suppr超能文献

水薯蓣(Dioscorea alata L.)的基因组资源:EST 序列分析、从头测序和 GBS 文库

Genomic Resources for Water Yam (Dioscorea alata L.): Analyses of EST-Sequences, De Novo Sequencing and GBS Libraries.

作者信息

Saski Christopher A, Bhattacharjee Ranjana, Scheffler Brian E, Asiedu Robert

机构信息

Institute for Translational Genomics, Genomics and Computational Biology Laboratory, Clemson University, Clemson, SC, United States of America.

Bioscience Center, International Institute of Tropical Agriculture, Ibadan, PMB 5320, Nigeria.

出版信息

PLoS One. 2015 Jul 29;10(7):e0134031. doi: 10.1371/journal.pone.0134031. eCollection 2015.

Abstract

The reducing cost and rapid progress in next-generation sequencing techniques coupled with high performance computational approaches have resulted in large-scale discovery of advanced genomic resources in several model and non-model plant species. Yam (Dioscorea spp.) is a major food and cash crop in many countries but research efforts have been limited to understand the genetics and generate genomic information for the crop. The availability of a large number of genomic resources including genome-wide molecular markers will accelerate the breeding efforts and application of genomic selection in yams. In the present study, several methods including expressed sequence tags (EST)-sequencing, de novo sequencing, and genotyping-by-sequencing (GBS) profiles on two yam (Dioscorea alata L.) genotypes (TDa 95/00328 and TDa 95-310) was performed to generate genomic resources for use in its improvement programs. This includes a comprehensive set of EST-SSRs, genomic SSRs, whole genome SNPs, and reduced representation SNPs. A total of 1,152 EST-SSRs were developed from >40,000 EST-sequences generated from the two genotypes. A set of 388 EST-SSRs were validated as polymorphic showing a polymorphism rate of 34% when tested on two diverse parents targeted for anthracnose disease. In addition, approximately 40X de novo whole genome sequence coverage was generated for each of the two genotypes, and a total of 18,584 and 15,952 genomic SSRs were identified for TDa 95/00328 and TDa 95-310, respectively. A custom made pipeline resulted in the selection of 573 genomic SSRs common across the two genotypes, of which only eight failed, 478 being polymorphic and 62 monomorphic indicating a polymorphic rate of 83.5%. Additionally, 288,505 high quality SNPs were also identified between these two genotypes. Genotyping by sequencing reads on these two genotypes also revealed 36,790 overlapping SNP positions that are distributed throughout the genome. Our efforts in using different approaches in generating genomic resources provides a non-biased glimpse into the publicly available EST-sequences, yam genome, and GBS profiles with affirmation that the genomic complexity can be methodically unraveled and constitute a critical foundation for future studies in linkage mapping, germplasm analysis, and predictive breeding.

摘要

下一代测序技术成本的降低以及高性能计算方法的快速发展,再加上高性能计算方法,已促使人们在多种模式植物和非模式植物物种中大规模发现先进的基因组资源。山药(薯蓣属)是许多国家的主要粮食和经济作物,但在了解其遗传学和生成该作物的基因组信息方面,研究工作一直有限。包括全基因组分子标记在内的大量基因组资源的可用性将加速山药的育种工作以及基因组选择的应用。在本研究中,对两种山药(参薯)基因型(TDa 95/00328和TDa 95 - 310)进行了多种方法,包括表达序列标签(EST)测序、从头测序和简化基因组测序(GBS)分析,以生成用于其改良计划的基因组资源。这包括一套全面的EST - SSR、基因组SSR、全基因组SNP和简化代表性SNP。从这两种基因型产生的超过40,000条EST序列中开发出了总共1,152个EST - SSR。一组388个EST - SSR被验证为多态性,在针对炭疽病的两个不同亲本上进行测试时,多态性率为34%。此外,为这两种基因型中的每一种都生成了约40倍的从头全基因组序列覆盖度,并且分别为TDa 95/00328和TDa 95 - 310鉴定出了总共18,584个和15,952个基因组SSR。一个定制的流程导致选择了两种基因型共有的573个基因组SSR,其中只有8个失败,478个是多态性的,62个是单态性的,表明多态性率为83.5%。此外,在这两种基因型之间还鉴定出了288,505个高质量SNP。对这两种基因型的测序读数进行基因分型还揭示了36,790个重叠的SNP位置,这些位置分布在整个基因组中。我们使用不同方法生成基因组资源的努力,让我们得以无偏地了解公开可用的EST序列、山药基因组和GBS分析结果,证实了基因组复杂性可以被系统地解开,为未来的连锁图谱构建、种质分析和预测育种研究奠定了关键基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3060/4519108/9dfb97af290a/pone.0134031.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验