Suppr超能文献

开发和验证油棕(Elaeis guineensis Jacq.)全基因组和基因微卫星标记:第一个微卫星数据库(OpSatdb)。

Development and validation of whole genome-wide and genic microsatellite markers in oil palm (Elaeis guineensis Jacq.): First microsatellite database (OpSatdb).

机构信息

ICAR-Indian Institute of Oil Palm Research, Pedavegi-534 450, West Godavari (Dt), Andhra Pradesh, India.

ICAR-Indian Agricultural Statistics Research Institute, New Delhi, 12, India.

出版信息

Sci Rep. 2019 Feb 13;9(1):1899. doi: 10.1038/s41598-018-37737-7.

Abstract

The availability of large expressed sequence tag (EST) and whole genome databases of oil palm enabled the development of a data base of microsatellite markers. For this purpose, an EST database consisting of 40,979 EST sequences spanning 27 Mb and a chromosome-wise whole genome databases were downloaded. A total of 3,950 primer pairs were identified and developed from EST sequences. The tri and tetra nucleotide repeat motifs were most prevalent (each 24.75%) followed by di-nucleotide repeat motifs. Whole genome-wide analysis found a total of 245,654 SSR repeats across the 16 chromosomes of oil palm, of which 38,717 were compound microsatellite repeats. A web application, OpSatdb, the first microsatellite database of oil palm, was developed using the PHP and MySQL database ( https://ssr.icar.gov.in/index.php ). It is a simple and systematic web-based search engine for searching SSRs based on repeat motif type, repeat type, and primer details. High synteny was observed between oil palm and rice genomes. The mapping of ESTs having SSRs by Blast2GO resulted in the identification of 19.2% sequences with gene ontology (GO) annotations. Randomly, a set of ten genic SSRs and five genomic SSRs were used for validation and genetic diversity on 100 genotypes belonging to the world oil palm genetic resources. The grouping pattern was observed to be broadly in accordance with the geographical origin of the genotypes. The identified genic and genome-wide SSRs can be effectively useful for various genomic applications of oil palm, such as genetic diversity, linkage map construction, mapping of QTLs, marker-assisted selection, and comparative population studies.

摘要

油棕大量表达序列标签 (EST) 和全基因组数据库的可用性使得微卫星标记数据库的开发成为可能。为此,下载了一个包含 40979 个 EST 序列、跨越 27 Mb 的 EST 数据库和一个按染色体划分的全基因组数据库。总共从 EST 序列中鉴定和开发了 3950 对引物。三核苷酸和四核苷酸重复基序最为常见(各占 24.75%),其次是二核苷酸重复基序。全基因组分析在油棕的 16 条染色体上共发现了 245654 个 SSR 重复序列,其中 38717 个是复合微卫星重复序列。OpSatdb 是油棕的第一个微卫星数据库,是使用 PHP 和 MySQL 数据库开发的网络应用程序(https://ssr.icar.gov.in/index.php)。它是一个简单而系统的基于网络的搜索引擎,可根据重复基序类型、重复类型和引物详细信息搜索 SSR。油棕和水稻基因组之间观察到高度同线性。通过 Blast2GO 将具有 SSR 的 ESTs 进行映射,导致 19.2%的序列具有基因本体 (GO) 注释。随机选择了一组 10 个基因 SSR 和 5 个基因组 SSR,用于对 100 个属于世界油棕遗传资源的基因型进行验证和遗传多样性分析。观察到的分组模式与基因型的地理起源大致相符。鉴定的基因和全基因组 SSR 可有效用于油棕的各种基因组应用,如遗传多样性、连锁图谱构建、QTL 作图、标记辅助选择和比较群体研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c45/6374426/3e9bcc12bf8a/41598_2018_37737_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验