用于高粱基因发现和简单重复序列（SSR）开发的限制性片段长度多态性（RFLP）探针序列的特征分析

Characterization of RFLP probe sequences for gene discovery and SSR development in Sorghum bicolor (L.) Moench.

作者信息

Schloss J., Mitchell E., White M., Kukatla R., Bowers E., Paterson H., Kresovich S.

机构信息

Department of Plant Breeding and Institute for Genomic Diversity, Cornell University, 157 Biotechnology Building, Ithaca, NY 14853, USA.

出版信息

Theor Appl Genet. 2002 Nov;105(6-7):912-920. doi: 10.1007/s00122-002-0991-4. Epub 2002 Jul 30.

DOI:10.1007/s00122-002-0991-4

PMID:12582917

Abstract

In this study, we collected and analyzed DNA sequence data for 789 previously mapped RFLP probes from Sorghum bicolor (L.) Moench. DNA sequences, comprising 894 non-redundant contigs and end sequences, were searched against three GenBank databases, nucleotide (nt), protein (nr) and EST (dbEST), using BLAST algorithms. Matching ESTs were also searched against nt and nr. Translated DNA sequences were then searched against the conserved domain database (CDD) to determine if functional domains/motifs were congruent with the proteins identified in previous searches. More than half (500/894 or 56%) of the query sequences had significant matches in at least one of the GenBank searches. Overall, proteins identified for 148 sequences (17%) were consistent among all searches, of which 66 sequences (7%) contained congruent coding domains. The RFLP probe sequences were also evaluated for the presence of simple sequence repeats (SSRs) and 60 SSRs were developed and assayed in an array of sorghum germplasm comprising inbreds, landraces and wild relatives. Overall, these SSR loci had lower levels of polymorphism ( D = 0.46, averaged over 51 polymorphic loci) compared with sorghum SSRs that were isolated by library hybridization screens ( D = 0.69, averaged over 38 polymorphic loci). This result was probably due to the relatively small proportion of di-nucleotide repeat-containing markers (42% of the total SSR loci) obtained from the DNA sequence data. These di-nucleotide markers also contained shorter repeat motifs than those isolated from genomic libraries. Based on BLAST results, 24 SSRs (40%) were located within, or near, previously annotated or hypothetical genes. We determined the location of 19 of these SSRs relative to putative coding regions. In general, SSRs located in coding regions were less polymorphic ( D = 0.07, averaged over three loci) than those from gene flanking regions, UTRs and introns ( D = 0.49, averaged over 16 loci). The sequence information and SSR loci generated through this study will be valuable for application to sorghum genetics and improvement, including gene discovery, marker-assisted selection, diversity and pedigree analyses, comparative mapping and evolutionary genetic studies.

摘要

在本研究中，我们收集并分析了来自双色高粱（Sorghum bicolor (L.) Moench）的789个先前定位的RFLP探针的DNA序列数据。使用BLAST算法，将包含894个非冗余重叠群和末端序列的DNA序列与三个GenBank数据库（核苷酸（nt）、蛋白质（nr）和EST（dbEST））进行比对。匹配的EST也与nt和nr进行比对。然后将翻译后的DNA序列与保守结构域数据库（CDD）进行比对，以确定功能结构域/基序是否与先前搜索中鉴定的蛋白质一致。超过一半（500/894或56%）的查询序列在至少一次GenBank搜索中具有显著匹配。总体而言，在所有搜索中，为148个序列（17%）鉴定的蛋白质是一致的，其中66个序列（7%）包含一致的编码结构域。还评估了RFLP探针序列中简单序列重复（SSR）的存在情况，并在包括自交系、地方品种和野生近缘种的一系列高粱种质中开发并检测了60个SSR。总体而言，与通过文库杂交筛选分离的高粱SSR（平均38个多态性位点的D = 0.69）相比，这些SSR位点的多态性水平较低（平均51个多态性位点的D = 0.46）。这一结果可能是由于从DNA序列数据中获得的含二核苷酸重复标记的比例相对较小（占SSR位点总数的42%）。这些二核苷酸标记的重复基序也比从基因组文库中分离的短。基于BLAST结果，24个SSR（40%）位于先前注释或假设基因之内或附近。我们确定了其中19个SSR相对于推定编码区的位置。一般来说，位于编码区的SSR比来自基因侧翼区、UTR和内含子的SSR多态性更低（平均三个位点的D = 0.07）（平均16个位点的D = 0.49）。通过本研究产生的序列信息和SSR位点对于应用于高粱遗传学和改良将是有价值的，包括基因发现、标记辅助选择、多样性和系谱分析、比较作图和进化遗传学研究。

相似文献

Characterization of RFLP probe sequences for gene discovery and SSR development in Sorghum bicolor (L.) Moench.

Theor Appl Genet. 2002 Nov;105(6-7):912-920. doi: 10.1007/s00122-002-0991-4. Epub 2002 Jul 30.

Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.).

Theor Appl Genet. 2003 Feb;106(3):411-22. doi: 10.1007/s00122-002-1031-0. Epub 2002 Sep 14.

[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].

Yi Chuan Xue Bao. 2004 May;31(5):431-43.

An integrated SSR and RFLP linkage map of Sorghum bicolor (L.) Moench.

Genome. 2000 Dec;43(6):988-1002.

An in silico mining for simple sequence repeats from expressed sequence tags of zebrafish, medaka, Fundulus, and Xiphophorus.

In Silico Biol. 2005;5(5-6):439-63.

Exploiting EST databases for the development and characterization of EST-SSRs in the Pacific oyster (Crassostrea gigas).

J Hered. 2008 Mar-Apr;99(2):208-14. doi: 10.1093/jhered/esm124. Epub 2008 Jan 30.

A database of simple sequence repeats from cereal and legume expressed sequence tags mined in silico: survey and evaluation.

In Silico Biol. 2006;6(6):607-20.

Microsatellites within genes: structure, function, and evolution.

Mol Biol Evol. 2004 Jun;21(6):991-1007. doi: 10.1093/molbev/msh073. Epub 2004 Feb 12.

Utility of EST-derived SSRs as population genetics markers in a beetle.

J Hered. 2008 Mar-Apr;99(2):112-24. doi: 10.1093/jhered/esm104. Epub 2008 Jan 24.

An SSR-based genetic linkage map for perennial ryegrass ( Lolium perenne L.).

Theor Appl Genet. 2002 Sep;105(4):577-584. doi: 10.1007/s00122-002-0907-3. Epub 2002 May 23.

引用本文的文献

De novo identification and targeted sequencing of SSRs efficiently fingerprints Sorghum bicolor sub-population identity.

PLoS One. 2021 Mar 8;16(3):e0248213. doi: 10.1371/journal.pone.0248213. eCollection 2021.

Introgression of Shoot Fly ( L. Moench) Resistance QTLs into Elite Post-rainy Season Sorghum Varieties Using Marker Assisted Backcrossing (MABC).

Front Plant Sci. 2017 Sep 1;8:1494. doi: 10.3389/fpls.2017.01494. eCollection 2017.

Molecular mapping and candidate gene analysis of a new epicuticular wax locus in sorghum (Sorghum bicolor L. Moench).

Theor Appl Genet. 2017 Oct;130(10):2109-2125. doi: 10.1007/s00122-017-2945-x. Epub 2017 Jul 12.

MSH1-induced non-genetic variation provides a source of phenotypic diversity in Sorghum bicolor.

PLoS One. 2014 Oct 27;9(10):e108407. doi: 10.1371/journal.pone.0108407. eCollection 2014.

Next generation characterisation of cereal genomes for marker discovery.

Biology (Basel). 2013 Nov 25;2(4):1357-77. doi: 10.3390/biology2041357.

Assessment of genetic diversity in the sorghum reference set using EST-SSR markers.

Theor Appl Genet. 2013 Aug;126(8):2051-64. doi: 10.1007/s00122-013-2117-6. Epub 2013 May 25.

Genetic analysis of recombinant inbred lines for Sorghum bicolor × Sorghum propinquum.

G3 (Bethesda). 2013 Jan;3(1):101-8. doi: 10.1534/g3.112.004499. Epub 2013 Jan 1.

Functional markers for gene mapping and genetic diversity studies in sugarcane.

BMC Res Notes. 2011 Jul 28;4:264. doi: 10.1186/1756-0500-4-264.

QTL for fibre-related traits in grain × sweet sorghum as a tool for the enhancement of sorghum as a biomass crop.

Theor Appl Genet. 2011 Oct;123(6):999-1011. doi: 10.1007/s00122-011-1642-4. Epub 2011 Jul 8.

Genetic structure and diversity of wild sorghum populations (Sorghum spp.) from different eco-geographical regions of Kenya.

Theor Appl Genet. 2011 Aug;123(4):571-83. doi: 10.1007/s00122-011-1608-6. Epub 2011 Jun 4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于高粱基因发现和简单重复序列（SSR）开发的限制性片段长度多态性（RFLP）探针序列的特征分析

Characterization of RFLP probe sequences for gene discovery and SSR development in Sorghum bicolor (L.) Moench.

作者信息

Schloss J., Mitchell E., White M., Kukatla R., Bowers E., Paterson H., Kresovich S.

机构信息

Department of Plant Breeding and Institute for Genomic Diversity, Cornell University, 157 Biotechnology Building, Ithaca, NY 14853, USA.

出版信息

Theor Appl Genet. 2002 Nov;105(6-7):912-920. doi: 10.1007/s00122-002-0991-4. Epub 2002 Jul 30.

DOI:10.1007/s00122-002-0991-4

PMID:12582917

Abstract

摘要

用于高粱基因发现和简单重复序列（SSR）开发的限制性片段长度多态性（RFLP）探针序列的特征分析

Characterization of RFLP probe sequences for gene discovery and SSR development in Sorghum bicolor (L.) Moench.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于高粱基因发现和简单重复序列（SSR）开发的限制性片段长度多态性（RFLP）探针序列的特征分析

Characterization of RFLP probe sequences for gene discovery and SSR development in Sorghum bicolor (L.) Moench.

作者信息

机构信息

出版信息

相似文献

引用本文的文献