INRA, UMR1202 BIOGECO, F-33610 Cestas, France.
BMC Genomics. 2010 Oct 15;11:570. doi: 10.1186/1471-2164-11-570.
Expressed Sequence Tags (ESTs) are a source of simple sequence repeats (SSRs) that can be used to develop molecular markers for genetic studies. The availability of ESTs for Quercus robur and Quercus petraea provided a unique opportunity to develop microsatellite markers to accelerate research aimed at studying adaptation of these long-lived species to their environment. As a first step toward the construction of a SSR-based linkage map of oak for quantitative trait locus (QTL) mapping, we describe the mining and survey of EST-SSRs as well as a fast and cost-effective approach (bin mapping) to assign these markers to an approximate map position. We also compared the level of polymorphism between genomic and EST-derived SSRs and address the transferability of EST-SSRs in Castanea sativa (chestnut).
A catalogue of 103,000 Sanger ESTs was assembled into 28,024 unigenes from which 18.6% presented one or more SSR motifs. More than 42% of these SSRs corresponded to trinucleotides. Primer pairs were designed for 748 putative unigenes. Overall 37.7% (283) were found to amplify a single polymorphic locus in a reference full-sib pedigree of Quercus robur. The usefulness of these loci for establishing a genetic map was assessed using a bin mapping approach. Bin maps were constructed for the male and female parental tree for which framework linkage maps based on AFLP markers were available. The bin set consisting of 14 highly informative offspring selected based on the number and position of crossover sites. The female and male maps comprised 44 and 37 bins, with an average bin length of 16.5 cM and 20.99 cM, respectively. A total of 256 EST-SSRs were assigned to bins and their map position was further validated by linkage mapping. EST-SSRs were found to be less polymorphic than genomic SSRs, but their transferability rate to chestnut, a phylogenetically related species to oak, was higher.
We have generated a bin map for oak comprising 256 EST-SSRs. This resource constitutes a first step toward the establishment of a gene-based map for this genus that will facilitate the dissection of QTLs affecting complex traits of ecological importance.
表达序列标签 (EST) 是简单序列重复 (SSR) 的来源,可以用于开发遗传研究的分子标记。栎属榉木和栎属石栎的 EST 的可用性为加速研究这些长寿物种对环境的适应提供了一个独特的机会。作为构建橡木 SSR 连锁图谱以进行数量性状基因座 (QTL) 作图的第一步,我们描述了 EST-SSR 的挖掘和调查,以及一种快速且具有成本效益的方法(bin 作图)来将这些标记分配到近似图谱位置。我们还比较了基因组和 EST 衍生 SSR 之间的多态性水平,并解决了 EST-SSR 在栗 (板栗) 中的可转移性问题。
组装了 103000 个 Sanger EST,组装成 28024 个非冗余基因,其中 18.6%的基因含有一个或多个 SSR 基序。这些 SSR 中有超过 42%对应于三核苷酸。为 748 个推定的非冗余基因设计了引物对。总体而言,在栎属榉木的参考全同胞家系中,有 37.7%(283)的引物对扩增出单一的多态性位点。使用 bin 作图方法评估了这些位点建立遗传图谱的有用性。为具有 AFLP 标记的框架连锁图谱的雄性和雌性亲树构建了 bin 图谱。根据交叉点的数量和位置,选择了 14 个高度信息的后代组成 bin 集。雌性和雄性图谱分别包含 44 和 37 个 bin,平均 bin 长度分别为 16.5 cM 和 20.99 cM。共有 256 个 EST-SSR 被分配到 bin 中,其图谱位置通过连锁作图进一步验证。EST-SSR 的多态性低于基因组 SSR,但它们向栗(与橡木亲缘关系密切的物种)的转移率更高。
我们已经为橡木生成了一个包含 256 个 EST-SSR 的 bin 图谱。这一资源构成了为该属建立基于基因图谱的第一步,这将有助于剖析影响生态重要性状的 QTL。