National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China.
DNA Res. 2013 Feb;20(1):31-44. doi: 10.1093/dnares/dss031. Epub 2012 Dec 4.
Gene-derived simple sequence repeats (genic SSRs), also known as functional markers, are often preferred over random genomic markers because they represent variation in gene coding and/or regulatory regions. We characterized 544 genic SSR loci derived from 138 candidate genes involved in wood formation, distributed throughout the genome of Populus tomentosa, a key ecological and cultivated wood production species. Of these SSRs, three-quarters were located in the promoter or intron regions, and dinucleotide (59.7%) and trinucleotide repeat motifs (26.5%) predominated. By screening 15 wild P. tomentosa ecotypes, we identified 188 polymorphic genic SSRs with 861 alleles, 2-7 alleles for each marker. Transferability analysis of 30 random genic SSRs, testing whether these SSRs work in 26 genotypes of five genus Populus sections (outgroup, Salix matsudana), showed that 72% of the SSRs could be amplified in Turanga and 100% could be amplified in Leuce. Based on genotyping of these 26 genotypes, a neighbour-joining analysis showed the expected six phylogenetic groupings. In silico analysis of SSR variation in 220 sequences that are homologous between P. tomentosa and Populus trichocarpa suggested that genic SSR variations between relatives were predominantly affected by repeat motif variations or flanking sequence mutations. Inheritance tests and single-marker associations demonstrated the power of genic SSRs in family-based linkage mapping and candidate gene-based association studies, as well as marker-assisted selection and comparative genomic studies of P. tomentosa and related species.
基因衍生的简单序列重复(genic SSRs),也称为功能标记,通常优于随机基因组标记,因为它们代表基因编码和/或调控区域的变异。我们从参与木材形成的 138 个候选基因中鉴定了 544 个基因 SSR 位点,这些基因 SSR 分布在毛白杨基因组中,毛白杨是一种重要的生态和栽培木材生产物种。这些 SSR 中有四分之三位于启动子或内含子区域,二核苷酸(59.7%)和三核苷酸重复基序(26.5%)占主导地位。通过筛选 15 个野生毛白杨生态型,我们鉴定出 188 个多态性基因 SSR,具有 861 个等位基因,每个标记 2-7 个等位基因。对 30 个随机基因 SSR 的转移分析,测试这些 SSR 在 5 个杨树属(柳属外类群)6 个节的 26 个基因型中的工作情况,结果表明 72%的 SSR 可以在 Turanga 中扩增,100%可以在 Leuce 中扩增。基于这 26 个基因型的基因分型,邻接聚类分析显示了预期的六个系统发育分组。在毛白杨和杨属的 220 个同源序列的 SSR 变异的计算机分析表明,亲缘关系之间的基因 SSR 变异主要受到重复基序变异或侧翼序列突变的影响。遗传测试和单标记关联表明,基因 SSR 在基于家族的连锁作图和候选基因关联研究、标记辅助选择以及毛白杨和相关物种的比较基因组研究中具有强大的功能。