UMR 1136, INRA-Nancy Université, Interactions Arbres/Microorganismes, INRA-Nancy, 54280, Champenoux, France.
Curr Genet. 2011 Apr;57(2):75-88. doi: 10.1007/s00294-010-0328-9. Epub 2010 Dec 4.
It is becoming clear that simple sequence repeats (SSRs) play a significant role in fungal genome organization, and they are a large source of genetic markers for population genetics and meiotic maps. We identified SSRs in the Laccaria bicolor genome by in silico survey and analyzed their distribution in the different genomic regions. We also compared the abundance and distribution of SSRs in L. bicolor with those of the following fungal genomes: Phanerochaete chrysosporium, Coprinopsis cinerea, Ustilago maydis, Cryptococcus neoformans, Aspergillus nidulans, Magnaporthe grisea, Neurospora crassa and Saccharomyces cerevisiae. Using the MISA computer program, we detected 277,062 SSRs in the L. bicolor genome representing 8% of the assembled genomic sequence. Among the analyzed basidiomycetes, L. bicolor exhibited the highest SSR density although no correlation between relative abundance and the genome sizes was observed. In most genomes the short motifs (mono- to trinucleotides) were more abundant than the longer repeated SSRs. Generally, in each organism, the occurrence, relative abundance, and relative density of SSRs decreased as the repeat unit increased. Furthermore, each organism had its own common and longest SSRs. In the L. bicolor genome, most of the SSRs were located in intergenic regions (73.3%) and the highest SSR density was observed in transposable elements (TEs; 6,706 SSRs/Mb). However, 81% of the protein-coding genes contained SSRs in their exons, suggesting that SSR polymorphism may alter gene phenotypes. Within a L. bicolor offspring, sequence polymorphism of 78 SSRs was mainly detected in non-TE intergenic regions. Unlike previously developed microsatellite markers, these new ones are spread throughout the genome; these markers could have immediate applications in population genetics.
重复序列在真菌基因组组织中起着重要作用,它们是遗传标记的重要来源,可用于群体遗传学和减数分裂图谱。我们通过计算机预测鉴定了双色蜡蘑基因组中的简单重复序列(SSR),并分析了它们在不同基因组区域的分布。我们还比较了双色蜡蘑 SSR 的丰度和分布与以下真菌基因组的 SSR 丰度和分布:白腐菌、毛栓菌、玉蜀黍黑粉菌、新型隐球菌、构巢曲霉、稻瘟病菌、粗糙脉孢菌和酿酒酵母。使用 MISA 计算机程序,我们在双色蜡蘑基因组中检测到 277062 个 SSR,占组装基因组序列的 8%。在所分析的担子菌中,尽管没有观察到相对丰度与基因组大小之间的相关性,但双色蜡蘑表现出最高的 SSR 密度。在大多数基因组中,短重复基序(单核苷酸到三核苷酸)比长重复 SSR 更丰富。一般来说,在每种生物中,SSR 的出现、相对丰度和相对密度随着重复单元的增加而降低。此外,每种生物都有其自身的常见和最长 SSR。在双色蜡蘑基因组中,大多数 SSR 位于基因间区(73.3%),转座元件(TEs)中 SSR 密度最高(6706 SSRs/Mb)。然而,81%的蛋白质编码基因在其外显子中含有 SSR,表明 SSR 多态性可能改变基因表型。在双色蜡蘑后代中,78 个 SSR 的序列多态性主要在非 TEs 基因间区检测到。与以前开发的微卫星标记不同,这些新的标记分布在整个基因组中;这些标记可以立即应用于群体遗传学。