Karaoglu Haydar, Lee Crystal Man Ying, Meyer Wieland
School of Molecular and Microbial Biosciences, University of Sydney, Sydney, Australia.
Mol Biol Evol. 2005 Mar;22(3):639-49. doi: 10.1093/molbev/msi057. Epub 2004 Nov 24.
The use of simple sequence repeats or microsatellites as genetic markers has become very popular because of their abundance and length variation between different individuals. SSRs are tandem repeat units of 1 to 6 base pairs that are found abundantly in many prokaryotic and eukaryotic genomes. This is the first study examining and comparing SSRs in completely sequenced fungal genomes. We analyzed and compared the occurrences, relative abundance, relative density, most common, and longest SSRs in nine taxonomically different fungal species: Aspergillus nidulans, Cryptococcus neoformans, Encephalitozoon cuniculi, Fusarium graminearum, Magnaporthe grisea, Neurospora crassa, Saccharomyces cerevisiae, Schizosaccharomyces pombe, and Ustilago maydis. Our analysis revealed that, in all of the genomes studied, the occurrence, abundance, and relative density of SSRs varied and was not influenced by the genome sizes. No correlation between relative abundance and the genome sizes was observed, but it was shown that N. crassa, the largest genome analyzed had the highest relative abundance of SSRs. In most genomes, mononucleotide, dinucleotide, and trinucleotide repeats were more abundant than the longer repeated SSRs. Generally, in each organism, the occurrence, relative abundance, and relative density of SSRs decreased as the repeat unit increased. Furthermore, each organism had its own common and longest SSRs. Our analysis showed that the relative abundance of SSRs in fungi is low compared with the human genome and that longer SSRs in fungi are rare. In addition to providing new information concerning the abundance of SSRs for each of these fungi, the results provide a general source of molecular markers that could be useful for a variety of applications such as population genetics and strain identification of fungal organisms.
由于简单序列重复(SSRs)或微卫星在不同个体间具有丰富性和长度变异性,因此作为遗传标记被广泛应用。SSRs是1至6个碱基对的串联重复单元,在许多原核生物和真核生物基因组中大量存在。这是第一项对完全测序的真菌基因组中的SSRs进行检测和比较的研究。我们分析并比较了9种分类学上不同的真菌物种中SSRs的出现情况、相对丰度、相对密度、最常见的以及最长的SSRs,这9种真菌分别是:构巢曲霉、新型隐球菌、兔脑炎微孢子虫、禾谷镰刀菌、稻瘟病菌、粗糙脉孢菌、酿酒酵母、粟酒裂殖酵母和玉米黑粉菌。我们的分析表明,在所有研究的基因组中,SSRs的出现情况、丰度和相对密度各不相同,且不受基因组大小的影响。未观察到相对丰度与基因组大小之间的相关性,但研究表明,所分析的最大基因组——粗糙脉孢菌,其SSRs的相对丰度最高。在大多数基因组中,单核苷酸、二核苷酸和三核苷酸重复比更长的重复SSRs更为丰富。一般来说,在每个生物体中,随着重复单元的增加,SSRs的出现情况、相对丰度和相对密度会降低。此外,每个生物体都有其自身常见和最长的SSRs。我们的分析表明,与人类基因组相比,真菌中SSRs的相对丰度较低,而且真菌中较长的SSRs很少见。除了为这些真菌各自的SSRs丰度提供新信息外,这些结果还提供了一个通用的分子标记来源,可用于多种应用,如真菌生物的群体遗传学和菌株鉴定。