López-Giráldez Francesc, Andrés Olga, Domingo-Roura Xavier, Bosch Montserrat
Genètica de la Conservació, Institut de Recerca i Tecnologia Agroalimentàries, Ctra, de Cabrils Km 2, 08348 Cabrils (Barcelona), Spain.
BMC Genomics. 2006 Oct 23;7:269. doi: 10.1186/1471-2164-7-269.
The popularity of microsatellites has greatly increased in the last decade on account of their many applications. However, little is currently understood about the factors that influence their genesis and distribution among and within species genomes. In this work, we analyzed carnivore microsatellite clones from GenBank to study their association with interspersed repeats and elucidate the role of the latter in microsatellite genesis and distribution.
We constructed a comprehensive carnivore microsatellite database comprising 1236 clones from GenBank. Thirty-three species of 11 out of 12 carnivore families were represented, although two distantly related species, the domestic dog and cat, were clearly overrepresented. Of these clones, 330 contained tRNALys-derived SINEs and 357 contained other interspersed repeats. Our rough estimates of tRNA SINE copies per haploid genome were much higher than published ones. Our results also revealed a distinct juxtaposition of AG and A-rich repeats and tRNALys-derived SINEs suggesting their coevolution. Both microsatellites arose repeatedly in two regions of the interspersed repeat. Moreover, microsatellites associated with tRNALys-derived SINEs showed the highest complexity and less potential instability.
Our results suggest that tRNALys-derived SINEs are a significant source for microsatellite generation in carnivores, especially for AG and A-rich repeat motifs. These observations indicate two modes of microsatellite generation: the expansion and variation of pre-existing tandem repeats and the conversion of sequences with high cryptic simplicity into a repeat array; mechanisms which are not specific to tRNALys-derived SINEs. Microsatellite and interspersed repeat coevolution could also explain different distribution of repeat types among and within species genomes.Finally, due to their higher complexity and lower potential informative content of microsatellites associated with tRNALys-derived SINEs, we recommend avoiding their use as genetic markers.
在过去十年中,微卫星因其众多应用而广受欢迎。然而,目前对于影响其在物种基因组间及基因组内起源和分布的因素了解甚少。在这项研究中,我们分析了来自GenBank的食肉动物微卫星克隆,以研究它们与散布重复序列的关联,并阐明后者在微卫星起源和分布中的作用。
我们构建了一个包含来自GenBank的1236个克隆的综合食肉动物微卫星数据库。代表了12个食肉动物科中的11个科的33个物种,尽管两个远缘物种家犬和家猫明显占比过高。在这些克隆中,330个包含源自tRNALys的SINEs,357个包含其他散布重复序列。我们对每个单倍体基因组中tRNA SINE拷贝数的粗略估计远高于已发表的估计值。我们的结果还揭示了AG和富含A的重复序列与源自tRNALys的SINEs的明显并列,表明它们的共同进化。两种微卫星在散布重复序列的两个区域反复出现。此外,与源自tRNALys的SINEs相关的微卫星显示出最高的复杂性和较低的潜在不稳定性。
我们的结果表明,源自tRNALys的SINEs是食肉动物微卫星产生的重要来源,特别是对于AG和富含A的重复基序。这些观察结果表明了微卫星产生的两种模式:先前存在的串联重复序列的扩增和变异,以及具有高度隐蔽简单性的序列转化为重复阵列;这些机制并非源自tRNALys的SINEs所特有。微卫星和散布重复序列的共同进化也可以解释重复类型在物种基因组间及基因组内的不同分布。最后,由于与源自tRNALys的SINEs相关的微卫星具有更高的复杂性和更低的潜在信息含量,我们建议避免将它们用作遗传标记。