Edwards Y J, Elgar G, Clark M S, Bishop M J
UK Human Genome Mapping Project Resource Centre, Hinxton, Cambridge, UK.
J Mol Biol. 1998 May 15;278(4):843-54. doi: 10.1006/jmbi.1998.1752.
Fugu rubripes (Fugu) has one of the smallest recorded vertebrate genomes and is an economic tool for comparative DNA sequence analysis. Initial characterization of 128 kb of Fugu DNA attributed the compactness of this genome, in part, to a sparseness of repetitive DNA sequence compared with mammalian genomic sequences. This paper describes a new and comprehensive analysis in which 501 theoretically possible microsatellites with a repeat unit of one to six bases were used to query two orders of magnitude more Fugu DNA (i.e. 11.338 Mb). A total of 6042 microsatellites were identified and categorized. In decreasing order, the 20 most frequently occurring microsatellites are AC, A, C, AGG, AG, AGC, AAT, AAAT, ACAG, ACGC, ATCC, AAC, ATC, AGGG, AAAG, AAG, AAAC, AT, CCG and TTAGGG. The 20 most frequently occurring microsatellites represent 81.79% of all microsatellites identified. Our results indicate that one microsatellite occurs every 1.876 kb of DNA in Fugu, 11.55% of the microsatellites are detected in open reading frames that are predicted protein coding regions. With respect to the proportion of microsatellites present in open reading frames and the total abundance (bp) of all microsatellites, the genome of Fugu is similar to the genome of many other vertebrate species. Previous estimates performed indicate that approximately 1% of many vertebrate genomes are comprized of microsatellite sequences. However, many differences prevail in the abundance and frequency of the individual microsatellite classes. Many of the frequently occurring microsatellites in Fugu are known to code in other species for regions in proteins such as transcription factors, whilst others are associated with known functions, such as transcription factor binding sites and form part of promoter regions in DNA sequences of genes. Therefore, it is likely that such repeats in genomes have a role in the evolution of genes, regulation of gene expression and consequently the evolution of species.
红鳍东方鲀(东方鲀)拥有有记录的最小脊椎动物基因组之一,是用于比较DNA序列分析的一种经济工具。对128 kb东方鲀DNA的初步特征分析表明,与哺乳动物基因组序列相比,该基因组的紧凑性部分归因于重复DNA序列的稀少。本文描述了一项新的全面分析,其中使用501个理论上可能的、重复单元为1至6个碱基的微卫星,对超过两个数量级的东方鲀DNA(即11.338 Mb)进行查询。总共鉴定并分类了6042个微卫星。按出现频率从高到低排列,20个最常见的微卫星是AC、A、C、AGG、AG、AGC、AAT、AAAT、ACAG、ACGC、ATCC、AAC、ATC、AGGG、AAAG、AAG、AAAC、AT、CCG和TTAGGG。这20个最常见的微卫星占所有鉴定出的微卫星的81.79%。我们的结果表明,东方鲀基因组中每1.876 kb DNA就有一个微卫星,11.55%的微卫星在预测为蛋白质编码区的开放阅读框中被检测到。就开放阅读框中微卫星的比例以及所有微卫星的总丰度(碱基对)而言,东方鲀的基因组与许多其他脊椎动物物种的基因组相似。先前的估计表明,许多脊椎动物基因组约1%由微卫星序列组成。然而,各个微卫星类别的丰度和频率存在许多差异。东方鲀中许多常见的微卫星在其他物种中已知编码蛋白质中的区域,如转录因子,而其他一些微卫星与已知功能相关,如转录因子结合位点,并构成基因DNA序列中启动子区域的一部分。因此,基因组中的此类重复很可能在基因进化、基因表达调控以及物种进化中发挥作用。