Tagle D A, Stanhope M J, Siemieniak D R, Benson P, Goodman M, Slightom J L
Department of Molecular Biology, Wayne State University School of Medicine, Detroit, Michigan 48201.
Genomics. 1992 Jul;13(3):741-60. doi: 10.1016/0888-7543(92)90150-q.
The nucleotide sequence of the beta globin gene cluster of the prosimian Galago crassicaudatus has been determined. A total sequence spanning 41,101 bp contains and links together previously published sequences of the five galago beta-like globin genes (5'-epsilon-gamma-psi eta-delta-beta-3'). A computer-aided search for middle interspersed repetitive sequences identified 10 LINE (L1) elements, including a 5' truncated repeat that is orthologous to the full-length L1 element found in the human epsilon-gamma intergenic region. SINE elements that were identified included one Alu type I repeat, four Alu type II repeats, and two methionine tRNA-derived Monomer (type III) elements. Alu type II and Monomer sequences are unique to the galago genome. Structural analyses of the cluster sequence reveals that it is relatively A+T rich (about 62%) and regions with high G+C content are associated primarily with globin coding regions. Comparative analyses with the beta globin cluster sequences of human, rabbit, and mouse reveal extensive sequence homologies in their genic regions, but only human, galago, and rabbit sequences share extensive intergenic sequence homologies. Divergence analyses of aligned intergenic and flanking sequences from orthologous human, galago, and rabbit sequences show a gradation in the rate of nucleotide sequence evolution along the cluster where sequences 5' of the epsilon globin gene region show the least sequence divergence and sequences just 5' of the beta globin gene region show the greatest sequence divergence.
已确定原猴粗尾婴猴β珠蛋白基因簇的核苷酸序列。一段总长41,101 bp的序列包含并连接了先前发表的五个婴猴β样珠蛋白基因(5'-ε-γ-ψη-δ-β-3')的序列。通过计算机辅助搜索中间散布的重复序列,鉴定出10个LINE(L1)元件,其中包括一个5'端截短的重复序列,该序列与在人类ε-γ基因间隔区发现的全长L1元件是直系同源的。鉴定出的SINE元件包括一个I型Alu重复序列、四个II型Alu重复序列和两个甲硫氨酸tRNA衍生的单体(III型)元件。II型Alu和单体序列是婴猴基因组特有的。对该基因簇序列的结构分析表明,它相对富含A+T(约62%),高G+C含量的区域主要与珠蛋白编码区相关。与人类、兔子和小鼠的β珠蛋白基因簇序列进行比较分析发现,它们的基因区域存在广泛的序列同源性,但只有人类、婴猴和兔子的序列在基因间隔区有广泛的序列同源性。对直系同源的人类、婴猴和兔子序列的比对基因间隔区和侧翼序列进行分歧分析表明,沿着基因簇核苷酸序列进化速率存在梯度变化,其中ε珠蛋白基因区域5'端的序列分歧最小,而β珠蛋白基因区域5'端的序列分歧最大。