Hardison R C, Sawada I, Cheng J F, Shen C K, Schmid C W
Nucleic Acids Res. 1986 Feb 25;14(4):1903-11. doi: 10.1093/nar/14.4.1903.
The sequence of the DNA between two pseudogenes in the human alpha-like globin gene cluster has been determined. Comparison of this sequence with sequences from other alpha-like globin gene clusters revealed another pseudogene, psi alpha 2, between the previously recognized pseudogenes zeta 1 and psi alpha 1. Therefore, the human alpha-like globin gene family is organized 5'-zeta 2-zeta 1-psi alpha 2-psi alpha 1-alpha 2-alpha 1-3'. The new pseudogene psi alpha 2 is very close to zeta 1, beginning only 65 base pairs 3' to the polyadenylation site of zeta 1. The first exon and the first intron of psi alpha 2 are interrupted by large inserts which are flanked by short (6 to 8 base pairs) direct repeats. The pseudogene psi alpha 2 lacks a promoter for transcription by RNA polymerase II, the first exon is highly divergent, one splice site is mutated, and five different frameshift mutations have occurred in the coding regions. Thus psi alpha 2 cannot encode a globin polypeptide. This pseudogene was not recognized in previous hybridization analyses of the human alpha-like globin gene cluster, and our discovery of it by sequence analysis suggests that divergent copies of a large number of genes may comprise a substantial fraction of the slowly renaturing DNA of mammalian genomes.
人类α-类珠蛋白基因簇中两个假基因之间的DNA序列已被测定。将该序列与其他α-类珠蛋白基因簇的序列进行比较,发现在先前识别的假基因ζ1和ψα1之间存在另一个假基因ψα2。因此,人类α-类珠蛋白基因家族的组织形式为5'-ζ2-ζ1-ψα2-ψα1-α2-α1-3'。新的假基因ψα2非常靠近ζ1,起始于ζ1聚腺苷酸化位点下游仅65个碱基对处。ψα2的第一个外显子和第一个内含子被大的插入序列打断,这些插入序列两侧为短(6至8个碱基对)的直接重复序列。假基因ψα2缺乏RNA聚合酶II转录所需的启动子,第一个外显子高度分化,一个剪接位点发生突变,编码区出现了五个不同的移码突变。因此,ψα2不能编码珠蛋白多肽。在先前对人类α-类珠蛋白基因簇的杂交分析中未识别出该假基因,我们通过序列分析发现它表明大量基因的差异拷贝可能占哺乳动物基因组缓慢复性DNA的很大一部分。