Reitman M, Grasso J A, Blumenthal R, Lewit P
Diabetes Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892.
Genomics. 1993 Dec;18(3):616-26. doi: 10.1016/s0888-7543(05)80364-7.
The DNA sequence of the Gallus gallus (chicken) beta-globin cluster was completed and analyzed. This G + C-rich region is 23.7 kb in length and includes the rho-, beta H-, beta A-, and epsilon-globin genes, the enhancer found between the beta A and epsilon genes, and three upstream DNase I hypersensitive sites. The CpG dinucleotides are nonrandomly distributed, being present at an increased relative frequency near the promoters and upstream hypersensitive sites. The cluster has an unusually low TA dinucleotide frequency. The upstream hypersensitive sites (5'HS1, 5'HS2, and 5'HS3) contain DNA sequence motifs recognized by erythroid transcription factors. However, no significant sequence similarity was found among the upstream hypersensitive sites and the beta A/epsilon enhancer. The G. gallus upstream site sequences were not similar to the upstream sites of the mammalian globin clusters, probably due to the small size of the functional regions and large evolutionary distance between the classes. The avian cluster evolved by gene duplication from an ancestor beta-globin gene, first producing the epsilon and the rho/beta H/beta A ancestor genes, then the rho and the beta H/beta A ancestor genes, and finally the beta H- and beta A-globins. Four probable gene conversions can be documented: beta A to beta H, epsilon to beta H, and rho/epsilon (twice). The cluster shows a massive overrepresentation of a non-LTR retrotransposon, CR1, which accounts for 16% of the DNA. We suggest that the locus is a preferred site for CR1 insertion.
鸡的β-珠蛋白基因簇的DNA序列已完成测定并进行了分析。这个富含G + C的区域长度为23.7 kb,包括rho-、βH-、βA-和ε-珠蛋白基因,βA和ε基因之间的增强子,以及三个上游DNase I超敏位点。CpG二核苷酸分布不均,在启动子和上游超敏位点附近相对频率增加。该基因簇的TA二核苷酸频率异常低。上游超敏位点(5'HS1、5'HS2和5'HS3)包含红细胞转录因子识别的DNA序列基序。然而,在上游超敏位点和βA/ε增强子之间未发现明显的序列相似性。鸡的上游位点序列与哺乳动物珠蛋白基因簇的上游位点不相似,这可能是由于功能区域较小以及两类生物之间进化距离较大。禽类基因簇通过基因复制从一个祖先β-珠蛋白基因进化而来,首先产生ε和rho/βH/βA祖先基因,然后是rho和βH/βA祖先基因,最后是βH-和βA-珠蛋白。可以记录到四次可能的基因转换:βA到βH、ε到βH以及rho/ε(两次)。该基因簇显示出非LTR逆转座子CR1大量过度存在,其占DNA的16%。我们认为该位点是CR1插入的优选位点。