Aiken J M, McKenzie D, Zhao H Z, States J C, Dixon G H
Nucleic Acids Res. 1983 Jul 25;11(14):4907-22. doi: 10.1093/nar/11.14.4907.
We have sequenced five different rainbow trout protamine genes plus their flanking regions. The genes are not clustered and do not contain intervening sequences. There is an extremely high degree of sequence conservation in the coding and 3' untranslated regions of the gene. Downstream sequences exhibit little homology though conserved regions are found 250 base pairs 3' to the gene. There are four regions upstream of the gene that are highly conserved in the six clones, including the canonical Goldberg - Hogness box which is 45 base pairs 5' to the coding region. A second homologous region is found 90 bases upstream. Although in the same approximate location as the CAAT box found upstream of other genes, it does not contain the canonical CAAT sequence. Further upstream of the protamine genes at -115 there is an A-T rich sequence while a 25 base pair conserved sequence is located 150 bases upstream. In addition we report the presence of a potential Z-DNA region of predominantly A-C repeats approximately one kilobase downstream of one of the genes.
我们已经对五个不同的虹鳟鱼鱼精蛋白基因及其侧翼区域进行了测序。这些基因没有聚集在一起,也不包含间隔序列。该基因的编码区和3'非翻译区存在极高程度的序列保守性。下游序列的同源性较低,不过在基因3'端250个碱基对处发现了保守区域。基因上游有四个区域在六个克隆中高度保守,包括位于编码区5'端45个碱基对处的典型戈德堡-霍格尼斯盒。在其上游90个碱基处发现了第二个同源区域。尽管它与其他基因上游的CAAT盒位置大致相同,但它不包含典型的CAAT序列。在鱼精蛋白基因上游-115处有一个富含A-T的序列,而在其上游150个碱基处有一个25个碱基对的保守序列。此外,我们报告在其中一个基因下游约一千碱基处存在一个主要由A-C重复组成的潜在Z-DNA区域。