Tsuzuki T, Obaru K, Setoyama C, Shimada K
Department of Biochemistry, Kumamoto University Medical School, Japan.
J Mol Biol. 1987 Nov 5;198(1):21-31. doi: 10.1016/0022-2836(87)90454-2.
Structural organization of the entire mouse mitochondrial aspartate aminotransferase (EC 2.6.1.1) gene was determined by analyzing the overlapping genomic clones obtained from a Charon 4A DNA library. The gene is 25 X 10(3) base-pairs long and contains ten exons interrupted by nine introns of various sizes. The 5' and 3'-flanking regions, the exact sizes and boundaries of the exon blocks including the transcription-initiation sites were determined. The 5' end of the gene lacks the prototypical 5' transcriptional regulatory sequence elements, such as TATA and CAAT boxes, but contains G + C-rich sequences, two putative binding sites for a cellular transcription factor, Sp1, and multiple transcription-initiation sites. Moreover, the sequences around the transcription-initiation sites are compatible with the formation of a number of potentially stable stem-loop structures. The leader sequence, which is essential for the transport of the protein into the mitochondria, is coded by the first exon and is separated from the mature protein by the first intron. The pyridoxal 5'-phosphate-binding domain, consisting of seven alternating beta-sheets and alpha-helical polypeptide strands, is separated by four introns present at the ends of alpha-helices. These genomic DNA structures suggest that the introns were not inserted into a previously uninterrupted coding sequence, but rather are products of evolution of the ancestral gene. However, a further correlation between the positions of introns relative to the well-defined structural domains of the mature protein was not obvious.
通过分析从Charon 4A DNA文库获得的重叠基因组克隆,确定了整个小鼠线粒体天冬氨酸氨基转移酶(EC 2.6.1.1)基因的结构组织。该基因长25×10³碱基对,包含10个外显子,被9个大小各异的内含子隔开。确定了5'和3'侧翼区域、包括转录起始位点在内的外显子模块的确切大小和边界。该基因的5'端缺乏典型的5'转录调控序列元件,如TATA盒和CAAT盒,但含有富含G + C的序列、细胞转录因子Sp1的两个假定结合位点以及多个转录起始位点。此外,转录起始位点周围的序列与许多潜在稳定的茎环结构的形成相符。对于蛋白质转运到线粒体至关重要的前导序列由第一个外显子编码,并通过第一个内含子与成熟蛋白分开。由七个交替的β-折叠和α-螺旋多肽链组成的磷酸吡哆醛结合结构域,被α-螺旋末端的四个内含子隔开。这些基因组DNA结构表明,内含子不是插入到先前不间断的编码序列中,而是祖先基因进化的产物。然而,内含子相对于成熟蛋白明确结构域的位置之间的进一步相关性并不明显。