Venta P J, Montgomery J C, Hewett-Emmett D, Wiebauer K, Tashian R E
J Biol Chem. 1985 Oct 5;260(22):12130-5.
We have isolated a cosmid clone containing the entire mouse (YBR strain) carbonic anhydrase (CA) II gene in 38 kilobase pairs of genomic DNA. The gene was found to be composed of seven exons and six introns. A TATA box (TATAAAA) and a possible CCAAT box (CCACT) have been located beginning 92 and 142 base pairs, respectively, upstream from the initiation codon ATG. When the regions encoded by exons and protein domains are examined, all but 1 of the 30 putative active site residues are encoded by four exons: exons 2 and 3 mainly code for hydrophilic residues and exons 4 and 6 mostly hydrophobic residues. Two intron splice positions, one between the codons for Glu-116 and Leu-117 and the other interrupting the codon for Gly-143, are located at the bottom of the active site cavity, and the former separates two of the three histidine residues forming ligands to the active site zinc ion. The other four splice sites map to the exterior of the molecule. Thus, except for the possible association of the 29 active site residues encoded by four exons, no obvious correspondence is seen between the regions coded by exons and the functional or secondary structural domains of the mouse CA II molecule. During this study, the possible basis for the two electrophoretic types, CA IIa and CA IIb, of inbred mouse strains was detected as a Gln/His interchange at position 38.
我们分离出了一个黏粒克隆,它在38千碱基对的基因组DNA中包含完整的小鼠(YBR品系)碳酸酐酶(CA)II基因。发现该基因由7个外显子和6个内含子组成。一个TATA盒(TATAAAA)和一个可能的CCAAT盒(CCACT)分别位于起始密码子ATG上游92和142个碱基对处。当检查由外显子编码的区域和蛋白质结构域时,30个假定的活性位点残基中除1个外均由4个外显子编码:外显子2和3主要编码亲水性残基,外显子4和6大多编码疏水性残基。两个内含子剪接位置,一个在Glu-116和Leu-117密码子之间,另一个打断Gly-143密码子,位于活性位点腔的底部,前者将形成活性位点锌离子配体的三个组氨酸残基中的两个分开。其他四个剪接位点位于分子外部。因此,除了由4个外显子编码的29个活性位点残基可能存在关联外,在小鼠CA II分子的外显子编码区域与功能或二级结构域之间未发现明显对应关系。在这项研究中,检测到近交系小鼠品系的两种电泳类型CA IIa和CA IIb的可能基础是第38位的Gln/His互换。