Hirosawa S, Nakamura Y, Miura O, Sumi Y, Aoki N
First Department of Medicine, Tokyo Medical and Dental University, Japan.
Proc Natl Acad Sci U S A. 1988 Sep;85(18):6836-40. doi: 10.1073/pnas.85.18.6836.
We have isolated overlapping phage genomic clones covering an area of 26 kilobases that encodes the human alpha 2-plasmin inhibitor. The alpha 2-plasmin inhibitor gene contains 10 exons and 9 introns distributed over approximately 16 kilobases of DNA. To our knowledge, the number of introns is the highest yet reported for a member of the serine protease inhibitor (serpin) superfamily. All introns are located in the 5'-half of the corresponding mRNA. The 5'-untranslated region and the leader sequence are interrupted by 3 introns totaling approximately equal to 6 kilobases. A "TATA box" sequence is located 17 nucleotides upstream from the proposed transcription initiation site. Multiple "GC box" sequences, G + C-rich sequences, and "CCAAT box"-like sequence, the hepatitis B virus enhancer element-like sequence and the human immunodeficiency virus enhancer-like sequence appear in the 5'-flanking region. The NH2-terminal region, which implements factor XIII-catalyzed cross-linking of alpha 2-plasmin inhibitor to fibrin, is encoded by the 4th exon. The reactive site and plasminogen-binding site, both located in the COOH-terminal region, are encoded by the 10th exon. When similar amino acids of alpha 2-plasmin inhibitor and other members of the serpin gene superfamily are aligned, the position of the 7th intron of the alpha 2-plasmin inhibitor gene aligns precisely with that of the second intron of the genes for rat angiotensinogen and human alpha 1-antitrypsin genes and is misaligned by only one nucleotide with that of the third intron of antithrombin III, suggesting that the alpha 2-plasmin inhibitor gene originates from the common ancestor of these serine protease inhibitors.
我们分离出了重叠的噬菌体基因组克隆,其覆盖了一个26千碱基的区域,该区域编码人α2 - 纤溶酶抑制剂。α2 - 纤溶酶抑制剂基因包含10个外显子和9个内含子,分布在约16千碱基的DNA上。据我们所知,内含子数量是丝氨酸蛋白酶抑制剂(丝氨酸蛋白酶抑制剂超家族)成员中迄今报道的最高数量。所有内含子都位于相应mRNA的5'端一半区域。5'非翻译区和前导序列被3个内含子打断,总计约6千碱基。一个“TATA盒”序列位于推测的转录起始位点上游17个核苷酸处。多个“GC盒”序列、富含G + C的序列以及“CCAAT盒”样序列、乙型肝炎病毒增强子样序列和人类免疫缺陷病毒增强子样序列出现在5'侧翼区域。实施因子XIII催化α2 - 纤溶酶抑制剂与纤维蛋白交联的NH2末端区域由第4个外显子编码。反应位点和纤溶酶原结合位点都位于COOH末端区域,由第10个外显子编码。当将α2 - 纤溶酶抑制剂与丝氨酸蛋白酶抑制剂基因超家族的其他成员的相似氨基酸进行比对时,α2 - 纤溶酶抑制剂基因第7个内含子的位置与大鼠血管紧张素原基因和人α1 - 抗胰蛋白酶基因的第二个内含子位置精确对齐,与抗凝血酶III的第三个内含子位置仅错配一个核苷酸,这表明α2 - 纤溶酶抑制剂基因起源于这些丝氨酸蛋白酶抑制剂的共同祖先。