Nishio H, Dugaiczyk A
Department of Biochemistry, University of California, Riverside 92521, USA.
Proc Natl Acad Sci U S A. 1996 Jul 23;93(15):7557-61. doi: 10.1073/pnas.93.15.7557.
The nucleotide sequence of the human alpha-albumin gene, including 887 bp of the 5'-flanking region and 1311 bp of the 3-flanking region (24,454 in total), was determined from three overlapping lambda phage clones. The sequence spans 22,256 bp from the cap site to the polyadenylylation site, revealing a gene structure of 15 exons separated by 14 introns. The methionine initiation codon ATG is within exon 1; the termination codon TGA is within exon 14. Exon 15 is entirely untranslated and contains the polyadenylylation signal AATAAA. The deduced polypeptide chain is composed of a 21-amino-acid leader peptide, followed by 578 amino acids of the mature protein. There are seven repetitive DNA elements (Alu and Kpn) in the introns and 3-flanking region. The sizes of the 15 alpha-albumin exons match closely those of the albumin, alpha-fetoprotein, and vitamin D-binding protein genes. The exons are symmetrically placed within the three domains of the individual proteins, and they share a characteristic codon splitting pattern that is conserved among members of the gene family. The results provide strong evidence that alpha-albumin belongs to, and most likely completes with, the serum albumin gene family. Based on structural similarity, alpha-albumin appears to be most closely related to alpha-fetoprotein. The complete structure of this family of four tandemly linked genes provides a well-characterized approximately 200 kb locus in the 4q subcentromeric region of the human genome.
人类α-白蛋白基因的核苷酸序列,包括5'侧翼区的887 bp和3'侧翼区的1311 bp(总计24454 bp),是从三个重叠的λ噬菌体克隆中确定的。该序列从帽位点到聚腺苷酸化位点跨度为22256 bp,揭示了一个由15个外显子被14个内含子隔开的基因结构。甲硫氨酸起始密码子ATG在外显子1内;终止密码子TGA在外显子14内。外显子15完全不编码,并包含聚腺苷酸化信号AATAAA。推导的多肽链由一个21个氨基酸的前导肽组成,随后是578个氨基酸的成熟蛋白。内含子和3'侧翼区有七个重复DNA元件(Alu和Kpn)。15个α-白蛋白外显子的大小与白蛋白、甲胎蛋白和维生素D结合蛋白基因的大小密切匹配。外显子对称地位于各个蛋白质的三个结构域内,并且它们共享一种在基因家族成员中保守的特征性密码子分割模式。结果提供了强有力的证据表明α-白蛋白属于血清白蛋白基因家族,并且很可能与之竞争。基于结构相似性,α-白蛋白似乎与甲胎蛋白关系最为密切。这四个串联连接基因家族的完整结构在人类基因组4q亚着丝粒区域提供了一个特征明确的约200 kb位点。