Dugaiczyk A, Law S W, Dennison O E
Proc Natl Acad Sci U S A. 1982 Jan;79(1):71-5. doi: 10.1073/pnas.79.1.71.
The complete nucleotide sequence of human serum albumin mRNA has been determined from recombinant cDNA clones and from a primer-extended cDNA synthesis on the mRNA template. The sequence is composed of 2078 nucleotides, starting upstream from a potential ribosome binding site in the 5' untranslated region. It contains all the translated codons and extends into the poly(A) at the 3' terminus. Part of the translated sequence codes for a hydrophobic prepeptide, Met-Lys-Trp-Val-Thr-Phe-Ile-Ser-Leu-Leu-Phe-Leu-Phe-Ser-Ser-Ala-Tyr-Ser, followed by a basic propeptide, Arg-Gly-Val-Phe-Arg-Arg. These signal peptides are absent from mature normal serum albumin and, so far, have not been identified in their nascent state in humans. A remaining 1755 nucleotides of the translated mRNA sequence code for 585 amino acids, which are in agreement, with few exceptions, with the published amino acid sequence for human serum albumin. The mRNA sequence verifies and refines the repeating homology in the triple-domain structure of the serum albumin molecule.
人类血清白蛋白mRNA的完整核苷酸序列已通过重组cDNA克隆以及在mRNA模板上进行引物延伸cDNA合成来确定。该序列由2078个核苷酸组成,起始于5'非翻译区中一个潜在的核糖体结合位点的上游。它包含所有的翻译密码子,并延伸至3'末端的多聚腺苷酸。部分翻译序列编码一个疏水前肽,即甲硫氨酸-赖氨酸-色氨酸-缬氨酸-苏氨酸-苯丙氨酸-异亮氨酸-丝氨酸-亮氨酸-亮氨酸-苯丙氨酸-亮氨酸-苯丙氨酸-丝氨酸-丝氨酸-丙氨酸-酪氨酸-丝氨酸,随后是一个碱性前肽,即精氨酸-甘氨酸-缬氨酸-苯丙氨酸-精氨酸-精氨酸。这些信号肽在成熟正常血清白蛋白中不存在,并且迄今为止,尚未在人类中鉴定出它们的新生状态。翻译后的mRNA序列的其余1755个核苷酸编码585个氨基酸,除少数例外,这些氨基酸与已发表的人类血清白蛋白氨基酸序列一致。mRNA序列验证并完善了血清白蛋白分子三结构域结构中的重复同源性。