Beckmann R J, Schmidt R J, Santerre R F, Plutzky J, Crabtree G R, Long G L
Nucleic Acids Res. 1985 Jul 25;13(14):5233-47. doi: 10.1093/nar/13.14.5233.
Human liver cDNA coding for protein C has been synthesized, cloned and sequenced. The abundance of protein C message is approximately 0.02% of total mRNA. Three overlapping clones contain 1,798 nucleotides of contiguous sequence, which approximates the size of the protein's mRNA, based upon Northern hybridization. The cDNA sequence consists of 73 5'-noncoding bases, coding sequence for a 461 amino acid nascent polypeptide precursor, a TAA termination codon, 296 3'-noncoding bases, and a 38 base polyadenylation segment. The nascent protein consists of a 33 amino acid "signal", a 9 amino acid propeptide, a 155 amino acid "light" chain, a Lys-Arg connecting dipeptide, and a 262 amino acid "heavy" chain. Human protein C and Factor IX and X precursors possess about one third identical amino acids (59% in the gamma-carboxyglutamate domain), including two forty-six amino acid segments homologous to epidermal growth factor. Human protein C also has similar homology with prothrombin in the "leader", gamma-carboxyglutamate and serine protease domains, but lacks the two "kringle" domains found in prothrombin.
编码蛋白C的人肝脏互补DNA(cDNA)已被合成、克隆并测序。蛋白C信使核糖核酸(mRNA)的丰度约占总mRNA的0.02%。根据Northern杂交结果,三个重叠克隆包含1798个连续核苷酸序列,这与该蛋白mRNA的大小相近。cDNA序列由73个5'非编码碱基、一个编码461个氨基酸新生多肽前体的编码序列、一个TAA终止密码子、296个3'非编码碱基以及一个38个碱基的聚腺苷酸化片段组成。新生蛋白由一个33个氨基酸的“信号肽”、一个9个氨基酸的前肽、一个155个氨基酸的“轻”链、一个赖氨酸-精氨酸连接二肽以及一个262个氨基酸的“重”链组成。人蛋白C与因子IX和X前体约有三分之一的氨基酸相同(在γ-羧基谷氨酸结构域中为59%),包括两个与表皮生长因子同源的46个氨基酸片段。人蛋白C在“前导肽”、γ-羧基谷氨酸和丝氨酸蛋白酶结构域中也与凝血酶原具有相似的同源性,但缺少凝血酶原中发现的两个“kringle”结构域。