Verweij C L, Diergaarde P J, Hart M, Pannekoek H
EMBO J. 1986 Aug;5(8):1839-47. doi: 10.1002/j.1460-2075.1986.tb04435.x.
Full-length human von Willebrand factor (vWF) cDNA was assembled from partial, overlapping vWF cDNAs. This cDNA construct includes a coding sequence of 8439 nucleotides which encode a single-chain precursor of 2813 amino-acid residues, representing a putative signal peptide, a prosequence and mature vWF of 22, 741 and 2050 amino acids, respectively. This represents the longest coding sequence determined to date. In-vitro expression of full-length vWF cDNA revealed the synthesis of a polypeptide with a mol. wt corresponding with that of the unglycosylated precursor. The precursor is a highly repetitive protein which consists of two duplicated (B, C), a triplicated (A), a quadruplicated (D) and a partly duplicated domain (D'), in the following order: H-D1-D2-D'-D3-A1-A2-A3-D4-B1-B2-C1-C2-OH. Both the prosequence, composed of two D domains (D1, D2), and mature vWF harbor an arg-gly-asp ('R-G-D') sequence which has been implicated in cell-attachment functions. It is argued that the pro-sequence is equivalent to von Willebrand Antigen II (vW AgII).
全长人血管性血友病因子(vWF)cDNA由部分重叠的vWF cDNA组装而成。该cDNA构建体包含一个8439个核苷酸的编码序列,其编码一个由2813个氨基酸残基组成的单链前体,分别代表一个推定的信号肽、一个前序列和成熟的vWF,其氨基酸数分别为22、741和2050。这是迄今为止确定的最长编码序列。全长vWF cDNA的体外表达揭示了一种分子量与未糖基化前体相对应的多肽的合成。前体是一种高度重复的蛋白质,由两个重复的结构域(B、C)、一个三重重复的结构域(A)、一个四重重复的结构域(D)和一个部分重复的结构域(D')按以下顺序组成:H-D1-D2-D'-D3-A1-A2-A3-D4-B1-B2-C1-C2-OH。由两个D结构域(D1、D2)组成的前序列和成熟的vWF都含有一个与细胞附着功能有关的精氨酸-甘氨酸-天冬氨酸('R-G-D')序列。有人认为前序列等同于血管性血友病因子抗原II(vW AgII)。