Klickstein L B, Wong W W, Smith J A, Weis J H, Wilson J G, Fearon D T
J Exp Med. 1987 Apr 1;165(4):1095-112. doi: 10.1084/jem.165.4.1095.
10 overlapping CR1 cDNA clones that span 5.5 kb were isolated from a tonsillar library and sequenced in whole or in part. A single long open reading frame beginning at the 5' end of the clones and extending 4.7 kb downstream to a stop codon was identified. This sequence represents approximately 80% of the estimated 6 kb of coding sequence for the F allotype of CR1. Three tandem, direct, long homologous repeats (LHRs) of 450 amino acids were identified. Analysis of the sequences of tryptic peptides provided evidence for a fourth LHR in the F allotype of CR1. Amino acid identity between the LHRs ranged from 70% between the first and third repeats to 99% between the NH2-terminal 250 amino acids of the first and second repeats. Each LHR comprises seven short consensus repeats (SCRs) of 60-70 amino acids that resemble the SCRs of other C3/C4 binding proteins, such as complement receptor type 2, factors B and H, C4 binding protein, and C2. Two additional SCRs join the LHRs to a single membrane-spanning domain of 25 amino acids; thus, the F allotype of CR1 probably contains at least 30 SCRs, 23 of which have been sequenced. Each SCR is predicted to form a triple loop structure in which the four conserved half-cystines form disulfide linkages. The linear alignment of 30 SCRs as a semi-rigid structure would extend 1,140A from the plasma membrane and might facilitate the interaction of CR1 with C3b and C4b located within the interstices of immune complexes and microbial cell walls. The COOH-terminal cytoplasmic domain of 43 residues contains a six-amino-acid sequence that is homologous to the sequence in the epidermal growth factor receptor that is phosphorylated by protein kinase C.
从扁桃体文库中分离出10个重叠的CR1 cDNA克隆,其跨度为5.5 kb,并进行了全序列或部分序列测定。鉴定出一个单一的长开放阅读框,它从克隆的5'端开始,向下游延伸4.7 kb至一个终止密码子。该序列约占CR1 F等位基因估计6 kb编码序列的80%。鉴定出三个由450个氨基酸组成的串联、直接、长同源重复序列(LHRs)。对胰蛋白酶肽段序列的分析为CR1 F等位基因中的第四个LHR提供了证据。LHRs之间的氨基酸同一性范围从第一个和第三个重复序列之间的70%到第一个和第二个重复序列的NH2-末端250个氨基酸之间的99%。每个LHR包含七个由60-70个氨基酸组成的短共有重复序列(SCRs),它们类似于其他C3/C4结合蛋白的SCRs,如补体受体2型、B因子和H因子、C4结合蛋白和C2。另外两个SCRs将LHRs与一个由25个氨基酸组成的单跨膜结构域相连;因此,CR1的F等位基因可能至少包含30个SCRs,其中23个已被测序。每个SCR预计形成一个三环结构,其中四个保守的半胱氨酸形成二硫键。30个SCRs作为一个半刚性结构的线性排列将从质膜延伸1140A,并可能促进CR1与位于免疫复合物和微生物细胞壁间隙内的C3b和C4b的相互作用。43个残基的COOH-末端胞质结构域包含一个六氨基酸序列,该序列与表皮生长因子受体中被蛋白激酶C磷酸化的序列同源。