Doege K, Sasaki M, Horigan E, Hassell J R, Yamada Y
Laboratory of Developmental Biology and Anomalies, National Institute of Dental Research, Bethesda, Maryland 20892.
J Biol Chem. 1987 Dec 25;262(36):17757-67.
We have obtained overlapping cDNA clones for the entire coding sequence of the rat cartilage proteoglycan core protein from the Swarm rat chondrosarcoma. These cDNAs hybridize to two sizes of RNA transcripts of 8.2 and 8.9 kilobase pairs, which contain large 3'-untranslated sequences. The total contiguous cDNA is 6.55 kilobase pairs in size, and codes for a 2124-residue protein, including a 19-residue signal peptide. The sequence forms a series of eight structural domains including two globules, (Mr = 37,000 and 22,000) at the NH2 terminus of the molecule, one a complete and one a partial copy of the cartilage link protein. The major feature of the deduced protein sequence is a 1,104-residue segment containing 117 Ser-Gly sequences, the presumed chondroitin sulfate attachment sites. These are arranged in three domains of 428, 503, and 173 amino acids. The first domain contains 11 complete or partial repeats of a 40-residue unit, and the second domain is composed of six copies of a 100-residue repeating sequence. The first pattern is the more highly conserved, and may have given rise to the second. The carboxyl-terminal domain is a third globule which has homology with animal lectins.
我们从斯旺大鼠软骨肉瘤中获得了大鼠软骨蛋白聚糖核心蛋白完整编码序列的重叠cDNA克隆。这些cDNA与大小分别为8.2和8.9千碱基对的两种RNA转录本杂交,这两种转录本含有大的3'非翻译序列。连续的cDNA全长为6.55千碱基对,编码一个2124个残基的蛋白质,包括一个19个残基的信号肽。该序列形成一系列八个结构域,包括分子NH2末端的两个球蛋白(Mr = 37,000和22,000),其中一个是软骨连接蛋白的完整拷贝,另一个是部分拷贝。推导的蛋白质序列的主要特征是一个1104个残基的片段,包含117个Ser-Gly序列,推测为硫酸软骨素附着位点。这些位点排列在三个结构域中,分别含有428、503和173个氨基酸。第一个结构域包含11个40个残基单元的完整或部分重复序列,第二个结构域由六个100个残基重复序列的拷贝组成。第一种模式更为保守,可能衍生出了第二种模式。羧基末端结构域是第三个球蛋白,与动物凝集素有同源性。