Doege K J, Garrison K, Coulter S N, Yamada Y
Research Unit, Shriners Hospitals for Crippled Children, Portland, Oregon 97201.
J Biol Chem. 1994 Nov 18;269(46):29232-40.
Aggrecan is a major structural component of cartilage extracellular matrix and a specific gene product of differentiated chondrocytes. cDNA clones have been used to isolate rat aggrecan genomic clones from phage and cosmid libraries, producing over 80 kilobases (kb) of overlapping DNA containing the complete rat aggrecan gene, including 12 kb of 5'- and 8 kb of 3'-flanking DNA. DNA sequencing shows 18 exons, most of which encode structural or functional modules; exceptions are domains G1-B and G2-B, which are split into two exons and the G3 lectin domain, which is encoded by three exons. There is one expressed epidermal growth factor-like exon and in addition a non-expressed "pseudo-exon" encoding a heavily mutated epidermal growth factor-like domain. Intron sizes have been determined by restriction mapping and inter-exon polymerase chain reaction; a 30-kb intron separates exons 1 and 2. Exon 1 has been mapped by primer extension and S1 nuclease protection; it encodes 381 base pairs (bp) of 5'-untranslated sequence. There is a minor promoter which initiates transcription an additional 68 bp 5' of the major promoter start site. DNA sequence is reported for a 529-bp fragment encompassing exon 1, including 120 bp of 5'-flanking DNA comprising the promoter. This promoter is lacking the TATAA or CCAAT elements but has several putative binding sites for transcription factors. A 922-bp DNA fragment with 640-bp 5'-flanking DNA and 282-bp exon 1 sequence showed higher promoter activity in transfected chondrocytes than in fibroblasts, is completely inactive in the reverse orientation, and is strongly enhanceable in the forward direction by the SV40 enhancer.
聚集蛋白聚糖是软骨细胞外基质的主要结构成分,是分化软骨细胞的特异性基因产物。已利用cDNA克隆从噬菌体和黏粒文库中分离大鼠聚集蛋白聚糖基因组克隆,产生了超过80千碱基(kb)的重叠DNA,其中包含完整的大鼠聚集蛋白聚糖基因,包括12 kb的5'侧翼DNA和8 kb的3'侧翼DNA。DNA测序显示有18个外显子,其中大多数编码结构或功能模块;例外的是结构域G1-B和G2-B,它们被分成两个外显子,以及G3凝集素结构域,它由三个外显子编码。有一个表达的表皮生长因子样外显子,此外还有一个非表达的“假外显子”,其编码一个高度突变的表皮生长因子样结构域。内含子大小已通过限制性图谱分析和外显子间聚合酶链反应确定;一个30 kb的内含子将外显子1和2分开。外显子1已通过引物延伸和S1核酸酶保护进行定位;它编码5'非翻译序列的381个碱基对(bp)。有一个次要启动子,它在主要启动子起始位点的5'端另外68 bp处启动转录。报道了一个包含外显子1的529 bp片段的DNA序列,包括由启动子组成的120 bp的5'侧翼DNA。该启动子缺乏TATAA或CCAAT元件,但有几个转录因子的假定结合位点。一个具有640 bp 5'侧翼DNA和282 bp外显子1序列的922 bp DNA片段在转染的软骨细胞中显示出比在成纤维细胞中更高的启动子活性,在反向时完全无活性,并且在正向可被SV40增强子强烈增强。