Grover J, Chen X N, Korenberg J R, Recklies A D, Roughley P J
Genetics Unit, Shriners Hospital for Crippled Children, Montreal, Quebec, Canada.
Genomics. 1996 Dec 1;38(2):109-17. doi: 10.1006/geno.1996.0605.
The gene corresponding to a 55-kDa matrix protein previously described in adult human articular cartilage was characterized by sequencing of genomic clones. The deduced protein sequence corresponds to the recently described matrix protein PRELP. The protein was encoded by messages of 1.7, 4.6, and 6.7 kb, whose relative abundance increased as their size decreased. The message heterogeneity appears to originate from variation in the length of the 3'-untranslated region, with the smallest message being contained within the reported sequence and the larger messages having extended 3'-untranslated regions. Two introns were identified within the genomic sequence encoding the smallest message. The first intron of about 6.7 kb resides 16 nucleotides prior to the translation initiation codon, and the second intron of about 2.6 kb resides 173 nucleotides prior to the translation termination codon. The gene, which encompasses at least 16 kb of genomic DNA, was shown to reside on chromosome 1q32. Primer extension techniques were used to establish that the coding sequence commences 199 bp downstream from the major transcription start site. Analysis of the DNA sequence upstream from the transcription start site reveals the presence of numerous potential transcription factor binding sites, but no CAAT or TATA box. At the message level, gene expression was at a high level in juvenile and adult cartilage, but not in the fetus or neonate. The presence of protein in the cartilage matrix was also much lower in the neonate than in the adult. In noncartilagenous tissues appreciable message levels were observed only in the adult lung.
通过对基因组克隆进行测序,对先前在成人关节软骨中描述的一种55 kDa基质蛋白对应的基因进行了表征。推导的蛋白质序列与最近描述的基质蛋白PRELP相对应。该蛋白质由1.7、4.6和6.7 kb的信使RNA编码,其相对丰度随着大小的减小而增加。信使RNA的异质性似乎源于3'-非翻译区长度的变化,最小的信使RNA包含在报道的序列中,而较大的信使RNA具有延伸的3'-非翻译区。在编码最小信使RNA的基因组序列中鉴定出两个内含子。第一个约6.7 kb的内含子位于翻译起始密码子之前16个核苷酸处,第二个约2.6 kb的内含子位于翻译终止密码子之前173个核苷酸处。该基因包含至少16 kb的基因组DNA,位于1号染色体的1q32上。使用引物延伸技术确定编码序列从主要转录起始位点下游199 bp处开始。对转录起始位点上游的DNA序列分析揭示了存在许多潜在的转录因子结合位点,但没有CAAT盒或TATA盒。在信使RNA水平上,该基因在幼年和成年软骨中高表达,但在胎儿或新生儿中不表达。新生儿软骨基质中的蛋白质含量也比成年人低得多。在非软骨组织中,仅在成年肺中观察到明显的信使RNA水平。