Perälä M, Elima K, Metsäranta M, Rosati R, de Crombrugghe B, Vuorio E
Department of Medical Biochemistry, University of Turku, Finland.
J Biol Chem. 1994 Feb 18;269(7):5064-71.
One cosmid and two overlapping phage clones covering the entire mouse alpha 2(IX) collagen gene including 12 kilobase pairs (kb) of 5'- and 8 kb of 3'-flanking sequences were isolated from two genomic libraries. The overall gene structure was determined by restriction mapping and nucleotide sequencing. The gene spans 16 kb from the start of transcription to the polyadenylation site and contains 32 exons. It codes for a mRNA of 3 kb that translates into a polypeptide of 688 amino acids. The intron-exon junctions and mRNA structure were confirmed by amplification of cDNA made for mouse cartilage RNA. The coding sequence of the mouse alpha 2(IX) collagen gene shows marked similarities to those for other type IX collagen chains. Although the overall exon-intron organization of the mouse gene is very similar to the chick alpha 2(IX) gene, some unexpected differences were observed at the splice junctions. Split codons characteristic for the central triple helical domain of the chick were not found in the mouse gene that thus exhibited a long stretch of exons with sizes that are multiples of 9 base pairs in this domain. The promoter of the mouse alpha 2(IX) collagen gene contains some G + C-rich elements including three Sp1 consensus recognition sites and a far upstream CCAAT box but no TATAA box. Both primer extension and RNase protection assays revealed several transcription start sites within 418 base pairs of the promoter. The present study reports the first complete nucleotide sequence of any type IX collagen gene and forms the basis for comparative structural studies on this collagen type and for experiments involving transgenic mice.
从两个基因组文库中分离出一个黏粒和两个重叠的噬菌体克隆,它们覆盖了整个小鼠α2(IX)胶原蛋白基因,包括12千碱基对(kb)的5'侧翼序列和8 kb的3'侧翼序列。通过限制性酶切图谱分析和核苷酸测序确定了该基因的整体结构。该基因从转录起始位点到聚腺苷酸化位点跨度为16 kb,包含32个外显子。它编码一个3 kb的mRNA,该mRNA翻译成一个由688个氨基酸组成的多肽。通过对小鼠软骨RNA制备的cDNA进行扩增,证实了内含子-外显子连接和mRNA结构。小鼠α2(IX)胶原蛋白基因的编码序列与其他IX型胶原蛋白链的编码序列有显著相似性。虽然小鼠基因的整体外显子-内含子组织与鸡α2(IX)基因非常相似,但在剪接连接处观察到一些意外的差异。在小鼠基因中未发现鸡中央三螺旋结构域特有的裂分密码子,因此该结构域在小鼠基因中呈现出一段外显子,其大小是9个碱基对的倍数。小鼠α2(IX)胶原蛋白基因的启动子包含一些富含G + C的元件,包括三个Sp1共有识别位点和一个远上游的CCAAT框,但没有TATA框。引物延伸和RNA酶保护试验均显示在启动子的418个碱基对内有几个转录起始位点。本研究报道了任何IX型胶原蛋白基因的首个完整核苷酸序列,为该胶原蛋白类型的比较结构研究以及涉及转基因小鼠的实验奠定了基础。