Lui V C, Ng L J, Sat E W, Cheah K S
Department of Biochemistry, University of Hong Kong.
Genomics. 1996 Mar 15;32(3):401-12. doi: 10.1006/geno.1996.0135.
Type XI collagen, a fibril-forming collagen, is important for the integrity and development of the skeleton because mutations in the genes encoding its consituent alpha chains have been found in some osteochondrodysplasias. We provide data that complete information for the coding sequence of human alpha 2(XI) procollagen, with details of the promoter region and intron-exon organization at the 5' and 3' ends of the gene (COL11A2), including the transcription start and polyadenylation sites. COL11A2 is 30.5 kb with a minimum of 62 exons, differing from other reported fibrillar collagen genes because the amino propeptide is encoded by 14 not 5 to 8 exons. But exon numbers for the carboxy propeptide and 3'-untranslated region are conserved. The promoter region of COL11A2 lacks a TATA box but is GC-rich with two potential SP1 binding sites. Mouse alpha 2(XI) collagen mRNAs undergo complex alternative splicing involving three amino-terminal propeptide exons but only one of these has been reported for COL11A2. We have located these missing human exons and have identified splice signals that point to additional splice variants. We have precisely mapped COL11A2 within the major histocompatibility complex on chromosome 6. The retinoid X receptor beta (RXR beta) gene is located 1.1 kb upstream of COL11A2. KE5, previously thought to be a distinct transcribed gene sequence, was mapped within COL11A2 in the alternatively spliced region, raising the question whether KE5 and COL11A2 are separate genes.
XI型胶原蛋白是一种形成纤维的胶原蛋白,对骨骼的完整性和发育很重要,因为在一些骨软骨发育不良中发现了编码其组成α链的基因突变。我们提供的数据包含了人类α2(XI)前胶原编码序列的完整信息,以及该基因(COL11A2)5'和3'端启动子区域和内含子-外显子组织的详细信息,包括转录起始位点和聚腺苷酸化位点。COL11A2基因全长30.5 kb,至少有62个外显子,与其他已报道的纤维状胶原蛋白基因不同,因为其氨基前肽由14个外显子编码,而非5至8个外显子。但其羧基前肽和3'非翻译区的外显子数量是保守的。COL11A2的启动子区域缺乏TATA盒,但富含GC,有两个潜在的SP1结合位点。小鼠α2(XI)胶原蛋白mRNA经历复杂的可变剪接,涉及三个氨基末端前肽外显子,但COL11A2仅报道了其中一个。我们定位了这些缺失的人类外显子,并确定了指向其他剪接变体的剪接信号。我们精确地将COL11A2定位在6号染色体上的主要组织相容性复合体中。视黄酸X受体β(RXRβ)基因位于COL11A2上游1.1 kb处。KE5以前被认为是一个独特的转录基因序列,被定位在COL11A2的可变剪接区域内,这就提出了KE5和COL11A2是否为独立基因的问题。