Christiano A M, Hoffman G G, Chung-Honet L C, Lee S, Cheng W, Uitto J, Greenspan D S
Department of Dermatology, Jefferson Medical College, Philadelphia, Pennsylvania 19107.
Genomics. 1994 May 1;21(1):169-79. doi: 10.1006/geno.1994.1239.
The human type VII collagen (COL7A1) gene is the locus for mutations in at least some cases of dystrophic epidermolysis bullosa. Here we describe the entire intron/exon organization of COL7A1, which is shown to have 118 exons, more than any previously described gene. Despite this complexity, COL7A1 is compact. Consisting of 31,132 bp from transcription start site to polyadenylation site, it is only about three times the size of type VII collagen mRNA. Thus, COL7A1 introns are small. A 71-nucleotide COL7A1 intron is the smallest intron yet reported in a collagen gene, and only one COL7A1 intron is greater than 1 kb in length. All exons in the COL7A1 triple helix coding region that do not begin with sequences corresponding to imperfections of the triple helix begin with intact codons for Gly residues of Gly-X-Y repeats. This is reminiscent of the structure of fibrillar rather than other nonfibrillar collagen genes. In addition, the COL7A1 triple helix coding region contains many exons of recurring sizes (e.g., 25 exons are 36 bp, 12 exons are 45 bp, 8 exons are 63 bp), suggesting an evolutionary origin distinct from those of other nonfibrillar collagen genes. Sequences from the 5' portion of COL7A1 are presented along with the 3766-bp intergenic sequence, which separates COL7A1 from the upstream gene encoding the core I protein of the cytochrome bc1 complex. The COL7A1 promoter region is found to lack extensive homologies with promoter regions of other genes expressed primarily in skin.
人类VII型胶原蛋白(COL7A1)基因是至少某些营养不良性大疱性表皮松解症病例中突变的位点。在此我们描述了COL7A1的完整内含子/外显子结构,该基因显示有118个外显子,比之前描述的任何基因都多。尽管结构复杂,但COL7A1基因很紧凑。从转录起始位点到聚腺苷酸化位点由31,132个碱基对组成,其大小仅约为VII型胶原蛋白mRNA的三倍。因此,COL7A1的内含子较小。一个71个核苷酸的COL7A1内含子是迄今在胶原蛋白基因中报道的最小内含子,并且只有一个COL7A1内含子长度大于1 kb。COL7A1三螺旋编码区域中所有不以对应于三螺旋缺陷序列开头的外显子,均以Gly-X-Y重复序列中Gly残基的完整密码子开头。这让人联想到纤维状而非其他非纤维状胶原蛋白基因的结构。此外,COL7A1三螺旋编码区域包含许多大小重复的外显子(例如,25个外显子为36 bp,12个外显子为45 bp,8个外显子为63 bp),这表明其进化起源与其他非纤维状胶原蛋白基因不同。展示了COL7A1 5'部分的序列以及3766 bp的基因间序列,该序列将COL7A1与编码细胞色素bc1复合物核心I蛋白的上游基因分隔开。发现COL7A1启动子区域与主要在皮肤中表达的其他基因的启动子区域缺乏广泛的同源性。