Helaakoski T, Veijola J, Vuori K, Rehn M, Chow L T, Taillon-Miller P, Kivirikko K I, Pihlajaniemi T
Collagen Research Unit, University of Oulu, Finland.
J Biol Chem. 1994 Nov 11;269(45):27847-54.
Prolyl 4-hydroxylase, an alpha 2 beta 2 tetramer, plays a central role in collagen synthesis as it catalyzes the formation of 4-hydroxyproline residues by the hydroxylation of proline in X-Pro-Gly sequences. We report here that the human gene for the catalytically important alpha subunit is more than 69 kilobase pairs and consists of 16 exons. The exons that encode solely protein sequences vary from 54 to 240 base pairs (bp), and the introns vary from 750 to more than 16,000 bp. The 133 bp of 5'-untranslated sequences of the mRNA are coded by two exons, and these sequences contain inverted repeats with a potential for stem-loop formation, which may be involved in translational control of the expression of this gene. The 5'-flanking region contains a TATa motif at -29 relative to the major transcription site but no CCAAT motif. The 5'-flanking region and the downstream sequences contain several motifs that may act as binding sites for various transcription factors. Evidence has previously been reported for a mutually exclusive alternative splicing of RNA transcripts of this gene. The present data indicate that the mutually exclusive sequences found in the mRNAs are coded by two consecutive, homologous 71-bp exons 9 and 10. These exons are identical in their first 5 bp and the overall identity between them is 61% at the nucleotide level and 58% at the level of the coded amino acids. Both types of mRNA were found to be expressed in all of the tissues studied, but in some tissues the type coding for exon 9 or 10 sequences was more abundant than the other type.
脯氨酰4-羟化酶是一种α2β2四聚体,在胶原蛋白合成中起核心作用,因为它通过催化X-Pro-Gly序列中的脯氨酸羟基化形成4-羟脯氨酸残基。我们在此报告,具有催化重要性的α亚基的人类基因超过69千碱基对,由16个外显子组成。仅编码蛋白质序列的外显子长度从54到240碱基对(bp)不等,内含子长度从750到超过16,000 bp不等。mRNA的133 bp 5'-非翻译序列由两个外显子编码,这些序列包含具有形成茎环结构潜力的反向重复序列,这可能参与该基因表达的翻译控制。5'-侧翼区域在相对于主要转录位点-29处含有一个TATA基序,但没有CCAAT基序。5'-侧翼区域和下游序列包含几个可能作为各种转录因子结合位点的基序。此前已有报道该基因的RNA转录本存在相互排斥的可变剪接。目前的数据表明,在mRNA中发现的相互排斥序列由两个连续的、同源的71-bp外显子9和10编码。这些外显子在其前5 bp相同,它们之间在核苷酸水平的总体同一性为61%,在编码氨基酸水平为58%。在所有研究的组织中都发现了两种类型的mRNA,但在某些组织中,编码外显子9或10序列的类型比另一种类型更丰富。