Vogeli G, Ohkubo H, Sobel M E, Yamada Y, Pastan I, de Crombrugghe B
Proc Natl Acad Sci U S A. 1981 Sep;78(9):5334-8. doi: 10.1073/pnas.78.9.5334.
The chicken alpha 2 type I collagen gene is 38 kilobases long and its coding information is subdivided into more than 50 exons. In the current study, we used primer extension and S1 nuclease mapping to determine the sequence of the 5' end of alpha 2 collagen mRNA and to locate the start site for transcription of the alpha 2 collagen gene. The DNA sequence around the start site for transcription shows a typical Goldberg-Hogness sequence, 5' T-A-T-A-A-A-T 3', between -33 and -26 and a 5' G-C-C-C-A-T-T 3' sequence ("CAT" box) between -84 and -78. Three AUGs are found in the initial portion of the mRNA, the first from +54 to +56, the second from +117 to +119, and the third from +134 to +136. The first two AUGs are followed by short coding sequences that could specify a hexapeptide a tetrapeptide, respectively. Only the third AUG is followed by an open reading frame coding for a sequence that presents considerable homology with the previously determined amino acid sequence of prepro alpha 1 collagen. In the promoter region sequence there are several extensive dyads of symmetry. Three of these inverted repeats which precede the start site for transcription overlap each other and may have a role in the developmental regulation of this gene.
鸡α2 I型胶原基因长度为38千碱基对,其编码信息被细分为50多个外显子。在本研究中,我们使用引物延伸和S1核酸酶图谱分析来确定α2胶原mRNA 5'端的序列,并定位α2胶原基因的转录起始位点。转录起始位点周围的DNA序列在-33至-26之间显示出典型的戈德堡-霍格内斯序列5'T-A-T-A-A-A-T 3',在-84至-78之间显示出5'G-C-C-C-A-T-T 3'序列(“CAT”框)。在mRNA的起始部分发现了三个AUG,第一个从+54至+56,第二个从+117至+119,第三个从+134至+136。前两个AUG后面跟着短编码序列,分别可以指定一个六肽和一个四肽。只有第三个AUG后面跟着一个开放阅读框,编码一个与先前确定的前原α1胶原氨基酸序列具有相当同源性的序列。在启动子区域序列中有几个广泛的对称二元组。转录起始位点之前的三个反向重复序列相互重叠,可能在该基因的发育调控中起作用。