Aho S, Tate V, Boedtker H
Nucleic Acids Res. 1984 Aug 10;12(15):6117-25. doi: 10.1093/nar/12.15.6117.
During the fine structural analysis of the 5' end of the 38 kb chicken pro alpha 2(I) collagen gene, we failed to locate an exon, only 11 bp in size, which had been predicted from the DNA sequence analysis of a cDNA clone complementary to the 5' end of the pro alpha 2(I) collagen mRNA (1). We know report the location of this 11 bp exon, exon 2, at the 5' end of a 180 bp Pst I fragment, 1900 bp 3' to exon 1 and 600 bp 5' to exon 3. Its sequence, ATGTGAGTGAG, is highly unusual in that it contains two overlapping consensus donor splice sequences. Moreover, it is flanked by two overlapping donor splice sequences but only one of the four splice sequences is actually spliced (1). The first half of intron 1 also has an unusual sequence: it is 68% GC, contains 88 CpG dinucleotides and 11 Hpa II sites. The second half is more like other intron sequences in the collagen gene with a GC content of 41%, 19 CpG, and no Hpa II sites. However it contains two sequences with 7 and 9 bp homology to the 14 bp SV40 enhancer core sequence. It is suggested that some part of intron 1 may be involved in regulation.
在对38kb鸡原α2(I)型胶原基因5'端进行精细结构分析的过程中,我们未能找到一个仅11bp大小的外显子,该外显子是根据与原α2(I)型胶原mRNA 5'端互补的cDNA克隆的DNA序列分析预测出来的(1)。我们现在报道这个11bp外显子(外显子2)的位置,它位于一个180bp Pst I片段的5'端,在1号外显子下游1900bp处,3号外显子上游600bp处。它的序列ATGTGAGTGAG非常独特,因为它包含两个重叠的共有供体剪接序列。此外,它两侧还有两个重叠的供体剪接序列,但四个剪接序列中只有一个实际发生了剪接(1)。1号内含子的前半部分也有一个不寻常的序列:它的GC含量为68%,含有88个CpG二核苷酸和11个Hpa II位点。后半部分更类似于胶原基因中的其他内含子序列,GC含量为41%,有19个CpG,没有Hpa II位点。然而,它包含两个与14bp SV40增强子核心序列有7bp和9bp同源性的序列。有人认为1号内含子的某些部分可能参与调控。