Avvedimento V E, Musti A M, Obici S, Cocozza S, Di Lauro R
Nucleic Acids Res. 1984 Apr 25;12(8):3461-72. doi: 10.1093/nar/12.8.3461.
We report the structural organization of an 80 Kb segment of rat DNA, which encodes for about 40% of Thyroglobulin mRNA at the 3' end. The codogenic information included in this segment is splitted in 17 exons of homogeneous size (about 200 bp). The seven exons at the extreme 3' end have been precisely defined by DNA sequence analysis. No clear sequence homology is found among the exons, even though their coding capacity is quite similar, from 55 to 63 aminoacids residues. We located 2 hormonogenic (T4 forming) sites on the extreme 3' end of the gene in different exons. The DNA sequence coding for these functional sites shows a 70% homology in a 50 nucleotides segment. In addition we found a remnant of this sequence in other exons of the gene. Two large introns have been found on the 3' end of the gene: one is 17 Kb and the other one is more than 30 Kb long. On the basis of these findings and of preliminary studies on the remaining 5' end of the gene, we can predict that the minimum length of the rat TGB gene will be 150 Kb, which makes this gene the largest so far identified eukaryotic gene. We propose in addition that the 3' end exons arose by duplication of a common ancestor.
我们报道了大鼠DNA一个80千碱基片段的结构组织,该片段在3'端编码约40%的甲状腺球蛋白mRNA。此片段所含的编码信息分散在17个大小均匀(约200碱基对)的外显子中。通过DNA序列分析已精确确定了最末端3'端的七个外显子。外显子之间未发现明显的序列同源性,尽管它们的编码能力相当相似,从55到63个氨基酸残基不等。我们在基因最末端3'端的不同外显子中定位了2个激素生成(形成T4)位点。编码这些功能位点的DNA序列在一个50个核苷酸的片段中显示出70%的同源性。此外,我们在该基因的其他外显子中发现了此序列遗迹。在基因末端3'端发现了两个大的内含子:一个17千碱基,另一个超过30千碱基长。基于这些发现以及对该基因其余5'端的初步研究,我们可以预测大鼠TGB基因的最小长度将为150千碱基,这使得该基因成为迄今为止所鉴定的最大的真核基因。我们还提出,3'端外显子是由一个共同祖先的复制产生的。