Naora H, Deacon N J
Proc Natl Acad Sci U S A. 1982 Oct;79(20):6196-200. doi: 10.1073/pnas.79.20.6196.
We have attempted to ascertain the correlation between the genetic information content in the exons and the surrounding intron sequences with regard to their spatial arrangement within a gene. A comparison is made of the sizes, taken from recent publications, of exons and introns of approximately equal to 80 different protein-coding chromosomal genes, mostly from higher eukaryotes. The exons of these genes do not show very marked variation in size and can be classified into three major discrete and two minor additional size groups, whereas individual introns vary considerably in size within and between genes. Notwithstanding, the overall length of all introns present within a given gene is a function of the total size, mostly corresponding to the total genetic information content, of the exons. Three cases that violate this exon-size dependency of introns are genes coding for (i) histone H1, feather keratin, and interferons, (ii) tubulin and actin, and (iii) silk fibroin. The exons of these genes are larger than 0.7 kilobase pair in total size and the genes show a strong sequence homogeneity among the repetitious family members or internal repeats of coding sequences within the gene. We propose that conservation of sequences, which is required by the family members, internal repeats, or the entire gene, would actually motivate the removal of introns.
我们试图确定外显子中的遗传信息内容与其周围内含子序列在基因内的空间排列之间的相关性。我们比较了近期出版物中约80种不同蛋白质编码染色体基因(主要来自高等真核生物)的外显子和内含子大小。这些基因的外显子大小变化不太明显,可分为三个主要离散大小组和两个较小的额外大小组,而各个内含子在基因内部和基因之间的大小差异很大。尽管如此,给定基因内所有内含子的总长度是外显子总大小(大多对应于总遗传信息内容)的函数。有三种情况违反了内含子对外显子大小的这种依赖性,即编码(i)组蛋白H1、羽毛角蛋白和干扰素的基因,(ii)微管蛋白和肌动蛋白的基因,以及(iii)丝心蛋白的基因。这些基因的外显子总大小大于0.7千碱基对,并且这些基因在重复家族成员或基因内编码序列的内部重复序列之间表现出很强的序列同质性。我们提出,家族成员、内部重复序列或整个基因所需的序列保守性实际上会促使内含子的去除。