Lipman D J, Maizel J
Nucleic Acids Res. 1982 Apr 24;10(8):2723-39. doi: 10.1093/nar/10.8.2723.
We describe two measures of a nucleic acid sequence, derived from Information Theory, which characterize the constraints toward nonuniform base composition, and the constraints on the ordering of the bases. These two measures distinguish extra-chromosomal coding sequences from all other coding sequences examined. The two measures separate eukaryotic coding sequences into two groups: those with introns and those without introns. We have also found a relationship between the general constraints of a subsequence and its degree of conservation in related genes.
我们描述了两种源自信息论的核酸序列度量方法,它们分别表征了对非均匀碱基组成的限制以及对碱基排序的限制。这两种度量方法可将染色体外编码序列与所研究的所有其他编码序列区分开来。这两种度量方法还将真核生物编码序列分为两组:有内含子的和没有内含子的。我们还发现了子序列的一般限制与其在相关基因中的保守程度之间的关系。