Elton R A
J Mol Evol. 1975 Mar 24;4(4):323-46. doi: 10.1007/BF01732535.
A doublet frequency count (set of frequencies of the 16 possible two-base sequences) can be calculated from the experimentally determined overall sequence of a nucleic acid. In this paper, a statistical methodology is developed for comparing such counts with random, with others of the same type or with doublet proportions found in whole DNAs. The methods are applied to two major categories of sequenced RNAs. It is found that vertebrate ribosomal and transfer RNAs show significant differences from the overall vertebrate DNA pattern, especially in the frequency of the doublet CG. Bacterial rRNA and tRNA, on the other hand, show less dissimilarity from total DNA. In the RNA of the small bacteriophage MS2, the doublet frequencies of the translated regions of the genome resemble those in the host E. coli, whereas those in the intercistronic regions differ substantially. All these findings are discussed in relation to the origin, evolution and selection of the nucleic acids concerned.
可以根据实验确定的核酸总体序列计算双重频率计数(16种可能的双碱基序列的频率集)。本文开发了一种统计方法,用于将此类计数与随机情况、相同类型的其他计数或整个DNA中发现的双重比例进行比较。这些方法应用于两类主要的已测序RNA。研究发现,脊椎动物核糖体RNA和转移RNA与脊椎动物DNA的总体模式存在显著差异,尤其是在双碱基CG的频率方面。另一方面,细菌rRNA和tRNA与总DNA的差异较小。在小噬菌体MS2的RNA中,基因组翻译区域的双重频率与宿主大肠杆菌中的相似,而顺反子间区域的双重频率则有很大差异。所有这些发现都围绕相关核酸的起源、进化和选择进行了讨论。