Peterson R C
Uniformed Services University of the Health Sciences.
Biotechniques. 1988 Jan;6(1):34-40.
The calculation of probabilities of nucleotide sequences from the frequencies of dinucleotides is described. The dinucleotide and mononucleotide frequencies used can be obtained from nearest neighbor analysis or from databank sequences. If dinucleotide and mononucleotide frequencies from nearest neighbor analysis are used, probabilities for oligonucleotides can be calculated for genomes in which there is little or no sequence data. Within a given genome, a broad range of probabilities for hexanucleotide palindromes with the same base composition is predicted and shown (14).
本文描述了根据二核苷酸频率计算核苷酸序列概率的方法。所使用的二核苷酸和单核苷酸频率可通过最近邻分析或从数据库序列中获得。如果使用来自最近邻分析的二核苷酸和单核苷酸频率,则可以为几乎没有序列数据的基因组计算寡核苷酸的概率。在给定的基因组中,预测并展示了具有相同碱基组成的六核苷酸回文序列的广泛概率范围(14)。