Kleffe J, Grau E
Institut für Molekularbiologie und Biochemei, Freie Universität Berlin.
Comput Appl Biosci. 1993 Jun;9(3):275-83. doi: 10.1093/bioinformatics/9.3.275.
A method was previously developed for computation of pattern probabilities in random sequences under Markov chain models. We extend this method to the calculation of the joint distribution for two patterns. An application yields the distribution of the right choice measure for expressivity and how significance bounds depend on sequence length. These bounds are used to show that the choice of pyrimidine in codon position 3 of Escherichia coli genes deviates considerably from a general Markov process model for coding regions. We also derive some statistical evidence that this significant deviation is limited to codon position 3.
先前已开发出一种方法,用于计算马尔可夫链模型下随机序列中的模式概率。我们将此方法扩展到两种模式联合分布的计算。一项应用得出了表达性的正确选择度量的分布,以及显著性界限如何依赖于序列长度。这些界限用于表明大肠杆菌基因密码子第3位嘧啶的选择与编码区的一般马尔可夫过程模型有很大偏差。我们还得出了一些统计证据,表明这种显著偏差仅限于密码子第3位。