Ogata K, Ohya M, Umeyama H
School of Pharmaceutical Sciences, Kitasato University, Tokyo, Japan.
J Mol Graph Model. 1998 Aug-Dec;16(4-6):178-89, 254. doi: 10.1016/s1093-3263(98)80002-8.
In this paper, we obtained a similarity matrix for homology modeling based on the structure of proteins in a structural alignment. The alignment procedure was executed within dynamic programming generally used in alignment methods. An initial matrix derived from the structural alignment was optimized by the Markov chain Monte Carlo method at low temperature to fit its sequence alignment to the structural alignment. Structural alignment was performed on the basis of the superposition of C alpha atoms for two protein structures. The objective function in the Monte Carlo procedure was defined by entropy in the information theory, allowing us to show that the amino acid similarity matrix aligned accurately. When compared with the structural alignment, the average number of incorrect amino acid residues in the sequence alignment was 22.6 for all residues and about 3.7 for residues in structurally conserved regions. The alignment with our matrix was more similar to structural alignment than to sequence alignments using other amino acid substitution matrices.
在本文中,我们基于结构比对中蛋白质的结构获得了用于同源建模的相似性矩阵。比对过程是在比对方法中普遍使用的动态规划内执行的。从结构比对中导出的初始矩阵通过低温下的马尔可夫链蒙特卡罗方法进行优化,以使序列比对与结构比对相匹配。结构比对是基于两个蛋白质结构的Cα原子的叠加进行的。蒙特卡罗过程中的目标函数由信息论中的熵定义,这使我们能够表明氨基酸相似性矩阵准确对齐。与结构比对相比,序列比对中所有残基的平均错误氨基酸残基数为22.6,结构保守区域中的残基平均约为3.7。与使用其他氨基酸替换矩阵的序列比对相比,我们的矩阵进行的比对与结构比对更相似。