Miyazawa Sanzo, Kinjo Akira R
Graduate School of Engineering, Gunma University, Kiryu, Gunma 376-8515, Japan.
Phys Rev E Stat Nonlin Soft Matter Phys. 2008 May;77(5 Pt 1):051910. doi: 10.1103/PhysRevE.77.051910. Epub 2008 May 14.
The properties of contact matrices ( C matrices) needed for native proteins to be the lowest-energy conformations are considered in relation to a contact energy matrix ( E matrix). The total conformational energy is assumed to consist of pairwise interaction energies between atoms or residues, each of which is expressed as a product of a conformation-dependent function (an element of the C matrix) and a sequence-dependent energy parameter (an element of the E matrix). Such pairwise interactions in proteins force native C matrices to be in a relationship as if the interactions are a Go-like potential [N. Go, Annu. Rev. Biophys. Bioeng. 12, 183 (1983)] for the native C matrix, because the lowest bound of the total energy function is equal to the total energy of the native conformation interacting in a Go-like pairwise potential. This relationship between C and E matrices corresponds to (a) a parallel relationship between the eigenvectors of the C and E matrices and a linear relationship between their eigenvalues and (b) a parallel relationship between a contact number vector and the principal eigenvectors of the C and E matrices, where the E matrix is expanded in a series of eigenspaces with an additional constant term. The additional constant term in the spectral expansion of the E matrix is indicated by the lowest bound of the total energy function to correspond to a threshold of contact energy that approximately separates native contacts from non-native ones. Inner products between the principal eigenvector of the C matrix, that of the E matrix, and a contact number vector have been examined for 182 proteins, each of which is a representative from each family of the SCOP database [Murzin, J. Mol. Biol. 247, 536 (1995)], and the results indicate the parallel tendencies between those vectors. A statistical contact potential [S. Miyazawa and R. L. Jernigan, Proteins 34, 49 (1999); S. Miyazawa and R. L. Jernigan, Proteins50, 35 (2003)] estimated from protein crystal structures was used to evaluate pairwise residue-residue interactions in the proteins. In addition, the spectral representation of C and E matrices reveals that pairwise residue-residue interactions, which depend only on the types of interacting amino acids, but not on other residues in a protein, are insufficient and other interactions including residue connectivities and steric hindrance are needed to make native structures unique lowest-energy conformations.
考虑天然蛋白质处于最低能量构象所需的接触矩阵(C矩阵)的性质与接触能量矩阵(E矩阵)的关系。假设总构象能量由原子或残基之间的成对相互作用能组成,其中每一项都表示为构象相关函数(C矩阵的一个元素)与序列相关能量参数(E矩阵的一个元素)的乘积。蛋白质中的这种成对相互作用迫使天然C矩阵处于一种关系中,就好像这些相互作用是天然C矩阵的类Go势 [N. Go, Annu. Rev. Biophys. Bioeng. 12, 183 (1983)],因为总能量函数的下限等于以类Go成对势相互作用的天然构象的总能量。C矩阵和E矩阵之间的这种关系对应于:(a)C矩阵和E矩阵的特征向量之间的平行关系以及它们特征值之间的线性关系;(b)接触数向量与C矩阵和E矩阵的主特征向量之间的平行关系,其中E矩阵在一系列特征子空间中展开并带有一个附加常数项。E矩阵谱展开中的附加常数项由总能量函数的下限表示,对应于一个接触能量阈值,该阈值近似地将天然接触与非天然接触区分开。已对182种蛋白质研究了C矩阵的主特征向量、E矩阵的主特征向量与接触数向量之间的内积,每种蛋白质都是SCOP数据库 [Murzin, J. Mol. Biol. 247, 536 (1995)] 中每个家族的代表,结果表明这些向量之间存在平行趋势。利用从蛋白质晶体结构估计的统计接触势 [S. Miyazawa和R. L. Jernigan, Proteins 34, 49 (1999); S. Miyazawa和R. L. Jernigan, Proteins50, 35 (2003)] 来评估蛋白质中残基间的成对相互作用。此外,C矩阵和E矩阵的谱表示表明,仅依赖于相互作用氨基酸类型而不依赖于蛋白质中其他残基的残基间成对相互作用是不够的,还需要包括残基连接性和空间位阻在内的其他相互作用才能使天然结构成为唯一的最低能量构象。