Pereira De Araújo A F
Departamento de Biologia Celular, International Center of Condensed Matter Physics, Universidade de Brasília, Brasília-DF 70910-900, Brazil.
Proc Natl Acad Sci U S A. 1999 Oct 26;96(22):12482-7. doi: 10.1073/pnas.96.22.12482.
The present study explores a "hydrophobic" energy function for folding simulations of the protein lattice model. The contribution of each monomer to conformational energy is the product of its "hydrophobicity" and the number of contacts it makes, i.e., E(h,c) = -Sigma N/i=1 c(i)h(i) = -(h.c) is the negative scalar product between two vectors in N-dimensional cartesian space: h = (h1,., hN), which represents monomer hydrophobicities and is sequence-dependent; and c = (c(1),., c(N)), which represents the number of contacts made by each monomer and is conformation-dependent. A simple theoretical analysis shows that restrictions are imposed concomitantly on both sequences and native structures if the stability criterion for protein-like behavior is to be satisfied. Given a conformation with vector c, the best sequence is a vector h on the direction upon which the projection of c - c is maximal, where c is the diagonal vector with components equal to c, the average number of contacts per monomer in the unfolded state. Best native conformations are suggested to be not maximally compact, as assumed in many studies, but the ones with largest variance of contacts among its monomers, i.e., with monomers tending to occupy completely buried or completely exposed positions. This inside/outside segregation is reflected on an apolar/polar distribution on the corresponding sequence. Monte Carlo simulations in two dimensions corroborate this general scheme. Sequences targeted to conformations with large contact variances folded cooperatively with thermodynamics of a two-state transition. Sequences targeted to maximally compact conformations, which have lower contact variance, were either found to have degenerate ground state or to fold with much lower cooperativity.
本研究探索了一种用于蛋白质晶格模型折叠模拟的“疏水”能量函数。每个单体对构象能量的贡献是其“疏水性”与其形成的接触数的乘积,即E(h,c)= -ΣN/i=1 c(i)h(i)= -(h·c),这是N维笛卡尔空间中两个向量的负标量积:h = (h1, ..., hN),它表示单体疏水性且依赖于序列;c = (c(1), ..., c(N)),它表示每个单体形成的接触数且依赖于构象。一个简单的理论分析表明,如果要满足类似蛋白质行为的稳定性标准,那么对序列和天然结构都会同时施加限制。给定一个具有向量c的构象,最佳序列是c - c的投影最大方向上的向量h,其中c是对角向量,其分量等于c,即未折叠状态下每个单体的平均接触数。最佳天然构象并非如许多研究中假设的那样是最紧密的,而是其单体间接触差异最大的构象,即单体倾向于占据完全埋藏或完全暴露的位置。这种内外分离反映在相应序列的非极性/极性分布上。二维的蒙特卡罗模拟证实了这一总体方案。针对具有大接触差异构象的序列以两态转变的热力学方式协同折叠。针对具有较低接触差异的最紧密构象的序列,要么发现具有简并基态,要么以低得多的协同性折叠。