Felts Anthony K, Gallicchio Emilio, Wallqvist Anders, Levy Ronald M
Department of Chemistry and Chemical Biology, Rutgers University, Wright-Rieman Laboratories, Piscataway, New Jersey 08854-8087, USA.
Proteins. 2002 Aug 1;48(2):404-22. doi: 10.1002/prot.10171.
Protein decoy data sets provide a benchmark for testing scoring functions designed for fold recognition and protein homology modeling problems. It is commonly believed that statistical potentials based on reduced atomic models are better able to discriminate native-like from misfolded decoys than scoring functions based on more detailed molecular mechanics models. Recent benchmark tests on small data sets, however, suggest otherwise. In this work, we report the results of extensive decoy detection tests using an effective free energy function based on the OPLS all-atom (OPLS-AA) force field and the Surface Generalized Born (SGB) model for the solvent electrostatic effects. The OPLS-AA/SGB effective free energy is used as a scoring function to detect native protein folds among a total of 48,832 decoys for 32 different proteins from Park and Levitt's 4-state-reduced, Levitt's local-minima, Baker's ROSETTA all-atom, and Skolnick's decoy sets. Solvent electrostatic effects are included through the Surface Generalized Born (SGB) model. All structures are locally minimized without restraints. From an analysis of the individual energy components of the OPLS-AA/SGB energy function for the native and the best-ranked decoy, it is determined that a balance of the terms of the potential is responsible for the minimized energies that most successfully distinguish the native from the misfolded conformations. Different combinations of individual energy terms provide less discrimination than the total energy. The results are consistent with observations that all-atom molecular potentials coupled with intermediate level solvent dielectric models are competitive with knowledge-based potentials for decoy detection and protein modeling problems such as fold recognition and homology modeling.
蛋白质诱饵数据集为测试针对折叠识别和蛋白质同源性建模问题设计的评分函数提供了一个基准。人们普遍认为,基于简化原子模型的统计势比基于更详细分子力学模型的评分函数更能区分天然样构象和错误折叠的诱饵。然而,最近对小数据集的基准测试却表明并非如此。在这项工作中,我们报告了使用基于OPLS全原子(OPLS-AA)力场和用于溶剂静电效应的表面广义玻恩(SGB)模型的有效自由能函数进行广泛诱饵检测测试的结果。OPLS-AA/SGB有效自由能被用作评分函数,以在来自Park和Levitt的四态简化、Levitt的局部最小值、Baker的ROSETTA全原子以及Skolnick的诱饵集中的32种不同蛋白质的总共48,832个诱饵中检测天然蛋白质折叠。通过表面广义玻恩(SGB)模型纳入溶剂静电效应。所有结构在无约束的情况下进行局部最小化。通过对天然构象和排名最佳的诱饵的OPLS-AA/SGB能量函数的各个能量成分进行分析,确定势项的平衡导致了最小化能量,这些能量最成功地区分了天然构象和错误折叠的构象。单个能量项的不同组合提供的区分度不如总能量。这些结果与以下观察结果一致,即全原子分子势与中级溶剂介电模型相结合,在诱饵检测以及诸如折叠识别和同源性建模等蛋白质建模问题中与基于知识的势具有竞争力。