Fogolari Federico, Tosatto Silvio C E, Colombo Giorgio
Dipartimento di Scienze e Tecnologie Biomediche, Università di Udine, P,le Kolbe 4, 33100 Udine, Italy.
BMC Bioinformatics. 2005 Dec 14;6:301. doi: 10.1186/1471-2105-6-301.
Estimators of free energies are routinely used to judge the quality of protein structural models. As these estimators still present inaccuracies, they are frequently evaluated by discriminating native or native-like conformations from large ensembles of so-called decoy structures.
A decoy set is obtained from snapshots taken from 5 long (100 ns) molecular dynamics (MD) simulations of the thermostable subdomain from chicken villin headpiece. An evaluation of the energy of the decoys is given using: i) a residue based contact potential supplemented by a term for the quality of dihedral angles; ii) a recently introduced combination of four statistical scoring functions for model quality estimation (FRST); iii) molecular mechanics with solvation energy estimated either according to the generalized Born surface area (GBSA) or iv) the Poisson-Boltzmann surface area (PBSA) method.
The decoy set presented here has the following features which make it attractive for testing energy scoring functions:1) it covers a broad range of RMSD values (from less than 2.0 A to more than 12 A);2) it has been obtained from molecular dynamics trajectories, starting from different non-native-like conformations which have diverse behaviour, with secondary structure elements correctly or incorrectly formed, and in one case folding to a native-like structure. This allows not only for scoring of static structures, but also for studying, using free energy estimators, the kinetics of folding;3) all structures have been obtained from accurate MD simulations in explicit solvent and after molecular mechanics (MM) energy minimization using an implicit solvent method. The quality of the covalent structure therefore does not suffer from steric or covalent problems. The statistical and physical effective energy functions tested on the set behave differently when native simulation snapshots are included or not in the set and when averaging over the trajectory is performed.
自由能估计器经常被用于评判蛋白质结构模型的质量。由于这些估计器仍存在不准确之处,它们常常通过从大量所谓的诱饵结构集合中区分出天然或类天然构象来进行评估。
通过从鸡肌动蛋白头部热稳定亚结构域的5次长时间(100纳秒)分子动力学(MD)模拟所拍摄的快照中获取一个诱饵集。使用以下方法对诱饵的能量进行评估:i)基于残基的接触势,并辅以二面角质量项;ii)最近引入的用于模型质量估计的四个统计评分函数的组合(FRST);iii)分子力学,其溶剂化能根据广义玻恩表面积(GBSA)进行估计,或者iv)泊松-玻尔兹曼表面积(PBSA)方法。
此处呈现的诱饵集具有以下使其对测试能量评分函数具有吸引力的特征:1)它涵盖了广泛的均方根偏差(RMSD)值范围(从小于2.0埃到大于12埃);2)它是从分子动力学轨迹中获得的,起始于具有不同行为的不同非类天然构象,二级结构元件正确或错误形成,并且在一种情况下折叠成类天然结构。这不仅允许对静态结构进行评分,还允许使用自由能估计器研究折叠动力学;3)所有结构均通过在显式溶剂中的精确MD模拟以及使用隐式溶剂方法进行分子力学(MM)能量最小化后获得。因此,共价结构的质量不存在空间或共价问题。当该集合中包含或不包含天然模拟快照以及对轨迹进行平均时,在该集合上测试的统计和物理有效能量函数表现不同。