Benkert Pascal, Tosatto Silvio C E, Schomburg Dietmar
Institute for Biochemistry, University of Cologne, 50674 Cologne, Germany.
Proteins. 2008 Apr;71(1):261-77. doi: 10.1002/prot.21715.
In protein structure prediction, a considerable number of alternative models are usually produced from which subsequently the final model has to be selected. Thus, a scoring function for the identification of the best model within an ensemble of alternative models is a key component of most protein structure prediction pipelines. QMEAN, which stands for Qualitative Model Energy ANalysis, is a composite scoring function describing the major geometrical aspects of protein structures. Five different structural descriptors are used. The local geometry is analyzed by a new kind of torsion angle potential over three consecutive amino acids. A secondary structure-specific distance-dependent pairwise residue-level potential is used to assess long-range interactions. A solvation potential describes the burial status of the residues. Two simple terms describing the agreement of predicted and calculated secondary structure and solvent accessibility, respectively, are also included. A variety of different implementations are investigated and several approaches to combine and optimize them are discussed. QMEAN was tested on several standard decoy sets including a molecular dynamics simulation decoy set as well as on a comprehensive data set of totally 22,420 models from server predictions for the 95 targets of CASP7. In a comparison to five well-established model quality assessment programs, QMEAN shows a statistically significant improvement over nearly all quality measures describing the ability of the scoring function to identify the native structure and to discriminate good from bad models. The three-residue torsion angle potential turned out to be very effective in recognizing the native fold.
在蛋白质结构预测中,通常会产生大量的替代模型,随后必须从中选择最终模型。因此,用于在一组替代模型中识别最佳模型的评分函数是大多数蛋白质结构预测流程的关键组成部分。QMEAN代表定性模型能量分析,是一种描述蛋白质结构主要几何特征的复合评分函数。它使用了五种不同的结构描述符。通过一种针对三个连续氨基酸残基的新型扭转角势来分析局部几何结构。使用一种依赖于二级结构的距离相关的成对残基水平势来评估远程相互作用。溶剂化势描述了残基的埋藏状态。还包括两个分别描述预测二级结构与计算二级结构以及溶剂可及性一致性的简单项。研究了各种不同的实现方式,并讨论了几种组合和优化它们的方法。QMEAN在包括分子动力学模拟诱饵集在内的几个标准诱饵集上进行了测试,还在来自CASP7的95个目标的服务器预测的总共22420个模型的综合数据集上进行了测试。与五个成熟的模型质量评估程序相比,QMEAN在几乎所有描述评分函数识别天然结构以及区分好坏模型能力的质量指标上都显示出统计学上的显著改进。三残基扭转角势在识别天然折叠方面非常有效。