Suppr超能文献

对接配体到柔性和溶剂化的大分子中。5. 基于力场的配体与蛋白质结合亲和力的预测。

Docking ligands into flexible and solvated macromolecules. 5. Force-field-based prediction of binding affinities of ligands to proteins.

机构信息

Department of Chemistry, McGill University, 801 Sherbrooke Street West, Montreal, Quebec, Canada H3A 2K6.

出版信息

J Chem Inf Model. 2009 Nov;49(11):2564-71. doi: 10.1021/ci900251k.

Abstract

We report herein our efforts in the development of three empirical scoring functions with application in protein-ligand docking. A first scoring function was developed from 209 crystal structures of protein-ligand complexes and a second one from 946 cross-docked complexes. Tuning of the coefficients for the different terms making up these functions was performed by an iterative approach to optimize the correlations between observed activities and calculated scores. A third scoring function was developed from libraries of known actives and decoys docked to six different protein conformational ensembles. In the latter case, the tuning of the coefficients was performed so as to optimize the area under the curve of a receiver operating characteristic (ROC) for the discrimination of actives and inactives. The newly developed scoring functions were next assessed on independent sets of protein-ligand complexes for their ability to predict binding affinities and to discriminate actives from inactives. In the first validation the first function, which was trained on active compounds only, performed as well as other commonly used ones. On a high-throughput virtual screening validation on five protein conformational ensembles, the third scoring function that included data from inactive compounds performed significantly better. This validation showed that the inclusion of data from inactive compounds is critical for performance in virtual high-throughput screening applications.

摘要

我们在此报告了开发三个经验评分函数的努力,这些函数可应用于蛋白质-配体对接。第一个评分函数是从 209 个蛋白质-配体复合物的晶体结构中开发的,第二个是从 946 个交叉对接复合物中开发的。通过迭代方法调整组成这些函数的不同项的系数,以优化观察到的活性和计算得分之间的相关性。第三个评分函数是从已知活性和诱饵库中开发的,这些库与六个不同的蛋白质构象集合对接。在后一种情况下,调整系数的目的是优化接收者操作特征 (ROC) 曲线下的面积,以区分活性和非活性。接下来,将新开发的评分函数应用于独立的蛋白质-配体复合物数据集,以评估它们预测结合亲和力和区分活性和非活性的能力。在第一项验证中,仅在活性化合物上进行训练的第一个函数表现与其他常用函数一样好。在对五个蛋白质构象集合的高通量虚拟筛选验证中,包含非活性化合物数据的第三个评分函数表现明显更好。这项验证表明,在虚拟高通量筛选应用中,包含非活性化合物的数据对于性能至关重要。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验