Kumar Sivakumar Prasanth
Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India.
J Mol Model. 2018 Sep 15;24(10):282. doi: 10.1007/s00894-018-3820-7.
Ensemble methods are gaining more importance in structure-based approaches as single protein-ligand complexes strongly influence the outcomes of virtual screening. Structure-based pharmacophore modeling based on a single protein-ligand complex with complex feature combinations is often limited to certain chemical classes. The REPHARMBLE (receptor pharmacophore ensemble) approach presented here examines the ability of an ensemble of selected protein-ligand complexes to populate pharmacophore space in the ligand binding site, rigorously assesses the importance of pharmacophore features using Poisson statistic and information theory-based entropy calculations, and generates pharmacophore models with high probabilities. In addition, an ensemble scoring function that combines all the resultant high-scoring pharmacophore models to score molecules is derived. The REPHARMBLE approach was evaluated on ten DUD-E benchmark datasets and afforded good screening performance, as measured by receiver operating characteristic, enrichment factor and Güner-Henry score. Although one of the high-scoring models achieved superior statistical results in each dataset, the ensemble scoring function balanced the shortcomings of each model and passed with close performance measures. This approach offers a reliable way of choosing the best-scoring features to build four-feature pharmacophore queries and customize a target-biased 'pharmacophore ensemble' scoring function for subsequent virtual screening.
在基于结构的方法中,集成方法正变得越来越重要,因为单个蛋白质-配体复合物会强烈影响虚拟筛选的结果。基于具有复杂特征组合的单个蛋白质-配体复合物的基于结构的药效团建模通常仅限于某些化学类别。本文提出的REPHARMBLE(受体药效团集成)方法研究了一组选定的蛋白质-配体复合物在配体结合位点填充药效团空间的能力,使用泊松统计和基于信息论的熵计算严格评估药效团特征的重要性,并生成具有高概率的药效团模型。此外,还推导了一种集成评分函数,该函数结合所有生成的高分药效团模型对分子进行评分。REPHARMBLE方法在十个DUD-E基准数据集上进行了评估,并通过接收器操作特征、富集因子和Güner-Henry评分衡量,表现出良好的筛选性能。尽管每个数据集中的一个高分模型都取得了优异的统计结果,但集成评分函数平衡了每个模型的缺点,并以相近的性能指标通过。这种方法提供了一种可靠的方式来选择最佳评分特征,以构建四特征药效团查询,并为后续的虚拟筛选定制一个针对目标的“药效团集成”评分函数。