Nicolotti Orazio, Giangreco Ilenia, Miscioscia Teresa Fabiola, Carotti Angelo
Dipartimento Farmaco-Chimico, University of Bari, via Orabona 4, I-70125 Bari, Italy.
J Chem Inf Model. 2009 Oct;49(10):2290-302. doi: 10.1021/ci9002409.
A multiobjective optimization algorithm was proposed for the automated integration of structure- and ligand-based molecular design. Driven by a genetic algorithm, the herein proposed approach enabled the detection of a number of trade-off QSAR models accounting simultaneously for two independent objectives. The first was biased toward best regressions among docking scores and biological affinities; the second minimized the atom displacements from a properly established crystal-based binding topology. Based on the concept of dominance, 3D QSAR equivalent models profiled the Pareto frontier and were, thus, designated as nondominated solutions of the search space. K-means clustering was, then, operated to select a representative subset of the available trade-off models. These were effectively subjected to GRID/GOLPE analyses for quantitatively featuring molecular determinants of ligand binding affinity. More specifically, it was demonstrated that a) diverse binding conformations occurred on the basis of the ligand ability to profitably contact different part of protein binding site; b) enzyme selectivity was better approached and interpreted by combining diverse equivalent models; and c) trade-off models were successful and even better than docking virtual screening, in retrieving at high sensitivity active hits from a large pool of chemically similar decoys. The approach was tested on a large series, very well-known to QSAR practitioners, of 3-amidinophenylalanine inhibitors of thrombin and trypsin, two serine proteases having rather different biological actions despite a high sequence similarity.
提出了一种多目标优化算法,用于基于结构和配体的分子设计的自动整合。在遗传算法的驱动下,本文提出的方法能够检测出一些兼顾两个独立目标的权衡定量构效关系(QSAR)模型。第一个目标倾向于对接分数和生物亲和力之间的最佳回归;第二个目标是使基于适当建立的晶体结构的结合拓扑结构的原子位移最小化。基于优势概念,三维定量构效关系等效模型描绘了帕累托前沿,因此被指定为搜索空间的非支配解。然后,运用K均值聚类算法从可用的权衡模型中选择一个代表性子集。对这些模型有效地进行GRID/GOLPE分析,以定量表征配体结合亲和力的分子决定因素。更具体地说,结果表明:a)基于配体与蛋白质结合位点不同部分有效接触的能力,会出现多种结合构象;b)通过组合不同的等效模型能更好地接近和解释酶的选择性;c)在从大量化学相似的诱饵中高灵敏度地检索活性命中物方面,权衡模型是成功的,甚至比对接虚拟筛选更好。该方法在一系列对QSAR从业者来说非常知名的、针对凝血酶和胰蛋白酶的3-脒基苯丙氨酸抑制剂上进行了测试,凝血酶和胰蛋白酶是两种丝氨酸蛋白酶,尽管序列相似度很高,但具有相当不同的生物学作用。