Sandia National Laboratories, Box 969, MS 9291, Livermore, California 94551, USA.
J Med Chem. 2012 Mar 8;55(5):1926-39. doi: 10.1021/jm200979x. Epub 2012 Feb 17.
We present a new approach for identifying features of ligand-protein binding interfaces that predict binding selectivity and demonstrate its effectiveness for predicting kinase inhibitor specificity. We analyzed a large set of human kinases and kinase inhibitors using clustering of experimentally determined inhibition constants (to define specificity classes of kinases and inhibitors) and virtual ligand docking (to extract structural and chemical features of the ligand-protein binding interfaces). We then used statistical methods to identify features characteristic of each class. Machine learning was employed to determine which combinations of characteristic features were predictive of class membership and to predict binding specificities and affinities of new compounds. Experiments showed predictions were 70% accurate. These results show that our method can automatically pinpoint on the three-dimensional binding interfaces pharmacophore-like features that act as "selectivity filters". The method is not restricted to kinases, requires no prior hypotheses about specific interactions, and can be applied to any protein families for which sets of structures and ligand binding data are available.
我们提出了一种新的方法来识别配体-蛋白结合界面的特征,这些特征可以预测结合选择性,并证明其在预测激酶抑制剂特异性方面的有效性。我们使用实验测定的抑制常数聚类(定义激酶和抑制剂的特异性类别)和虚拟配体对接(提取配体-蛋白结合界面的结构和化学特征)分析了一大组人类激酶和激酶抑制剂。然后,我们使用统计方法来识别每个类别的特征。机器学习用于确定哪些特征组合可以预测类别成员,并预测新化合物的结合特异性和亲和力。实验表明预测的准确率为 70%。这些结果表明,我们的方法可以自动确定三维结合界面上的类药性特征,这些特征充当“选择性过滤器”。该方法不仅限于激酶,不需要关于特定相互作用的先验假设,并且可以应用于任何具有结构和配体结合数据集的蛋白质家族。