Laboratory of Systems Pharmacology, Department of Systems Biology, Harvard Program in Therapeutic Science, Harvard Medical School, Boston, Massachusetts 02115, United States.
Novartis Institutes for Biomedical Research, Cambridge, Massachusetts 02139, United States.
J Chem Inf Model. 2023 Sep 11;63(17):5457-5472. doi: 10.1021/acs.jcim.3c00347. Epub 2023 Aug 18.
Kinases have been the focus of drug discovery programs for three decades leading to over 70 therapeutic kinase inhibitors and biophysical affinity measurements for over 130,000 kinase-compound pairs. Nonetheless, the precise target spectrum for many kinases remains only partly understood. In this study, we describe a computational approach to unlocking qualitative and quantitative kinome-wide binding measurements for structure-based machine learning. Our study has three components: (i) a Kinase Inhibitor Complex (KinCo) data set comprising predicted kinase structures paired with experimental binding constants, (ii) a machine learning loss function that integrates qualitative and quantitative data for model training, and (iii) a structure-based machine learning model trained on KinCo. We show that our approach outperforms methods trained on crystal structures alone in predicting binary and quantitative kinase-compound interaction affinities; relative to structure-free methods, our approach also captures known kinase biochemistry and more successfully generalizes to distant kinase sequences and compound scaffolds.
三十年来,激酶一直是药物发现项目的重点,已开发出超过 70 种治疗性激酶抑制剂,并对超过 130,000 种激酶-化合物对进行了生物物理亲和性测量。尽管如此,许多激酶的确切靶标谱仍仅部分了解。在这项研究中,我们描述了一种用于基于结构的机器学习的计算方法,可实现定性和定量的全激酶组结合测量。我们的研究有三个组成部分:(i)包含预测激酶结构与实验结合常数的激酶抑制剂复合物(KinCo)数据集,(ii)用于模型训练的整合定性和定量数据的机器学习损失函数,以及(iii)基于 KinCo 训练的基于结构的机器学习模型。我们表明,我们的方法在预测二进制和定量激酶-化合物相互作用亲和力方面优于仅基于晶体结构训练的方法;与无结构方法相比,我们的方法还捕获了已知的激酶生物化学知识,并更成功地推广到了遥远的激酶序列和化合物支架。