Durai Prasannavenkatesh, Lee Sue Jung, Lee Jae Wook, Pan Cheol-Ho, Park Keunwan
Natural Product Informatics Research Center, Korea Institute of Science and Technology, Gangneung, 25451, Republic of Korea.
Natural Product Research Center, Korea Institute of Science and Technology, Gangneung, 25451, Republic of Korea.
J Cheminform. 2023 Sep 23;15(1):86. doi: 10.1186/s13321-023-00760-6.
Machine learning-based chemical screening has made substantial progress in recent years. However, these predictions often have low accuracy and high uncertainty when identifying new active chemical scaffolds. Hence, a high proportion of retrieved compounds are not structurally novel. In this study, we proposed a strategy to address this issue by iteratively optimizing an evolutionary chemical binding similarity (ECBS) model using experimental validation data. Various data update and model retraining schemes were tested to efficiently incorporate new experimental data into ECBS models, resulting in a fine-tuned ECBS model with improved accuracy and coverage. To demonstrate the effectiveness of our approach, we identified the novel hit molecules for the mitogen-activated protein kinase kinase 1 (MEK1). These molecules showed sub-micromolar affinity (Kd 0.1-5.3 μM) to MEKs and were distinct from previously-known MEK1 inhibitors. We also determined the binding specificity of different MEK isoforms and proposed potential docking models. Furthermore, using de novo drug design tools, we utilized one of the new MEK inhibitors to generate additional drug-like molecules with improved binding scores. This resulted in the identification of several potential MEK1 inhibitors with better binding affinity scores. Our results demonstrated the potential of this approach for identifying novel hit molecules and optimizing their binding affinities.
近年来,基于机器学习的化学筛选取得了显著进展。然而,在识别新的活性化学骨架时,这些预测往往准确率低且不确定性高。因此,大量检索到的化合物在结构上并非新颖。在本研究中,我们提出了一种策略来解决这一问题,即使用实验验证数据迭代优化进化化学结合相似性(ECBS)模型。测试了各种数据更新和模型再训练方案,以有效地将新的实验数据纳入ECBS模型,从而得到一个经过微调的ECBS模型,其准确率和覆盖范围均有所提高。为了证明我们方法的有效性,我们鉴定了丝裂原活化蛋白激酶激酶1(MEK1)的新型命中分子。这些分子对MEK显示出亚微摩尔亲和力(Kd 0.1 - 5.3 μM),且与先前已知的MEK1抑制剂不同。我们还确定了不同MEK亚型的结合特异性,并提出了潜在的对接模型。此外,使用从头药物设计工具,我们利用其中一种新的MEK抑制剂生成了具有更高结合分数的其他类药物分子。这导致鉴定出了几种具有更好结合亲和力分数的潜在MEK1抑制剂。我们的结果证明了这种方法在识别新型命中分子和优化其结合亲和力方面的潜力。