Department of Pharmacology, Kangwon National University School of Medicine, Chuncheon 24341, Republic of Korea.
Department of Medical Environmental Biology and Tropical Medicine, Kangwon National University School of Medicine, Chuncheon 24341, Republic of Korea.
J Chem Inf Model. 2023 Nov 13;63(21):6487-6500. doi: 10.1021/acs.jcim.3c01090. Epub 2023 Oct 31.
Machine learning algorithms have been increasingly applied in drug development due to their efficiency and effectiveness. Machine learning-based drug repurposing can contribute to the identification of novel therapeutic applications for drugs with other indications. The current study used a trained machine learning model to screen a vast chemical library for new JAK2 inhibitors, the biological activities of which were reported. Reference JAK2 inhibitors, comprising 1911 compounds, have experimentally determined IC values. To generate the input to the machine learning model, reference compounds were subjected to RDKit, a cheminformatic toolkit, to extract molecular descriptors. A Random Forest Regression model from the Scikit-learn machine learning library was applied to obtain a predictive regression model and to analyze each molecular descriptor's role in determining IC values in the reference data set. Then, IC values of the library compounds, comprised of 1,576,903 compounds, were predicted using the generated regression model. Interestingly, some compounds that exhibit high IC values from the prediction were reported to possess JAK inhibition activity, which indicates the limitations of the prediction model. To confirm the JAK2 inhibition activity of predicted compounds, molecular docking and molecular dynamics simulation were carried out with the JAK inhibitor reference compound, tofacitinib. The binding affinity of docked compounds in the active region of JAK2 was also analyzed by the gmxMMPBSA approach. Furthermore, experimental validation confirmed the results from the computational analysis. Results showed highly comparable outcomes concerning tofacitinib. Conclusively, the machine learning model can efficiently improve the virtual screening of drugs and drug development.
机器学习算法因其高效性和有效性,在药物开发中得到了越来越多的应用。基于机器学习的药物重新定位可以有助于发现具有其他适应症的药物的新治疗应用。本研究使用经过训练的机器学习模型筛选了一个庞大的化学库,以寻找具有报道的生物活性的新型 JAK2 抑制剂。参考 JAK2 抑制剂包括 1911 种化合物,其 IC 值已通过实验确定。为了生成机器学习模型的输入,参考化合物经过 RDKit(一种化学信息学工具包)处理,以提取分子描述符。来自 Scikit-learn 机器学习库的随机森林回归模型被应用于获得预测回归模型,并分析每个分子描述符在确定参考数据集 IC 值方面的作用。然后,使用生成的回归模型预测库化合物(由 1576903 种化合物组成)的 IC 值。有趣的是,一些预测具有高 IC 值的化合物被报道具有 JAK 抑制活性,这表明预测模型存在局限性。为了确认预测化合物的 JAK2 抑制活性,对 JAK 抑制剂参考化合物托法替尼进行了分子对接和分子动力学模拟。还通过 gmxMMPBSA 方法分析了对接化合物在 JAK2 活性区域的结合亲和力。此外,实验验证证实了计算分析的结果。结果表明,与托法替尼高度可比。总之,机器学习模型可以有效地提高药物虚拟筛选和药物开发的效率。