School of Systems Biomedical Science and Department of Bioinformatics and Life Science, Soongsil University, Seoul, South Korea.
Laboratory of Computational Modeling of Drugs, Higher Medical & Biological School, South Ural State University, Chelyabinsk, Russia.
J Comput Chem. 2023 Jun 15;44(16):1493-1504. doi: 10.1002/jcc.27103. Epub 2023 Mar 16.
Janus kinase 2 (JAK2) is emerging as a potential therapeutic target for many inflammatory diseases such as myeloproliferative disorders (MPD), cancer and rheumatoid arthritis (RA). In this study, we have collected experimental data of JAK2 protein containing 6021 unique inhibitors. We then characterized them based on Morgan (ECFP6) fingerprints followed by clustering into training and test set based on their molecular scaffolds. These data were used to build the classification models with various supervised machine learning (ML) algorithms that could prioritize novel inhibitors for future drug development against JAK2 protein. The best model built by Random Forest (RF) and Morgan fingerprints achieved the G-mean value of 0.84 on the external test set. As an application of our classification model, virtual screening was performed against Drugbank molecules in order to identify the potential inhibitors based on the confidence score by RF model. Nine potential molecules were identified, which were further subject to molecular docking studies to evaluate the virtual screening results of the best RF model. This proposed method can prove useful for developing novel target-specific JAK2 inhibitors.
Janus 激酶 2(JAK2)作为许多炎症性疾病(如骨髓增生性疾病、癌症和类风湿关节炎)的潜在治疗靶点正在兴起。在这项研究中,我们收集了包含 6021 种独特抑制剂的 JAK2 蛋白的实验数据。然后,我们根据 Morgan(ECFP6)指纹对它们进行了特征描述,并根据它们的分子支架将其聚类为训练集和测试集。这些数据用于构建具有各种监督机器学习(ML)算法的分类模型,以便为未来针对 JAK2 蛋白的药物开发优先考虑新型抑制剂。基于随机森林(RF)和 Morgan 指纹构建的最佳模型在外部测试集上的 G-均值值达到了 0.84。作为我们分类模型的应用,针对 Drugbank 分子进行了虚拟筛选,以根据 RF 模型的置信度得分识别潜在抑制剂。确定了 9 个潜在分子,进一步进行分子对接研究以评估最佳 RF 模型的虚拟筛选结果。该方法可用于开发新型靶向 JAK2 抑制剂。