Al-Tawil Mai Fayiz, Daoud Safa, Hatmal Ma'mon M, Taha Mutasem Omar
Department of Pharmaceutical Sciences, Faculty of Pharmacy, University of Jordan Amman 11942 Jordan
Department of Pharmaceutical Chemistry and Pharmacognosy, Faculty of Pharmacy, Applied Sciences Private University Amman Jordan.
RSC Adv. 2022 Apr 5;12(17):10686-10700. doi: 10.1039/d2ra00136e. eCollection 2022 Mar 31.
Cdc2-like kinase 4 (CLK4) inhibitors are of potential therapeutic value in many diseases particularly cancer. In this study, we combined extensive ligand-based pharmacophore exploration, ligand-receptor contact fingerprints generated by flexible docking, physicochemical descriptors and machine learning-quantitative structure-activity relationship (ML-QSAR) analysis to investigate the pharmacophoric/binding requirements for potent CLK4 antagonists. Several ML methods were attempted to tie these properties with anti-CLK4 bioactivities including multiple linear regression (MLR), random forests (RF), extreme gradient boosting (XGBoost), probabilistic neural network (PNN), and support vector regression (SVR). A genetic function algorithm (GFA) was combined with each method for feature selection. Eventually, GFA-SVR was found to produce the best self-consistent and predictive model. The model selected three pharmacophores, three ligand-receptor contacts and two physicochemical descriptors. The GFA-SVR model and associated pharmacophore models were used to screen the National Cancer Institute (NCI) structural database for novel CLK4 antagonists. Three potent hits were identified with the best one showing an anti-CLK4 IC value of 57 nM.
细胞周期蛋白依赖性激酶2样激酶4(CLK4)抑制剂在许多疾病尤其是癌症中具有潜在的治疗价值。在本研究中,我们结合了基于配体的广泛药效团探索、通过柔性对接生成的配体-受体接触指纹、物理化学描述符以及机器学习-定量构效关系(ML-QSAR)分析,以研究强效CLK4拮抗剂的药效团/结合要求。尝试了几种机器学习方法将这些性质与抗CLK4生物活性联系起来,包括多元线性回归(MLR)、随机森林(RF)、极端梯度提升(XGBoost)、概率神经网络(PNN)和支持向量回归(SVR)。将遗传函数算法(GFA)与每种方法相结合进行特征选择。最终,发现GFA-SVR产生了最佳的自洽和预测模型。该模型选择了三个药效团、三个配体-受体接触和两个物理化学描述符。利用GFA-SVR模型和相关的药效团模型在国立癌症研究所(NCI)结构数据库中筛选新型CLK4拮抗剂。鉴定出三个强效命中物,其中最佳的一个显示抗CLK4 IC值为57 nM。