Key Laboratory for Carbonaceous Waste Processing and Process Intensification Research of Zhejiang Province, University of Nottingham Ningbo China, 199 Taikang East Road, Ningbo 315100, China.
School of Computer Science, University of Nottingham Ningbo China, 199 Taikang East Road, Ningbo 315100, China.
Molecules. 2021 Dec 10;26(24):7492. doi: 10.3390/molecules26247492.
A machine learning approach has been applied to virtual screening for lysine specific demethylase 1 (LSD1) inhibitors. LSD1 is an important anti-cancer target. Machine learning models to predict activity were constructed using Morgan molecular fingerprints. The dataset, consisting of 931 molecules with LSD1 inhibition activity, was obtained from the ChEMBL database. An evaluation of several candidate algorithms on the main dataset revealed that the support vector regressor gave the best model, with a coefficient of determination (R2) of 0.703. Virtual screening, using this model, identified five predicted potent inhibitors from the ZINC database comprising more than 300,000 molecules. The virtual screening recovered a known inhibitor, RN1, as well as four compounds where activity against LSD1 had not previously been suggested. Thus, we performed a machine-learning-enabled virtual screening of LSD1 inhibitors using only the structural information of the molecules.
一种机器学习方法已被应用于赖氨酸特异性去甲基化酶 1(LSD1)抑制剂的虚拟筛选。LSD1 是一个重要的抗癌靶点。使用 Morgan 分子指纹构建了用于预测活性的机器学习模型。该数据集由来自 ChEMBL 数据库的 931 个具有 LSD1 抑制活性的分子组成。在主数据集上评估了几种候选算法,结果表明支持向量回归器给出了最佳模型,决定系数(R2)为 0.703。使用该模型进行虚拟筛选,从包含超过 300,000 个分子的 ZINC 数据库中鉴定出五种预测的强效抑制剂。虚拟筛选回收了一种已知抑制剂 RN1 以及四种以前未显示 LSD1 活性的化合物。因此,我们仅使用分子的结构信息进行了 LSD1 抑制剂的机器学习辅助虚拟筛选。