Zhang Li, Ai Hai-Xin, Li Shi-Meng, Qi Meng-Yuan, Zhao Jian, Zhao Qi, Liu Hong-Sheng
School of Life Science, Liaoning University, Shenyang 110036, China.
Research Center for Computer Simulating and Information Processing of Bio-macromolecules of Liaoning Province, Shenyang 110036, China.
Oncotarget. 2017 Sep 15;8(47):83142-83154. doi: 10.18632/oncotarget.20915. eCollection 2017 Oct 10.
In recent years, an epidemic of the highly pathogenic avian influenza H7N9 virus has persisted in China, with a high mortality rate. To develop novel anti-influenza therapies, we have constructed a machine-learning-based scoring function (RF-NA-Score) for the effective virtual screening of lead compounds targeting the viral neuraminidase (NA) protein. RF-NA-Score is more accurate than RF-Score, with a root-mean-square error of 1.46, Pearson's correlation coefficient of 0.707, and Spearman's rank correlation coefficient of 0.707 in a 5-fold cross-validation study. The performance of RF-NA-Score in a docking-based virtual screening of NA inhibitors was evaluated with a dataset containing 281 NA inhibitors and 322 noninhibitors. Compared with other docking-rescoring virtual screening strategies, rescoring with RF-NA-Score significantly improved the efficiency of virtual screening, and a strategy that averaged the scores given by RF-NA-Score, based on the binding conformations predicted with AutoDock, AutoDock Vina, and LeDock, was shown to be the best strategy. This strategy was then applied to the virtual screening of NA inhibitors in the SPECS database. The 100 selected compounds were tested in an H7N9 NA inhibition assay, and two compounds with novel scaffolds showed moderate inhibitory activities. These results indicate that RF-NA-Score improves the efficiency of virtual screening for NA inhibitors, and can be used successfully to identify new NA inhibitor scaffolds. Scoring functions specific for other drug targets could also be established with the same method.
近年来,高致病性禽流感H7N9病毒在中国持续流行,死亡率很高。为了开发新型抗流感疗法,我们构建了一种基于机器学习的评分函数(RF-NA-Score),用于有效虚拟筛选靶向病毒神经氨酸酶(NA)蛋白的先导化合物。在一项5折交叉验证研究中,RF-NA-Score比RF-Score更准确,均方根误差为1.46,皮尔逊相关系数为0.707,斯皮尔曼等级相关系数为0.707。使用包含281种NA抑制剂和322种非抑制剂的数据集评估了RF-NA-Score在基于对接的NA抑制剂虚拟筛选中的性能。与其他对接重评分虚拟筛选策略相比,使用RF-NA-Score进行重评分显著提高了虚拟筛选的效率,并且基于AutoDock、AutoDock Vina和LeDock预测的结合构象对RF-NA-Score给出的分数进行平均的策略被证明是最佳策略。然后将该策略应用于SPECS数据库中NA抑制剂的虚拟筛选。对筛选出的100种化合物进行了H7N9 NA抑制试验,两种具有新型骨架的化合物表现出中等抑制活性。这些结果表明,RF-NA-Score提高了NA抑制剂虚拟筛选的效率,并可成功用于鉴定新的NA抑制剂骨架。也可以用相同的方法建立针对其他药物靶点的评分函数。