Xu Zhongren, Yang Lixiang, Zhang Xinghao, Zhang Qiling, Yang Zhibin, Liu Yuanhao, Wei Shuang, Liu Wukun
Jiangsu Collaborative Innovation Center of Chinese Medicinal Resources Industrialization, School of Medicine and Holistic Integrative Medicine, School of Pharmacy, Nanjing University of Chinese Medicine, Nanjing, China.
Shenzhen Bay Laboratory, Shenzhen, China.
Front Mol Biosci. 2020 Sep 29;7:556481. doi: 10.3389/fmolb.2020.556481. eCollection 2020.
The outbreak of 2019 novel coronavirus (COVID-19) has caused serious threat to public health. Discovery of new anti-COVID-19 drugs is urgently needed. Fortunately, the crystal structure of COVID-19 3CL proteinase was recently resolved. The proteinase has been identified as a promising target for drug discovery in this crisis. Here, a dataset including 2030 natural compounds was screened and refined based on the machine learning and molecular docking. The performance of six machine learning (ML) methods of predicting active coronavirus inhibitors had achieved satisfactory accuracy, especially, the AUC (Area Under ROC Curve) scores with fivefold cross-validation of Logistic Regression (LR) reached up to 0.976. Comprehensive ML prediction and molecular docking results accounted for the compound Rutin, which was approved by NMPA (National Medical Products Administration), exhibited the best AUC and the most promising binding affinity compared to other compounds. Therefore, Rutin might be a promising agent in anti-COVID-19 drugs development.
2019新型冠状病毒(COVID-19)的爆发对公众健康造成了严重威胁。迫切需要发现新的抗COVID-19药物。幸运的是,COVID-19 3CL蛋白酶的晶体结构最近得到了解析。在这场危机中,该蛋白酶已被确定为药物研发的一个有前景的靶点。在此,基于机器学习和分子对接筛选并优化了一个包含2030种天然化合物的数据集。六种预测活性冠状病毒抑制剂的机器学习(ML)方法的性能达到了令人满意的准确率,特别是,逻辑回归(LR)五重交叉验证的AUC(ROC曲线下面积)得分高达0.976。综合的ML预测和分子对接结果表明,与其他化合物相比,经国家药品监督管理局(NMPA)批准的化合物芦丁表现出最佳的AUC和最有前景的结合亲和力。因此,芦丁可能是抗COVID-19药物研发中有前景的药物。