State Key Laboratory of Primate Biomedical Research, Institute of Primate Translational Medicine, Kunming University of Science and Technology, Kunming 650500, China.
Yunnan Key Laboratory of Primate Biomedical Research, Kunming 650500, China.
Molecules. 2022 Dec 26;28(1):208. doi: 10.3390/molecules28010208.
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causative pathogen of COVID-19, is spreading rapidly and has caused hundreds of millions of infections and millions of deaths worldwide. Due to the lack of specific vaccines and effective treatments for COVID-19, there is an urgent need to identify effective drugs. Traditional Chinese medicine (TCM) is a valuable resource for identifying novel anti-SARS-CoV-2 drugs based on the important contribution of TCM and its potential benefits in COVID-19 treatment. Herein, we aimed to discover novel anti-SARS-CoV-2 compounds and medicinal plants from TCM by establishing a prediction method of anti-SARS-CoV-2 activity using machine learning methods. We first constructed a benchmark dataset from anti-SARS-CoV-2 bioactivity data collected from the ChEMBL database. Then, we established random forest (RF) and support vector machine (SVM) models that both achieved satisfactory predictive performance with AUC values of 0.90. By using this method, a total of 1011 active anti-SARS-CoV-2 compounds were predicted from the TCMSP database. Among these compounds, six compounds with highly potent activity were confirmed in the anti-SARS-CoV-2 experiments. The molecular fingerprint similarity analysis revealed that only 24 of the 1011 compounds have high similarity to the FDA-approved antiviral drugs, indicating that most of the compounds were structurally novel. Based on the predicted anti-SARS-CoV-2 compounds, we identified 74 anti-SARS-CoV-2 medicinal plants through enrichment analysis. The 74 plants are widely distributed in 68 genera and 43 families, 14 of which belong to antipyretic detoxicate plants. In summary, this study provided several medicinal plants with potential anti-SARS-CoV-2 activity, which offer an attractive starting point and a broader scope to mine for potentially novel anti-SARS-CoV-2 drugs.
严重急性呼吸系统综合症冠状病毒 2(SARS-CoV-2)是 COVID-19 的病原体,正在全球迅速传播,并导致数亿感染和数百万人死亡。由于 COVID-19 缺乏特定的疫苗和有效治疗方法,因此迫切需要确定有效的药物。中医药(TCM)是基于 TCM 的重要贡献及其在 COVID-19 治疗中的潜在益处,寻找新型抗 SARS-CoV-2 药物的宝贵资源。在此,我们旨在通过建立使用机器学习方法预测抗 SARS-CoV-2 活性的方法,从 TCM 中发现新型抗 SARS-CoV-2 化合物和药用植物。我们首先从 ChEMBL 数据库中收集的抗 SARS-CoV-2 生物活性数据构建基准数据集。然后,我们建立了随机森林(RF)和支持向量机(SVM)模型,这两个模型都取得了令人满意的预测性能,AUC 值为 0.90。通过这种方法,从 TCMSP 数据库中总共预测到 1011 种活性抗 SARS-CoV-2 化合物。在这些化合物中,有 6 种化合物在抗 SARS-CoV-2 实验中被证实具有高活性。分子指纹相似性分析表明,在 1011 种化合物中,只有 24 种与 FDA 批准的抗病毒药物具有高相似性,这表明大多数化合物在结构上是新颖的。基于预测的抗 SARS-CoV-2 化合物,我们通过富集分析确定了 74 种抗 SARS-CoV-2 药用植物。这 74 种植物广泛分布在 68 属和 43 科中,其中 14 种属于解热解毒植物。总之,本研究提供了几种具有潜在抗 SARS-CoV-2 活性的药用植物,为寻找潜在的新型抗 SARS-CoV-2 药物提供了一个有吸引力的起点和更广泛的范围。