Laboratory of Macromolecular Engineering, Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Universitas Gadjah Mada Sekip Utara II, 55281, Yogyakarta, Indonesia.
Master Student of Pharmaceutical Sciences, Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Hacettepe University, Ankara, 06100, Turkey.
Daru. 2024 Jun;32(1):47-65. doi: 10.1007/s40199-023-00484-w. Epub 2023 Oct 31.
COVID-19 is an infectious disease caused by SARS-CoV-2, a close relative of SARS-CoV. Several studies have searched for COVID-19 therapies. The topics of these works ranged from vaccine discovery to natural products targeting the SARS-CoV-2 main protease (M), a potential therapeutic target due to its essential role in replication and conserved sequences. However, published research on this target is limited, presenting an opportunity for drug discovery and development.
This study aims to repurpose 10692 drugs in DrugBank by using ligand-based virtual screening (LBVS) machine learning (ML) with Konstanz Information Miner (KNIME) to seek potential therapeutics based on M inhibitors. The top candidate compounds, the native ligand (GC-376) of the M inhibitor, and the positive control boceprevir were then subjected to absorption, distribution, metabolism, excretion, and toxicity (ADMET) characterization, drug-likeness prediction, and molecular docking (MD). Protein-protein interaction (PPI) network analysis was added to provide accurate information about the M regulatory network.
This study identified 3,166 compound candidates inhibiting M. The random forest (RF) molecular access system ML model provided the highest confidence score of 0.95 (bromo-7-nitroindazole) and identified the top 22 candidate compounds. Subjecting the 22 candidate compounds, the native ligand GC-376, and boceprevir to further ADMET property characterization and drug-likeness predictions revealed that one compound had two violations of Lipinski's rule. Additional MD results showed that only five compounds had more negative binding energies than the native ligand (- 12.25 kcal/mol). Among these compounds, CCX-140 exhibited the lowest score of - 13.64 kcal/mol. Through literature analysis, six compound classes with potential activity for M were discovered. They included benzopyrazole, azole, pyrazolopyrimidine, carboxylic acids and derivatives, benzene and substituted derivatives, and diazine. Four pathologies were also discovered on the basis of the M PPI network.
Results demonstrated the efficiency of LBVS combined with MD. This combined strategy provided positive evidence showing that the top screened drugs, including CCX-140, which had the lowest MD score, can be reasonably advanced to the in vitro phase. This combined method may accelerate the discovery of therapies for novel or orphan diseases from existing drugs.
COVID-19 是一种由 SARS-CoV-2 引起的传染病,SARS-CoV-2 是 SARS-CoV 的近亲。已有多项研究针对 COVID-19 疗法展开搜索。这些研究的主题从疫苗发现到针对 SARS-CoV-2 主蛋白酶 (M) 的天然产物,M 是一种潜在的治疗靶点,因为它在复制和保守序列中发挥着重要作用。然而,关于该靶点的已发表研究有限,为药物发现和开发提供了机会。
本研究旨在使用基于配体的虚拟筛选 (LBVS) 机器学习 (ML) 与 Konstanz Information Miner (KNIME),通过对 DrugBank 中的 10692 种药物进行再利用,寻找基于 M 抑制剂的潜在治疗方法。然后对排名最高的候选化合物、M 抑制剂的天然配体 (GC-376) 和阳性对照药 boceprevir 进行吸收、分布、代谢、排泄和毒性 (ADMET) 特征描述、药物相似性预测和分子对接 (MD)。还添加了蛋白质-蛋白质相互作用 (PPI) 网络分析,以提供有关 M 调节网络的准确信息。
本研究鉴定出 3166 种抑制 M 的化合物候选物。随机森林 (RF) 分子访问系统 ML 模型提供了最高置信度 0.95 (溴-7-硝基吲唑) 和 22 个候选化合物。对 22 种候选化合物、天然配体 GC-376 和 boceprevir 进行进一步的 ADMET 特性描述和药物相似性预测,结果显示有 1 种化合物违反了 Lipinski 规则。此外的 MD 结果表明,只有 5 种化合物的结合能比天然配体 (-12.25 kcal/mol) 更负。在这些化合物中,CCX-140 的结合能最低,为-13.64 kcal/mol。通过文献分析,发现了 6 种具有 M 潜在活性的化合物类别,包括苯并吡唑、唑、吡唑并嘧啶、羧酸及其衍生物、苯及其取代衍生物和二嗪。还基于 M PPI 网络发现了 4 种病理学。
结果表明 LBVS 与 MD 相结合的效率。该联合策略提供了积极的证据,表明包括 CCX-140 在内的筛选出的前药具有最低的 MD 评分,可合理推进到体外阶段。这种联合方法可能会加速从现有药物中发现针对新型或孤儿疾病的疗法。