• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用随机森林通过优化分子指纹增强丙型肝炎病毒NS3抑制剂分类

Enhancing HCV NS3 Inhibitor Classification with Optimized Molecular Fingerprints Using Random Forest.

作者信息

Atasever Sema

机构信息

Department of Computer Engineering, Faculty of Engineering and Architecture, Nevsehir Haci Bektas Veli University, 50300 Nevşehir, Turkey.

出版信息

Int J Mol Sci. 2025 Mar 17;26(6):2680. doi: 10.3390/ijms26062680.

DOI:10.3390/ijms26062680
PMID:40141322
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11943357/
Abstract

The classification of Hepatitis C virus (HCV) NS3 inhibitors is essential for identifying potential antiviral agents through computational methods. This study aims to develop an optimized machine learning (ML) model using random forest (RF) and molecular fingerprints to accurately classify HCV NS3 inhibitors. A dataset of 965 molecules was retrieved from the ChEMBL database, and 290 bioactive compounds were selected for model training. Twelve molecular fingerprint descriptors were tested, and the CDK graph-only fingerprint yielded the best performance. In addition to RF, performance comparisons of other classifiers such as instance-based k-nearest neighbor (IBk), logistic regression (LR), AdaBoost, and OneR were conducted using WEKA with various molecular fingerprint descriptors. The optimized RF model achieved an accuracy of 89.6552%, a mean absolute error (MAE) of 0.2114, a root mean square error (RMSE) of 0.3304, and a Matthews correlation coefficient (MCC) of 0.7950 on the test set. These results highlight the effectiveness of optimized molecular fingerprints in enhancing virtual screening (VS) for HCV inhibitors. This approach offers a data-driven method for drug discovery.

摘要

丙型肝炎病毒(HCV)NS3抑制剂的分类对于通过计算方法识别潜在抗病毒药物至关重要。本研究旨在使用随机森林(RF)和分子指纹开发一种优化的机器学习(ML)模型,以准确分类HCV NS3抑制剂。从ChEMBL数据库中检索了一个包含965个分子的数据集,并选择了290种生物活性化合物用于模型训练。测试了12种分子指纹描述符,其中CDK仅图形指纹表现最佳。除了RF之外,还使用WEKA和各种分子指纹描述符对其他分类器(如基于实例的k近邻(IBk)、逻辑回归(LR)、AdaBoost和OneR)进行了性能比较。优化后的RF模型在测试集上的准确率为89.6552%,平均绝对误差(MAE)为0.2114,均方根误差(RMSE)为0.3304,马修斯相关系数(MCC)为0.7950。这些结果突出了优化的分子指纹在增强HCV抑制剂虚拟筛选(VS)方面的有效性。这种方法为药物发现提供了一种数据驱动的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f855/11943357/ffdc5bfdaa33/ijms-26-02680-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f855/11943357/31c4ef7be883/ijms-26-02680-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f855/11943357/c0ddbba3a59c/ijms-26-02680-g002a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f855/11943357/860a0fb0790d/ijms-26-02680-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f855/11943357/00c0dc5e8f50/ijms-26-02680-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f855/11943357/ff4aa3fc804f/ijms-26-02680-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f855/11943357/ffdc5bfdaa33/ijms-26-02680-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f855/11943357/31c4ef7be883/ijms-26-02680-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f855/11943357/c0ddbba3a59c/ijms-26-02680-g002a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f855/11943357/860a0fb0790d/ijms-26-02680-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f855/11943357/00c0dc5e8f50/ijms-26-02680-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f855/11943357/ff4aa3fc804f/ijms-26-02680-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f855/11943357/ffdc5bfdaa33/ijms-26-02680-g006.jpg

相似文献

1
Enhancing HCV NS3 Inhibitor Classification with Optimized Molecular Fingerprints Using Random Forest.利用随机森林通过优化分子指纹增强丙型肝炎病毒NS3抑制剂分类
Int J Mol Sci. 2025 Mar 17;26(6):2680. doi: 10.3390/ijms26062680.
2
On the steroids extracted from soft corals against the NS3/4A protease of hepatitis C virus.关于从软珊瑚中提取的甾体对丙型肝炎病毒NS3/4A蛋白酶的作用。
J Mol Graph Model. 2025 May;136:108936. doi: 10.1016/j.jmgm.2024.108936. Epub 2024 Dec 27.
3
Pharmacophore-Assisted Covalent Docking Identifies a Potential Covalent Inhibitor for Drug-Resistant Genotype 3 Variants of Hepatitis C Viral NS3/4A Serine Protease.基于药效团的共价对接鉴定出一种潜在的针对丙型肝炎病毒 NS3/4A 丝氨酸蛋白酶耐药基因型 3 变异体的共价抑制剂。
Viruses. 2024 Aug 3;16(8):1250. doi: 10.3390/v16081250.
4
Classification models of HCV NS3 protease inhibitors based on support vector machine (SVM).基于支持向量机(SVM)的丙型肝炎病毒NS3蛋白酶抑制剂分类模型
Comb Chem High Throughput Screen. 2015;18(1):24-32. doi: 10.2174/1386207317666141120122554.
5
QSAR studies of the bioactivity of hepatitis C virus (HCV) NS3/4A protease inhibitors by multiple linear regression (MLR) and support vector machine (SVM).通过多元线性回归(MLR)和支持向量机(SVM)对丙型肝炎病毒(HCV)NS3/4A蛋白酶抑制剂的生物活性进行定量构效关系(QSAR)研究。
Bioorg Med Chem Lett. 2017 Jul 1;27(13):2931-2938. doi: 10.1016/j.bmcl.2017.05.001. Epub 2017 May 3.
6
Predicting repurposed drugs targeting the NS3 protease of dengue virus using machine learning-based QSAR, molecular docking, and molecular dynamics simulations.基于机器学习的定量构效关系、分子对接和分子动力学模拟预测针对登革热病毒 NS3 蛋白酶的再利用药物。
SAR QSAR Environ Res. 2024 Aug;35(8):707-728. doi: 10.1080/1062936X.2024.2392677. Epub 2024 Aug 30.
7
A review of HCV protease inhibitors.丙型肝炎病毒蛋白酶抑制剂综述。
Curr Opin Investig Drugs. 2009 Aug;10(8):821-37.
8
Unraveling the mechanisms of Sofosbuvir resistance in HCV NS3/4A protease: Structural and molecular simulation-based insights.解析 HCV NS3/4A 蛋白酶中 Sofosbuvir 耐药的机制:结构和分子模拟的见解。
Int J Biol Macromol. 2024 May;267(Pt 2):131629. doi: 10.1016/j.ijbiomac.2024.131629. Epub 2024 Apr 16.
9
Identification of novel small molecule inhibitors against the NS3/4A protease of hepatitis C virus genotype 4a.鉴定新型小分子抑制剂抗丙型肝炎病毒 4a 基因型 NS3/4A 蛋白酶。
Curr Pharm Des. 2018;24(37):4484-4491. doi: 10.2174/1381612825666181203153835.
10
Identification of potential inhibitors for HCV NS3 genotype 4a by combining protein-ligand interaction fingerprint, 3D pharmacophore, docking, and dynamic simulation.通过组合蛋白-配体相互作用指纹、3D 药效团、对接和动态模拟鉴定 HCV NS3 基因型 4a 的潜在抑制剂。
J Biomol Struct Dyn. 2018 May;36(7):1713-1727. doi: 10.1080/07391102.2017.1332689. Epub 2017 Jun 7.

引用本文的文献

1
Drug repurposing targeting COVID-19 3CL protease using molecular docking and machine learning regression approaches.使用分子对接和机器学习回归方法针对新冠病毒3CL蛋白酶进行药物重新利用研究
Sci Rep. 2025 May 28;15(1):18722. doi: 10.1038/s41598-025-02773-7.

本文引用的文献

1
Hybrid Approach to Identifying Druglikeness Leading Compounds against COVID-19 3CL Protease.用于鉴定抗新型冠状病毒3CL蛋白酶类药物先导化合物的混合方法
Pharmaceuticals (Basel). 2022 Oct 28;15(11):1333. doi: 10.3390/ph15111333.
2
Targeting non-structural proteins of Hepatitis C virus for predicting repurposed drugs using QSAR and machine learning approaches.利用定量构效关系(QSAR)和机器学习方法,针对丙型肝炎病毒的非结构蛋白预测重新利用的药物。
Comput Struct Biotechnol J. 2022 Jun 30;20:3422-3438. doi: 10.1016/j.csbj.2022.06.060. eCollection 2022.
3
Privileged substructures for anti-sickling activity cheminformatic analysis.
抗镰状化活性的特权子结构 化学信息学分析
RSC Adv. 2018 Feb 7;8(11):5920-5935. doi: 10.1039/c7ra12079f. eCollection 2018 Feb 2.
4
In silico prediction of potential drug-induced nephrotoxicity with machine learning methods.基于机器学习方法的药物诱导肾毒性的计算机预测。
J Appl Toxicol. 2022 Oct;42(10):1639-1650. doi: 10.1002/jat.4331. Epub 2022 Apr 29.
5
StackHCV: a web-based integrative machine-learning framework for large-scale identification of hepatitis C virus NS5B inhibitors.StackHCV:一种基于网络的集成机器学习框架,用于大规模鉴定丙型肝炎病毒 NS5B 抑制剂。
J Comput Aided Mol Des. 2021 Oct;35(10):1037-1053. doi: 10.1007/s10822-021-00418-1. Epub 2021 Oct 8.
6
A review on machine learning approaches and trends in drug discovery.关于药物发现中机器学习方法与趋势的综述。
Comput Struct Biotechnol J. 2021 Aug 12;19:4538-4558. doi: 10.1016/j.csbj.2021.08.011. eCollection 2021.
7
Prediction of Drug-Induced Liver Toxicity Using SVM and Optimal Descriptor Sets.基于 SVM 和最优描述符集预测药物性肝毒性。
Int J Mol Sci. 2021 Jul 28;22(15):8073. doi: 10.3390/ijms22158073.
8
Validating ADME QSAR Models Using Marketed Drugs.利用已上市药物验证 ADME QSAR 模型。
SLAS Discov. 2021 Dec;26(10):1326-1336. doi: 10.1177/24725552211017520. Epub 2021 Jun 26.
9
SYBA: Bayesian estimation of synthetic accessibility of organic compounds.SYBA:有机化合物合成可及性的贝叶斯估计
J Cheminform. 2020 May 20;12(1):35. doi: 10.1186/s13321-020-00439-2.
10
HCVpred: A web server for predicting the bioactivity of hepatitis C virus NS5B inhibitors.HCVpred:一个用于预测丙型肝炎病毒 NS5B 抑制剂生物活性的网络服务器。
J Comput Chem. 2020 Jul 30;41(20):1820-1834. doi: 10.1002/jcc.26223. Epub 2020 May 25.