• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大规模比较机器学习算法在天然产物靶标预测中的应用。

Large-scale comparison of machine learning algorithms for target prediction of natural products.

机构信息

State Key Laboratory of Medicinal Chemical Biology, College of Pharmacy and Tianjin Key Laboratory of Molecular Drug Research, Nankai University, Haihe Education Park, 38 Tongyan Road, Tianjin 300353, China.

National Supercomputer Center in Tianjin, 10 Xinhuanxi Road, Tianjin Binhai New Area, Tianjin 300457, China.

出版信息

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac359.

DOI:10.1093/bib/bbac359
PMID:36007240
Abstract

Natural products (NPs) and their derivatives are important resources for drug discovery. There are many in silico target prediction methods that have been reported, however, very few of them distinguish NPs from synthetic molecules. Considering the fact that NPs and synthetic molecules are very different in many characteristics, it is necessary to build specific target prediction models of NPs. Therefore, we collected the activity data of NPs and their derivatives from the public databases and constructed four datasets, including the NP dataset, the NPs and its first-class derivatives dataset, the NPs and all its derivatives and the ChEMBL26 compounds dataset. Conditions, including activity thresholds and input features, were explored to access the performance of eight machine learning methods of target prediction of NPs, including support vector machines (SVM), extreme gradient boosting, random forests, K-nearest neighbor, naive Bayes, feedforward neural networks (FNN), convolutional neural networks and recurrent neural networks. As a result, the NPs and all their derivatives datasets were selected to build the best NP-specific models. Furthermore, the consensus models, as well as the voting models, were additionally applied to improve the prediction performance. More evaluations were made on the external validation set and the results demonstrated that (1) the NP-specific model performed better on the target prediction of NPs than the traditional models training on the whole compounds of ChEMBL26. (2) The consensus model of FNN + SVM possessed the best overall performance, and the voting model can significantly improve recall and specificity.

摘要

天然产物(NPs)及其衍生物是药物发现的重要资源。已经有许多基于计算的靶点预测方法被报道,但很少有方法能够区分 NPs 与合成分子。考虑到 NPs 和合成分子在许多特性上非常不同,有必要建立专门针对 NPs 的靶点预测模型。因此,我们从公共数据库中收集了 NPs 及其衍生物的活性数据,并构建了四个数据集,包括 NP 数据集、NP 及其一级衍生物数据集、NP 及其所有衍生物数据集和 ChEMBL26 化合物数据集。我们探索了条件,包括活性阈值和输入特征,以评估八种机器学习方法对 NPs 靶点预测的性能,包括支持向量机(SVM)、极端梯度提升、随机森林、K-最近邻、朴素贝叶斯、前馈神经网络(FNN)、卷积神经网络和递归神经网络。结果表明,选择 NPs 和所有衍生物数据集来构建最佳的 NP 特异性模型。此外,还应用了共识模型和投票模型来提高预测性能。我们在外部验证集上进行了更多的评估,结果表明:(1)NP 特异性模型在 NP 的靶点预测上的性能优于基于 ChEMBL26 所有化合物的传统模型。(2)FNN+SVM 的共识模型具有最佳的整体性能,投票模型可以显著提高召回率和特异性。

相似文献

1
Large-scale comparison of machine learning algorithms for target prediction of natural products.大规模比较机器学习算法在天然产物靶标预测中的应用。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac359.
2
Machine Learning Hybrid Model for the Prediction of Chronic Kidney Disease.机器学习混合模型预测慢性肾脏病。
Comput Intell Neurosci. 2023 Mar 14;2023:9266889. doi: 10.1155/2023/9266889. eCollection 2023.
3
Bioactivity Comparison across Multiple Machine Learning Algorithms Using over 5000 Datasets for Drug Discovery.利用 5000 多个数据集进行药物发现的多种机器学习算法的生物活性比较。
Mol Pharm. 2021 Jan 4;18(1):403-415. doi: 10.1021/acs.molpharmaceut.0c01013. Epub 2020 Dec 16.
4
Development and validation of consensus machine learning-based models for the prediction of novel small molecules as potential anti-tubercular agents.开发和验证基于共识机器学习的模型,用于预测新型小分子作为潜在的抗结核药物。
Mol Divers. 2022 Jun;26(3):1345-1356. doi: 10.1007/s11030-021-10238-y. Epub 2021 Jun 10.
5
Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用:以新生儿呼吸暂停预测为例的研究
Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.
6
Machine learning prediction of nanoparticle in vitro toxicity: A comparative study of classifiers and ensemble-classifiers using the Copeland Index.基于Copeland 指数的分类器和集成分类器在纳米粒子体外毒性预测中的比较研究
Toxicol Lett. 2019 Sep 15;312:157-166. doi: 10.1016/j.toxlet.2019.05.016. Epub 2019 May 15.
7
Do we need different machine learning algorithms for QSAR modeling? A comprehensive assessment of 16 machine learning algorithms on 14 QSAR data sets.我们是否需要不同的机器学习算法来进行定量构效关系建模?对 16 种机器学习算法在 14 个定量构效关系数据集上的综合评估。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa321.
8
A Review of Machine Learning Algorithms for Biomedical Applications.机器学习算法在生物医学应用中的综述。
Ann Biomed Eng. 2024 May;52(5):1159-1183. doi: 10.1007/s10439-024-03459-3. Epub 2024 Feb 21.
9
Machine learning in medicine: a practical introduction.医学中的机器学习:实用入门
BMC Med Res Methodol. 2019 Mar 19;19(1):64. doi: 10.1186/s12874-019-0681-4.
10
Use of Multiprognostic Index Domain Scores, Clinical Data, and Machine Learning to Improve 12-Month Mortality Risk Prediction in Older Hospitalized Patients: Prospective Cohort Study.使用多预后指标领域评分、临床数据和机器学习提高老年住院患者 12 个月死亡率风险预测:前瞻性队列研究。
J Med Internet Res. 2021 Jun 21;23(6):e26139. doi: 10.2196/26139.

引用本文的文献

1
Advances and challenges in drug design against dental caries: Application of approaches.抗龋齿药物设计的进展与挑战:方法的应用
J Pharm Anal. 2025 Jun;15(6):101161. doi: 10.1016/j.jpha.2024.101161. Epub 2024 Dec 9.
2
Integrating multi-omics and machine learning for disease resistance prediction in legumes.整合多组学和机器学习用于豆类抗病性预测
Theor Appl Genet. 2025 Jun 27;138(7):163. doi: 10.1007/s00122-025-04948-2.
3
Multi-omic analysis tools for microbial metabolites prediction.微生物代谢物预测的多组学分析工具。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae264.
4
Non-Contrasted CT Radiomics for SAH Prognosis Prediction.用于蛛网膜下腔出血预后预测的非增强CT影像组学
Bioengineering (Basel). 2023 Aug 16;10(8):967. doi: 10.3390/bioengineering10080967.