• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测天然产物陆生和海洋来源的机器学习方法

Machine Learning Methods to Predict the Terrestrial and Marine Origin of Natural Products.

作者信息

Pereira Florbela

机构信息

LAQV and REQUIMTE, Departamento de Química, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, 2829-516, Caparica, Portugal.

出版信息

Mol Inform. 2021 Jun;40(6):e2060034. doi: 10.1002/minf.202060034. Epub 2021 Mar 30.

DOI:10.1002/minf.202060034
PMID:33787065
Abstract

In recent years there has been a growing interest in studying the differences between the chemical and biological space represented by natural products (NPs) of terrestrial and marine origin. In order to learn more about these two chemical spaces, marine natural products (MNPs) and terrestrial natural products (TNPs), a machine learning (ML) approach was developed in the current work to predict three classes, MNPs, TNPs and a third class of NPs that appear in both the terrestrial and marine environments. In total 22,398 NPs were retrieved from the Reaxys® database, from those 10,790 molecules are recorded as MNPs, 10,857 as TNPs, and 761 NPs appear registered as both MNPs and TNPs. Several ML algorithms such as Random Forest, Support Vector Machines, and deep learning Multilayer Perceptron networks have been benchmarked. The best performance was achieved with a consensus classification model, which predicted the external test set with an overall predictive accuracy up to 81 %. As far as we know this approach has never been intended and therefore allow to be used to better understand the chemical space defined by MNPs, TNPs or both, but also in virtual screening to define the applicability domain of QSAR models of MNPs and TNPs.

摘要

近年来,人们对研究陆地和海洋来源的天然产物(NP)所代表的化学空间和生物空间之间的差异越来越感兴趣。为了更深入了解这两个化学空间,即海洋天然产物(MNP)和陆地天然产物(TNP),在当前工作中开发了一种机器学习(ML)方法,用于预测三类物质:MNP、TNP以及在陆地和海洋环境中均出现的第三类NP。总共从Reaxys®数据库中检索到22398种NP,其中10790个分子被记录为MNP,10857个为TNP,761种NP同时被记录为MNP和TNP。已经对几种ML算法进行了基准测试,如随机森林、支持向量机和深度学习多层感知器网络。通过共识分类模型获得了最佳性能,该模型对外部测试集的总体预测准确率高达81%。据我们所知,这种方法从未被尝试过,因此它不仅可以用于更好地理解由MNP、TNP或两者定义的化学空间,还可以用于虚拟筛选,以定义MNP和TNP的QSAR模型的适用范围。

相似文献

1
Machine Learning Methods to Predict the Terrestrial and Marine Origin of Natural Products.预测天然产物陆生和海洋来源的机器学习方法
Mol Inform. 2021 Jun;40(6):e2060034. doi: 10.1002/minf.202060034. Epub 2021 Mar 30.
2
Cheminformatic Insight into the Differences between Terrestrial and Marine Originated Natural Products.化学生态学视角下的陆地来源和海洋来源天然产物的差异。
J Chem Inf Model. 2018 Jun 25;58(6):1182-1193. doi: 10.1021/acs.jcim.8b00125. Epub 2018 Jun 7.
3
Predicting Antifouling Activity and Acetylcholinesterase Inhibition of Marine-Derived Compounds Using a Computer-Aided Drug Design Approach.运用计算机辅助药物设计方法预测海洋来源化合物的抗污活性和乙酰胆碱酯酶抑制活性。
Mar Drugs. 2022 Feb 8;20(2):129. doi: 10.3390/md20020129.
4
Chemical Space, Scaffolds, and Halogenated Compounds of CMNPD: A Comprehensive Chemoinformatic Analysis.CMNPD的化学空间、骨架与卤代化合物:全面的化学信息学分析
J Chem Inf Model. 2021 Jul 26;61(7):3323-3336. doi: 10.1021/acs.jcim.1c00162. Epub 2021 Jun 22.
5
Predictive classifier models built from natural products with antimalarial bioactivity using machine learning approach.基于具有抗疟生物活性的天然产物,采用机器学习方法构建的预测分类器模型。
PLoS One. 2018 Sep 28;13(9):e0204644. doi: 10.1371/journal.pone.0204644. eCollection 2018.
6
Large-scale comparison of machine learning algorithms for target prediction of natural products.大规模比较机器学习算法在天然产物靶标预测中的应用。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac359.
7
In Silico HCT116 Human Colon Cancer Cell-Based Models En Route to the Discovery of Lead-Like Anticancer Drugs.基于人结肠癌细胞的 HCT116 细胞模型的计算机辅助药物筛选——旨在发现类似先导的抗癌药物。
Biomolecules. 2018 Jul 17;8(3):56. doi: 10.3390/biom8030056.
8
A Computer-Driven Approach to Discover Natural Product Leads for Methicillin-Resistant Infection Therapy.计算机驱动的方法发现治疗耐甲氧西林金黄色葡萄球菌感染的天然产物先导化合物。
Mar Drugs. 2018 Dec 28;17(1):16. doi: 10.3390/md17010016.
9
Assigning the Origin of Microbial Natural Products by Chemical Space Map and Machine Learning.基于化学空间图谱和机器学习的微生物天然产物生源归属研究。
Biomolecules. 2020 Sep 28;10(10):1385. doi: 10.3390/biom10101385.
10
Computational Methodologies in the Exploration of Marine Natural Product Leads.海洋天然产物先导化合物探索中的计算方法学。
Mar Drugs. 2018 Jul 13;16(7):236. doi: 10.3390/md16070236.

引用本文的文献

1
A Chemoinformatics Investigation of Spectral and Quantum Chemistry Patterns for Discovering New Drug Leads from Natural Products Targeting the PD-1/PD-L1 Immune Checkpoint, with a Particular Focus on Naturally Occurring Marine Products.一项针对靶向PD-1/PD-L1免疫检查点的天然产物(特别关注天然海洋产物)发现新药先导物的光谱和量子化学模式的化学信息学研究。
Mar Drugs. 2025 Jun 10;23(6):247. doi: 10.3390/md23060247.
2
Advanced Methods for Natural Products Discovery: Bioactivity Screening, Dereplication, Metabolomics Profiling, Genomic Sequencing, Databases and Informatic Tools, and Structure Elucidation.天然产物发现的先进方法:生物活性筛选、去重复、代谢组学分析、基因组测序、数据库和信息工具以及结构解析。
Mar Drugs. 2023 May 19;21(5):308. doi: 10.3390/md21050308.
3
Classifying natural products from plants, fungi or bacteria using the COCONUT database and machine learning.使用COCONUT数据库和机器学习对来自植物、真菌或细菌的天然产物进行分类。
J Cheminform. 2021 Oct 18;13(1):82. doi: 10.1186/s13321-021-00559-3.