• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用公开可用的Reactome数据,通过监督式机器学习重新设计植物特殊代谢。

Redesigning plant specialized metabolism with supervised machine learning using publicly available reactome data.

作者信息

Lim Peng Ken, Julca Irene, Mutwil Marek

机构信息

School of Biological Sciences, Nanyang Technological University, Singapore, Singapore.

出版信息

Comput Struct Biotechnol J. 2023 Jan 18;21:1639-1650. doi: 10.1016/j.csbj.2023.01.013. eCollection 2023.

DOI:10.1016/j.csbj.2023.01.013
PMID:36874159
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9976193/
Abstract

The immense structural diversity of products and intermediates of plant specialized metabolism (specialized metabolites) makes them rich sources of therapeutic medicine, nutrients, and other useful materials. With the rapid accumulation of reactome data that can be accessible on biological and chemical databases, along with recent advances in machine learning, this review sets out to outline how supervised machine learning can be used to design new compounds and pathways by exploiting the wealth of said data. We will first examine the various sources from which reactome data can be obtained, followed by explaining the different machine learning encoding methods for reactome data. We then discuss current supervised machine learning developments that can be employed in various aspects to help redesign plant specialized metabolism.

摘要

植物特殊代谢(特殊代谢产物)的产物和中间体具有巨大的结构多样性,使其成为治疗药物、营养物质和其他有用材料的丰富来源。随着生物和化学数据库中可获取的反应组数据的快速积累,以及机器学习的最新进展,本综述旨在概述如何通过利用这些丰富的数据,使用监督机器学习来设计新的化合物和途径。我们将首先研究可获取反应组数据的各种来源,然后解释反应组数据的不同机器学习编码方法。接着,我们将讨论当前可用于各个方面以帮助重新设计植物特殊代谢的监督机器学习进展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb26/9976193/b06400f5a33f/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb26/9976193/4dc4e51876ec/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb26/9976193/b06400f5a33f/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb26/9976193/4dc4e51876ec/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb26/9976193/b06400f5a33f/gr2.jpg

相似文献

1
Redesigning plant specialized metabolism with supervised machine learning using publicly available reactome data.利用公开可用的Reactome数据,通过监督式机器学习重新设计植物特殊代谢。
Comput Struct Biotechnol J. 2023 Jan 18;21:1639-1650. doi: 10.1016/j.csbj.2023.01.013. eCollection 2023.
2
Predicting instances of pathway ontology classes for pathway integration.预测用于通路整合的通路本体类的实例。
J Biomed Semantics. 2019 Jun 13;10(1):11. doi: 10.1186/s13326-019-0202-8.
3
Interleukins and their signaling pathways in the Reactome biological pathway database.白细胞介素及其信号通路在 Reactome 生物途径数据库中。
J Allergy Clin Immunol. 2018 Apr;141(4):1411-1416. doi: 10.1016/j.jaci.2017.12.992. Epub 2018 Feb 21.
4
Plant Reactome and PubChem: The Plant Pathway and (Bio)Chemical Entity Knowledgebases.植物反应组数据库和化学生物信息数据库:植物通路及(生物)化学实体知识库
Methods Mol Biol. 2022;2443:511-525. doi: 10.1007/978-1-0716-2067-0_27.
5
Comprehensive study of semi-supervised learning for DNA methylation-based supervised classification of central nervous system tumors.基于 DNA 甲基化的中枢神经系统肿瘤有监督分类的半监督学习综合研究。
BMC Bioinformatics. 2022 Jun 8;23(1):223. doi: 10.1186/s12859-022-04764-1.
6
Machine Learning upon RDF Knowledge Graphs for Drug Safety: A Case Study on Reactome Data.基于 RDF 知识图谱的药物安全机器学习:以 Reactome 数据为例的研究
Stud Health Technol Inform. 2024 Aug 22;316:873-874. doi: 10.3233/SHTI240550.
7
Reactome: a knowledgebase of biological pathways.Reactome:生物通路知识库。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D428-32. doi: 10.1093/nar/gki072.
8
Unsupervised and semi-supervised learning: the next frontier in machine learning for plant systems biology.无监督和半监督学习:植物系统生物学机器学习的下一个前沿。
Plant J. 2022 Sep;111(6):1527-1538. doi: 10.1111/tpj.15905. Epub 2022 Jul 27.
9
Machine Learning Using Neural Networks for Metabolomic Pathway Analyses.基于神经网络的代谢组学通路分析的机器学习方法
Methods Mol Biol. 2023;2553:395-415. doi: 10.1007/978-1-0716-2617-7_17.
10
Unsupervised and self-supervised deep learning approaches for biomedical text mining.无监督和自监督深度学习方法在生物医学文本挖掘中的应用。
Brief Bioinform. 2021 Mar 22;22(2):1592-1603. doi: 10.1093/bib/bbab016.

引用本文的文献

1
Evolution of plant metabolism: the state-of-the-art.植物代谢的进化:现状。
Philos Trans R Soc Lond B Biol Sci. 2024 Nov 18;379(1914):20230347. doi: 10.1098/rstb.2023.0347. Epub 2024 Sep 30.

本文引用的文献

1
Comparison of Descriptor- and Fingerprint Sets in Machine Learning Models for ADME-Tox Targets.用于ADME-Tox靶点的机器学习模型中描述符集和指纹集的比较
Front Chem. 2022 Jun 8;10:852893. doi: 10.3389/fchem.2022.852893. eCollection 2022.
2
Deep learning driven biosynthetic pathways navigation for natural products with BioNavi-NP.深度学习驱动的生物合成途径导航用于天然产物的 BioNavi-NP。
Nat Commun. 2022 Jun 10;13(1):3342. doi: 10.1038/s41467-022-30970-9.
3
Protein language-model embeddings for fast, accurate, and alignment-free protein structure prediction.
基于蛋白质语言模型的嵌入来实现快速、准确且无需对齐的蛋白质结构预测。
Structure. 2022 Aug 4;30(8):1169-1177.e4. doi: 10.1016/j.str.2022.05.001. Epub 2022 May 23.
4
An analysis of protein language model embeddings for fold prediction.蛋白质语言模型嵌入物折叠预测分析。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac142.
5
Expanding biochemical knowledge and illuminating metabolic dark matter with ATLASx.利用 ATLASx 扩展生化知识并阐明代谢暗物质。
Nat Commun. 2022 Mar 23;13(1):1560. doi: 10.1038/s41467-022-29238-z.
6
MolData, a molecular benchmark for disease and target based machine learning.MolData,一种基于疾病和靶点的机器学习分子基准。
J Cheminform. 2022 Mar 7;14(1):10. doi: 10.1186/s13321-022-00590-y.
7
Breaking away from labels: The promise of self-supervised machine learning in intelligent health.摆脱标签:自监督机器学习在智能健康领域的前景。
Patterns (N Y). 2022 Feb 11;3(2):100410. doi: 10.1016/j.patter.2021.100410.
8
Biocatalysed synthesis planning using data-driven learning.基于数据驱动学习的生物催化合成规划。
Nat Commun. 2022 Feb 18;13(1):964. doi: 10.1038/s41467-022-28536-w.
9
Machine learning modeling of family wide enzyme-substrate specificity screens.基于机器学习的全家族酶底物特异性筛选模型。
PLoS Comput Biol. 2022 Feb 10;18(2):e1009853. doi: 10.1371/journal.pcbi.1009853. eCollection 2022 Feb.
10
AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.AlphaFold 蛋白质结构数据库:用高精度模型极大地扩展蛋白质序列空间的结构覆盖范围。
Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444. doi: 10.1093/nar/gkab1061.