• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测《京都基因与基因组百科全书》中定义的所有通路及相关化合物条目的通路参与情况。

Predicting the Pathway Involvement of All Pathway and Associated Compound Entries Defined in the Kyoto Encyclopedia of Genes and Genomes.

作者信息

Huckvale Erik D, Moseley Hunter N B

机构信息

Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA.

Superfund Research Center, University of Kentucky, Lexington, KY 40536, USA.

出版信息

Metabolites. 2024 Oct 27;14(11):582. doi: 10.3390/metabo14110582.

DOI:10.3390/metabo14110582
PMID:39590818
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11596622/
Abstract

: Predicting the biochemical pathway involvement of a compound could facilitate the interpretation of biological and biomedical research. Prior prediction approaches have largely focused on metabolism, training machine learning models to solely predict based on metabolic pathways. However, there are many other types of pathways in cells and organisms that are of interest to biologists. : While several publications have made use of the metabolites and metabolic pathways available in the Kyoto Encyclopedia of Genes and Genomes (KEGG), we downloaded all the compound entries with pathway annotations available in the KEGG. From these data, we constructed a dataset where each entry contained features representing compounds combined with features representing pathways, followed by a binary label indicating whether the given compound is associated with the given pathway. We trained multi-layer perceptron binary classifiers on variations of this dataset. : The models trained on 6485 KEGG compounds and 502 pathways scored an overall mean Matthews correlation coefficient (MCC) performance of 0.847, a median MCC of 0.848, and a standard deviation of 0.0098. : This performance on all 502 KEGG pathways represents a roughly 6% improvement over the performance of models trained on only the 184 KEGG metabolic pathways, which had a mean MCC of 0.800 and a standard deviation of 0.021. These results demonstrate the capability to effectively predict biochemical pathways in general, in addition to those specifically related to metabolism. Moreover, the improvement in the performance demonstrates additional transfer learning with the inclusion of non-metabolic pathways.

摘要

预测一种化合物参与的生化途径有助于解释生物学和生物医学研究。先前的预测方法主要集中在代谢方面,训练机器学习模型仅基于代谢途径进行预测。然而,细胞和生物体中还有许多其他类型的途径是生物学家感兴趣的。虽然有几篇出版物利用了京都基因与基因组百科全书(KEGG)中可用的代谢物和代谢途径,但我们下载了KEGG中所有带有途径注释的化合物条目。从这些数据中,我们构建了一个数据集,其中每个条目都包含代表化合物的特征与代表途径的特征,随后是一个二元标签,表明给定化合物是否与给定途径相关。我们在这个数据集的变体上训练了多层感知器二元分类器。在6485种KEGG化合物和502条途径上训练的模型,总体平均马修斯相关系数(MCC)性能为0.847,中位数MCC为0.848,标准差为0.0098。在所有502条KEGG途径上的这种性能比仅在184条KEGG代谢途径上训练的模型性能提高了约6%,后者的平均MCC为0.800,标准差为0.021。这些结果表明,除了与代谢特别相关的途径外,总体上有能力有效预测生化途径。此外,性能的提高表明,纳入非代谢途径可实现额外的迁移学习。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/11596622/54a5bbcb8ae7/metabolites-14-00582-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/11596622/0657e0e3002d/metabolites-14-00582-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/11596622/cb95edc52e11/metabolites-14-00582-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/11596622/ff8af68d6674/metabolites-14-00582-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/11596622/41f188fa719c/metabolites-14-00582-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/11596622/54a5bbcb8ae7/metabolites-14-00582-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/11596622/0657e0e3002d/metabolites-14-00582-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/11596622/cb95edc52e11/metabolites-14-00582-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/11596622/ff8af68d6674/metabolites-14-00582-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/11596622/41f188fa719c/metabolites-14-00582-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e1c/11596622/54a5bbcb8ae7/metabolites-14-00582-g005.jpg

相似文献

1
Predicting the Pathway Involvement of All Pathway and Associated Compound Entries Defined in the Kyoto Encyclopedia of Genes and Genomes.预测《京都基因与基因组百科全书》中定义的所有通路及相关化合物条目的通路参与情况。
Metabolites. 2024 Oct 27;14(11):582. doi: 10.3390/metabo14110582.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
Fabricating mice and dementia: opening up relations in multi-species research制造小鼠与痴呆症:开启多物种研究中的关联
4
AI-based Hepatic Steatosis Detection and Integrated Hepatic Assessment from Cardiac CT Attenuation Scans Enhances All-cause Mortality Risk Stratification: A Multi-center Study.基于人工智能的心脏CT衰减扫描检测肝脂肪变性及综合肝脏评估可增强全因死亡风险分层:一项多中心研究
medRxiv. 2025 Jun 11:2025.06.09.25329157. doi: 10.1101/2025.06.09.25329157.
5
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
6
Nutritional interventions for survivors of childhood cancer.儿童癌症幸存者的营养干预措施。
Cochrane Database Syst Rev. 2016 Aug 22;2016(8):CD009678. doi: 10.1002/14651858.CD009678.pub2.
7
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
8
Audit and feedback: effects on professional practice.审核与反馈:对专业实践的影响
Cochrane Database Syst Rev. 2025 Mar 25;3(3):CD000259. doi: 10.1002/14651858.CD000259.pub4.
9
Short-Term Memory Impairment短期记忆障碍
10
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.

引用本文的文献

1
Chemical representation standardization needed to generalize metabolic pathway involvement prediction across the Kyoto Encyclopedia of Genes and Genomes, Reactome, and MetaCyc knowledgebases.需要进行化学表示标准化,以便在《京都基因与基因组百科全书》、Reactome和MetaCyc知识库中推广代谢途径参与预测。
bioRxiv. 2025 Apr 8:2025.04.02.646918. doi: 10.1101/2025.04.02.646918.
2
Chloroplast Functionality at the Interface of Growth, Defense, and Genetic Innovation: A Multi-Omics and Technological Perspective.生长、防御与基因创新界面的叶绿体功能:多组学与技术视角
Plants (Basel). 2025 Mar 20;14(6):978. doi: 10.3390/plants14060978.
3

本文引用的文献

1
Predicting the Association of Metabolites with Both Pathway Categories and Individual Pathways.预测代谢物与通路类别及单个通路之间的关联。
Metabolites. 2024 Sep 21;14(9):510. doi: 10.3390/metabo14090510.
2
Predicting the Pathway Involvement of Metabolites Based on Combined Metabolite and Pathway Features.基于代谢物和通路特征组合预测代谢物的通路参与情况
Metabolites. 2024 May 7;14(5):266. doi: 10.3390/metabo14050266.
3
A cautionary tale about properly vetting datasets used in supervised learning predicting metabolic pathway involvement.
Predicting the Pathway Involvement of Compounds Annotated in the Reactome Knowledgebase.
预测Reactome知识库中注释化合物的通路参与情况。
Metabolites. 2025 Mar 1;15(3):161. doi: 10.3390/metabo15030161.
一个关于在监督学习中使用预测代谢途径参与的数据集进行适当验证的警示故事。
PLoS One. 2024 May 2;19(5):e0299583. doi: 10.1371/journal.pone.0299583. eCollection 2024.
4
md_harmonize: A Python Package for Atom-Level Harmonization of Public Metabolic Databases.md_harmonize:一个用于公共代谢数据库原子级协调的Python包。
Metabolites. 2023 Dec 17;13(12):1199. doi: 10.3390/metabo13121199.
5
Benchmark Dataset for Training Machine Learning Models to Predict the Pathway Involvement of Metabolites.用于训练机器学习模型以预测代谢物途径参与情况的基准数据集。
Metabolites. 2023 Nov 1;13(11):1120. doi: 10.3390/metabo13111120.
6
The Reactome Pathway Knowledgebase 2024.Reactome 通路知识库 2024.
Nucleic Acids Res. 2024 Jan 5;52(D1):D672-D678. doi: 10.1093/nar/gkad1025.
7
kegg_pull: a software package for the RESTful access and pulling from the Kyoto Encyclopedia of Gene and Genomes.KEGG_PULL:一个用于通过 RESTful 访问和从京都基因与基因组百科全书(KEGG)中提取数据的软件包。
BMC Bioinformatics. 2023 Mar 4;24(1):78. doi: 10.1186/s12859-023-05208-0.
8
KEGG for taxonomy-based analysis of pathways and genomes.KEGG 用于基于分类的途径和基因组分析。
Nucleic Acids Res. 2023 Jan 6;51(D1):D587-D592. doi: 10.1093/nar/gkac963.
9
MLGL-MP: a Multi-Label Graph Learning framework enhanced by pathway interdependence for Metabolic Pathway prediction.MLGL-MP:一种通过途径相互依赖性增强的多标签图学习框架,用于代谢途径预测。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i325-i332. doi: 10.1093/bioinformatics/btac222.
10
Array programming with NumPy.使用 NumPy 进行数组编程。
Nature. 2020 Sep;585(7825):357-362. doi: 10.1038/s41586-020-2649-2. Epub 2020 Sep 16.