利用机器学习和约束规划快速预测细菌异养通量组学

Rapid Prediction of Bacterial Heterotrophic Fluxomics Using Machine Learning and Constraint Programming.

作者信息

Wu Stephen Gang, Wang Yuxuan, Jiang Wu, Oyetunde Tolutola, Yao Ruilian, Zhang Xuehong, Shimizu Kazuyuki, Tang Yinjie J, Bao Forrest Sheng

机构信息

Department of Energy, Environmental and Chemical Engineering, Washington University in St. Louis, St. Louis, Missouri, United States of America.

Department of Computer Science and Engineering, Ohio State University, Columbus, Ohio, United States of America.

出版信息

PLoS Comput Biol. 2016 Apr 19;12(4):e1004838. doi: 10.1371/journal.pcbi.1004838. eCollection 2016 Apr.

DOI:10.1371/journal.pcbi.1004838

PMID:27092947

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4836714/

Abstract

13C metabolic flux analysis (13C-MFA) has been widely used to measure in vivo enzyme reaction rates (i.e., metabolic flux) in microorganisms. Mining the relationship between environmental and genetic factors and metabolic fluxes hidden in existing fluxomic data will lead to predictive models that can significantly accelerate flux quantification. In this paper, we present a web-based platform MFlux (http://mflux.org) that predicts the bacterial central metabolism via machine learning, leveraging data from approximately 100 13C-MFA papers on heterotrophic bacterial metabolisms. Three machine learning methods, namely Support Vector Machine (SVM), k-Nearest Neighbors (k-NN), and Decision Tree, were employed to study the sophisticated relationship between influential factors and metabolic fluxes. We performed a grid search of the best parameter set for each algorithm and verified their performance through 10-fold cross validations. SVM yields the highest accuracy among all three algorithms. Further, we employed quadratic programming to adjust flux profiles to satisfy stoichiometric constraints. Multiple case studies have shown that MFlux can reasonably predict fluxomes as a function of bacterial species, substrate types, growth rate, oxygen conditions, and cultivation methods. Due to the interest of studying model organism under particular carbon sources, bias of fluxome in the dataset may limit the applicability of machine learning models. This problem can be resolved after more papers on 13C-MFA are published for non-model species.

摘要

13C代谢通量分析（13C-MFA）已被广泛用于测量微生物体内的酶反应速率（即代谢通量）。挖掘环境和遗传因素与现有通量组学数据中隐藏的代谢通量之间的关系，将产生能够显著加速通量定量的预测模型。在本文中，我们展示了一个基于网络的平台MFlux（http://mflux.org），它通过机器学习预测细菌的中心代谢，利用来自约100篇关于异养细菌代谢的13C-MFA论文的数据。采用了三种机器学习方法，即支持向量机（SVM）、k近邻（k-NN）和决策树，来研究影响因素与代谢通量之间的复杂关系。我们对每种算法的最佳参数集进行了网格搜索，并通过10折交叉验证来验证它们的性能。在所有三种算法中，SVM的准确率最高。此外，我们采用二次规划来调整通量分布以满足化学计量约束。多个案例研究表明，MFlux可以根据细菌种类、底物类型、生长速率、氧气条件和培养方法合理地预测通量组。由于在特定碳源下研究模式生物的兴趣，数据集中通量组的偏差可能会限制机器学习模型的适用性。在发表更多关于非模式物种的13C-MFA论文后，这个问题可以得到解决。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f0a2/4836714/0809c625e2c2/pcbi.1004838.g001.jpg

相似文献

Rapid Prediction of Bacterial Heterotrophic Fluxomics Using Machine Learning and Constraint Programming.利用机器学习和约束规划快速预测细菌异养通量组学

PLoS Comput Biol. 2016 Apr 19;12(4):e1004838. doi: 10.1371/journal.pcbi.1004838. eCollection 2016 Apr.

WUFlux: an open-source platform for C metabolic flux analysis of bacterial metabolism.WUFlux：用于细菌代谢中碳代谢通量分析的开源平台。

BMC Bioinformatics. 2016 Nov 4;17(1):444. doi: 10.1186/s12859-016-1314-0.

Computational Framework for Machine-Learning-Enabled C Fluxomics.机器学习赋能 C 通量组学的计算框架。

ACS Synth Biol. 2022 Jan 21;11(1):103-115. doi: 10.1021/acssynbio.1c00189. Epub 2021 Oct 27.

A Method to Constrain Genome-Scale Models with 13C Labeling Data.一种利用¹³C标记数据约束基因组规模模型的方法。

PLoS Comput Biol. 2015 Sep 17;11(9):e1004363. doi: 10.1371/journal.pcbi.1004363. eCollection 2015 Sep.

p13CMFA: Parsimonious 13C metabolic flux analysis.p13CMFA：简约 13C 代谢通量分析。

PLoS Comput Biol. 2019 Sep 6;15(9):e1007310. doi: 10.1371/journal.pcbi.1007310. eCollection 2019 Sep.

Genome-Scale C Fluxomics Modeling for Metabolic Engineering of Saccharomyces cerevisiae.用于酿酒酵母代谢工程的基因组尺度碳通量组学建模

Methods Mol Biol. 2019;1859:317-345. doi: 10.1007/978-1-4939-8757-3_19.

C-Fingerprinting and Metabolic Flux Analysis of Bacterial Metabolisms.细菌代谢的C-指纹图谱与代谢通量分析

Methods Mol Biol. 2019;1927:215-230. doi: 10.1007/978-1-4939-9142-6_15.

From Escherichia coli mutant 13C labeling data to a core kinetic model: A kinetic model parameterization pipeline.从大肠杆菌突变体 13C 标记数据到核心动力学模型：一个动力学模型参数化管道。

PLoS Comput Biol. 2019 Sep 10;15(9):e1007319. doi: 10.1371/journal.pcbi.1007319. eCollection 2019 Sep.

SUMOFLUX: A Generalized Method for Targeted 13C Metabolic Flux Ratio Analysis.SUMOFLUX：一种用于靶向13C代谢通量比率分析的通用方法。

PLoS Comput Biol. 2016 Sep 14;12(9):e1005109. doi: 10.1371/journal.pcbi.1005109. eCollection 2016 Sep.

ScalaFlux: A scalable approach to quantify fluxes in metabolic subnetworks.ScalaFlux：一种可扩展的方法，用于量化代谢子网络中的通量。

PLoS Comput Biol. 2020 Apr 14;16(4):e1007799. doi: 10.1371/journal.pcbi.1007799. eCollection 2020 Apr.

引用本文的文献

Machine learning methods for predicting essential metabolic genes from Plasmodium falciparum genome-scale metabolic network.基于恶性疟原虫基因组规模代谢网络预测必需代谢基因的机器学习方法

PLoS One. 2024 Dec 23;19(12):e0315530. doi: 10.1371/journal.pone.0315530. eCollection 2024.

Recent advances in culture medium design for enhanced production of monoclonal antibodies in CHO cells: A comparative study of machine learning and systems biology approaches.用于提高中国仓鼠卵巢细胞中单克隆抗体产量的培养基设计的最新进展：机器学习与系统生物学方法的比较研究

Biotechnol Adv. 2025 Jan-Feb;78:108480. doi: 10.1016/j.biotechadv.2024.108480. Epub 2024 Nov 19.

Machine Learning and Deep Learning in Synthetic Biology: Key Architectures, Applications, and Challenges.合成生物学中的机器学习与深度学习：关键架构、应用及挑战

ACS Omega. 2024 Feb 19;9(9):9921-9945. doi: 10.1021/acsomega.3c05913. eCollection 2024 Mar 5.

Biotechnological production of omega-3 fatty acids: current status and future perspectives.ω-3脂肪酸的生物技术生产：现状与未来展望

Front Microbiol. 2023 Nov 7;14:1280296. doi: 10.3389/fmicb.2023.1280296. eCollection 2023.

Determination of Metabolic Fluxes by Deep Learning of Isotope Labeling Patterns.通过对同位素标记模式进行深度学习来确定代谢通量

bioRxiv. 2023 Nov 8:2023.11.06.565907. doi: 10.1101/2023.11.06.565907.

Predicting metabolic fluxes from omics data via machine learning: Moving from knowledge-driven towards data-driven approaches.通过机器学习从组学数据预测代谢通量：从知识驱动方法向数据驱动方法的转变。

Comput Struct Biotechnol J. 2023 Oct 5;21:4960-4973. doi: 10.1016/j.csbj.2023.10.002. eCollection 2023.

Recent advances in proteomics and metabolomics in plants.植物蛋白质组学和代谢组学的最新进展。

Mol Hortic. 2022 Jul 23;2(1):17. doi: 10.1186/s43897-022-00038-9.

FreeFlux: A Python Package for Time-Efficient Isotopically Nonstationary Metabolic Flux Analysis.FreeFlux：一个用于高效同位素非稳定代谢通量分析的 Python 包。

ACS Synth Biol. 2023 Sep 15;12(9):2707-2714. doi: 10.1021/acssynbio.3c00265. Epub 2023 Aug 10.

From genotype to phenotype: computational approaches for inferring microbial traits relevant to the food industry.从基因型到表型：推断与食品工业相关的微生物特性的计算方法。

FEMS Microbiol Rev. 2023 Jul 5;47(4). doi: 10.1093/femsre/fuad030.

Synthetic Biology Meets Machine Learning.合成生物学与机器学习相遇。

Methods Mol Biol. 2023;2553:21-39. doi: 10.1007/978-1-0716-2617-7_2.

本文引用的文献

Fluxome study of Pseudomonas fluorescens reveals major reorganisation of carbon flux through central metabolic pathways in response to inactivation of the anti-sigma factor MucA.荧光假单胞菌的通量组研究揭示了碳通量通过中心代谢途径的重大重组，以响应抗σ因子MucA的失活。

BMC Syst Biol. 2015 Feb 18;9:6. doi: 10.1186/s12918-015-0148-0.

An ancient Chinese wisdom for metabolic engineering: Yin-Yang.中国古代关于代谢工程的智慧：阴阳。

Microb Cell Fact. 2015 Mar 20;14:39. doi: 10.1186/s12934-015-0219-3.

Integrated 13C-metabolic flux analysis of 14 parallel labeling experiments in Escherichia coli.大肠杆菌中14个平行标记实验的整合13C代谢通量分析

Metab Eng. 2015 Mar;28:151-158. doi: 10.1016/j.ymben.2015.01.001. Epub 2015 Jan 14.

CeCaFDB: a curated database for the documentation, visualization and comparative analysis of central carbon metabolic flux distributions explored by 13C-fluxomics.CeCaFDB：一个用于13C通量组学探索的中心碳代谢通量分布的记录、可视化和比较分析的精选数据库。

Nucleic Acids Res. 2015 Jan;43(Database issue):D549-57. doi: 10.1093/nar/gku1137. Epub 2014 Nov 11.

A de novo NADPH generation pathway for improving lysine production of Corynebacterium glutamicum by rational design of the coenzyme specificity of glyceraldehyde 3-phosphate dehydrogenase.通过合理设计3-磷酸甘油醛脱氢酶的辅酶特异性来改善谷氨酸棒杆菌赖氨酸生产的从头NADPH生成途径。

Metab Eng. 2014 Sep;25:30-7. doi: 10.1016/j.ymben.2014.06.005. Epub 2014 Jun 19.

Incomplete Wood-Ljungdahl pathway facilitates one-carbon metabolism in organohalide-respiring Dehalococcoides mccartyi.不完全伍德-吕格达姆途径促进有机卤化物呼吸的脱卤球菌中一碳代谢。

Proc Natl Acad Sci U S A. 2014 Apr 29;111(17):6419-24. doi: 10.1073/pnas.1321542111. Epub 2014 Apr 14.

Robustness and plasticity of metabolic pathway flux among uropathogenic isolates of Pseudomonas aeruginosa.铜绿假单胞菌尿路致病性分离株中代谢途径通量的稳健性和可塑性。

PLoS One. 2014 Apr 7;9(4):e88368. doi: 10.1371/journal.pone.0088368. eCollection 2014.

Transcriptional regulation is insufficient to explain substrate-induced flux changes in Bacillus subtilis.转录调控不足以解释枯草芽孢杆菌中底物诱导的通量变化。

Mol Syst Biol. 2013 Nov 26;9:709. doi: 10.1038/msb.2013.66.

Central metabolic responses to the overproduction of fatty acids in Escherichia coli based on 13C-metabolic flux analysis.基于 13C 代谢通量分析的大肠杆菌脂肪酸过度生产的中心代谢响应。

Biotechnol Bioeng. 2014 Mar;111(3):575-85. doi: 10.1002/bit.25124.

COMPLETE-MFA: complementary parallel labeling experiments technique for metabolic flux analysis.COMPLETE-MFA：代谢通量分析互补平行标记实验技术。

Metab Eng. 2013 Nov;20:49-55. doi: 10.1016/j.ymben.2013.08.006. Epub 2013 Sep 8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用机器学习和约束规划快速预测细菌异养通量组学

Rapid Prediction of Bacterial Heterotrophic Fluxomics Using Machine Learning and Constraint Programming.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献