通过结合基于约束的建模和机器学习来准确预测体内蛋白质丰度。

Accurate prediction of in vivo protein abundances by coupling constraint-based modelling and machine learning.

机构信息

Department of Microbiology, Federal University of Viçosa, Viçosa, Minas Gerais, 36570900, Brazil.

Bioinformatics, Institute of Biochemistry and Biology, University of Potsdam, Potsdam, 14476, Germany; Systems Biology and Mathematical Modelling, Max Planck Institute of Molecular Plant Physiology, Potsdam, 14476, Germany.

出版信息

Metab Eng. 2023 Nov;80:184-192. doi: 10.1016/j.ymben.2023.09.014. Epub 2023 Oct 5.

DOI:10.1016/j.ymben.2023.09.014

PMID:37802292

Abstract

Quantification of how different environmental cues affect protein allocation can provide important insights for understanding cell physiology. While absolute quantification of proteins can be obtained by resource-intensive mass-spectrometry-based technologies, prediction of protein abundances offers another way to obtain insights into protein allocation. Here we present CAMEL, a framework that couples constraint-based modelling with machine learning to predict protein abundance for any environmental condition. This is achieved by building machine learning models that leverage static features, derived from protein sequences, and condition-dependent features predicted from protein-constrained metabolic models. Our findings demonstrate that CAMEL results in excellent prediction of protein allocation in E. coli (average Pearson correlation of at least 0.9), and moderate performance in S. cerevisiae (average Pearson correlation of at least 0.5). Therefore, CAMEL outperformed contending approaches without using molecular read-outs from unseen conditions and provides a valuable tool for using protein allocation in biotechnological applications.

摘要

量化不同环境线索如何影响蛋白质分配，可以为理解细胞生理学提供重要的见解。虽然基于质谱的资源密集型技术可以实现蛋白质的绝对定量，但预测蛋白质丰度为深入了解蛋白质分配提供了另一种方法。在这里，我们提出了 CAMEL，这是一种将约束建模与机器学习相结合的框架，可预测任何环境条件下的蛋白质丰度。这是通过构建机器学习模型来实现的，这些模型利用源自蛋白质序列的静态特征和从蛋白质约束代谢模型预测的条件相关特征。我们的研究结果表明，CAMEL 可以出色地预测大肠杆菌中的蛋白质分配（至少 0.9 的平均 Pearson 相关系数），在酿酒酵母中表现中等（至少 0.5 的平均 Pearson 相关系数）。因此，CAMEL 在不使用未见条件的分子读出值的情况下优于竞争方法，并为生物技术应用中的蛋白质分配提供了有价值的工具。

相似文献

Accurate prediction of in vivo protein abundances by coupling constraint-based modelling and machine learning.通过结合基于约束的建模和机器学习来准确预测体内蛋白质丰度。

Metab Eng. 2023 Nov;80:184-192. doi: 10.1016/j.ymben.2023.09.014. Epub 2023 Oct 5.

Protein Abundance Prediction Through Machine Learning Methods.通过机器学习方法进行蛋白质丰度预测

J Mol Biol. 2021 Nov 5;433(22):167267. doi: 10.1016/j.jmb.2021.167267. Epub 2021 Sep 23.

PARROT: Prediction of enzyme abundances using protein-constrained metabolic models.利用蛋白约束代谢模型预测酶丰度。

PLoS Comput Biol. 2023 Oct 19;19(10):e1011549. doi: 10.1371/journal.pcbi.1011549. eCollection 2023 Oct.

Energy metabolism controls phenotypes by protein efficiency and allocation.能量代谢通过蛋白质效率和分配来控制表型。

Proc Natl Acad Sci U S A. 2019 Aug 27;116(35):17592-17597. doi: 10.1073/pnas.1906569116. Epub 2019 Aug 12.

PERISCOPE-Opt: Machine learning-based prediction of optimal fermentation conditions and yields of recombinant periplasmic protein expressed in .潜望镜-Opt：基于机器学习预测在……中表达的重组周质蛋白的最佳发酵条件和产量。（你提供的原文似乎不完整，“expressed in”后面缺少具体内容）

Comput Struct Biotechnol J. 2022 Jun 3;20:2909-2920. doi: 10.1016/j.csbj.2022.06.006. eCollection 2022.

Interrogating noise in protein sequences from the perspective of protein-protein interactions prediction.从蛋白质-蛋白质相互作用预测的角度探讨蛋白质序列中的噪声。

J Theor Biol. 2012 Dec 21;315:64-70. doi: 10.1016/j.jtbi.2012.09.007. Epub 2012 Sep 18.

Prediction and integration of metabolite-protein interactions with genome-scale metabolic models.基于基因组代谢模型预测和整合代谢物-蛋白质相互作用。

Metab Eng. 2024 Mar;82:216-224. doi: 10.1016/j.ymben.2024.02.008. Epub 2024 Feb 15.

Machine learning applied to enzyme turnover numbers reveals protein structural correlates and improves metabolic models.机器学习在酶周转率中的应用揭示了蛋白质结构相关性，并改进了代谢模型。

Nat Commun. 2018 Dec 7;9(1):5252. doi: 10.1038/s41467-018-07652-6.

Prediction of metabolic fluxes from gene expression data with Huber penalty convex optimization function.使用Huber罚函数凸优化函数从基因表达数据预测代谢通量

Mol Biosyst. 2017 May 2;13(5):901-909. doi: 10.1039/c6mb00811a.

Prediction of metabolite-protein interactions based on integration of machine learning and constraint-based modeling.基于机器学习与基于约束的建模整合的代谢物-蛋白质相互作用预测

Bioinform Adv. 2023 Jul 17;3(1):vbad098. doi: 10.1093/bioadv/vbad098. eCollection 2023.

引用本文的文献

PARROT: Prediction of enzyme abundances using protein-constrained metabolic models.利用蛋白约束代谢模型预测酶丰度。

PLoS Comput Biol. 2023 Oct 19;19(10):e1011549. doi: 10.1371/journal.pcbi.1011549. eCollection 2023 Oct.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过结合基于约束的建模和机器学习来准确预测体内蛋白质丰度。

Accurate prediction of in vivo protein abundances by coupling constraint-based modelling and machine learning.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献