Kulyashov M A, Hamilton R, Afshin Y, Kolmykov S K, Sokolova T S, Khlebodarova T M, Kalyuzhnaya M G, Akberdin I R
Department of Computational Biology, Scientific Center for Genetics and Life Sciences, Sirius University of Science and Technology, Sochi, Russia.
Department of Biology and Viral Information Institute, San Diego State University, San Diego, California, USA.
mSystems. 2025 Jan 21;10(1):e0110524. doi: 10.1128/msystems.01105-24. Epub 2024 Dec 19.
Context-specific genome-scale model (CS-GSM) reconstruction is becoming an efficient strategy for integrating and cross-comparing experimental multi-scale data to explore the relationship between cellular genotypes, facilitating fundamental or applied research discoveries. However, the application of CS modeling for non-conventional microbes is still challenging. Here, we present a graphical user interface that integrates COBRApy, EscherPy, and RIPTiDe, Python-based tools within the BioUML platform, and streamlines the reconstruction and interrogation of the CS genome-scale metabolic frameworks via Jupyter Notebook. The approach was tested using -omics data collected for 20Z, a prominent microbial chassis for methane capturing and valorization. We optimized the previously reconstructed whole genome-scale metabolic network by adjusting the flux distribution using gene expression data. The outputs of the automatically reconstructed CS metabolic network were comparable to manually optimized IA409 models for Ca-growth conditions. However, the CS model questions the reversibility of the phosphoketolase pathway and suggests higher flux via primary oxidation pathways. The model also highlighted unresolved carbon partitioning between assimilatory and catabolic pathways at the formaldehyde-formate node. Only a very few genes and only one enzyme with a predicted function in C1 metabolism, a homolog of the formaldehyde oxidation enzyme (), showed a significant change in expression in La-growth conditions. The CS-GSM predictions agreed with the experimental measurements under the assumption that the Fae1-2 is a part of the tetrahydrofolate-linked pathway. The cellular roles of the tungsten (W)-dependent formate dehydrogenase () and homologs ( and ) were investigated via mutagenesis. The phenotype of the f mutant followed the model prediction. Furthermore, a more significant reduction of the biomass yield was observed during growth in La-supplemented media, confirming a higher flux through formate. 20Z mutants lacking did not display any significant defects in methane or methanol-dependent growth. However, contrary to the homolog failed to restore the formaldehyde-activating enzyme function in complementation tests. Overall, the presented data suggest that the developed computational workflow supports the reconstruction and validation of CS-GSM networks of non-model microbes.
The interrogation of various types of data is a routine strategy to explore the relationship between genotype and phenotype. An efficient approach for integrating and cross-comparing experimental multi-scale data in the context of whole-genome-based metabolic network reconstruction becomes a powerful tool that facilitates fundamental and applied research discoveries. The present study describes the reconstruction of a context-specific (CS) model for the methane-utilizing bacterium, 20Z. 20Z is becoming an attractive microbial platform for the production of biofuels, chemicals, pharmaceuticals, and bio-sorbents for capturing atmospheric methane. We demonstrate that this pipeline can help reconstruct metabolic models that are similar to manually curated networks. Furthermore, the model is able to highlight previously overlooked pathways, thus advancing fundamental knowledge of non-model microbial systems or promoting their development toward biotechnological or environmental implementations.
特定背景下的基因组规模模型(CS - GSM)重建正成为整合和交叉比较实验多尺度数据以探索细胞基因型之间关系的有效策略,有助于基础研究或应用研究的发现。然而,将CS建模应用于非常规微生物仍然具有挑战性。在这里,我们展示了一个图形用户界面,它在BioUML平台内集成了基于Python的工具COBRApy、EscherPy和RIPTiDe,并通过Jupyter Notebook简化了CS基因组规模代谢框架的重建和查询。该方法使用为20Z收集的组学数据进行了测试,20Z是用于甲烷捕获和增值的重要微生物底盘。我们通过使用基因表达数据调整通量分布,优化了先前重建的全基因组规模代谢网络。自动重建的CS代谢网络的输出与针对Ca生长条件手动优化的IA409模型相当。然而,CS模型对磷酸酮醇酶途径的可逆性提出了质疑,并表明通过初级氧化途径的通量更高。该模型还突出了在甲醛 - 甲酸节点处同化和分解代谢途径之间未解决的碳分配问题。在La生长条件下,只有极少数基因以及在C1代谢中具有预测功能的一种酶(甲醛氧化酶的同源物)的表达出现了显著变化。在假定Fae1 - 2是四氢叶酸连接途径的一部分的情况下,CS - GSM预测与实验测量结果一致。通过诱变研究了钨(W)依赖性甲酸脱氢酶()及其同源物(和)的细胞作用。f突变体的表型遵循模型预测。此外,在添加La的培养基中生长期间观察到生物量产量有更显著的降低,证实了通过甲酸的通量更高。缺乏的20Z突变体在甲烷或甲醇依赖性生长中未表现出任何显著缺陷。然而,与相反,该同源物在互补测试中未能恢复甲醛激活酶的功能。总体而言,所呈现的数据表明所开发的计算工作流程支持非模式微生物CS - GSM网络的重建和验证。
对各种类型数据的查询是探索基因型和表型之间关系的常规策略。在基于全基因组的代谢网络重建背景下,一种整合和交叉比较实验多尺度数据的有效方法成为促进基础研究和应用研究发现的强大工具。本研究描述了用于甲烷利用细菌20Z的特定背景(CS)模型的重建。20Z正成为用于生产生物燃料、化学品、药品以及用于捕获大气甲烷的生物吸附剂的有吸引力的微生物平台。我们证明该流程有助于重建与手动策划网络相似的代谢模型。此外,该模型能够突出以前被忽视的途径,从而推进对非模式微生物系统的基础知识研究,或促进它们向生物技术或环境应用方向的发展。