Sorbonne University, INSERM, NutriOmics F75013, France., 91, blvd. de l'Hôpital, 75013 Paris, France.
BMC Bioinformatics. 2019 Oct 15;20(1):499. doi: 10.1186/s12859-019-3112-y.
Metabolic networks reflect the relationships between metabolites (biomolecules) and the enzymes (proteins), and are of particular interest since they describe all chemical reactions of an organism. The metabolic networks are constructed from the genome sequence of an organism, and the graphs can be used to study fluxes through the reactions, or to relate the graph structure to environmental characteristics and phenotypes. About ten years ago, Takemoto et al. (2007) stated that the structure of prokaryotic metabolic networks represented as undirected graphs, is correlated to their living environment. Although metabolic networks are naturally directed graphs, they are still usually analysed as undirected graphs.
We implemented a pipeline to reconstruct metabolic networks from genome data and confirmed some of the results of Takemoto et al. (2007) with today data using up-to-date databases. However, Takemoto et al. (2007) used only a fraction of all available enzymes from the genome and taking into account all the enzymes we fail to reproduce the main results. Therefore, we introduce three robust measures on directed representations of graphs, which lead to similar results regardless of the method of network reconstruction. We show that the size of the largest strongly connected component, the flow hierarchy and the Laplacian spectrum are strongly correlated to the environmental conditions.
We found a significant negative correlation between the size of the largest strongly connected component (a cycle) and the optimal growth temperature of the considered prokaryotes. This relationship holds true for the spectrum, high temperature being associated with lower eigenvalues. The hierarchy flow shows a negative correlation with optimal growth temperature. This suggests that the dynamical properties of the network are dependant on environmental factors.
代谢网络反映了代谢物(生物分子)和酶(蛋白质)之间的关系,由于它们描述了生物体的所有化学反应,因此特别有趣。代谢网络是根据生物体的基因组序列构建的,这些图可用于研究反应中的通量,或将图结构与环境特征和表型联系起来。大约十年前,Takemoto 等人(2007 年)指出,作为无向图表示的原核代谢网络的结构与其生活环境有关。尽管代谢网络是自然定向的图,但它们通常仍被分析为无向图。
我们实现了一个从基因组数据重建代谢网络的管道,并使用最新的数据库,用今天的数据证实了 Takemoto 等人(2007 年)的一些结果。然而,Takemoto 等人(2007 年)仅使用了基因组中所有可用酶的一小部分,并且考虑到所有的酶,我们无法重现主要结果。因此,我们引入了三个关于图的有向表示的稳健度量标准,这些标准无论网络重建方法如何,都能得到相似的结果。我们表明,最大强连通分量的大小、流层次结构和拉普拉斯谱与环境条件强烈相关。
我们发现最大强连通分量(一个循环)的大小与所考虑的原核生物的最佳生长温度之间存在显著的负相关关系。这种关系适用于谱,高温与较低的特征值相关。层次流与最佳生长温度呈负相关。这表明网络的动态特性取决于环境因素。