Verwoerd Wynand S
Centre for Advanced Computational Solutions, Dept WF & Molecular Bioscience, Lincoln University, Ellesmere Junction Road, Christchurch, New Zealand.
BMC Syst Biol. 2011 Feb 7;5:25. doi: 10.1186/1752-0509-5-25.
Compared to more general networks, biochemical networks have some special features: while generally sparse, there are a small number of highly connected metabolite nodes; and metabolite nodes can also be divided into two classes: internal nodes with associated mass balance constraints and external ones without. Based on these features, reclassifying selected internal nodes (separators) to external ones can be used to divide a large complex metabolic network into simpler subnetworks. Selection of separators based on node connectivity is commonly used but affords little detailed control and tends to produce excessive fragmentation.The method proposed here (Netsplitter) allows the user to control separator selection. It combines local connection degree partitioning with global connectivity derived from random walks on the network, to produce a more even distribution of subnetwork sizes. Partitioning is performed progressively and the interactive visual matrix presentation used allows the user considerable control over the process, while incorporating special strategies to maintain the network integrity and minimise the information loss due to partitioning.
Partitioning of a genome scale network of 1348 metabolites and 1468 reactions for Arabidopsis thaliana encapsulates 66% of the network into 10 medium sized subnets. Applied to the flavonoid subnetwork extracted in this way, it is shown that Netsplitter separates this naturally into four subnets with recognisable functionality, namely synthesis of lignin precursors, flavonoids, coumarin and benzenoids. A quantitative quality measure called efficacy is constructed and shows that the new method gives improved partitioning for several metabolic networks, including bacterial, plant and mammal species.
For the examples studied the Netsplitter method is a considerable improvement on the performance of connection degree partitioning, giving a better balance of subnet sizes with the removal of fewer mass balance constraints. In addition, the user can interactively control which metabolite nodes are selected for cutting and when to stop further partitioning as the desired granularity has been reached. Finally, the blocking transformation at the heart of the procedure provides a powerful visual display of network structure that may be useful for its exploration independent of whether partitioning is required.
与更一般的网络相比,生化网络具有一些特殊特征:虽然通常较为稀疏,但存在少量高度连接的代谢物节点;代谢物节点还可分为两类:具有相关质量平衡约束的内部节点和无此约束的外部节点。基于这些特征,将选定的内部节点(分隔符)重新分类为外部节点可用于将大型复杂代谢网络划分为更简单的子网。基于节点连通性选择分隔符是常用方法,但控制细节不足,且往往会导致过度碎片化。本文提出的方法(Netsplitter)允许用户控制分隔符的选择。它将局部连接度划分与通过网络上随机游走得出的全局连通性相结合,以使子网大小分布更均匀。划分逐步进行,所使用的交互式可视化矩阵呈现方式使用户能够对该过程进行相当程度的控制,同时纳入特殊策略以维护网络完整性并最小化划分导致的信息损失。
对拟南芥包含1348个代谢物和1468个反应的基因组规模网络进行划分,将66%的网络封装到10个中等大小的子网中。应用于以此方式提取的类黄酮子网时,结果表明Netsplitter将其自然地分离为四个具有可识别功能的子网,即木质素前体合成、类黄酮合成、香豆素合成和苯类化合物合成。构建了一种名为功效的定量质量度量,结果表明该新方法对包括细菌、植物和哺乳动物物种在内的多个代谢网络给出了改进的划分。
对于所研究的示例,Netsplitter方法在连接度划分性能方面有显著改进,在去除较少质量平衡约束的情况下,子网大小实现了更好的平衡。此外,用户可以交互式控制选择哪些代谢物节点进行切割,以及在达到所需粒度时何时停止进一步划分。最后,该过程核心的阻塞变换提供了强大的网络结构可视化显示,这对于网络探索可能是有用的,无论是否需要进行划分。