Department of Plant Biology, University of Minnesota, Saint Paul, MN 55108, USA.
J Exp Bot. 2012 Mar;63(6):2353-62. doi: 10.1093/jxb/err407. Epub 2011 Dec 29.
The green picoalga Ostreococcus is emerging as a simple plant model organism, and two species, O. lucimarinus and O. tauri, have now been sequenced and annotated manually. To evaluate the completeness of the metabolic annotation of both species, metabolic networks of O. lucimarinus and O. tauri were reconstructed from the KEGG database, thermodynamically constrained, elementally balanced, and functionally evaluated. The draft networks contained extensive gaps and, in the case of O. tauri, no biomass components could be produced due to an incomplete Calvin cycle. To find and remove gaps from the networks, an extensive reference biochemical reaction database was assembled using a stepwise approach that minimized the inclusion of microbial reactions. Gaps were then removed from both Ostreococcus networks using two existing gap-filling methodologies. In the first method, a bottom-up approach, a minimal list of reactions was added to each model to enable the production of all metabolites included in our biomass equation. In the second method, a top-down approach, all reactions in the reference database were added to the target networks and subsequently trimmed away based on the sequence alignment scores of identified orthologues. Because current gap-filling methods do not produce unique solutions, a quality metric that includes a weighting for phylogenetic distance and sequence similarity was developed to distinguish between gap-filling results automatically. The draft O. lucimarinus and O. tauri networks required the addition of 56 and 70 reactions, respectively, in order to produce the same biomass precursor metabolites that were produced by our plant reference database.
绿球藻(Ostreococcus)正在成为一种简单的植物模式生物,目前已经对两个物种(O. lucimarinus 和 O. tauri)进行了测序和手动注释。为了评估这两个物种代谢注释的完整性,从 KEGG 数据库中重建了 O. lucimarinus 和 O. tauri 的代谢网络,并对其进行了热力学约束、元素平衡和功能评估。这些草案网络包含广泛的缺口,而且在 O. tauri 的情况下,由于不完全的卡尔文循环,无法产生生物质成分。为了从网络中找到并消除缺口,我们使用逐步方法组装了一个广泛的参考生化反应数据库,该方法最大限度地减少了微生物反应的纳入。然后,使用两种现有的填补缺口方法从两种绿球藻网络中消除缺口。在第一种方法(自下而上方法)中,向每个模型添加了一组最小的反应,以使我们生物质方程中包含的所有代谢物都能够生产。在第二种方法(自上而下方法)中,向目标网络中添加了参考数据库中的所有反应,然后根据鉴定的同源物的序列比对分数将其修剪掉。由于当前的填补缺口方法不能产生唯一的解决方案,因此开发了一种质量度量标准,该标准包括对系统发育距离和序列相似性的加权,以便自动区分填补缺口的结果。为了产生与我们的植物参考数据库相同的生物质前体代谢物,O. lucimarinus 和 O. tauri 的草案网络分别需要添加 56 个和 70 个反应。