Computing, Environment, and Life Sciences Division, Argonne National Laboratory, Lemont, IL 60439, USA.
Center for Individualized Medicine, Mayo Clinic, Rochester, MN 55905, USA.
Nucleic Acids Res. 2021 Jan 8;49(D1):D575-D588. doi: 10.1093/nar/gkaa746.
For over 10 years, ModelSEED has been a primary resource for the construction of draft genome-scale metabolic models based on annotated microbial or plant genomes. Now being released, the biochemistry database serves as the foundation of biochemical data underlying ModelSEED and KBase. The biochemistry database embodies several properties that, taken together, distinguish it from other published biochemistry resources by: (i) including compartmentalization, transport reactions, charged molecules and proton balancing on reactions; (ii) being extensible by the user community, with all data stored in GitHub; and (iii) design as a biochemical 'Rosetta Stone' to facilitate comparison and integration of annotations from many different tools and databases. The database was constructed by combining chemical data from many resources, applying standard transformations, identifying redundancies and computing thermodynamic properties. The ModelSEED biochemistry is continually tested using flux balance analysis to ensure the biochemical network is modeling-ready and capable of simulating diverse phenotypes. Ontologies can be designed to aid in comparing and reconciling metabolic reconstructions that differ in how they represent various metabolic pathways. ModelSEED now includes 33,978 compounds and 36,645 reactions, available as a set of extensible files on GitHub, and available to search at https://modelseed.org/biochem and KBase.
十年来,ModelSEED 一直是基于注释的微生物或植物基因组构建草案基因组规模代谢模型的主要资源。现在发布的生物化学数据库是 ModelSEED 和 KBase 中生化数据的基础。该生物化学数据库具有几个特性,这些特性结合在一起,使其有别于其他已发布的生物化学资源:(i) 包括区室化、运输反应、带电分子和反应中的质子平衡;(ii) 可由用户社区扩展,所有数据都存储在 GitHub 中;(iii) 设计为生化“罗塞塔石碑”,以促进来自许多不同工具和数据库的注释的比较和整合。该数据库是通过结合来自多个资源的化学数据、应用标准转换、识别冗余和计算热力学特性来构建的。ModelSEED 生物化学使用通量平衡分析进行持续测试,以确保生化网络具有建模就绪能力,并能够模拟各种表型。可以设计本体论来帮助比较和协调代谢重建,这些重建在如何表示各种代谢途径方面存在差异。ModelSEED 现在包括 33978 种化合物和 36645 种反应,可作为 GitHub 上的一组可扩展文件获取,并可在 https://modelseed.org/biochem 和 KBase 上搜索。