Department of Biomedical Engineering, Computational Biology, Eindhoven University of Technology, 5612 AZ Eindhoven, the Netherlands; Novo Nordisk Foundation Center for Biosustainability at UC San Diego, University of California, San Diego, School of Medicine, La Jolla, CA 92093, USA.
Novo Nordisk Foundation Center for Biosustainability at UC San Diego, University of California, San Diego, School of Medicine, La Jolla, CA 92093, USA; Department of Pediatrics, University of California, San Diego, School of Medicine, La Jolla, CA 92093, USA.
Cell Syst. 2017 Mar 22;4(3):318-329.e6. doi: 10.1016/j.cels.2017.01.010. Epub 2017 Feb 15.
Genome-scale models of metabolism can illuminate the molecular basis of cell phenotypes. Since some enzymes are only active in specific cell types, several algorithms use omics data to construct cell-line- and tissue-specific metabolic models from genome-scale models. However, these methods are often not rigorously benchmarked, and it is unclear how algorithm and parameter selection (e.g., gene expression thresholds, metabolic constraints) affects model content and predictive accuracy. To investigate this, we built hundreds of models of four different cancer cell lines using six algorithms, four gene expression thresholds, and three sets of metabolic constraints. Model content varied substantially across different parameter sets, but the algorithms generally increased accuracy in gene essentiality predictions. However, model extraction method choice had the largest impact on model accuracy. We further highlight how assumptions during model development influence model prediction accuracy. These insights will guide further development of context-specific models, thus more accurately resolving genotype-phenotype relationships.
基因组规模的代谢模型可以阐明细胞表型的分子基础。由于一些酶仅在特定的细胞类型中活跃,因此有几种算法使用组学数据从基因组规模的模型中构建细胞系和组织特异性代谢模型。然而,这些方法通常没有经过严格的基准测试,也不清楚算法和参数选择(例如,基因表达阈值、代谢约束)如何影响模型内容和预测准确性。为了研究这一点,我们使用六种算法、四个基因表达阈值和三套代谢约束,构建了四种不同癌细胞系的数百个模型。模型内容在不同的参数集中差异很大,但这些算法通常可以提高基因必需性预测的准确性。然而,模型提取方法的选择对模型准确性的影响最大。我们进一步强调了模型开发过程中的假设如何影响模型预测的准确性。这些见解将指导特定于上下文的模型的进一步开发,从而更准确地解决基因型-表型关系。