Specialty Division for Systems Biotechnology, Technische Universität München, Garching, Germany.
PLoS One. 2020 Apr 30;15(4):e0230599. doi: 10.1371/journal.pone.0230599. eCollection 2020.
Systems biology applies concepts from engineering in order to understand biological networks. If such an understanding was complete, biologists would be able to design ad hoc biochemical components tailored for different purposes, which is the goal of synthetic biology. Needless to say that we are far away from creating biological subsystems as intricate and precise as those found in nature, but mathematical models and high throughput techniques have brought us a long way in this direction. One of the difficulties that still needs to be overcome is finding the right values for model parameters and dealing with uncertainty, which is proving to be an extremely difficult task. In this work, we take advantage of ensemble modeling techniques, where a large number of models with different parameter values are formulated and then tested according to some performance criteria. By finding features shared by successful models, the role of different components and the synergies between them can be better understood. We will address some of the difficulties often faced by ensemble modeling approaches, such as the need to sample a space whose size grows exponentially with the number of parameters, and establishing useful selection criteria. Some methods will be shown to reduce the predictions from many models into a set of understandable "design principles" that can guide us to improve or manufacture a biochemical network. Our proposed framework formulates models within standard formalisms in order to integrate information from different sources and minimize the dimension of the parameter space. Additionally, the mathematical properties of the formalism enable a partition of the parameter space into independent subspaces. Each of these subspaces can be paired with a set of criteria that depend exclusively on it, thus allowing a separate sampling/screening in spaces of lower dimension. By applying tests in a strict order where computationally cheaper tests are applied first to each subspace and applying computationally expensive tests to the remaining subset thereafter, the use of resources is optimized and a larger number of models can be examined. This can be compared to a complex database query where the order of the requests can make a huge difference in the processing time. The method will be illustrated by analyzing a classical model of a metabolic pathway with end-product inhibition. Even for such a simple model, the method provides novel insight.
系统生物学运用工程学的概念来理解生物网络。如果能够完全理解这些网络,生物学家就能够设计出针对不同目的的特定生化组件,这就是合成生物学的目标。不用说,我们离创造出像自然界中那样复杂和精确的生物子系统还很远,但数学模型和高通量技术已经在这方面取得了很大的进展。我们仍然需要克服的一个困难是为模型参数找到正确的值,并处理不确定性,这被证明是一项极其困难的任务。在这项工作中,我们利用了集合建模技术,其中制定了大量具有不同参数值的模型,然后根据一些性能标准进行测试。通过找到成功模型所共有的特征,可以更好地理解不同组件的作用以及它们之间的协同作用。我们将解决集合建模方法经常面临的一些困难,例如需要对参数数量呈指数增长的空间进行采样,以及建立有用的选择标准。将展示一些方法,这些方法可以将来自许多模型的预测结果简化为一组可理解的“设计原则”,这些原则可以指导我们改进或制造生化网络。我们提出的框架在标准形式主义中制定模型,以便整合来自不同来源的信息并最小化参数空间的维度。此外,形式主义的数学性质使得参数空间可以划分为独立的子空间。每个子空间都可以与一组仅依赖于它的标准相关联,从而可以在更低维度的空间中进行单独的采样/筛选。通过以严格的顺序应用测试,首先对每个子空间应用计算成本较低的测试,然后对剩余的子集应用计算成本较高的测试,可以优化资源的使用,并可以检查更多的模型。这可以与复杂的数据库查询进行比较,其中请求的顺序会对处理时间产生巨大影响。该方法将通过分析具有终产物抑制的经典代谢途径模型来说明。即使对于这样一个简单的模型,该方法也提供了新的见解。