Myasnikova Ekaterina, Kozlov Konstantin N
Department of Computational Biology, St. Petersburg State Polytechnical University, 29 Polytekhnicheskaya, St. Petersburg, 195251, Russia , Department of Bioinformatics, Moscow Institute of Physics and Technology, Institutskiy per. 9, Dolgoprudny 141700, Moscow Region, Russia.
J Bioinform Comput Biol. 2014 Apr;12(2):1441002. doi: 10.1142/S0219720014410029. Epub 2014 Mar 31.
In this paper, a specific aspect of the prediction problem is considered: high predictive power is understood as a possibility to reproduce correct behavior of model solutions at predefined values of a subset of parameters. The problem is discussed in the context of a specific mathematical model, the gene circuit model for segmentation gap gene system in early Drosophila development. A shortcoming of the model is that it cannot be used for predicting the system behavior in mutants when fitted to wild type (WT) data. In order to answer a question whether experimental data contain enough information for the correct prediction we introduce two measures of predictive power. The first measure reveals the biologically substantiated low sensitivity of the model to parameters that are responsible for correct reconstruction of expression patterns in mutants, while the second one takes into account their correlation with the other parameters. It is demonstrated that the model solution, obtained by fitting to gene expression data in WT and Kr⁻ mutants simultaneously, and exhibiting the high predictive power, is characterized by much higher values of both measures than those fitted to WT data alone. This result leads us to the conclusion that information contained in WT data is insufficient to reliably estimate the large number of model parameters and provide predictions of mutants.
在本文中,我们考虑了预测问题的一个特定方面:高预测能力被理解为在参数子集的预定义值处重现模型解的正确行为的可能性。该问题在特定数学模型的背景下进行讨论,即早期果蝇发育中分割间隙基因系统的基因回路模型。该模型的一个缺点是,当拟合野生型(WT)数据时,它不能用于预测突变体中的系统行为。为了回答实验数据是否包含足够信息进行正确预测的问题,我们引入了两种预测能力的度量。第一种度量揭示了模型对负责在突变体中正确重建表达模式的参数的生物学上合理的低敏感性,而第二种度量则考虑了它们与其他参数的相关性。结果表明,通过同时拟合WT和Kr⁻突变体中的基因表达数据而获得的具有高预测能力的模型解,其两种度量的值都比仅拟合WT数据时高得多。这一结果使我们得出结论,WT数据中包含的信息不足以可靠地估计大量模型参数并提供对突变体的预测。