Department of Clinical and Experimental Medicine, Linköping University, Sweden.
FEBS J. 2012 Sep;279(18):3513-27. doi: 10.1111/j.1742-4658.2012.08725.x. Epub 2012 Aug 28.
It is often predicted that model-based data analysis will revolutionize biology, just as it has physics and engineering. A widely used tool within such analysis is hypothesis testing, which focuses on model rejections. However, the fact that a systems biology model is non-rejected is often a relatively weak statement, as such models usually are highly over-parametrized with respect to the available data, and both parameters and predictions may therefore be arbitrarily uncertain. For this reason, we formally define and analyse the concept of a core prediction. A core prediction is a uniquely identified property that must be fulfilled if the given model structure is to explain the data, even if the individual parameters are non-uniquely identified. It is shown that such a prediction is as strong a conclusion as a rejection. Furthermore, a new method for core prediction analysis is introduced, which is beneficial for the uncertainty of specific model properties, as the method only characterizes the space of acceptable parameters in the relevant directions. This avoids the curse of dimensionality associated with the generic characterizations used by previously proposed methods. Analysis on examples shows that the new method is comparable to profile likelihood with regard to practical identifiability, and thus generalizes profile likelihood to the more general problem of observability. If used, the concepts and methods presented herein make it possible to distinguish between a conclusion and a mere suggestion, which hopefully will contribute to a more justified confidence in systems biology analyses.
人们常预测,基于模型的数据分析将像在物理学和工程学领域一样彻底改变生物学。此类分析中广泛使用的一种工具是假设检验,它侧重于模型的拒绝。然而,系统生物学模型未被拒绝通常只是一个相对较弱的陈述,因为这些模型通常针对可用数据进行了高度超参数化,因此参数和预测可能会任意不确定。出于这个原因,我们正式定义并分析了核心预测的概念。核心预测是一个独特识别的属性,如果给定的模型结构要解释数据,就必须满足该属性,即使个别参数无法唯一确定。结果表明,这样的预测与拒绝一样具有强有力的结论。此外,引入了一种新的核心预测分析方法,该方法有利于特定模型属性的不确定性,因为该方法仅在相关方向上对可接受参数的空间进行特征化。这避免了先前提出的方法中使用的通用特征化所带来的维数灾难。对示例的分析表明,新方法在实际可识别性方面与似然比轮廓法相当,因此将似然比轮廓法推广到更一般的可观测性问题。如果使用这些概念和方法,可以区分结论和仅仅是建议,这有望为系统生物学分析带来更合理的信心。