Whitaker Biomedical Engineering Institute, The Johns Hopkins University, Baltimore, MD 21218, USA.
BMC Bioinformatics. 2010 May 12;11:246. doi: 10.1186/1471-2105-11-246.
Sensitivity analysis is an indispensable tool for the analysis of complex systems. In a recent paper, we have introduced a thermodynamically consistent variance-based sensitivity analysis approach for studying the robustness and fragility properties of biochemical reaction systems under uncertainty in the standard chemical potentials of the activated complexes of the reactions and the standard chemical potentials of the molecular species. In that approach, key sensitivity indices were estimated by Monte Carlo sampling, which is computationally very demanding and impractical for large biochemical reaction systems. Computationally efficient algorithms are needed to make variance-based sensitivity analysis applicable to realistic cellular networks, modeled by biochemical reaction systems that consist of a large number of reactions and molecular species.
We present four techniques, derivative approximation (DA), polynomial approximation (PA), Gauss-Hermite integration (GHI), and orthonormal Hermite approximation (OHA), for analytically approximating the variance-based sensitivity indices associated with a biochemical reaction system. By using a well-known model of the mitogen-activated protein kinase signaling cascade as a case study, we numerically compare the approximation quality of these techniques against traditional Monte Carlo sampling. Our results indicate that, although DA is computationally the most attractive technique, special care should be exercised when using it for sensitivity analysis, since it may only be accurate at low levels of uncertainty. On the other hand, PA, GHI, and OHA are computationally more demanding than DA but can work well at high levels of uncertainty. GHI results in a slightly better accuracy than PA, but it is more difficult to implement. OHA produces the most accurate approximation results and can be implemented in a straightforward manner. It turns out that the computational cost of the four approximation techniques considered in this paper is orders of magnitude smaller than traditional Monte Carlo estimation. Software, coded in MATLAB, which implements all sensitivity analysis techniques discussed in this paper, is available free of charge.
Estimating variance-based sensitivity indices of a large biochemical reaction system is a computationally challenging task that can only be addressed via approximations. Among the methods presented in this paper, a technique based on orthonormal Hermite polynomials seems to be an acceptable candidate for the job, producing very good approximation results for a wide range of uncertainty levels in a fraction of the time required by traditional Monte Carlo sampling.
敏感性分析是分析复杂系统不可或缺的工具。在最近的一篇论文中,我们引入了一种热力学一致性的基于方差的敏感性分析方法,用于研究在反应的激活复合物的标准化学势和分子物种的标准化学势的不确定性下,生化反应系统的鲁棒性和脆弱性特性。在该方法中,通过蒙特卡罗抽样估计关键的敏感性指标,这对于大型生化反应系统来说计算量非常大且不切实际。需要计算效率高的算法,使基于方差的敏感性分析能够应用于由大量反应和分子物种组成的生化反应系统建模的现实细胞网络。
我们提出了四种技术,即导数逼近(DA)、多项式逼近(PA)、高斯-赫尔墨特积分(GHI)和正交赫尔墨特逼近(OHA),用于分析逼近与生化反应系统相关的基于方差的敏感性指标。通过使用丝裂原激活蛋白激酶信号级联的一个著名模型作为案例研究,我们数值比较了这些技术与传统蒙特卡罗抽样的逼近质量。我们的结果表明,虽然 DA 在计算上是最有吸引力的技术,但在进行敏感性分析时应特别小心,因为它可能仅在低不确定性水平下准确。另一方面,PA、GHI 和 OHA 比 DA 在计算上要求更高,但在高不确定性水平下效果良好。GHI 的精度略高于 PA,但实施起来更困难。OHA 产生最准确的逼近结果,并且可以以直接的方式实现。事实证明,本文考虑的四种逼近技术的计算成本比传统的蒙特卡罗估计小几个数量级。以 MATLAB 编写的实现本文讨论的所有敏感性分析技术的软件是免费提供的。
估计大型生化反应系统的基于方差的敏感性指数是一项具有挑战性的计算任务,只能通过逼近来解决。在本文提出的方法中,基于正交赫尔墨特多项式的方法似乎是一个可行的候选方法,它可以在传统蒙特卡罗抽样所需时间的一小部分内为广泛的不确定性水平产生非常好的逼近结果。