Fan Ming, Kuwahara Hiroyuki, Wang Xiaolei, Wang Suojin, Gao Xin
Brief Bioinform. 2015 Nov;16(6):987-99. doi: 10.1093/bib/bbv015. Epub 2015 Mar 26.
Parameter estimation is a challenging computational problem in the reverse engineering of biological systems. Because advances in biotechnology have facilitated wide availability of time-series gene expression data, systematic parameter estimation of gene circuit models from such time-series mRNA data has become an important method for quantitatively dissecting the regulation of gene expression. By focusing on the modeling of gene circuits, we examine here the performance of three types of state-of-the-art parameter estimation methods: population-based methods, online methods and model-decomposition-based methods. Our results show that certain population-based methods are able to generate high-quality parameter solutions. The performance of these methods, however, is heavily dependent on the size of the parameter search space, and their computational requirements substantially increase as the size of the search space increases. In comparison, online methods and model decomposition-based methods are computationally faster alternatives and are less dependent on the size of the search space. Among other things, our results show that a hybrid approach that augments computationally fast methods with local search as a subsequent refinement procedure can substantially increase the quality of their parameter estimates to the level on par with the best solution obtained from the population-based methods while maintaining high computational speed. These suggest that such hybrid methods can be a promising alternative to the more commonly used population-based methods for parameter estimation of gene circuit models when limited prior knowledge about the underlying regulatory mechanisms makes the size of the parameter search space vastly large.
参数估计是生物系统逆向工程中一个具有挑战性的计算问题。由于生物技术的进步促进了时间序列基因表达数据的广泛可得性,从这种时间序列mRNA数据中对基因电路模型进行系统的参数估计已成为定量剖析基因表达调控的一种重要方法。通过专注于基因电路的建模,我们在此研究三种类型的先进参数估计方法的性能:基于群体的方法、在线方法和基于模型分解的方法。我们的结果表明,某些基于群体的方法能够生成高质量的参数解。然而,这些方法的性能在很大程度上取决于参数搜索空间的大小,并且随着搜索空间大小的增加,它们的计算需求会大幅增加。相比之下,在线方法和基于模型分解的方法在计算上更快,并且对搜索空间大小的依赖性较小。我们的结果表明,一种混合方法,即先用计算快速的方法,然后用局部搜索作为后续的细化程序,可以在保持高计算速度的同时,将其参数估计的质量大幅提高到与基于群体的方法获得的最佳解相当的水平。这些结果表明,当关于潜在调控机制的先验知识有限,使得参数搜索空间非常大时,这种混合方法对于基因电路模型的参数估计可能是一种比更常用的基于群体的方法更有前景的替代方法。