Penas David R, González Patricia, Egea Jose A, Doallo Ramón, Banga Julio R
BioProcess Engineering Group, IIM-CSIC, Eduardo Cabello 6, Vigo, 36208, Spain.
Computer Architecture Group, Universidade da Coruña, Campus de Elviña s/n, Coruña, 15071 A, Spain.
BMC Bioinformatics. 2017 Jan 21;18(1):52. doi: 10.1186/s12859-016-1452-4.
The development of large-scale kinetic models is one of the current key issues in computational systems biology and bioinformatics. Here we consider the problem of parameter estimation in nonlinear dynamic models. Global optimization methods can be used to solve this type of problems but the associated computational cost is very large. Moreover, many of these methods need the tuning of a number of adjustable search parameters, requiring a number of initial exploratory runs and therefore further increasing the computation times. Here we present a novel parallel method, self-adaptive cooperative enhanced scatter search (saCeSS), to accelerate the solution of this class of problems. The method is based on the scatter search optimization metaheuristic and incorporates several key new mechanisms: (i) asynchronous cooperation between parallel processes, (ii) coarse and fine-grained parallelism, and (iii) self-tuning strategies.
The performance and robustness of saCeSS is illustrated by solving a set of challenging parameter estimation problems, including medium and large-scale kinetic models of the bacterium E. coli, bakerés yeast S. cerevisiae, the vinegar fly D. melanogaster, Chinese Hamster Ovary cells, and a generic signal transduction network. The results consistently show that saCeSS is a robust and efficient method, allowing very significant reduction of computation times with respect to several previous state of the art methods (from days to minutes, in several cases) even when only a small number of processors is used.
The new parallel cooperative method presented here allows the solution of medium and large scale parameter estimation problems in reasonable computation times and with small hardware requirements. Further, the method includes self-tuning mechanisms which facilitate its use by non-experts. We believe that this new method can play a key role in the development of large-scale and even whole-cell dynamic models.
大规模动力学模型的开发是计算系统生物学和生物信息学当前的关键问题之一。在此,我们考虑非线性动态模型中的参数估计问题。全局优化方法可用于解决此类问题,但相关的计算成本非常高。此外,这些方法中的许多都需要调整一些可调整的搜索参数,需要进行多次初始探索性运行,从而进一步增加计算时间。在此,我们提出一种新颖的并行方法,即自适应协作增强散射搜索(saCeSS),以加速此类问题的解决。该方法基于散射搜索优化元启发式算法,并纳入了几个关键的新机制:(i)并行进程之间的异步协作,(ii)粗粒度和细粒度并行,以及(iii)自调整策略。
通过解决一组具有挑战性的参数估计问题,包括大肠杆菌、酿酒酵母、黑腹果蝇、中国仓鼠卵巢细胞的中型和大型动力学模型以及一个通用信号转导网络,说明了saCeSS的性能和鲁棒性。结果一致表明,saCeSS是一种鲁棒且高效的方法,即使仅使用少量处理器,相对于之前的几种现有技术方法,也能显著减少计算时间(在几种情况下,从数天减少到数分钟)。
本文提出的新并行协作方法能够在合理的计算时间内且硬件需求较小的情况下解决中型和大型参数估计问题。此外,该方法包含自调整机制,便于非专家使用。我们相信这种新方法能够在大规模甚至全细胞动态模型的开发中发挥关键作用。