School of Information Science and Engineering, Zaozhuang University, Zaozhuang, China.
School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, China.
Sci Rep. 2018 Dec 12;8(1):17787. doi: 10.1038/s41598-018-36180-y.
Inference of gene regulatory network (GRN) is crucial to understand intracellular physiological activity and function of biology. The identification of large-scale GRN has been a difficult and hot topic of system biology in recent years. In order to reduce the computation load for large-scale GRN identification, a parallel algorithm based on restricted gene expression programming (RGEP), namely MPRGEP, is proposed to infer instantaneous and time-delayed regulatory relationships between transcription factors and target genes. In MPRGEP, the structure and parameters of time-delayed S-system (TDSS) model are encoded into one chromosome. An original hybrid optimization approach based on genetic algorithm (GA) and gene expression programming (GEP) is proposed to optimize TDSS model with MapReduce framework. Time-delayed GRNs (TDGRN) with hundreds of genes are utilized to test the performance of MPRGEP. The experiment results reveal that MPRGEP could infer more accurately gene regulatory network than other state-of-art methods, and obtain the convincing speedup.
推断基因调控网络(GRN)对于理解细胞内的生理活动和生物学功能至关重要。近年来,大规模 GRN 的识别已成为系统生物学的一个难题和热点。为了降低大规模 GRN 识别的计算负荷,提出了一种基于受限基因表达编程(RGEP)的并行算法,即 MPRGEP,用于推断转录因子和靶基因之间的瞬时和时滞调控关系。在 MPRGEP 中,时滞 S 系统(TDSS)模型的结构和参数被编码到一个染色体中。提出了一种基于遗传算法(GA)和基因表达编程(GEP)的原始混合优化方法,用于在 MapReduce 框架中优化 TDSS 模型。使用具有数百个基因的时滞 GRN(TDGRN)来测试 MPRGEP 的性能。实验结果表明,MPRGEP 可以比其他最先进的方法更准确地推断基因调控网络,并获得令人信服的加速。