Barman Shohag, Farid Fahmid Al, Gope Hira Lal, Hafiz Md Ferdous Bin, Khan Niaz Ashraf, Ahmad Sabbir, Mansor Sarina
Department of Computer Science and Engineering, Bangabandhu Sheikh Mujibur Rahman Science and Technology University, Pirojpur 8500, Bangladesh.
Faculty of Engineering, Multimedia University, Cyberjaya 63000, Selangor, Malaysia.
Genes (Basel). 2024 Nov 27;15(12):1530. doi: 10.3390/genes15121530.
In the realm of system biology, it is a challenging endeavor to infer a gene regulatory network from time-series gene expression data. Numerous Boolean network inference techniques have emerged for reconstructing a gene regulatory network from a time-series gene expression dataset. However, most of these techniques pose scalability concerns given their capability to consider only two to three regulatory genes over a specific target gene.
To overcome this limitation, a novel inference method, LBF-MI, has been proposed in this research. This two-phase method utilizes limited Boolean functions and multivariate mutual information to reconstruct a Boolean gene regulatory network from time-series gene expression data. Initially, Boolean functions are applied to determine the optimum solutions. In case of failure, multivariate mutual information is applied to obtain the optimum solutions.
This research conducted a performance-comparison experiment between LBF-MI and three other methods: mutual information-based Boolean network inference, context likelihood relatedness, and relevance network. When examined on artificial as well as real-time-series gene expression data, the outcomes exhibited that the proposed LBF-MI method outperformed mutual information-based Boolean network inference, context likelihood relatedness, and relevance network on artificial datasets, and two real datasets ( gene regulatory network, and SOS response of regulatory network).
LBF-MI's superior performance in gene regulatory network inference enables researchers to uncover the regulatory mechanisms and cellular behaviors of various organisms.
在系统生物学领域,从时间序列基因表达数据推断基因调控网络是一项具有挑战性的工作。已经出现了许多布尔网络推理技术来从时间序列基因表达数据集中重建基因调控网络。然而,鉴于这些技术大多只能考虑特定目标基因的两到三个调控基因,它们存在可扩展性问题。
为克服这一局限性,本研究提出了一种新的推理方法,即LBF-MI。这种两阶段方法利用有限布尔函数和多变量互信息从时间序列基因表达数据重建布尔基因调控网络。首先,应用布尔函数确定最优解。若失败,则应用多变量互信息来获得最优解。
本研究对LBF-MI与其他三种方法进行了性能比较实验:基于互信息的布尔网络推理、上下文似然相关性和相关网络。在人工以及实时序列基因表达数据上进行检验时,结果表明,所提出的LBF-MI方法在人工数据集以及两个真实数据集(基因调控网络和调控网络的SOS反应)上优于基于互信息的布尔网络推理、上下文似然相关性和相关网络。
LBF-MI在基因调控网络推理中的卓越性能使研究人员能够揭示各种生物体的调控机制和细胞行为。