National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China.
Key Laboratory of Smart Farming for Agricultural Animals, Huazhong Agricultural University, Wuhan 430070, China.
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac717.
The question of how to construct gene regulatory networks has long been a focus of biological research. Mutual information can be used to measure nonlinear relationships, and it has been widely used in the construction of gene regulatory networks. However, this method cannot measure indirect regulatory relationships under the influence of multiple genes, which reduces the accuracy of inferring gene regulatory networks.
This work proposes a method for constructing gene regulatory networks based on mixed entropy optimizing context-related likelihood mutual information (MEOMI). First, two entropy estimators were combined to calculate the mutual information between genes. Then, distribution optimization was performed using a context-related likelihood algorithm to eliminate some indirect regulatory relationships and obtain the initial gene regulatory network. To obtain the complex interaction between genes and eliminate redundant edges in the network, the initial gene regulatory network was further optimized by calculating the conditional mutual inclusive information (CMI2) between gene pairs under the influence of multiple genes. The network was iteratively updated to reduce the impact of mutual information on the overestimation of the direct regulatory intensity.
The experimental results show that the MEOMI method performed better than several other kinds of gene network construction methods on DREAM challenge simulated datasets (DREAM3 and DREAM5), three real Escherichia coli datasets (E.coli SOS pathway network, E.coli SOS DNA repair network and E.coli community network) and two human datasets.
Source code and dataset are available at https://github.com/Dalei-Dalei/MEOMI/ and http://122.205.95.139/MEOMI/.
Supplementary data are available at Bioinformatics online.
如何构建基因调控网络一直是生物研究的焦点。互信息可用于衡量非线性关系,已被广泛应用于基因调控网络的构建。然而,这种方法无法衡量多个基因影响下的间接调控关系,从而降低了推断基因调控网络的准确性。
本研究提出了一种基于混合熵优化与上下文相关似然互信息(MEOMI)的基因调控网络构建方法。首先,结合两种熵估计器计算基因间的互信息。然后,使用与上下文相关的似然算法进行分布优化,以消除一些间接调控关系,并获得初始基因调控网络。为了获取基因之间的复杂相互作用并消除网络中的冗余边,通过计算多个基因影响下基因对之间的条件互包含信息(CMI2),进一步优化初始基因调控网络。通过迭代更新网络,减少互信息对直接调控强度的高估影响。
实验结果表明,在 DREAM 挑战赛模拟数据集(DREAM3 和 DREAM5)、三个真实大肠杆菌数据集(E.coli SOS 途径网络、E.coli SOS DNA 修复网络和 E.coli 群落网络)和两个人类数据集上,MEOMI 方法的性能优于其他几种基因网络构建方法。
源代码和数据集可在 https://github.com/Dalei-Dalei/MEOMI/ 和 http://122.205.95.139/MEOMI/ 上获取。
补充数据可在生物信息学在线获取。