Gao Zhen, Tang Jin, Xia Junfeng, Zheng Chun-Hou, Wei Pi-Jing
IEEE/ACM Trans Comput Biol Bioinform. 2023 Sep-Oct;20(5):2853-2861. doi: 10.1109/TCBB.2023.3282212. Epub 2023 Oct 9.
Gene regulatory networks (GRNs) participate in many biological processes, and reconstructing them plays an important role in systems biology. Although many advanced methods have been proposed for GRN reconstruction, their predictive performance is far from the ideal standard, so it is urgent to design a more effective method to reconstruct GRN. Moreover, most methods only consider the gene expression data, ignoring the network structure information contained in GRN. In this study, we propose a supervised model named CNNGRN, which infers GRN from bulk time-series expression data via convolutional neural network (CNN) model, with a more informative feature. Bulk time series gene expression data imply the intricate regulatory associations between genes, and the network structure feature of ground-truth GRN contains rich neighbor information. Hence, CNNGRN integrates the above two features as model inputs. In addition, CNN is adopted to extract intricate features of genes and infer the potential associations between regulators and target genes. Moreover, feature importance visualization experiments are implemented to seek the key features. Experimental results show that CNNGRN achieved competitive performance on benchmark datasets compared to the state-of-the-art computational methods. Finally, hub genes identified based on CNNGRN have been confirmed to be involved in biological processes through literature.
基因调控网络(GRNs)参与许多生物过程,重建基因调控网络在系统生物学中起着重要作用。尽管已经提出了许多用于基因调控网络重建的先进方法,但其预测性能远未达到理想标准,因此迫切需要设计一种更有效的方法来重建基因调控网络。此外,大多数方法仅考虑基因表达数据,而忽略了基因调控网络中包含的网络结构信息。在本研究中,我们提出了一种名为CNNGRN的监督模型,该模型通过卷积神经网络(CNN)模型从批量时间序列表达数据中推断基因调控网络,具有更丰富的信息特征。批量时间序列基因表达数据暗示了基因之间复杂的调控关联,而真实基因调控网络的网络结构特征包含丰富的邻居信息。因此,CNNGRN将上述两个特征作为模型输入。此外,采用卷积神经网络来提取基因的复杂特征,并推断调控因子与靶基因之间的潜在关联。此外,还进行了特征重要性可视化实验以寻找关键特征。实验结果表明,与现有最先进的计算方法相比,CNNGRN在基准数据集上取得了具有竞争力的性能。最后,基于CNNGRN识别出的枢纽基因已通过文献证实参与了生物过程。