Suppr超能文献

使用监督和半监督图神经网络对基因调控网络进行归纳推理。

Inductive inference of gene regulatory network using supervised and semi-supervised graph neural networks.

作者信息

Wang Juexin, Ma Anjun, Ma Qin, Xu Dong, Joshi Trupti

机构信息

Department of Electrical Engineering and Computer Science, and Christopher S. Bond Life Science Center, University of Missouri, 65211, USA.

Department of Biomedical Informatics, School of Medicine, Ohio State University, OH 43210, USA.

出版信息

Comput Struct Biotechnol J. 2020 Nov 5;18:3335-3343. doi: 10.1016/j.csbj.2020.10.022. eCollection 2020.

Abstract

Discovering gene regulatory relationships and reconstructing gene regulatory networks (GRN) based on gene expression data is a classical, long-standing computational challenge in bioinformatics. Computationally inferring a possible regulatory relationship between two genes can be formulated as a link prediction problem between two nodes in a graph. Graph neural network (GNN) provides an opportunity to construct GRN by integrating topological neighbor propagation through the whole gene network. We propose an end-to-end gene regulatory graph neural network (GRGNN) approach to reconstruct GRNs from scratch utilizing the gene expression data, in both a supervised and a semi-supervised framework. To get better inductive generalization capability, GRN inference is formulated as a graph classification problem, to distinguish whether a subgraph centered at two nodes contains the link between the two nodes. A linked pair between a transcription factor (TF) and a target gene, and their neighbors are labeled as a positive subgraph, while an unlinked TF and target gene pair and their neighbors are labeled as a negative subgraph. A GNN model is constructed with node features from both explicit gene expression and graph embedding. We demonstrate a noisy starting graph structure built from partial information, such as Pearson's correlation coefficient and mutual information can help guide the GRN inference through an appropriate ensemble technique. Furthermore, a semi-supervised scheme is implemented to increase the quality of the classifier. When compared with established methods, GRGNN achieved state-of-the-art performance on the DREAM5 GRN inference benchmarks. GRGNN is publicly available at https://github.com/juexinwang/GRGNN.

摘要

基于基因表达数据发现基因调控关系并重建基因调控网络(GRN)是生物信息学中一个经典且长期存在的计算挑战。通过计算推断两个基因之间可能的调控关系可以被表述为图中两个节点之间的链接预测问题。图神经网络(GNN)为通过整合整个基因网络中的拓扑邻居传播来构建GRN提供了契机。我们提出一种端到端的基因调控图神经网络(GRGNN)方法,在有监督和半监督框架下利用基因表达数据从头开始重建GRN。为了获得更好的归纳泛化能力,GRN推理被表述为一个图分类问题,以区分以两个节点为中心的子图是否包含这两个节点之间的链接。转录因子(TF)与靶基因之间的链接对及其邻居被标记为正子图,而无链接的TF与靶基因对及其邻居被标记为负子图。利用显式基因表达和图嵌入的节点特征构建GNN模型。我们证明,从诸如皮尔逊相关系数和互信息等部分信息构建的有噪声起始图结构可以通过适当的集成技术帮助指导GRN推理。此外,实施了一种半监督方案以提高分类器的质量。与现有方法相比,GRGNN在DREAM5 GRN推理基准测试中取得了领先性能。GRGNN可在https://github.com/juexinwang/GRGNN上公开获取。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验