College of Computer, National University of Defense Technology, Changsha, 410073, China.
State Key Laboratory of High-Performance Computing, National University of Defense Technology, Changsha, 410073, China.
BMC Bioinformatics. 2020 Nov 26;21(1):544. doi: 10.1186/s12859-020-03899-3.
Elucidation of interactive relation between chemicals and genes is of key relevance not only for discovering new drug leads in drug development but also for repositioning existing drugs to novel therapeutic targets. Recently, biological network-based approaches have been proven to be effective in predicting chemical-gene interactions.
We present CGINet, a graph convolutional network-based method for identifying chemical-gene interactions in an integrated multi-relational graph containing three types of nodes: chemicals, genes, and pathways. We investigate two different perspectives on learning node embeddings. One is to view the graph as a whole, and the other is to adopt a subgraph view that initial node embeddings are learned from the binary association subgraphs and then transferred to the multi-interaction subgraph for more focused learning of higher-level target node representations. Besides, we reconstruct the topological structures of target nodes with the latent links captured by the designed substructures. CGINet adopts an end-to-end way that the encoder and the decoder are trained jointly with known chemical-gene interactions. We aim to predict unknown but potential associations between chemicals and genes as well as their interaction types.
We study three model implementations CGINet-1/2/3 with various components and compare them with baseline approaches. As the experimental results suggest, our models exhibit competitive performances on identifying chemical-gene interactions. Besides, the subgraph perspective and the latent link both play positive roles in learning much more informative node embeddings and can lead to improved prediction.
阐明化学物质与基因之间的相互关系不仅对于在药物开发中发现新的药物先导物至关重要,而且对于将现有药物重新定位到新的治疗靶点也至关重要。最近,基于生物网络的方法已被证明可有效预测化学-基因相互作用。
我们提出了 CGINet,这是一种基于图卷积网络的方法,用于识别包含三种类型节点的集成多关系图中的化学-基因相互作用:化学物质、基因和途径。我们研究了学习节点嵌入的两种不同视角。一种是将图视为整体,另一种是采用子图视图,即从二项关联子图中学习初始节点嵌入,然后将其转移到多相互作用子图,以更专注地学习更高层次的目标节点表示。此外,我们使用设计的子结构捕获的潜在链接来重构目标节点的拓扑结构。CGINet 采用端到端的方式,编码器和解码器与已知的化学-基因相互作用一起进行联合训练。我们旨在预测化学物质和基因之间未知但潜在的关联及其相互作用类型。
我们研究了具有不同组件的三种模型实现 CGINet-1/2/3,并将它们与基线方法进行了比较。实验结果表明,我们的模型在识别化学-基因相互作用方面表现出了有竞争力的性能。此外,子图视角和潜在链接都在学习更具信息量的节点嵌入方面发挥了积极作用,并可以提高预测能力。