School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, 114051, China.
School of Automation and Electrical Engineering, Linyi University, Linyi, 276000, China.
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac266.
Metabolism is the process by which an organism continuously replaces old substances with new substances. It plays an important role in maintaining human life, body growth and reproduction. More and more researchers have shown that the concentrations of some metabolites in patients are different from those in healthy people. Traditional biological experiments can test some hypotheses and verify their relationships but usually take a considerable amount of time and money. Therefore, it is urgent to develop a new computational method to identify the relationships between metabolites and diseases. In this work, we present a new deep learning algorithm named as graph convolutional network with graph attention network (GCNAT) to predict the potential associations of disease-related metabolites. First, we construct a heterogeneous network based on known metabolite-disease associations, metabolite-metabolite similarities and disease-disease similarities. Metabolite and disease features are encoded and learned through the graph convolutional neural network. Then, a graph attention layer is used to combine the embeddings of multiple convolutional layers, and the corresponding attention coefficients are calculated to assign different weights to the embeddings of each layer. Further, the prediction result is obtained by decoding and scoring the final synthetic embeddings. Finally, GCNAT achieves a reliable area under the receiver operating characteristic curve of 0.95 and the precision-recall curve of 0.405, which are better than the results of existing five state-of-the-art predictive methods in 5-fold cross-validation, and the case studies show that the metabolite-disease correlations predicted by our method can be successfully demonstrated by relevant experiments. We hope that GCNAT could be a useful biomedical research tool for predicting potential metabolite-disease associations in the future.
代谢是生物体不断用新物质取代旧物质的过程。它在维持人类生命、身体生长和繁殖方面起着重要作用。越来越多的研究人员表明,患者体内某些代谢物的浓度与健康人不同。传统的生物实验可以检验一些假设并验证它们之间的关系,但通常需要相当多的时间和金钱。因此,迫切需要开发一种新的计算方法来识别代谢物与疾病之间的关系。在这项工作中,我们提出了一种新的深度学习算法,名为基于图注意力网络的图卷积网络(GCNAT),用于预测与疾病相关的代谢物的潜在关联。首先,我们基于已知的代谢物-疾病关联、代谢物-代谢物相似性和疾病-疾病相似性构建了一个异构网络。通过图卷积神经网络对代谢物和疾病特征进行编码和学习。然后,使用图注意力层将多个卷积层的嵌入结合起来,并计算相应的注意力系数,为每个层的嵌入分配不同的权重。进一步,通过对最终综合嵌入进行解码和评分,得到预测结果。最后,在 5 折交叉验证中,GCNAT 实现了可靠的接收器操作特征曲线下面积为 0.95 和精度-召回曲线为 0.405,优于现有 5 种最先进的预测方法的结果,案例研究表明,我们的方法预测的代谢物-疾病相关性可以通过相关实验成功证明。我们希望 GCNAT 能够成为未来预测潜在代谢物-疾病关联的有用的生物医学研究工具。