Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, Hubei 430072, China.
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae383.
Genome-scale metabolic models (GEMs) are powerful tools for predicting cellular metabolic and physiological states. However, there are still missing reactions in GEMs due to incomplete knowledge. Recent gaps filling methods suggest directly predicting missing responses without relying on phenotypic data. However, they do not differentiate between substrates and products when constructing the prediction models, which affects the predictive performance of the models. In this paper, we propose a hyperedge prediction model that distinguishes substrates and products based on dual-scale fused hypergraph convolution, DSHCNet, for inferring the missing reactions to effectively fill gaps in the GEM. First, we model each hyperedge as a heterogeneous complete graph and then decompose it into three subgraphs at both homogeneous and heterogeneous scales. Then we design two graph convolution-based models to, respectively, extract features of the vertices in two scales, which are then fused via the attention mechanism. Finally, the features of all vertices are further pooled to generate the representative feature of the hyperedge. The strategy of graph decomposition in DSHCNet enables the vertices to engage in message passing independently at both scales, thereby enhancing the capability of information propagation and making the obtained product and substrate features more distinguishable. The experimental results show that the average recovery rate of missing reactions obtained by DSHCNet is at least 11.7% higher than that of the state-of-the-art methods, and that the gap-filled GEMs based on our DSHCNet model achieve the best prediction performance, demonstrating the superiority of our method.
基因组规模代谢模型 (GEMs) 是预测细胞代谢和生理状态的强大工具。然而,由于知识不完整,GEM 中仍然存在缺失的反应。最近的填补空白方法建议直接预测缺失的反应,而不依赖于表型数据。然而,它们在构建预测模型时没有区分底物和产物,这会影响模型的预测性能。在本文中,我们提出了一种基于双尺度融合超图卷积的超边预测模型,即 DSHCNet,用于推断缺失的反应,从而有效地填补 GEM 中的空白。首先,我们将每个超边建模为一个异构完全图,然后将其分解为同质和异构两个尺度的三个子图。然后,我们设计了两个基于图卷积的模型,分别提取两个尺度上顶点的特征,然后通过注意力机制融合。最后,进一步汇集所有顶点的特征,以生成超边的代表性特征。DSHCNet 中的图分解策略使顶点能够在两个尺度上独立进行消息传递,从而增强了信息传播的能力,并使获得的产物和底物特征更加可区分。实验结果表明,DSHCNet 获得的缺失反应的平均恢复率至少比最先进的方法高 11.7%,并且基于我们的 DSHCNet 模型的填补空白 GEM 达到了最佳的预测性能,证明了我们方法的优越性。