School of Computer, China University of Geosciences, Wuhan, 430074, China.
Department of Pharmacy, Lianshui People's Hospital of Kangda College Affiliated to Nanjing Medical University, Huai'an 223300, China.
Comput Biol Med. 2024 Feb;169:107904. doi: 10.1016/j.compbiomed.2023.107904. Epub 2023 Dec 28.
miRNAs are a class of small non-coding RNA molecules that play important roles in gene regulation. They are crucial for maintaining normal cellular functions, and dysregulation or dysfunction of miRNAs which are linked to the onset and advancement of multiple human diseases. Research on miRNAs has unveiled novel avenues in the realm of the diagnosis, treatment, and prevention of human diseases. However, clinical trials pose challenges and drawbacks, such as complexity and time-consuming processes, which create obstacles for many researchers. Graph Attention Network (GAT) has shown excellent performance in handling graph-structured data for tasks such as link prediction. Some studies have successfully applied GAT to miRNA-disease association prediction. However, there are several drawbacks to existing methods. Firstly, most of the previous models rely solely on concatenation operations to merge features of miRNAs and diseases, which results in the deprivation of significant modality-specific information and even the inclusion of redundant information. Secondly, as the number of layers in GAT increases, there is a possibility of excessive smoothing in the feature extraction process, which significantly affects the prediction accuracy. To address these issues and effectively complete miRNA disease prediction tasks, we propose an innovative model called Multiplex Adaptive Modality Fusion Graph Attention Network (MAMFGAT). MAMFGAT utilizes GAT as the main structure for feature aggregation and incorporates a multi-modal adaptive fusion module to extract features from three interconnected networks: the miRNA-disease association network, the miRNA similarity network, and the disease similarity network. It employs adaptive learning and cross-modality contrastive learning to fuse more effective miRNA and disease feature embeddings as well as incorporates multi-modal residual feature fusion to tackle the problem of excessive feature smoothing in GATs. Finally, we employ a Multi-Layer Perceptron (MLP) model that takes the embeddings of miRNA and disease features as input to anticipate the presence of potential miRNA-disease associations. Extensive experimental results provide evidence of the superior performance of MAMFGAT in comparison to other state-of-the-art methods. To validate the significance of various modalities and assess the efficacy of the designed modules, we performed an ablation analysis. Furthermore, MAMFGAT shows outstanding performance in three cancer case studies, indicating that it is a reliable method for studying the association between miRNA and diseases. The implementation of MAMFGAT can be accessed at the following GitHub repository: https://github.com/zixiaojin66/MAMFGAT-master.
miRNAs 是一类小的非编码 RNA 分子,在基因调控中发挥着重要作用。它们对于维持正常的细胞功能至关重要,miRNAs 的失调或功能障碍与多种人类疾病的发生和发展有关。miRNAs 的研究为人类疾病的诊断、治疗和预防开辟了新的途径。然而,临床试验存在着复杂性和耗时的过程等挑战和缺点,这给许多研究人员带来了障碍。图注意网络(GAT)在处理图结构数据的任务中表现出色,例如链接预测。一些研究成功地将 GAT 应用于 miRNA-疾病关联预测。然而,现有的方法存在几个缺点。首先,大多数以前的模型仅依赖于串联操作来合并 miRNA 和疾病的特征,这导致重要的模态特定信息的缺失,甚至包括冗余信息。其次,随着 GAT 层数的增加,在特征提取过程中可能会出现过度平滑,这会显著影响预测准确性。为了解决这些问题,并有效地完成 miRNA 疾病预测任务,我们提出了一种名为多复用自适应模态融合图注意网络(MAMFGAT)的创新模型。MAMFGAT 利用 GAT 作为主要结构进行特征聚合,并结合了一个多模态自适应融合模块,从三个相互连接的网络中提取特征:miRNA-疾病关联网络、miRNA 相似性网络和疾病相似性网络。它采用自适应学习和跨模态对比学习来融合更有效的 miRNA 和疾病特征嵌入,并结合多模态残差特征融合来解决 GAT 中特征平滑过度的问题。最后,我们使用多层感知机(MLP)模型,将 miRNA 和疾病特征的嵌入作为输入,预测潜在的 miRNA-疾病关联。大量的实验结果表明,MAMFGAT 优于其他最先进的方法。为了验证各种模态的重要性并评估所设计模块的效果,我们进行了消融分析。此外,MAMFGAT 在三个癌症案例研究中表现出色,表明它是研究 miRNA 与疾病之间关联的可靠方法。MAMFGAT 的实现可以在以下 GitHub 存储库中访问:https://github.com/zixiaojin66/MAMFGAT-master。