Department of Computer Science and Technology, Xiamen University, Xiamen 361005, China.
Department of School of Aeronautics and Astronautics, Xiamen University, Xiamen 361005, China.
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbac623.
Potential miRNA-disease associations (MDA) play an important role in the discovery of complex human disease etiology. Therefore, MDA prediction is an attractive research topic in the field of biomedical machine learning. Recently, several models have been proposed for this task, but their performance limited by over-reliance on relevant network information with noisy graph structure connections. However, the application of self-supervised graph structure learning to MDA tasks remains unexplored. Our study is the first to use multi-view self-supervised contrastive learning (MSGCL) for MDA prediction. Specifically, we generated a learner view without association labels of miRNAs and diseases as input, and utilized the known association network to generate an anchor view that provides guiding signals for the learner view. The graph structure was optimized by designing a contrastive loss to maximize the consistency between the anchor and learner views. Our model is similar to a pre-trained model that continuously optimizes upstream tasks for high-quality association graph topology, thereby enhancing the latent representation of association predictions. The experimental results show that our proposed method outperforms state-of-the-art methods by 2.79$%$ and 3.20$%$ in area under the receiver operating characteristic curve (AUC) and area under the precision/recall curve (AUPR), respectively.
潜在的 miRNA-疾病关联 (MDA) 在复杂人类疾病病因的发现中起着重要作用。因此,MDA 预测是生物医学机器学习领域中一个很有吸引力的研究课题。最近,已经提出了几种用于该任务的模型,但它们的性能受到对具有噪声图结构连接的相关网络信息的过度依赖的限制。然而,自监督图结构学习在 MDA 任务中的应用仍未得到探索。我们的研究首次将多视图自监督对比学习 (MSGCL) 应用于 MDA 预测。具体来说,我们生成了一个没有 miRNA 和疾病关联标签的学习者视图作为输入,并利用已知的关联网络生成了一个锚点视图,为学习者视图提供了引导信号。通过设计对比损失来优化图结构,以最大化锚点和学习者视图之间的一致性。我们的模型类似于一个预训练模型,它不断优化上游任务,以获得高质量的关联图拓扑,从而增强关联预测的潜在表示。实验结果表明,我们提出的方法在接收者操作特征曲线下的面积 (AUC) 和精度/召回曲线下的面积 (AUPR) 方面分别比最先进的方法高出 2.79%和 3.20%。