School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China.
Guangxi Academy of Science, Nanning, 530007, China.
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae412.
Numerous studies have demonstrated that microRNAs (miRNAs) are critically important for the prediction, diagnosis, and characterization of diseases. However, identifying miRNA-disease associations through traditional biological experiments is both costly and time-consuming. To further explore these associations, we proposed a model based on hybrid high-order moments combined with element-level attention mechanisms (HHOMR). This model innovatively fused hybrid higher-order statistical information along with structural and community information. Specifically, we first constructed a heterogeneous graph based on existing associations between miRNAs and diseases. HHOMR employs a structural fusion layer to capture structure-level embeddings and leverages a hybrid high-order moments encoder layer to enhance features. Element-level attention mechanisms are then used to adaptively integrate the features of these hybrid moments. Finally, a multi-layer perceptron is utilized to calculate the association scores between miRNAs and diseases. Through five-fold cross-validation on HMDD v2.0, we achieved a mean AUC of 93.28%. Compared with four state-of-the-art models, HHOMR exhibited superior performance. Additionally, case studies on three diseases-esophageal neoplasms, lymphoma, and prostate neoplasms-were conducted. Among the top 50 miRNAs with high disease association scores, 46, 47, and 45 associated with these diseases were confirmed by the dbDEMC and miR2Disease databases, respectively. Our results demonstrate that HHOMR not only outperforms existing models but also shows significant potential in predicting miRNA-disease associations.
大量研究表明,微小 RNA(miRNAs)对于疾病的预测、诊断和特征描述至关重要。然而,通过传统的生物学实验来识别 miRNA-疾病的关联既昂贵又耗时。为了进一步探索这些关联,我们提出了一种基于混合高阶矩与元素级注意力机制的模型(HHOMR)。该模型创新性地融合了混合高阶统计信息以及结构和社区信息。具体来说,我们首先基于 miRNA 和疾病之间现有的关联构建了一个异质图。HHOMR 采用结构融合层来捕捉结构级别的嵌入,并利用混合高阶矩编码器层来增强特征。然后,使用元素级注意力机制自适应地整合这些混合矩的特征。最后,使用多层感知机计算 miRNA 和疾病之间的关联得分。在 HMDD v2.0 上进行五折交叉验证,我们实现了平均 AUC 为 93.28%。与四个最先进的模型相比,HHOMR 表现出了优越的性能。此外,我们对三种疾病(食管癌、淋巴瘤和前列腺癌)进行了案例研究。在与疾病关联得分较高的前 50 个 miRNA 中,有 46、47 和 45 个 miRNA 分别被 dbDEMC 和 miR2Disease 数据库所证实与这些疾病相关。我们的结果表明,HHOMR 不仅优于现有的模型,而且在预测 miRNA-疾病关联方面具有显著的潜力。