School of Biomedical Engineering, Guangdong Medical University, Dongguan, 523808, China.
Basic Teaching Department, Zhuhai Campus of Zunyi Medical University, Zhuhai, 519099, China.
BMC Genomics. 2024 Sep 20;25(1):885. doi: 10.1186/s12864-024-10729-w.
MicroRNAs (miRNAs) have been demonstrated to be closely related to human diseases. Studying the potential associations between miRNAs and diseases contributes to our understanding of disease pathogenic mechanisms. As traditional biological experiments are costly and time-consuming, computational models can be considered as effective complementary tools. In this study, we propose a novel model of robust orthogonal non-negative matrix tri-factorization (NMTF) with self-paced learning and dual hypergraph regularization, named SPLHRNMTF, to predict miRNA-disease associations. More specifically, SPLHRNMTF first uses a non-linear fusion method to obtain miRNA and disease comprehensive similarity. Subsequently, the improved miRNA-disease association matrix is reformulated based on weighted k-nearest neighbor profiles to correct false-negative associations. In addition, we utilize norm to replace Frobenius norm to calculate residual error, alleviating the impact of noise and outliers on prediction performance. Then, we integrate self-paced learning into NMTF to alleviate the model from falling into bad local optimal solutions by gradually including samples from easy to complex. Finally, hypergraph regularization is introduced to capture high-order complex relations from hypergraphs related to miRNAs and diseases. In 5-fold cross-validation five times experiments, SPLHRNMTF obtains higher average AUC values than other baseline models. Moreover, the case studies on breast neoplasms and lung neoplasms further demonstrate the accuracy of SPLHRNMTF. Meanwhile, the potential associations discovered are of biological significance.
微小 RNA(miRNA)与人类疾病密切相关。研究 miRNA 与疾病之间的潜在关联有助于我们了解疾病的发病机制。由于传统的生物学实验成本高、耗时长,因此计算模型可以被视为有效的补充工具。在这项研究中,我们提出了一种新的稳健正交非负矩阵三因子分解(NMTF)模型,该模型具有自步学习和双超图正则化,称为 SPLHRNMTF,用于预测 miRNA-疾病关联。具体来说,SPLHRNMTF 首先使用非线性融合方法获得 miRNA 和疾病的综合相似性。随后,基于加权 k-最近邻谱,对改进的 miRNA-疾病关联矩阵进行重新制定,以纠正假阴性关联。此外,我们利用范数代替 Frobenius 范数来计算残差,从而减轻噪声和异常值对预测性能的影响。然后,我们将自步学习集成到 NMTF 中,通过逐步包含从简单到复杂的样本,缓解模型陷入局部最优解的问题。最后,引入超图正则化来捕捉与 miRNA 和疾病相关的超图中的高阶复杂关系。在 5 次 5 折交叉验证实验中,SPLHRNMTF 获得了比其他基线模型更高的平均 AUC 值。此外,对乳腺肿瘤和肺肿瘤的案例研究进一步证明了 SPLHRNMTF 的准确性。同时,发现的潜在关联具有生物学意义。