IEEE/ACM Trans Comput Biol Bioinform. 2022 Mar-Apr;19(2):763-772. doi: 10.1109/TCBB.2020.3027444. Epub 2022 Apr 1.
Numerous studies have shown that microRNAs are associated with the occurrence and development of human diseases. Thus, studying disease-associated miRNAs is significantly valuable to the prevention, diagnosis and treatment of diseases. In this paper, we proposed a novel method based on matrix completion and non-negative matrix factorization (MCNMF)for predicting disease-associated miRNAs. Due to the information inadequacy on miRNA similarities and disease similarities, we calculated the latter via two models, and introduced the Gaussian interaction profile kernel similarity. In addition, the matrix completion (MC)was employed to further replenish the miRNA and disease similarities to improve the prediction performance. And to reduce the sparsity of miRNA-disease association matrix, the method of weighted K nearest neighbor (WKNKN)was used, which is a pre-processing step. We also utilized non-negative matrix factorization (NMF)using dual L-norm, graph Laplacian regularization, and Tikhonov regularization to effectively avoid the overfitting during the prediction. Finally, several experiments and a case study were implemented to evaluate the effectiveness and performance of the proposed MCNMF model. The results indicated that our method could reliably and effectively predict disease-associated miRNAs.
大量研究表明,microRNAs 与人类疾病的发生和发展有关。因此,研究与疾病相关的 microRNAs 对于疾病的预防、诊断和治疗具有重要意义。在本文中,我们提出了一种基于矩阵补全和非负矩阵分解(MCNMF)的新方法,用于预测与疾病相关的 microRNAs。由于 miRNA 相似性和疾病相似性的信息不足,我们通过两种模型计算了后者,并引入了高斯相互作用谱核相似性。此外,我们还采用矩阵补全(MC)进一步补充 miRNA 和疾病的相似性,以提高预测性能。为了减少 miRNA-疾病关联矩阵的稀疏性,我们使用了加权 K 最近邻(WKNKN)方法,这是一个预处理步骤。我们还利用双 L 范数、图拉普拉斯正则化和 Tikhonov 正则化的非负矩阵分解(NMF),有效地避免了预测过程中的过拟合。最后,我们进行了几项实验和案例研究,以评估所提出的 MCNMF 模型的有效性和性能。结果表明,我们的方法可以可靠有效地预测与疾病相关的 microRNAs。