Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education, Xiangtan University, Xiangtan, 411105, China.
School of Computer Science, Xiangtan University, Xiangtan, 411105, China.
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac104.
Increasing evidences show that the occurrence of human complex diseases is closely related to microRNA (miRNA) variation and imbalance. For this reason, predicting disease-related miRNAs is essential for the diagnosis and treatment of complex human diseases. Although some current computational methods can effectively predict potential disease-related miRNAs, the accuracy of prediction should be further improved. In our study, a new computational method via deep forest ensemble learning based on autoencoder (DFELMDA) is proposed to predict miRNA-disease associations. Specifically, a new feature representation strategy is proposed to obtain different types of feature representations (from miRNA and disease) for each miRNA-disease association. Then, two types of low-dimensional feature representations are extracted by two deep autoencoders for predicting miRNA-disease associations. Finally, two prediction scores of the miRNA-disease associations are obtained by the deep random forest and combined to determine the final results. DFELMDA is compared with several classical methods on the The Human microRNA Disease Database (HMDD) dataset. Results reveal that the performance of this method is superior. The area under receiver operating characteristic curve (AUC) values obtained by DFELMDA through 5-fold and 10-fold cross-validation are 0.9552 and 0.9560, respectively. In addition, case studies on colon, breast and lung tumors of different disease types further demonstrate the excellent ability of DFELMDA to predict disease-associated miRNA-disease. Performance analysis shows that DFELMDA can be used as an effective computational tool for predicting miRNA-disease associations.
越来越多的证据表明,人类复杂疾病的发生与 microRNA(miRNA)的变异和失衡密切相关。因此,预测与疾病相关的 miRNA 对于复杂人类疾病的诊断和治疗至关重要。尽管目前一些计算方法可以有效地预测潜在的疾病相关 miRNA,但预测的准确性仍需进一步提高。在我们的研究中,提出了一种基于自动编码器的深度森林集成学习新方法(DFELMDA)来预测 miRNA-疾病关联。具体来说,提出了一种新的特征表示策略,为每个 miRNA-疾病关联获得不同类型的特征表示(来自 miRNA 和疾病)。然后,通过两个深度自动编码器提取两种低维特征表示,用于预测 miRNA-疾病关联。最后,通过深度随机森林获得 miRNA-疾病关联的两个预测分数,并结合起来确定最终结果。在 HMDD 数据集上,将 DFELMDA 与几种经典方法进行了比较。结果表明,该方法的性能更优。通过 5 倍和 10 倍交叉验证,DFELMDA 的接收器工作特征曲线下面积(AUC)值分别为 0.9552 和 0.9560。此外,对不同疾病类型的结肠、乳腺和肺肿瘤的案例研究进一步证明了 DFELMDA 预测疾病相关 miRNA-疾病的优异能力。性能分析表明,DFELMDA 可以作为预测 miRNA-疾病关联的有效计算工具。