Academy of Arts and Design, Tsinghua University, Beijing, 10084, China.
The Future Laboratory, Tsinghua University, Beijing, 10084, China.
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac358.
Since the problem proposed in late 2000s, microRNA-disease association (MDA) predictions have been implemented based on the data fusion paradigm. Integrating diverse data sources gains a more comprehensive research perspective, and brings a challenge to algorithm design for generating accurate, concise and consistent representations of the fused data. After more than a decade of research progress, a relatively simple algorithm like the score function or a single computation layer may no longer be sufficient for further improving predictive performance. Advanced model design has become more frequent in recent years, particularly in the form of reasonably combing multiple algorithms, a process known as model fusion. In the current review, we present 29 state-of-the-art models and introduce the taxonomy of computational models for MDA prediction based on model fusion and non-fusion. The new taxonomy exhibits notable changes in the algorithmic architecture of models, compared with that of earlier ones in the 2017 review by Chen et al. Moreover, we discuss the progresses that have been made towards overcoming the obstacles to effective MDA prediction since 2017 and elaborated on how future models can be designed according to a set of new schemas. Lastly, we analysed the strengths and weaknesses of each model category in the proposed taxonomy and proposed future research directions from diverse perspectives for enhancing model performance.
自 21 世纪 10 年代末提出问题以来,基于数据融合范例已经实现了 microRNA-疾病关联 (MDA) 的预测。整合不同的数据源可以获得更全面的研究视角,这给生成融合数据的准确、简洁和一致表示的算法设计带来了挑战。经过十多年的研究进展,像评分函数或单个计算层这样相对简单的算法可能不再足以进一步提高预测性能。近年来,高级模型设计变得更加频繁,特别是以合理结合多种算法的形式,这一过程被称为模型融合。在当前的综述中,我们展示了 29 个最新模型,并根据模型融合和非融合介绍了 MDA 预测的计算模型分类法。与 2017 年 Chen 等人的综述相比,新的分类法在模型的算法结构上发生了显著变化。此外,我们讨论了自 2017 年以来在克服 MDA 有效预测障碍方面取得的进展,并详细阐述了如何根据一组新方案设计未来的模型。最后,我们分析了所提出分类法中每个模型类别的优缺点,并从不同角度提出了增强模型性能的未来研究方向。