用于预测小分子与微小RNA关联的机器学习方法:全面综述
Machine learning approaches for predicting the small molecule-miRNA associations: a comprehensive review.
作者信息
Panghalia Ashish, Singh Vikram
机构信息
Centre for Computational Biology and Bioinformatics, School of Life Sciences, Central University of Himachal Pradesh, Kangra, 176215, India.
出版信息
Mol Divers. 2025 May 20. doi: 10.1007/s11030-025-11211-9.
MicroRNAs (miRNAs) are evolutionarily conserved small regulatory elements that are ubiquitous in cells and are found to be abnormally expressed during the onset and progression of several human diseases. miRNAs are increasingly recognized as potential diagnostic and therapeutic targets that could be inhibited by small molecules (SMs). The knowledge of SM-miRNA associations (SMAs) is sparse, mainly because of the dynamic and less predictable 3D structures of miRNAs that restrict the high-throughput screening of SMs. Toward augmenting the costly and laborious experiments determining the SM-miRNA interactions, machine learning (ML) has emerged as a cost-effective and efficient platform. In this article, various aspects associated with the ML-guided predictions of SMAs are thoroughly reviewed. Firstly, a detailed account of the SMA data resources useful for algorithms training is provided, followed by an elaboration of various feature extraction methods and similarity measures utilized on SMs and miRNAs. Subsequent to a summary of the ML algorithms basics and a brief description of the performance measures, an exhaustive census of all the 32 ML-based SMA prediction methods developed so far is outlined. Distinctive features of these methods have been described by classifying them into six broad categories, namely, classical ML, deep learning, matrix factorization, network propagation, graph learning, and ensemble learning methods. Trend analyses are performed to investigate the patterns in ML algorithms usage and performance achievement in SMA prediction. Outlining key principles behind the up-to-date methodologies and comparing their accomplishments, this review offers valuable insights into critical areas for future research in ML-based SMA prediction.
微小RNA(miRNA)是进化上保守的小调节元件,在细胞中普遍存在,并且发现在几种人类疾病的发生和发展过程中异常表达。miRNA越来越被认为是潜在的诊断和治疗靶点,可被小分子(SM)抑制。关于SM-miRNA关联(SMA)的知识很少,主要是因为miRNA的动态且较难预测的三维结构限制了对SM的高通量筛选。为了增加确定SM-miRNA相互作用的昂贵且费力的实验,机器学习(ML)已成为一个经济高效的平台。在本文中,对与基于ML的SMA预测相关的各个方面进行了全面综述。首先,详细介绍了对算法训练有用的SMA数据资源,随后阐述了在SM和miRNA上使用的各种特征提取方法和相似性度量。在总结ML算法基础并简要描述性能度量之后,概述了迄今为止开发的所有32种基于ML的SMA预测方法的详尽普查。通过将这些方法分为六大类,即经典ML、深度学习、矩阵分解、网络传播、图学习和集成学习方法,描述了这些方法的独特特征。进行趋势分析以研究ML算法在SMA预测中的使用模式和性能成就。本综述概述了最新方法背后的关键原则并比较了它们的成果,为基于ML的SMA预测未来研究的关键领域提供了有价值的见解。