Bai Tao, Xie Junxi, Liu Yumeng, Liu Bin
IEEE J Biomed Health Inform. 2024 Oct 24;PP. doi: 10.1109/JBHI.2024.3483997.
Identifying subcellular localization of microRNAs (miRNAs) is essential for comprehensive understanding of cellular function and has significant implications for drug design. In the past, several computational methods for miRNA subcellular localization is being used for uncovering multiple facets of RNA function to facilitate the biological applications. Unfortunately, most existing classification methods rely on a single sequencebased view, making the effective fusion of data from multiple heterogeneous networks a primary challenge. Inspired by multi-view multi-label learning strategy, we propose a computational method, named MMLmiRLocNet, for predicting the subcellular localizations of miRNAs. The MMLmiRLocNet predictor extracts multi-perspective sequence representations by analyzing lexical, syntactic, and semantic aspects of biological sequences. Specifically, it integrates lexical attributes derived from k-mer physicochemical profiles, syntactic characteristics obtained via word2vec embeddings, and semantic representations generated by pre-trained feature embeddings. Finally, module for extracting multi-view consensus-level features and specific-level features was constructed to capture consensus and specific features from various perspectives. The full connection networks are utilized as the output module to predict the miRNA subcellular localization. Experimental results suggest that MMLmiRLocNet outperforms existing methods in terms of F1, subACC, and Accuracy, and achieves best performance with the help of multi-view consensus features and specific features extract network.
识别微小RNA(miRNA)的亚细胞定位对于全面理解细胞功能至关重要,并且对药物设计具有重要意义。过去,几种用于miRNA亚细胞定位的计算方法被用于揭示RNA功能的多个方面,以促进生物学应用。不幸的是,大多数现有的分类方法依赖于基于单一序列的观点,这使得有效地融合来自多个异构网络的数据成为一个主要挑战。受多视图多标签学习策略的启发,我们提出了一种名为MMLmiRLocNet的计算方法,用于预测miRNA的亚细胞定位。MMLmiRLocNet预测器通过分析生物序列的词汇、句法和语义方面来提取多视角序列表示。具体来说,它整合了从k-mer理化特征导出的词汇属性、通过word2vec嵌入获得的句法特征以及由预训练特征嵌入生成的语义表示。最后,构建了用于提取多视图共识级特征和特定级特征的模块,以从各种角度捕获共识特征和特定特征。全连接网络被用作输出模块来预测miRNA的亚细胞定位。实验结果表明,MMLmiRLocNet在F1、子ACC和准确率方面优于现有方法,并借助多视图共识特征和特定特征提取网络实现了最佳性能。