• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RNALoc-LM:使用预训练RNA语言模型进行RNA亚细胞定位预测。

RNALoc-LM: RNA subcellular localization prediction using pre-trained RNA language model.

作者信息

Zeng Min, Zhang Xinyu, Li Yiming, Lu Chengqian, Yin Rui, Guo Fei, Li Min

机构信息

School of Computer Science and Engineering, Central South University, Changsha, Hunan 410083, China.

School of Computer Science, Key Laboratory of Intelligent Computing and Information Processing, Xiangtan University, Xiangtan, Hunan 411105, China.

出版信息

Bioinformatics. 2025 Mar 29;41(4). doi: 10.1093/bioinformatics/btaf127.

DOI:10.1093/bioinformatics/btaf127
PMID:40119908
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11978386/
Abstract

MOTIVATION

Accurately predicting RNA subcellular localization is crucial for understanding the cellular functions and regulatory mechanisms of RNAs. Although many computational methods have been developed to predict the subcellular localization of lncRNAs, miRNAs, and circRNAs, very few of them are designed to simultaneously predict the subcellular localization of multiple types of RNAs. In addition, the emergence of pre-trained RNA language model has shown remarkable performance in various bioinformatics tasks, such as structure prediction and functional annotation. Despite these advancements, there remains a significant gap in applying pre-trained RNA language models specifically for predicting RNA subcellular localization.

RESULTS

In this study, we proposed RNALoc-LM, the first interpretable deep-learning framework that leverages a pre-trained RNA language model for predicting RNA subcellular localization. RNALoc-LM uses a pre-trained RNA language model to encode RNA sequences, then captures local patterns and long-range dependencies through TextCNN and BiLSTM modules. A multi-head attention mechanism is used to focus on important regions within the RNA sequences. The results demonstrate that RNALoc-LM significantly outperforms both deep-learning baselines and existing state-of-the-art predictors. Additionally, motif analysis highlights RNALoc-LM's potential for discovering important motifs, while an ablation study confirms the effectiveness of the RNA sequence embeddings generated by the pre-trained RNA language model.

AVAILABILITY AND IMPLEMENTATION

The RNALoc-LM web server is available at http://csuligroup.com:8000/RNALoc-LM. The source code can be obtained from https://github.com/CSUBioGroup/RNALoc-LM.

摘要

动机

准确预测RNA的亚细胞定位对于理解RNA的细胞功能和调控机制至关重要。尽管已经开发了许多计算方法来预测长链非编码RNA(lncRNA)、微小RNA(miRNA)和环状RNA(circRNA)的亚细胞定位,但其中很少有设计用于同时预测多种类型RNA亚细胞定位的方法。此外,预训练RNA语言模型的出现已在各种生物信息学任务中表现出卓越性能,如结构预测和功能注释。尽管有这些进展,但在专门应用预训练RNA语言模型来预测RNA亚细胞定位方面仍存在显著差距。

结果

在本研究中,我们提出了RNALoc-LM,这是首个利用预训练RNA语言模型来预测RNA亚细胞定位的可解释深度学习框架。RNALoc-LM使用预训练RNA语言模型对RNA序列进行编码,然后通过TextCNN和双向长短期记忆网络(BiLSTM)模块捕获局部模式和长程依赖性。多头注意力机制用于聚焦RNA序列中的重要区域。结果表明,RNALoc-LM显著优于深度学习基线模型和现有的最先进预测器。此外,基序分析突出了RNALoc-LM发现重要基序的潜力,而消融研究证实了预训练RNA语言模型生成的RNA序列嵌入的有效性。

可用性和实现

RNALoc-LM网络服务器可在http://csuligroup.com:8000/RNALoc-LM获取。源代码可从https://github.com/CSUBioGroup/RNALoc-LM获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3027/11978386/96e835bb7b43/btaf127f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3027/11978386/8bf9fa50f272/btaf127f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3027/11978386/2f324125b195/btaf127f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3027/11978386/2be92216aaba/btaf127f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3027/11978386/4868f4b9f8e4/btaf127f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3027/11978386/96e835bb7b43/btaf127f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3027/11978386/8bf9fa50f272/btaf127f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3027/11978386/2f324125b195/btaf127f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3027/11978386/2be92216aaba/btaf127f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3027/11978386/4868f4b9f8e4/btaf127f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3027/11978386/96e835bb7b43/btaf127f5.jpg

相似文献

1
RNALoc-LM: RNA subcellular localization prediction using pre-trained RNA language model.RNALoc-LM:使用预训练RNA语言模型进行RNA亚细胞定位预测。
Bioinformatics. 2025 Mar 29;41(4). doi: 10.1093/bioinformatics/btaf127.
2
LncLocFormer: a Transformer-based deep learning model for multi-label lncRNA subcellular localization prediction by using localization-specific attention mechanism.LncLocFormer:一种基于 Transformer 的深度学习模型,通过使用定位特异性注意力机制,对多标签 lncRNA 亚细胞定位进行预测。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad752.
3
Deep learning model for protein multi-label subcellular localization and function prediction based on multi-task collaborative training.基于多任务协同训练的蛋白质多标签亚细胞定位和功能预测深度学习模型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae568.
4
DeepLncLoc: a deep learning framework for long non-coding RNA subcellular localization prediction based on subsequence embedding.DeepLncLoc:一种基于子序列嵌入的深度学习框架,用于长非编码 RNA 亚细胞定位预测。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab360.
5
DeepLocRNA: an interpretable deep learning model for predicting RNA subcellular localization with domain-specific transfer-learning.DeepLocRNA:一种具有领域特定迁移学习功能的可解释深度学习模型,用于预测 RNA 亚细胞定位。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae065.
6
NN-RNALoc: Neural network-based model for prediction of mRNA sub-cellular localization using distance-based sub-sequence profiles.基于神经网络的模型,使用基于距离的子序列谱预测 mRNA 亚细胞定位。
PLoS One. 2023 Sep 14;18(9):e0258793. doi: 10.1371/journal.pone.0258793. eCollection 2023.
7
DeepLoc: prediction of protein subcellular localization using deep learning.DeepLoc:使用深度学习进行蛋白质亚细胞定位预测。
Bioinformatics. 2017 Nov 1;33(21):3387-3395. doi: 10.1093/bioinformatics/btx431.
8
EDCLoc: a prediction model for mRNA subcellular localization using improved focal loss to address multi-label class imbalance.EDCLoc:一种用于mRNA亚细胞定位的预测模型,使用改进的焦点损失来解决多标签类不平衡问题。
BMC Genomics. 2024 Dec 27;25(1):1252. doi: 10.1186/s12864-024-11173-6.
9
CellCircLoc: Deep Neural Network for Predicting and Explaining Cell Line-Specific CircRNA Subcellular Localization.CellCircLoc:用于预测和解释细胞系特异性环状RNA亚细胞定位的深度神经网络。
IEEE J Biomed Health Inform. 2025 Feb;29(2):1494-1503. doi: 10.1109/JBHI.2024.3491732. Epub 2025 Feb 10.
10
DeepCellEss: cell line-specific essential protein prediction with attention-based interpretable deep learning.DeepCellEss:基于注意力机制的可解释深度学习的细胞系特异性必需蛋白预测。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac779.

引用本文的文献

1
2OMe-LM: predicting 2'-O-methylation sites in human RNA using a pre-trained RNA language model.2OMe-LM:使用预训练的RNA语言模型预测人类RNA中的2'-O-甲基化位点
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf417.
2
Leveraging protein language models for cross-variant CRISPR/Cas9 sgRNA activity prediction.利用蛋白质语言模型进行跨变体CRISPR/Cas9 sgRNA活性预测。
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf385.

本文引用的文献

1
CellCircLoc: Deep Neural Network for Predicting and Explaining Cell Line-Specific CircRNA Subcellular Localization.CellCircLoc:用于预测和解释细胞系特异性环状RNA亚细胞定位的深度神经网络。
IEEE J Biomed Health Inform. 2025 Feb;29(2):1494-1503. doi: 10.1109/JBHI.2024.3491732. Epub 2025 Feb 10.
2
Transipedia.org: k-mer-based exploration of large RNA sequencing datasets and application to cancer data.Transipedia.org:基于 k-mer 的大型 RNA 测序数据集探索及其在癌症数据中的应用。
Genome Biol. 2024 Oct 10;25(1):266. doi: 10.1186/s13059-024-03413-5.
3
DeepLocRNA: an interpretable deep learning model for predicting RNA subcellular localization with domain-specific transfer-learning.
DeepLocRNA:一种具有领域特定迁移学习功能的可解释深度学习模型,用于预测 RNA 亚细胞定位。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae065.
4
LncLocFormer: a Transformer-based deep learning model for multi-label lncRNA subcellular localization prediction by using localization-specific attention mechanism.LncLocFormer:一种基于 Transformer 的深度学习模型,通过使用定位特异性注意力机制,对多标签 lncRNA 亚细胞定位进行预测。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad752.
5
RNA trafficking and subcellular localization-a review of mechanisms, experimental and predictive methodologies.RNA 运输和亚细胞定位——机制、实验和预测方法综述。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad249.
6
DAmiRLocGNet: miRNA subcellular localization prediction by combining miRNA-disease associations and graph convolutional networks.DAmiRLocGNet:通过结合 miRNA-疾病关联和图卷积网络进行 miRNA 亚细胞定位预测。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad212.
7
GraphLncLoc: long non-coding RNA subcellular localization prediction using graph convolutional networks based on sequence to graph transformation.GraphLncLoc:基于序列到图转换的图卷积网络预测长链非编码RNA亚细胞定位
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac565.
8
RNAlight: a machine learning model to identify nucleotide features determining RNA subcellular localization.RNA 亮氨酸:一种用于识别决定 RNA 亚细胞定位的核苷酸特征的机器学习模型。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac509.
9
iLoc-miRNA: extracellular/intracellular miRNA prediction using deep BiLSTM with attention mechanism.iLoc-miRNA:使用具有注意力机制的深度 BiLSTM 进行细胞外/细胞内 miRNA 预测。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac395.
10
Prediction and evaluation of combination pharmacotherapy using natural language processing, machine learning and patient electronic health records.使用自然语言处理、机器学习和患者电子健康记录预测和评估联合药物治疗。
J Biomed Inform. 2022 Sep;133:104164. doi: 10.1016/j.jbi.2022.104164. Epub 2022 Aug 17.