• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RPITER:一种用于 ncRNA-蛋白质相互作用预测的分层深度学习框架。

RPITER: A Hierarchical Deep Learning Framework for ncRNA⁻Protein Interaction Prediction.

机构信息

College of Computer Science and Technology, Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China.

出版信息

Int J Mol Sci. 2019 Mar 1;20(5):1070. doi: 10.3390/ijms20051070.

DOI:10.3390/ijms20051070
PMID:30832218
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6429152/
Abstract

Non-coding RNAs (ncRNAs) play crucial roles in multiple fundamental biological processes, such as post-transcriptional gene regulation, and are implicated in many complex human diseases. Mostly ncRNAs function by interacting with corresponding RNA-binding proteins. The research on ncRNA⁻protein interaction is the key to understanding the function of ncRNA. However, the biological experiment techniques for identifying RNA⁻protein interactions (RPIs) are currently still expensive and time-consuming. Due to the complex molecular mechanism of ncRNA⁻protein interaction and the lack of conservation for ncRNA, especially for long ncRNA (lncRNA), the prediction of ncRNA⁻protein interaction is still a challenge. Deep learning-based models have become the state-of-the-art in a range of biological sequence analysis problems due to their strong power of feature learning. In this study, we proposed a hierarchical deep learning framework RPITER to predict RNA⁻protein interaction. For sequence coding, we improved the conjoint triad feature (CTF) coding method by complementing more primary sequence information and adding sequence structure information. For model design, RPITER employed two basic neural network architectures of convolution neural network (CNN) and stacked auto-encoder (SAE). Comprehensive experiments were performed on five benchmark datasets from PDB and NPInter databases to analyze and compare the performances of different sequence coding methods and prediction models. We found that CNN and SAE deep learning architectures have powerful fitting abilities for the -mer features of RNA and protein sequence. The improved CTF coding method showed performance gain compared with the original CTF method. Moreover, our designed RPITER performed well in predicting RNA⁻protein interaction (RPI) and could outperform most of the previous methods. On five widely used RPI datasets, RPI369, RPI488, RPI1807, RPI2241 and NPInter, RPITER obtained A U C of 0.821, 0.911, 0.990, 0.957 and 0.985, respectively. The proposed RPITER could be a complementary method for predicting RPI and constructing RPI network, which would help push forward the related biological research on ncRNAs and lncRNAs.

摘要

非编码 RNA(ncRNA)在多种基本生物过程中发挥着关键作用,例如转录后基因调控,并与许多复杂的人类疾病有关。大多数 ncRNA 通过与相应的 RNA 结合蛋白相互作用发挥作用。ncRNA-蛋白质相互作用的研究是理解 ncRNA 功能的关键。然而,目前用于鉴定 RNA-蛋白质相互作用(RPI)的生物学实验技术仍然昂贵且耗时。由于 ncRNA-蛋白质相互作用的复杂分子机制以及 ncRNA,特别是长 ncRNA(lncRNA)缺乏保守性,因此 ncRNA-蛋白质相互作用的预测仍然是一个挑战。基于深度学习的模型由于其强大的特征学习能力,已成为一系列生物序列分析问题的最新技术。在这项研究中,我们提出了一个层次化的深度学习框架 RPITER 来预测 RNA-蛋白质相互作用。对于序列编码,我们通过补充更多的原始序列信息并添加序列结构信息,改进了联合三联体特征(CTF)编码方法。对于模型设计,RPITER 采用卷积神经网络(CNN)和堆叠自动编码器(SAE)两种基本神经网络架构。我们在来自 PDB 和 NPInter 数据库的五个基准数据集上进行了综合实验,以分析和比较不同序列编码方法和预测模型的性能。我们发现 CNN 和 SAE 深度学习架构对 RNA 和蛋白质序列的-mer 特征具有强大的拟合能力。改进的 CTF 编码方法与原始 CTF 方法相比表现出性能提升。此外,我们设计的 RPITER 在预测 RNA-蛋白质相互作用(RPI)方面表现出色,并且可以优于大多数先前的方法。在五个广泛使用的 RPI 数据集 RPI369、RPI488、RPI1807、RPI2241 和 NPInter 上,RPITER 分别获得了 0.821、0.911、0.990、0.957 和 0.985 的 AUC。所提出的 RPITER 可以作为预测 RPI 和构建 RPI 网络的补充方法,这将有助于推动有关 ncRNA 和 lncRNA 的相关生物学研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e4c/6429152/9d3d91650be9/ijms-20-01070-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e4c/6429152/566f290d406f/ijms-20-01070-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e4c/6429152/6a81eafff837/ijms-20-01070-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e4c/6429152/2c94f22e86cb/ijms-20-01070-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e4c/6429152/906d9a71743b/ijms-20-01070-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e4c/6429152/9d3d91650be9/ijms-20-01070-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e4c/6429152/566f290d406f/ijms-20-01070-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e4c/6429152/6a81eafff837/ijms-20-01070-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e4c/6429152/2c94f22e86cb/ijms-20-01070-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e4c/6429152/906d9a71743b/ijms-20-01070-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e4c/6429152/9d3d91650be9/ijms-20-01070-g005.jpg

相似文献

1
RPITER: A Hierarchical Deep Learning Framework for ncRNA⁻Protein Interaction Prediction.RPITER:一种用于 ncRNA-蛋白质相互作用预测的分层深度学习框架。
Int J Mol Sci. 2019 Mar 1;20(5):1070. doi: 10.3390/ijms20051070.
2
RPI-EDLCN: An Ensemble Deep Learning Framework Based on Capsule Network for ncRNA-Protein Interaction Prediction.RPI-EDLCN:一种基于胶囊网络的 ncRNA-蛋白质相互作用预测的集成深度学习框架。
J Chem Inf Model. 2024 Apr 8;64(7):2221-2235. doi: 10.1021/acs.jcim.3c00377. Epub 2023 May 9.
3
A Deep Learning Framework for Robust and Accurate Prediction of ncRNA-Protein Interactions Using Evolutionary Information.一种利用进化信息对非编码RNA-蛋白质相互作用进行稳健且准确预测的深度学习框架。
Mol Ther Nucleic Acids. 2018 Jun 1;11:337-344. doi: 10.1016/j.omtn.2018.03.001. Epub 2018 Mar 9.
4
EDLMFC: an ensemble deep learning framework with multi-scale features combination for ncRNA-protein interaction prediction.EDLMFC:一种具有多尺度特征组合的集成深度学习框架,用于 ncRNA-蛋白质相互作用预测。
BMC Bioinformatics. 2021 Mar 19;22(1):133. doi: 10.1186/s12859-021-04069-9.
5
RPI-SE: a stacking ensemble learning framework for ncRNA-protein interactions prediction using sequence information.RPI-SE:一种使用序列信息进行 ncRNA-蛋白质相互作用预测的堆叠集成学习框架。
BMC Bioinformatics. 2020 Feb 18;21(1):60. doi: 10.1186/s12859-020-3406-0.
6
DM-RPIs: Predicting ncRNA-protein interactions using stacked ensembling strategy.DM-RPIs:基于堆叠集成策略的 ncRNA-蛋白质相互作用预测
Comput Biol Chem. 2019 Dec;83:107088. doi: 10.1016/j.compbiolchem.2019.107088. Epub 2019 Jul 6.
7
Prediction of RNA-protein interactions by combining deep convolutional neural network with feature selection ensemble method.通过结合深度卷积神经网络和特征选择集成方法预测 RNA-蛋白质相互作用。
J Theor Biol. 2019 Jan 14;461:230-238. doi: 10.1016/j.jtbi.2018.10.029. Epub 2018 Oct 12.
8
IPMiner: hidden ncRNA-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction.IPMiner:基于堆叠自编码器的隐藏非编码RNA-蛋白质相互作用序列模式挖掘,用于准确的计算预测。
BMC Genomics. 2016 Aug 9;17:582. doi: 10.1186/s12864-016-2931-8.
9
Combining High Speed ELM Learning with a Deep Convolutional Neural Network Feature Encoding for Predicting Protein-RNA Interactions.将高速 ELM 学习与深度卷积神经网络特征编码相结合,用于预测蛋白质-RNA 相互作用。
IEEE/ACM Trans Comput Biol Bioinform. 2020 May-Jun;17(3):972-980. doi: 10.1109/TCBB.2018.2874267. Epub 2018 Oct 5.
10
Predicting lncRNA-protein interactions through deep learning framework employing multiple features and random forest algorithm.通过深度学习框架利用多种特征和随机森林算法预测 lncRNA-蛋白质相互作用。
BMC Bioinformatics. 2024 Mar 12;25(1):108. doi: 10.1186/s12859-024-05727-4.

引用本文的文献

1
RPIPLM: Prediction of ncRNA-protein interaction by post-training a dual-tower pretrained biological model with supervised contrastive learning.RPIPLM:通过使用监督对比学习对双塔预训练生物模型进行训练后预测非编码RNA与蛋白质的相互作用
PLoS One. 2025 Aug 14;20(8):e0329174. doi: 10.1371/journal.pone.0329174. eCollection 2025.
2
LPItabformer: Enhancing generalization in predicting lncRNA-protein interactions via a tabular Transformer.LPItabformer:通过表格Transformer增强lncRNA-蛋白质相互作用预测中的泛化能力。
Comput Struct Biotechnol J. 2025 May 29;27:2323-2335. doi: 10.1016/j.csbj.2025.05.050. eCollection 2025.
3

本文引用的文献

1
DeeperBind: Enhancing Prediction of Sequence Specificities of DNA Binding Proteins.深度绑定:增强对DNA结合蛋白序列特异性的预测
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2016 Dec;2016:178-183. doi: 10.1109/bibm.2016.7822515. Epub 2017 Jan 19.
2
MicroRNA-small molecule association identification: from experimental results to computational models.微小RNA-小分子关联识别:从实验结果到计算模型
Brief Bioinform. 2020 Jan 17;21(1):47-61. doi: 10.1093/bib/bby098.
3
Computational models for lncRNA function prediction and functional similarity calculation.
NPI-HetGNN: A Prediction Model of ncRNA-Protein Interactions Based on Heterogeneous Graph Neural Networks.
NPI-HetGNN:一种基于异构图神经网络的非编码RNA-蛋白质相互作用预测模型。
Interdiscip Sci. 2025 Jun 2. doi: 10.1007/s12539-025-00716-4.
4
Predicting lncRNA-protein interactions using a hybrid deep learning model with dinucleotide-codon fusion feature encoding.使用具有二核苷酸-密码子融合特征编码的混合深度学习模型预测长链非编码RNA-蛋白质相互作用。
BMC Genomics. 2024 Dec 28;25(1):1253. doi: 10.1186/s12864-024-11168-3.
5
BioPrediction-RPI: Democratizing the prediction of interaction between non-coding RNA and protein with end-to-end machine learning.生物预测-RPI:通过端到端机器学习实现非编码RNA与蛋白质相互作用预测的普及。
Comput Struct Biotechnol J. 2024 May 22;23:2267-2276. doi: 10.1016/j.csbj.2024.05.031. eCollection 2024 Dec.
6
Predicting lncRNA-protein interactions through deep learning framework employing multiple features and random forest algorithm.通过深度学习框架利用多种特征和随机森林算法预测 lncRNA-蛋白质相互作用。
BMC Bioinformatics. 2024 Mar 12;25(1):108. doi: 10.1186/s12859-024-05727-4.
7
A task-specific encoding algorithm for RNAs and RNA-associated interactions based on convolutional autoencoder.基于卷积自动编码器的 RNA 及其相关相互作用的特定任务编码算法。
Nucleic Acids Res. 2023 Nov 27;51(21):e110. doi: 10.1093/nar/gkad929.
8
Exploring the landscape of tools and resources for the analysis of long non-coding RNAs.探索用于长链非编码RNA分析的工具和资源全景。
Comput Struct Biotechnol J. 2023 Sep 29;21:4706-4716. doi: 10.1016/j.csbj.2023.09.041. eCollection 2023.
9
DAPTEV: Deep aptamer evolutionary modelling for COVID-19 drug design.DAPTEV:用于 COVID-19 药物设计的深度适体进化建模。
PLoS Comput Biol. 2023 Jul 5;19(7):e1010774. doi: 10.1371/journal.pcbi.1010774. eCollection 2023 Jul.
10
DCiPatho: deep cross-fusion networks for genome scale identification of pathogens.DCiPatho:用于大规模病原体基因组识别的深度交叉融合网络。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad194.
用于 lncRNA 功能预测和功能相似性计算的计算模型。
Brief Funct Genomics. 2019 Feb 14;18(1):58-82. doi: 10.1093/bfgp/ely031.
4
MicroRNAs and complex diseases: from experimental results to computational models.微小 RNA 与复杂疾病:从实验结果到计算模型。
Brief Bioinform. 2019 Mar 22;20(2):515-539. doi: 10.1093/bib/bbx130.
5
An introduction to deep learning on biological sequence data: examples and solutions.深度学习在生物序列数据上的应用:实例与解决方案。
Bioinformatics. 2017 Nov 15;33(22):3685-3690. doi: 10.1093/bioinformatics/btx531.
6
TITER: predicting translation initiation sites by deep learning.TITER:通过深度学习预测翻译起始位点。
Bioinformatics. 2017 Jul 15;33(14):i234-i242. doi: 10.1093/bioinformatics/btx247.
7
Chromatin accessibility prediction via convolutional long short-term memory networks with k-mer embedding.基于 k- -mer 嵌入卷积长短期记忆网络的染色质可及性预测。
Bioinformatics. 2017 Jul 15;33(14):i92-i101. doi: 10.1093/bioinformatics/btx234.
8
Sequence-based prediction of protein protein interaction using a deep-learning algorithm.使用深度学习算法基于序列预测蛋白质-蛋白质相互作用
BMC Bioinformatics. 2017 May 25;18(1):277. doi: 10.1186/s12859-017-1700-2.
9
IPMiner: hidden ncRNA-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction.IPMiner:基于堆叠自编码器的隐藏非编码RNA-蛋白质相互作用序列模式挖掘,用于准确的计算预测。
BMC Genomics. 2016 Aug 9;17:582. doi: 10.1186/s12864-016-2931-8.
10
Long non-coding RNAs and complex diseases: from experimental results to computational models.长链非编码RNA与复杂疾病:从实验结果到计算模型
Brief Bioinform. 2017 Jul 1;18(4):558-576. doi: 10.1093/bib/bbw060.