• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用深度卷积和递归神经网络预测 RNA-蛋白质序列和结构的结合偏好。

Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks.

机构信息

Department of Medical Informatics, Erasmus Medical Center, Rotterdam, The Netherlands.

Institute of Software Engineering, East China Normal University, Shanghai, China.

出版信息

BMC Genomics. 2018 Jul 3;19(1):511. doi: 10.1186/s12864-018-4889-1.

DOI:10.1186/s12864-018-4889-1
PMID:29970003
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6029131/
Abstract

BACKGROUND

RNA regulation is significantly dependent on its binding protein partner, known as the RNA-binding proteins (RBPs). Unfortunately, the binding preferences for most RBPs are still not well characterized. Interdependencies between sequence and secondary structure specificities is challenging for both predicting RBP binding sites and accurate sequence and structure motifs detection.

RESULTS

In this study, we propose a deep learning-based method, iDeepS, to simultaneously identify the binding sequence and structure motifs from RNA sequences using convolutional neural networks (CNNs) and a bidirectional long short term memory network (BLSTM). We first perform one-hot encoding for both the sequence and predicted secondary structure, to enable subsequent convolution operations. To reveal the hidden binding knowledge from the observed sequences, the CNNs are applied to learn the abstract features. Considering the close relationship between sequence and predicted structures, we use the BLSTM to capture possible long range dependencies between binding sequence and structure motifs identified by the CNNs. Finally, the learned weighted representations are fed into a classification layer to predict the RBP binding sites. We evaluated iDeepS on verified RBP binding sites derived from large-scale representative CLIP-seq datasets. The results demonstrate that iDeepS can reliably predict the RBP binding sites on RNAs, and outperforms the state-of-the-art methods. An important advantage compared to other methods is that iDeepS can automatically extract both binding sequence and structure motifs, which will improve our understanding of the mechanisms of binding specificities of RBPs.

CONCLUSION

Our study shows that the iDeepS method identifies the sequence and structure motifs to accurately predict RBP binding sites. iDeepS is available at https://github.com/xypan1232/iDeepS .

摘要

背景

RNA 的调控显著依赖于与其结合的蛋白质伴侣,即 RNA 结合蛋白(RBPs)。不幸的是,大多数 RBPs 的结合偏好仍未得到很好的描述。序列和二级结构特异性之间的相互依赖关系给预测 RBP 结合位点和准确检测序列和结构基序带来了挑战。

结果

在这项研究中,我们提出了一种基于深度学习的方法 iDeepS,该方法使用卷积神经网络(CNN)和双向长短期记忆网络(BLSTM),从 RNA 序列中同时识别结合序列和结构基序。我们首先对序列和预测的二级结构进行独热编码,以进行后续的卷积操作。为了从观察到的序列中揭示隐藏的结合知识,CNN 用于学习抽象特征。考虑到序列和预测结构之间的密切关系,我们使用 BLSTM 来捕获由 CNN 识别的结合序列和结构基序之间可能的长程依赖关系。最后,学习到的加权表示被馈送到分类层以预测 RBP 结合位点。我们在来自大规模代表性 CLIP-seq 数据集的已验证 RBP 结合位点上评估了 iDeepS。结果表明,iDeepS 可以可靠地预测 RNA 上的 RBP 结合位点,并优于最先进的方法。与其他方法相比,iDeepS 的一个重要优势是它可以自动提取结合序列和结构基序,这将提高我们对 RBPs 结合特异性机制的理解。

结论

我们的研究表明,iDeepS 方法可以识别序列和结构基序,从而准确预测 RBP 结合位点。iDeepS 可在 https://github.com/xypan1232/iDeepS 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/6029131/16b18b75baf6/12864_2018_4889_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/6029131/799771e97cc4/12864_2018_4889_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/6029131/2c8d4bd1bf9a/12864_2018_4889_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/6029131/06909a7a12db/12864_2018_4889_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/6029131/77a24d53ecb7/12864_2018_4889_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/6029131/16b18b75baf6/12864_2018_4889_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/6029131/799771e97cc4/12864_2018_4889_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/6029131/2c8d4bd1bf9a/12864_2018_4889_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/6029131/06909a7a12db/12864_2018_4889_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/6029131/77a24d53ecb7/12864_2018_4889_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d2c5/6029131/16b18b75baf6/12864_2018_4889_Fig5_HTML.jpg

相似文献

1
Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks.使用深度卷积和递归神经网络预测 RNA-蛋白质序列和结构的结合偏好。
BMC Genomics. 2018 Jul 3;19(1):511. doi: 10.1186/s12864-018-4889-1.
2
Prediction of the RBP binding sites on lncRNAs using the high-order nucleotide encoding convolutional neural network.使用高阶核苷酸编码卷积神经网络预测长链非编码RNA上的RBP结合位点
Anal Biochem. 2019 Oct 15;583:113364. doi: 10.1016/j.ab.2019.113364. Epub 2019 Jul 16.
3
RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach.基于新型混合深度学习跨域知识整合方法的RNA-蛋白质结合基序挖掘
BMC Bioinformatics. 2017 Feb 28;18(1):136. doi: 10.1186/s12859-017-1561-8.
4
Predicting RNA-protein binding sites and motifs through combining local and global deep convolutional neural networks.通过结合局部和全局深度卷积神经网络预测 RNA 与蛋白质的结合位点和基序。
Bioinformatics. 2018 Oct 15;34(20):3427-3436. doi: 10.1093/bioinformatics/bty364.
5
Deep neural networks for inferring binding sites of RNA-binding proteins by using distributed representations of RNA primary sequence and secondary structure.利用 RNA 一级序列和二级结构的分布式表示来推断 RNA 结合蛋白结合位点的深度神经网络。
BMC Genomics. 2020 Dec 17;21(Suppl 13):866. doi: 10.1186/s12864-020-07239-w.
6
DeepA-RBPBS: A hybrid convolution and recurrent neural network combined with attention mechanism for predicting RBP binding site.DeepA-RBPBS:一种结合注意力机制的卷积和循环神经网络混合模型,用于预测 RBP 结合位点。
J Biomol Struct Dyn. 2022 Jun;40(9):4250-4258. doi: 10.1080/07391102.2020.1854861. Epub 2020 Dec 4.
7
Predicting RBP Binding Sites of RNA With High-Order Encoding Features and CNN-BLSTM Hybrid Model.基于高阶编码特征和 CNN-BLSTM 混合模型预测 RNA 的 RBP 结合位点。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):2409-2419. doi: 10.1109/TCBB.2021.3083930. Epub 2022 Aug 8.
8
DeepPN: a deep parallel neural network based on convolutional neural network and graph convolutional network for predicting RNA-protein binding sites.DeepPN:一种基于卷积神经网络和图卷积网络的深度并行神经网络,用于预测 RNA-蛋白质结合位点。
BMC Bioinformatics. 2022 Jun 29;23(1):257. doi: 10.1186/s12859-022-04798-5.
9
Prediction of binding property of RNA-binding proteins using multi-sized filters and multi-modal deep convolutional neural network.基于多尺寸滤波器和多模态深度卷积神经网络的 RNA 结合蛋白结合特性预测。
PLoS One. 2019 Apr 26;14(4):e0216257. doi: 10.1371/journal.pone.0216257. eCollection 2019.
10
A deep learning framework for modeling structural features of RNA-binding protein targets.一种用于对RNA结合蛋白靶点的结构特征进行建模的深度学习框架。
Nucleic Acids Res. 2016 Feb 29;44(4):e32. doi: 10.1093/nar/gkv1025. Epub 2015 Oct 13.

引用本文的文献

1
Deep indel mutagenesis reveals the regulatory and modulatory architecture of alternative exon splicing.深度插入缺失诱变揭示了可变外显子剪接的调控和调节结构。
Nat Commun. 2025 Aug 30;16(1):8117. doi: 10.1038/s41467-025-62957-7.
2
Base-resolution binding profile prediction of proteins on RNAs with deep learning.利用深度学习预测蛋白质在RNA上的碱基分辨率结合图谱。
Nucleic Acids Res. 2025 Jul 19;53(14). doi: 10.1093/nar/gkaf748.
3
Old players and new insights: unraveling the role of RNA-binding proteins in brain tumors.老玩家与新见解:解析RNA结合蛋白在脑肿瘤中的作用

本文引用的文献

1
DeeperBind: Enhancing Prediction of Sequence Specificities of DNA Binding Proteins.深度绑定:增强对DNA结合蛋白序列特异性的预测
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2016 Dec;2016:178-183. doi: 10.1109/bibm.2016.7822515. Epub 2017 Jan 19.
2
Predicting RNA-protein binding sites and motifs through combining local and global deep convolutional neural networks.通过结合局部和全局深度卷积神经网络预测 RNA 与蛋白质的结合位点和基序。
Bioinformatics. 2018 Oct 15;34(20):3427-3436. doi: 10.1093/bioinformatics/bty364.
3
Recognizing RNA structural motifs in HT-SELEX data for ribosomal protein S15.
Theranostics. 2025 Apr 13;15(11):5238-5257. doi: 10.7150/thno.113312. eCollection 2025.
4
Unravelling the conundrum of nucleolar NR2F1 localization using antibody-based approaches in vitro and in vivo.运用基于抗体的方法在体外和体内解析核仁中NR2F1定位的难题。
Commun Biol. 2025 Apr 10;8(1):594. doi: 10.1038/s42003-025-07985-1.
5
RBPsuite 2.0: an updated RNA-protein binding site prediction suite with high coverage on species and proteins based on deep learning.RBPsuite 2.0:一个基于深度学习的、在物种和蛋白质上具有高覆盖率的更新版RNA-蛋白质结合位点预测套件。
BMC Biol. 2025 Mar 11;23(1):74. doi: 10.1186/s12915-025-02182-2.
6
EnrichRBP: an automated and interpretable computational platform for predicting and analysing RNA-binding protein events.EnrichRBP:一个用于预测和分析RNA结合蛋白事件的自动化且可解释的计算平台。
Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btaf018.
7
DeepMiRBP: a hybrid model for predicting microRNA-protein interactions based on transfer learning and cosine similarity.DeepMiRBP:一种基于迁移学习和余弦相似度预测微小RNA-蛋白质相互作用的混合模型。
BMC Bioinformatics. 2024 Dec 18;25(1):381. doi: 10.1186/s12859-024-05985-2.
8
AptaDiff: de novo design and optimization of aptamers based on diffusion models.AptaDiff:基于扩散模型的适体从头设计与优化。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae517.
9
RNAelem: an algorithm for discovering sequence-structure motifs in RNA bound by RNA-binding proteins.RNAelem:一种用于发现RNA结合蛋白所结合RNA中的序列-结构基序的算法。
Bioinform Adv. 2024 Sep 28;4(1):vbae144. doi: 10.1093/bioadv/vbae144. eCollection 2024.
10
An effective deep learning-based approach for splice site identification in gene expression.一种有效的基于深度学习的基因表达剪接位点识别方法。
Sci Prog. 2024 Jul-Sep;107(3):368504241266588. doi: 10.1177/00368504241266588.
识别核糖体蛋白S15的HT-SELEX数据中的RNA结构基序。
BMC Bioinformatics. 2017 Jun 6;18(1):298. doi: 10.1186/s12859-017-1704-y.
4
A Compendium of RNA-Binding Proteins that Regulate MicroRNA Biogenesis.RNA 结合蛋白在 microRNA 生物发生中的调控作用概览。
Mol Cell. 2017 Apr 20;66(2):270-284.e13. doi: 10.1016/j.molcel.2017.03.014.
5
RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach.基于新型混合深度学习跨域知识整合方法的RNA-蛋白质结合基序挖掘
BMC Bioinformatics. 2017 Feb 28;18(1):136. doi: 10.1186/s12859-017-1561-8.
6
A novel method for the identification of conserved structural patterns in RNA: From small scale to high-throughput applications.一种用于识别RNA中保守结构模式的新方法:从小规模到高通量应用。
Nucleic Acids Res. 2016 Oct 14;44(18):8600-8609. doi: 10.1093/nar/gkw750. Epub 2016 Aug 31.
7
IPMiner: hidden ncRNA-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction.IPMiner:基于堆叠自编码器的隐藏非编码RNA-蛋白质相互作用序列模式挖掘,用于准确的计算预测。
BMC Genomics. 2016 Aug 9;17:582. doi: 10.1186/s12864-016-2931-8.
8
Convolutional neural network architectures for predicting DNA-protein binding.用于预测DNA-蛋白质结合的卷积神经网络架构。
Bioinformatics. 2016 Jun 15;32(12):i121-i127. doi: 10.1093/bioinformatics/btw255.
9
Basset: learning the regulatory code of the accessible genome with deep convolutional neural networks.巴塞特:利用深度卷积神经网络学习可及基因组的调控密码。
Genome Res. 2016 Jul;26(7):990-9. doi: 10.1101/gr.200535.115. Epub 2016 May 3.
10
DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences.DanQ:一种用于量化DNA序列功能的卷积与循环相结合的深度神经网络。
Nucleic Acids Res. 2016 Jun 20;44(11):e107. doi: 10.1093/nar/gkw226. Epub 2016 Apr 15.