• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于对RNA结合蛋白靶点的结构特征进行建模的深度学习框架。

A deep learning framework for modeling structural features of RNA-binding protein targets.

作者信息

Zhang Sai, Zhou Jingtian, Hu Hailin, Gong Haipeng, Chen Ligong, Cheng Chao, Zeng Jianyang

机构信息

Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China.

Department of Pharmacology and Pharmaceutical Sciences, School of Medicine, Tsinghua University, Beijing 100084, China.

出版信息

Nucleic Acids Res. 2016 Feb 29;44(4):e32. doi: 10.1093/nar/gkv1025. Epub 2015 Oct 13.

DOI:10.1093/nar/gkv1025
PMID:26467480
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4770198/
Abstract

RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational methods have been developed for modeling RBP binding preferences, discovering a complete structural representation of the RBP targets by integrating their available structural features in all three dimensions is still a challenging task. In this paper, we develop a general and flexible deep learning framework for modeling structural binding preferences and predicting binding sites of RBPs, which takes (predicted) RNA tertiary structural information into account for the first time. Our framework constructs a unified representation that characterizes the structural specificities of RBP targets in all three dimensions, which can be further used to predict novel candidate binding sites and discover potential binding motifs. Through testing on the real CLIP-seq datasets, we have demonstrated that our deep learning framework can automatically extract effective hidden structural features from the encoded raw sequence and structural profiles, and predict accurate RBP binding sites. In addition, we have conducted the first study to show that integrating the additional RNA tertiary structural features can improve the model performance in predicting RBP binding sites, especially for the polypyrimidine tract-binding protein (PTB), which also provides a new evidence to support the view that RBPs may own specific tertiary structural binding preferences. In particular, the tests on the internal ribosome entry site (IRES) segments yield satisfiable results with experimental support from the literature and further demonstrate the necessity of incorporating RNA tertiary structural information into the prediction model. The source code of our approach can be found in https://github.com/thucombio/deepnet-rbp.

摘要

RNA结合蛋白(RBPs)在RNA的转录后调控中发挥着重要作用。识别RBP结合位点并表征RBP结合偏好是理解转录后基因调控基本机制的关键步骤。尽管已经开发了许多计算方法来模拟RBP结合偏好,但通过整合其在所有三个维度上的可用结构特征来发现RBP靶标的完整结构表示仍然是一项具有挑战性的任务。在本文中,我们开发了一个通用且灵活的深度学习框架,用于模拟结构结合偏好并预测RBPs的结合位点,该框架首次考虑了(预测的)RNA三级结构信息。我们的框架构建了一个统一的表示,该表示在所有三个维度上表征了RBP靶标的结构特异性,可进一步用于预测新的候选结合位点并发现潜在的结合基序。通过对真实的CLIP-seq数据集进行测试,我们证明了我们的深度学习框架可以自动从编码的原始序列和结构概况中提取有效的隐藏结构特征,并预测准确的RBP结合位点。此外,我们进行了首次研究,表明整合额外的RNA三级结构特征可以提高预测RBP结合位点的模型性能,特别是对于多嘧啶序列结合蛋白(PTB),这也为支持RBPs可能具有特定三级结构结合偏好的观点提供了新的证据。特别是,对内部核糖体进入位点(IRES)片段的测试在文献的实验支持下产生了令人满意的结果,并进一步证明了将RNA三级结构信息纳入预测模型的必要性。我们方法的源代码可在https://github.com/thucombio/deepnet-rbp上找到。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/4eb45f7fa58d/gkv1025fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/c0945b83181c/gkv1025fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/a7b27f57dddc/gkv1025fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/bd1a8e4916d8/gkv1025fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/3be475aa41b3/gkv1025fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/9a4d664a9d5d/gkv1025fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/175410e7f529/gkv1025fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/4eb45f7fa58d/gkv1025fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/c0945b83181c/gkv1025fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/a7b27f57dddc/gkv1025fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/bd1a8e4916d8/gkv1025fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/3be475aa41b3/gkv1025fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/9a4d664a9d5d/gkv1025fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/175410e7f529/gkv1025fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d5e9/4770198/4eb45f7fa58d/gkv1025fig7.jpg

相似文献

1
A deep learning framework for modeling structural features of RNA-binding protein targets.一种用于对RNA结合蛋白靶点的结构特征进行建模的深度学习框架。
Nucleic Acids Res. 2016 Feb 29;44(4):e32. doi: 10.1093/nar/gkv1025. Epub 2015 Oct 13.
2
Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks.使用深度卷积和递归神经网络预测 RNA-蛋白质序列和结构的结合偏好。
BMC Genomics. 2018 Jul 3;19(1):511. doi: 10.1186/s12864-018-4889-1.
3
A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data.一种基于深度增强学习的方法,用于从高通量CLIP-seq数据中捕获RNA结合蛋白的序列结合偏好。
Nucleic Acids Res. 2017 Aug 21;45(14):e129. doi: 10.1093/nar/gkx492.
4
A combined sequence and structure based method for discovering enriched motifs in RNA from in vivo binding data.一种基于序列和结构相结合的方法,用于从体内结合数据中发现RNA中富集的基序。
Methods. 2017 Apr 15;118-119:73-81. doi: 10.1016/j.ymeth.2017.03.003. Epub 2017 Mar 6.
5
Inferring RNA sequence preferences for poorly studied RNA-binding proteins based on co-evolution.基于共进化推断研究较少的 RNA 结合蛋白的 RNA 序列偏好。
BMC Bioinformatics. 2018 Mar 12;19(1):96. doi: 10.1186/s12859-018-2091-8.
6
RNAcontext: a new method for learning the sequence and structure binding preferences of RNA-binding proteins.RNAcontext:一种学习 RNA 结合蛋白序列和结构结合偏好的新方法。
PLoS Comput Biol. 2010 Jul 1;6(7):e1000832. doi: 10.1371/journal.pcbi.1000832.
7
Prediction of binding property of RNA-binding proteins using multi-sized filters and multi-modal deep convolutional neural network.基于多尺寸滤波器和多模态深度卷积神经网络的 RNA 结合蛋白结合特性预测。
PLoS One. 2019 Apr 26;14(4):e0216257. doi: 10.1371/journal.pone.0216257. eCollection 2019.
8
RNA-binding protein recognition based on multi-view deep feature and multi-label learning.基于多视图深度特征和多标签学习的 RNA 结合蛋白识别。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa174.
9
RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach.基于新型混合深度学习跨域知识整合方法的RNA-蛋白质结合基序挖掘
BMC Bioinformatics. 2017 Feb 28;18(1):136. doi: 10.1186/s12859-017-1561-8.
10
circRNA-binding protein site prediction based on multi-view deep learning, subspace learning and multi-view classifier.基于多视图深度学习、子空间学习和多视图分类器的 circRNA 结合蛋白位点预测。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab394.

引用本文的文献

1
Research on Plant RNA-Binding Protein Prediction Method Based on Improved Ensemble Learning.基于改进集成学习的植物RNA结合蛋白预测方法研究
Biology (Basel). 2025 Jun 10;14(6):672. doi: 10.3390/biology14060672.
2
RBPsuite 2.0: an updated RNA-protein binding site prediction suite with high coverage on species and proteins based on deep learning.RBPsuite 2.0:一个基于深度学习的、在物种和蛋白质上具有高覆盖率的更新版RNA-蛋白质结合位点预测套件。
BMC Biol. 2025 Mar 11;23(1):74. doi: 10.1186/s12915-025-02182-2.
3
Big data and deep learning for RNA biology.

本文引用的文献

1
Integrative Data Analysis of Multi-Platform Cancer Data with a Multimodal Deep Learning Approach.基于多模态深度学习方法的多平台癌症数据综合数据分析
IEEE/ACM Trans Comput Biol Bioinform. 2015 Jul-Aug;12(4):928-37. doi: 10.1109/TCBB.2014.2377729.
2
Stabilization of the G-quadruplex at the VEGF IRES represses cap-independent translation.VEGF内部核糖体进入位点处G-四链体的稳定抑制了不依赖帽子的翻译。
RNA Biol. 2015;12(3):320-9. doi: 10.1080/15476286.2015.1017236.
3
CLIPdb: a CLIP-seq database for protein-RNA interactions.CLIPdb:一个用于蛋白质-RNA相互作用的CLIP-seq数据库。
大数据和深度学习在 RNA 生物学中的应用。
Exp Mol Med. 2024 Jun;56(6):1293-1321. doi: 10.1038/s12276-024-01243-w. Epub 2024 Jun 14.
4
Deep Learning for Elucidating Modifications to RNA-Status and Challenges Ahead.深度学习解析 RNA 状态修饰及其面临的挑战。
Genes (Basel). 2024 May 15;15(5):629. doi: 10.3390/genes15050629.
5
Novel applications of Convolutional Neural Networks in the age of Transformers.卷积神经网络在Transformer时代的新应用。
Sci Rep. 2024 May 1;14(1):10000. doi: 10.1038/s41598-024-60709-z.
6
Seq-RBPPred: Predicting RNA-Binding Proteins from Sequence.Seq-RBPPred:从序列预测RNA结合蛋白。
ACS Omega. 2024 Mar 4;9(11):12734-12742. doi: 10.1021/acsomega.3c08381. eCollection 2024 Mar 19.
7
Role of Optimization in RNA-Protein-Binding Prediction.优化在RNA-蛋白质结合预测中的作用。
Curr Issues Mol Biol. 2024 Feb 4;46(2):1360-1373. doi: 10.3390/cimb46020087.
8
Graphylo: A deep learning approach for predicting regulatory DNA and RNA sites from whole-genome multiple alignments.Graphylo:一种用于从全基因组多序列比对中预测调控DNA和RNA位点的深度学习方法。
iScience. 2024 Jan 26;27(2):109002. doi: 10.1016/j.isci.2024.109002. eCollection 2024 Feb 16.
9
DeepFusion: A deep bimodal information fusion network for unraveling protein-RNA interactions using in vivo RNA structures.深度融合:一种用于利用体内RNA结构揭示蛋白质-RNA相互作用的深度双峰信息融合网络。
Comput Struct Biotechnol J. 2023 Dec 30;23:617-625. doi: 10.1016/j.csbj.2023.12.040. eCollection 2024 Dec.
10
Review of Computational Methods and Database Sources for Predicting the Effects of Coding Frameshift Small Insertion and Deletion Variations.预测编码移码小插入和缺失变异效应的计算方法及数据库来源综述
ACS Omega. 2024 Jan 3;9(2):2032-2047. doi: 10.1021/acsomega.3c07662. eCollection 2024 Jan 16.
BMC Genomics. 2015 Feb 5;16(1):51. doi: 10.1186/s12864-015-1273-2.
4
Role of the N6-methyladenosine RNA mark in gene regulation and its implications on development and disease.N6-甲基腺嘌呤RNA标记在基因调控中的作用及其对发育和疾病的影响。
Brief Funct Genomics. 2015 May;14(3):169-79. doi: 10.1093/bfgp/elu039. Epub 2014 Oct 10.
5
Deep learning of the tissue-regulated splicing code.深度学习组织调控的剪接代码。
Bioinformatics. 2014 Jun 15;30(12):i121-9. doi: 10.1093/bioinformatics/btu277.
6
Structures of human ALKBH5 demethylase reveal a unique binding mode for specific single-stranded N6-methyladenosine RNA demethylation.人类ALKBH5去甲基化酶的结构揭示了特定单链N6-甲基腺苷RNA去甲基化的独特结合模式。
J Biol Chem. 2014 Jun 20;289(25):17299-311. doi: 10.1074/jbc.M114.550350. Epub 2014 Apr 28.
7
MOV10 Is a 5' to 3' RNA helicase contributing to UPF1 mRNA target degradation by translocation along 3' UTRs.MOV10 是一种 5' 到 3' RNA 解旋酶,通过沿 3'UTR 的易位促进 UPF1mRNA 靶标降解。
Mol Cell. 2014 May 22;54(4):573-85. doi: 10.1016/j.molcel.2014.03.017. Epub 2014 Apr 10.
8
Identifying mRNA sequence elements for target recognition by human Argonaute proteins.鉴定人类 Argonaute 蛋白靶识别的 mRNA 序列元件。
Genome Res. 2014 May;24(5):775-85. doi: 10.1101/gr.162230.113. Epub 2014 Mar 24.
9
GraphProt: modeling binding preferences of RNA-binding proteins.GraphProt:RNA结合蛋白结合偏好性建模
Genome Biol. 2014 Jan 22;15(1):R17. doi: 10.1186/gb-2014-15-1-r17.
10
CapR: revealing structural specificities of RNA-binding protein target recognition using CLIP-seq data.CapR:利用CLIP-seq数据揭示RNA结合蛋白靶点识别的结构特异性
Genome Biol. 2014 Jan 21;15(1):R16. doi: 10.1186/gb-2014-15-1-r16.