• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于深度增强学习的方法,用于从高通量CLIP-seq数据中捕获RNA结合蛋白的序列结合偏好。

A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data.

作者信息

Li Shuya, Dong Fanghong, Wu Yuexin, Zhang Sai, Zhang Chen, Liu Xiao, Jiang Tao, Zeng Jianyang

机构信息

School of Life Sciences, Tsinghua University, Beijing 100084, China.

Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China.

出版信息

Nucleic Acids Res. 2017 Aug 21;45(14):e129. doi: 10.1093/nar/gkx492.

DOI:10.1093/nar/gkx492
PMID:28575488
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5737578/
Abstract

Characterizing the binding behaviors of RNA-binding proteins (RBPs) is important for understanding their functional roles in gene expression regulation. However, current high-throughput experimental methods for identifying RBP targets, such as CLIP-seq and RNAcompete, usually suffer from the false negative issue. Here, we develop a deep boosting based machine learning approach, called DeBooster, to accurately model the binding sequence preferences and identify the corresponding binding targets of RBPs from CLIP-seq data. Comprehensive validation tests have shown that DeBooster can outperform other state-of-the-art approaches in RBP target prediction. In addition, we have demonstrated that DeBooster may provide new insights into understanding the regulatory functions of RBPs, including the binding effects of the RNA helicase MOV10 on mRNA degradation, the potentially different ADAR1 binding behaviors related to its editing activity, as well as the antagonizing effect of RBP binding on miRNA repression. Moreover, DeBooster may provide an effective index to investigate the effect of pathogenic mutations in RBP binding sites, especially those related to splicing events. We expect that DeBooster will be widely applied to analyze large-scale CLIP-seq experimental data and can provide a practically useful tool for novel biological discoveries in understanding the regulatory mechanisms of RBPs. The source code of DeBooster can be downloaded from http://github.com/dongfanghong/deepboost.

摘要

表征RNA结合蛋白(RBP)的结合行为对于理解它们在基因表达调控中的功能作用至关重要。然而,当前用于识别RBP靶标的高通量实验方法,如CLIP-seq和RNAcompete,通常存在假阴性问题。在此,我们开发了一种基于深度增强的机器学习方法,称为DeBooster,以准确模拟结合序列偏好并从CLIP-seq数据中识别RBP的相应结合靶标。全面的验证测试表明,DeBooster在RBP靶标预测方面优于其他现有最先进的方法。此外,我们已经证明DeBooster可能为理解RBP的调控功能提供新的见解,包括RNA解旋酶MOV10对mRNA降解的结合作用、与其编辑活性相关的潜在不同的ADAR1结合行为,以及RBP结合对miRNA抑制的拮抗作用。此外,DeBooster可能为研究RBP结合位点中致病突变的影响提供一个有效的指标,特别是那些与剪接事件相关的突变。我们期望DeBooster将被广泛应用于分析大规模的CLIP-seq实验数据,并能为理解RBP调控机制的新生物学发现提供一个实际有用的工具。DeBooster的源代码可从http://github.com/dongfanghong/deepboost下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/37ccf05e67b2/gkx492fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/b0839ededfd2/gkx492fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/f1b98b270581/gkx492fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/cefab08452a7/gkx492fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/0e19cdc07a6d/gkx492fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/4479d81f0bd5/gkx492fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/02ec5b2927cd/gkx492fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/79eebc72f9e6/gkx492fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/553d058f52e6/gkx492fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/37ccf05e67b2/gkx492fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/b0839ededfd2/gkx492fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/f1b98b270581/gkx492fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/cefab08452a7/gkx492fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/0e19cdc07a6d/gkx492fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/4479d81f0bd5/gkx492fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/02ec5b2927cd/gkx492fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/79eebc72f9e6/gkx492fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/553d058f52e6/gkx492fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c70/5737578/37ccf05e67b2/gkx492fig9.jpg

相似文献

1
A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data.一种基于深度增强学习的方法,用于从高通量CLIP-seq数据中捕获RNA结合蛋白的序列结合偏好。
Nucleic Acids Res. 2017 Aug 21;45(14):e129. doi: 10.1093/nar/gkx492.
2
Mapping the Transcriptome-Wide Landscape of RBP Binding Sites Using gPAR-CLIP-seq: Bioinformatic Analysis.使用gPAR-CLIP-seq绘制全转录组范围的RBP结合位点图谱:生物信息学分析
Methods Mol Biol. 2016;1361:91-104. doi: 10.1007/978-1-4939-3079-1_6.
3
A combined sequence and structure based method for discovering enriched motifs in RNA from in vivo binding data.一种基于序列和结构相结合的方法,用于从体内结合数据中发现RNA中富集的基序。
Methods. 2017 Apr 15;118-119:73-81. doi: 10.1016/j.ymeth.2017.03.003. Epub 2017 Mar 6.
4
Leveraging cross-link modification events in CLIP-seq for motif discovery.利用CLIP-seq中的交联修饰事件进行基序发现。
Nucleic Acids Res. 2015 Jan;43(1):95-103. doi: 10.1093/nar/gku1288. Epub 2014 Dec 10.
5
RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach.基于新型混合深度学习跨域知识整合方法的RNA-蛋白质结合基序挖掘
BMC Bioinformatics. 2017 Feb 28;18(1):136. doi: 10.1186/s12859-017-1561-8.
6
Seten: a tool for systematic identification and comparison of processes, phenotypes, and diseases associated with RNA-binding proteins from condition-specific CLIP-seq profiles.Seten:一种用于从特定条件的CLIP-seq图谱中系统识别和比较与RNA结合蛋白相关的过程、表型和疾病的工具。
RNA. 2017 Jun;23(6):836-846. doi: 10.1261/rna.059089.116. Epub 2017 Mar 23.
7
CLIPdb: a CLIP-seq database for protein-RNA interactions.CLIPdb:一个用于蛋白质-RNA相互作用的CLIP-seq数据库。
BMC Genomics. 2015 Feb 5;16(1):51. doi: 10.1186/s12864-015-1273-2.
8
Mapping the Transcriptome-Wide Landscape of RBP Binding Sites Using gPAR-CLIP-seq: Experimental Procedures.使用gPAR-CLIP-seq绘制全转录组范围的RBP结合位点图谱:实验步骤
Methods Mol Biol. 2016;1361:77-90. doi: 10.1007/978-1-4939-3079-1_5.
9
Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks.使用深度卷积和递归神经网络预测 RNA-蛋白质序列和结构的结合偏好。
BMC Genomics. 2018 Jul 3;19(1):511. doi: 10.1186/s12864-018-4889-1.
10
Computational Analysis of RNA-Protein Interactions via Deep Sequencing.通过深度测序对RNA-蛋白质相互作用进行计算分析
Methods Mol Biol. 2018;1751:171-182. doi: 10.1007/978-1-4939-7710-9_12.

引用本文的文献

1
Data Science Issues in Studying Protein-RNA Interactions with CLIP Technologies.利用CLIP技术研究蛋白质-RNA相互作用中的数据科学问题
Annu Rev Biomed Data Sci. 2018 Jul 20;1(1):235-261. doi: 10.1146/annurev-biodatasci-080917-013525.
2
3D Modeling of Non-coding RNA Interactions.非编码RNA相互作用的三维建模
Adv Exp Med Biol. 2022;1385:281-317. doi: 10.1007/978-3-031-08356-3_11.
3
Computational tools to study RNA-protein complexes.用于研究RNA-蛋白质复合物的计算工具。

本文引用的文献

1
POSTAR: a platform for exploring post-transcriptional regulation coordinated by RNA-binding proteins.POSTAR:一个用于探索由RNA结合蛋白协调的转录后调控的平台。
Nucleic Acids Res. 2017 Jan 4;45(D1):D104-D114. doi: 10.1093/nar/gkw888. Epub 2016 Oct 5.
2
The multiple functions of RNA helicases as drivers and regulators of gene expression.RNA 解旋酶作为基因表达的驱动因子和调控因子的多种功能。
Nat Rev Mol Cell Biol. 2016 Jul;17(7):426-38. doi: 10.1038/nrm.2016.50. Epub 2016 Jun 2.
3
RNA mis-splicing in disease.疾病中的RNA错配剪接
Front Mol Biosci. 2022 Oct 7;9:954926. doi: 10.3389/fmolb.2022.954926. eCollection 2022.
4
Protein-RNA interaction prediction with deep learning: structure matters.基于深度学习的蛋白质-RNA 相互作用预测:结构很重要。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab540.
5
PRIME-3D2D is a 3D2D model to predict binding sites of protein-RNA interaction.PRIME-3D2D是一种用于预测蛋白质-RNA相互作用结合位点的3D2D模型。
Commun Biol. 2020 Jul 16;3(1):384. doi: 10.1038/s42003-020-1114-y.
6
ProbeRating: a recommender system to infer binding profiles for nucleic acid-binding proteins.ProbeRating:一种用于推断核酸结合蛋白结合谱的推荐系统。
Bioinformatics. 2020 Sep 15;36(18):4797-4804. doi: 10.1093/bioinformatics/btaa580.
7
Prediction of mRNA subcellular localization using deep recurrent neural networks.基于深度递归神经网络的 mRNA 亚细胞定位预测。
Bioinformatics. 2019 Jul 15;35(14):i333-i342. doi: 10.1093/bioinformatics/btz337.
8
Practical considerations on performing and analyzing CLIP-seq experiments to identify transcriptomic-wide RNA-protein interactions.关于执行和分析 CLIP-seq 实验以识别转录组范围内 RNA-蛋白质相互作用的实用考虑因素。
Methods. 2019 Feb 15;155:49-57. doi: 10.1016/j.ymeth.2018.12.002. Epub 2018 Dec 6.
Nat Rev Genet. 2016 Jan;17(1):19-32. doi: 10.1038/nrg.2015.3. Epub 2015 Nov 23.
4
A deep learning framework for modeling structural features of RNA-binding protein targets.一种用于对RNA结合蛋白靶点的结构特征进行建模的深度学习框架。
Nucleic Acids Res. 2016 Feb 29;44(4):e32. doi: 10.1093/nar/gkv1025. Epub 2015 Oct 13.
5
SRSF1-Regulated Alternative Splicing in Breast Cancer.SRSF1调控的乳腺癌中的可变剪接
Mol Cell. 2015 Oct 1;60(1):105-17. doi: 10.1016/j.molcel.2015.09.005.
6
Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.通过深度学习预测 DNA 和 RNA 结合蛋白的序列特异性。
Nat Biotechnol. 2015 Aug;33(8):831-8. doi: 10.1038/nbt.3300. Epub 2015 Jul 27.
7
Genomic analysis of ADAR1 binding and its involvement in multiple RNA processing pathways.ADAR1结合的基因组分析及其在多种RNA加工途径中的作用。
Nat Commun. 2015 Mar 9;6:6355. doi: 10.1038/ncomms7355.
8
COSMIC: exploring the world's knowledge of somatic mutations in human cancer.COSMIC:探索全球关于人类癌症体细胞突变的知识。
Nucleic Acids Res. 2015 Jan;43(Database issue):D805-11. doi: 10.1093/nar/gku1075. Epub 2014 Oct 29.
9
Computational Methods for CLIP-seq Data Processing.CLIP-seq数据处理的计算方法。
Bioinform Biol Insights. 2014 Oct 1;8:199-207. doi: 10.4137/BBI.S16803. eCollection 2014.
10
Structural and mechanistic insights into poly(uridine) tract recognition by the hnRNP C RNA recognition motif.hnRNP C RNA 识别模体识别多(尿嘧啶)序列的结构和机制见解。
J Am Chem Soc. 2014 Oct 15;136(41):14536-44. doi: 10.1021/ja507690d. Epub 2014 Sep 30.