一种用于学习内在蛋白-RNA 结合偏好的深度神经网络方法。

A deep neural network approach for learning intrinsic protein-RNA binding preferences.

机构信息

Blavatnik School of Computer Science, Tel-Aviv University, Tel-Aviv, Israel.

Department of Electrical and Computer Engineering, Ben-Gurion University of the Negev, Beer-Sheva, Israel.

出版信息

Bioinformatics. 2018 Sep 1;34(17):i638-i646. doi: 10.1093/bioinformatics/bty600.

DOI:10.1093/bioinformatics/bty600

PMID:30423078

Abstract

MOTIVATION

The complexes formed by binding of proteins to RNAs play key roles in many biological processes, such as splicing, gene expression regulation, translation and viral replication. Understanding protein-RNA binding may thus provide important insights to the functionality and dynamics of many cellular processes. This has sparked substantial interest in exploring protein-RNA binding experimentally, and predicting it computationally. The key computational challenge is to efficiently and accurately infer protein-RNA binding models that will enable prediction of novel protein-RNA interactions to additional transcripts of interest.

RESULTS

We developed DLPRB (Deep Learning for Protein-RNA Binding), a new deep neural network (DNN) approach for learning intrinsic protein-RNA binding preferences and predicting novel interactions. We present two different network architectures: a convolutional neural network (CNN), and a recurrent neural network (RNN). The novelty of our network hinges upon two key aspects: (i) the joint analysis of both RNA sequence and structure, which is represented as a probability vector of different RNA structural contexts; (ii) novel features in the architecture of the networks, such as the application of RNNs to RNA-binding prediction, and the combination of hundreds of variable-length filters in the CNN. Our results in inferring accurate RNA-binding models from high-throughput in vitro data exhibit substantial improvements, compared to all previous approaches for protein-RNA binding prediction (both DNN and non-DNN based). A more modest, yet statistically significant, improvement is achieved for in vivo binding prediction. When incorporating experimentally-measured RNA structure, compared to predicted one, the improvement on in vivo data increases. By visualizing the binding specificities, we can gain biological insights underlying the mechanism of protein RNA-binding.

AVAILABILITY AND IMPLEMENTATION

The source code is publicly available at https://github.com/ilanbb/dlprb.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

蛋白质与 RNA 结合形成的复合物在许多生物过程中发挥着关键作用，例如剪接、基因表达调控、翻译和病毒复制。因此，了解蛋白质-RNA 结合可能为许多细胞过程的功能和动态提供重要的见解。这激发了人们对实验探索蛋白质-RNA 结合以及计算预测的极大兴趣。关键的计算挑战是有效地和准确地推断出蛋白质-RNA 结合模型，从而能够预测到对其他感兴趣的转录本的新的蛋白质-RNA 相互作用。

结果

我们开发了 DLPRB（用于蛋白质-RNA 结合的深度学习），这是一种用于学习内在蛋白质-RNA 结合偏好并预测新相互作用的新的深度神经网络 (DNN) 方法。我们提出了两种不同的网络架构：卷积神经网络 (CNN) 和递归神经网络 (RNN)。我们的网络的新颖之处在于两个关键方面：(i) 对 RNA 序列和结构的联合分析，这表示为不同 RNA 结构环境的概率向量；(ii) 网络架构中的新特征，例如将 RNN 应用于 RNA 结合预测，以及在 CNN 中组合数百个可变长度滤波器。与所有以前的蛋白质-RNA 结合预测方法（基于 DNN 和非 DNN 的方法）相比，我们在从高通量体外数据推断准确的 RNA 结合模型方面取得了实质性的改进。在体内结合预测方面取得了更适度但具有统计学意义的改进。当将实验测量的 RNA 结构与预测的结构进行比较时，与体内数据相比，改进更为明显。通过可视化结合特异性，我们可以深入了解蛋白质 RNA 结合的机制。

可用性和实现

源代码可在 https://github.com/ilanbb/dlprb 上获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

A deep neural network approach for learning intrinsic protein-RNA binding preferences.

Bioinformatics. 2018 Sep 1;34(17):i638-i646. doi: 10.1093/bioinformatics/bty600.

Predicting RNA-protein binding sites and motifs through combining local and global deep convolutional neural networks.

Bioinformatics. 2018 Oct 15;34(20):3427-3436. doi: 10.1093/bioinformatics/bty364.

Integrating thermodynamic and sequence contexts improves protein-RNA binding prediction.

PLoS Comput Biol. 2019 Sep 4;15(9):e1007283. doi: 10.1371/journal.pcbi.1007283. eCollection 2019 Sep.

Comprehensive evaluation of deep learning architectures for prediction of DNA/RNA sequence binding specificities.

Bioinformatics. 2019 Jul 15;35(14):i269-i277. doi: 10.1093/bioinformatics/btz339.

RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach.

BMC Bioinformatics. 2017 Feb 28;18(1):136. doi: 10.1186/s12859-017-1561-8.

DNN-Dom: predicting protein domain boundary from sequence alone by deep neural network.

Bioinformatics. 2019 Dec 15;35(24):5128-5136. doi: 10.1093/bioinformatics/btz464.

Predicting protein-ligand binding residues with deep convolutional neural networks.

BMC Bioinformatics. 2019 Feb 26;20(1):93. doi: 10.1186/s12859-019-2672-1.

Deep neural networks for inferring binding sites of RNA-binding proteins by using distributed representations of RNA primary sequence and secondary structure.

BMC Genomics. 2020 Dec 17;21(Suppl 13):866. doi: 10.1186/s12864-020-07239-w.

Compound-protein interaction prediction with end-to-end learning of neural networks for graphs and sequences.

Bioinformatics. 2019 Jan 15;35(2):309-318. doi: 10.1093/bioinformatics/bty535.

Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks.

BMC Genomics. 2018 Jul 3;19(1):511. doi: 10.1186/s12864-018-4889-1.

引用本文的文献

rbpTransformer: A novel deep learning model for prediction of piRNA and mRNA bindings.

PLoS One. 2025 Jun 25;20(6):e0324462. doi: 10.1371/journal.pone.0324462. eCollection 2025.

Large-scale map of RNA-binding protein interactomes across the mRNA life cycle.

Mol Cell. 2024 Oct 3;84(19):3790-3809.e8. doi: 10.1016/j.molcel.2024.08.030. Epub 2024 Sep 19.

Optimizing protein sequence classification: integrating deep learning models with Bayesian optimization for enhanced biological analysis.

BMC Med Inform Decis Mak. 2024 Aug 27;24(1):236. doi: 10.1186/s12911-024-02631-y.

Mudskipper detects combinatorial RNA binding protein interactions in multiplexed CLIP data.

Cell Genom. 2024 Jul 10;4(7):100603. doi: 10.1016/j.xgen.2024.100603. Epub 2024 Jul 1.

Big data and deep learning for RNA biology.

Exp Mol Med. 2024 Jun;56(6):1293-1321. doi: 10.1038/s12276-024-01243-w. Epub 2024 Jun 14.

Sequence based model using deep neural network and hybrid features for identification of 5-hydroxymethylcytosine modification.

Sci Rep. 2024 Apr 20;14(1):9116. doi: 10.1038/s41598-024-59777-y.

RNA Metabolism Governs Immune Function and Response.

Adv Exp Med Biol. 2024;1444:145-161. doi: 10.1007/978-981-99-9781-7_10.

Databases and computational methods for the identification of piRNA-related molecules: A survey.

Comput Struct Biotechnol J. 2024 Jan 22;23:813-833. doi: 10.1016/j.csbj.2024.01.011. eCollection 2024 Dec.

DeepFusion: A deep bimodal information fusion network for unraveling protein-RNA interactions using in vivo RNA structures.

Comput Struct Biotechnol J. 2023 Dec 30;23:617-625. doi: 10.1016/j.csbj.2023.12.040. eCollection 2024 Dec.

Transfer Learning Allows Accurate RBP Target Site Prediction with Limited Sample Sizes.

Biology (Basel). 2023 Sep 25;12(10):1276. doi: 10.3390/biology12101276.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于学习内在蛋白-RNA 结合偏好的深度神经网络方法。

A deep neural network approach for learning intrinsic protein-RNA binding preferences.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

SUPPLEMENTARY INFORMATION

动机

结果

可用性和实现

补充信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献