RMDNet：基于RNA感知蜣螂优化算法的多分支整合网络用于RNA-蛋白质结合位点预测

RMDNet: RNA-aware dung beetle optimization-based multi-branch integration network for RNA-protein binding sites prediction.

作者信息

Zhang Jiangbo, Peng Yunhui, Cui Feifei, Zhang Zilong, Yan Shankai, Zhang Qingchen

机构信息

School of Computer Science and Technology, Hainan University, Haikou, 570100, Hainan, China.

School of Physics Science and Technology, Central China Normal University, Wuhan, 430000, Hubei, China.

出版信息

BMC Bioinformatics. 2025 Jul 11;26(1):176. doi: 10.1186/s12859-025-06197-y.

DOI:10.1186/s12859-025-06197-y

PMID:40646507

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12247420/

Abstract

RNA-binding proteins (RBPs) play crucial roles in gene regulation. Their dysregulation has been increasingly linked to neurodegenerative diseases, liver cancer, and lung cancer. Although experimental methods like CLIP-seq accurately identify RNA-protein binding sites, they are time-consuming and costly. To address this, we propose RMDNet-a deep learning framework that integrates CNN, CNN-Transformer, and ResNet branches to capture features at multiple sequence scales. These features are fused with structural representations derived from RNA secondary structure graphs. The graphs are processed using a graph neural network with DiffPool. To optimize feature integration, we incorporate an improved dung beetle optimization algorithm, which adaptively assigns fusion weights during inference. Evaluations on the RBP-24 benchmark show that RMDNet outperforms state-of-the-art models including GraphProt, DeepRKE, and DeepDW across multiple metrics. On the RBP-31 dataset, it demonstrates strong generalization ability, while ablation studies on RBPsuite2.0 validate the contributions of individual modules. We assess biological interpretability by extracting candidate binding motifs from the first-layer CNN kernels. Several motifs closely match experimentally validated RBP motifs, confirming the model's capacity to learn biologically meaningful patterns. A downstream case study on YTHDF1 focuses on analyzing interpretable spatial binding patterns, using a large-scale prediction dataset and CLIP-seq peak alignment. The results confirm that the model captures localized binding signals and spatial consistency with experimental annotations. Overall, RMDNet is a robust and interpretable tool for predicting RNA-protein binding sites. It has broad potential in disease mechanism research and therapeutic target discovery. The source code is available https://github.com/cskyan/RMDNet .

摘要

RNA结合蛋白（RBPs）在基因调控中发挥着关键作用。它们的失调与神经退行性疾病、肝癌和肺癌的关联日益增加。尽管像CLIP-seq这样的实验方法能够准确识别RNA-蛋白质结合位点，但它们既耗时又昂贵。为了解决这个问题，我们提出了RMDNet——一个深度学习框架，它整合了卷积神经网络（CNN）、卷积神经网络-Transformer（CNN-Transformer）和残差网络（ResNet）分支，以在多个序列尺度上捕捉特征。这些特征与从RNA二级结构图派生的结构表示进行融合。这些图使用带有DiffPool的图神经网络进行处理。为了优化特征整合，我们引入了一种改进的蜣螂优化算法，该算法在推理过程中自适应地分配融合权重。在RBP-24基准测试中的评估表明，RMDNet在多个指标上优于包括GraphProt、DeepRKE和DeepDW在内的现有最先进模型。在RBP-31数据集上，它展示了强大的泛化能力，而在RBPsuite2.0上的消融研究验证了各个模块的贡献。我们通过从第一层CNN内核中提取候选结合基序来评估生物学可解释性。几个基序与实验验证的RBP基序紧密匹配，证实了该模型学习生物学有意义模式的能力。关于YTHDF1的下游案例研究聚焦于使用大规模预测数据集和CLIP-seq峰比对来分析可解释的空间结合模式。结果证实该模型捕获了局部结合信号以及与实验注释的空间一致性。总体而言，RMDNet是一种用于预测RNA-蛋白质结合位点的强大且可解释的工具。它在疾病机制研究和治疗靶点发现方面具有广阔的潜力。源代码可在https://github.com/cskyan/RMDNet获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/353c/12247420/3b53068729d8/12859_2025_6197_Fig1_HTML.jpg

相似文献

RMDNet: RNA-aware dung beetle optimization-based multi-branch integration network for RNA-protein binding sites prediction.

BMC Bioinformatics. 2025 Jul 11;26(1):176. doi: 10.1186/s12859-025-06197-y.

GraphPro: An interpretable graph neural network-based model for identifying promoters in multiple species.

Comput Biol Med. 2024 Sep;180:108974. doi: 10.1016/j.compbiomed.2024.108974. Epub 2024 Aug 2.

Short-Term Memory Impairment

iACP-DPNet: a dual-pooling causal dilated convolutional network for interpretable anticancer peptide identification.

Funct Integr Genomics. 2025 Jul 4;25(1):147. doi: 10.1007/s10142-025-01641-x.

A deep learning model for predicting systemic lupus erythematosus-associated epitopes.

BMC Med Inform Decis Mak. 2025 Jul 1;25(1):230. doi: 10.1186/s12911-025-03056-x.

A 4D tensor-enhanced multi-dimensional convolutional neural network for accurate prediction of protein-ligand binding affinity.

Mol Divers. 2024 Dec 23. doi: 10.1007/s11030-024-11044-y.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

EM-PLA: environment-aware heterogeneous graph-based multimodal protein-ligand binding affinity prediction.

Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf298.

Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.

Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.

Uncertainty Quantification and Temperature Scaling Calibration for Protein-RNA Binding Site Prediction.

J Chem Inf Model. 2025 Jun 23;65(12):6310-6321. doi: 10.1021/acs.jcim.5c00556. Epub 2025 Jun 2.

本文引用的文献

RBPsuite 2.0: an updated RNA-protein binding site prediction suite with high coverage on species and proteins based on deep learning.

BMC Biol. 2025 Mar 11;23(1):74. doi: 10.1186/s12915-025-02182-2.

Research progress on prediction of RNA-protein binding sites in the past five years.

Anal Biochem. 2024 Aug;691:115535. doi: 10.1016/j.ab.2024.115535. Epub 2024 Apr 20.

Recent Advances in Deep Learning for Protein-Protein Interaction Analysis: A Comprehensive Review.

Molecules. 2023 Jul 2;28(13):5169. doi: 10.3390/molecules28135169.

WVDL: Weighted Voting Deep Learning Model for Predicting RNA-Protein Binding Sites.

IEEE/ACM Trans Comput Biol Bioinform. 2023 Sep-Oct;20(5):3322-3328. doi: 10.1109/TCBB.2023.3252276. Epub 2023 Oct 9.

A Comprehensive Survey of Deep Learning Techniques in Protein Function Prediction.

IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):2291-2301. doi: 10.1109/TCBB.2023.3247634. Epub 2023 Jun 5.

Editorial: Recent advances in molecular properties of DNA-protein interactions, chromatin and their biological roles.

Front Mol Biosci. 2023 Mar 7;10:1171714. doi: 10.3389/fmolb.2023.1171714. eCollection 2023.

Protein-protein interaction prediction with deep learning: A comprehensive review.

Comput Struct Biotechnol J. 2022 Sep 19;20:5316-5341. doi: 10.1016/j.csbj.2022.08.070. eCollection 2022.

MCNN: Multiple Convolutional Neural Networks for RNA-Protein Binding Sites Prediction.

IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):1180-1187. doi: 10.1109/TCBB.2022.3170367. Epub 2023 Apr 3.

The Pivotal Role of Major Chromosomes of Sub-Genomes A and D in Fiber Quality Traits of Cotton.

Front Genet. 2022 Mar 24;12:642595. doi: 10.3389/fgene.2021.642595. eCollection 2021.

Time-Aware Multi-Type Data Fusion Representation Learning Framework for Risk Prediction of Cardiovascular Diseases.

IEEE/ACM Trans Comput Biol Bioinform. 2021 Oct 7;PP. doi: 10.1109/TCBB.2021.3118418.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

RMDNet：基于RNA感知蜣螂优化算法的多分支整合网络用于RNA-蛋白质结合位点预测

RMDNet: RNA-aware dung beetle optimization-based multi-branch integration network for RNA-protein binding sites prediction.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献