CircSSNN：使用带有预归一化的序列自注意力神经网络进行 circRNA 结合位点预测。

CircSSNN: circRNA-binding site prediction via sequence self-attention neural networks with pre-normalization.

机构信息

School of Computer Science and Technology, Guangxi University of Science and Technology, Liuzhou, China.

Key Laboratory of Guangxi Universities on Intelligent Computing and Distributed Information Processing, Guangxi University of Science and Technology, Liuzhou, China.

出版信息

BMC Bioinformatics. 2023 May 30;24(1):220. doi: 10.1186/s12859-023-05352-7.

DOI:10.1186/s12859-023-05352-7

PMID:37254080

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10230723/

Abstract

BACKGROUND

Circular RNAs (circRNAs) play a significant role in some diseases by acting as transcription templates. Therefore, analyzing the interaction mechanism between circRNA and RNA-binding proteins (RBPs) has far-reaching implications for the prevention and treatment of diseases. Existing models for circRNA-RBP identification usually adopt convolution neural network (CNN), recurrent neural network (RNN), or their variants as feature extractors. Most of them have drawbacks such as poor parallelism, insufficient stability, and inability to capture long-term dependencies.

METHODS

In this paper, we propose a new method completely using the self-attention mechanism to capture deep semantic features of RNA sequences. On this basis, we construct a CircSSNN model for the cirRNA-RBP identification. The proposed model constructs a feature scheme by fusing circRNA sequence representations with statistical distributions, static local contexts, and dynamic global contexts. With a stable and efficient network architecture, the distance between any two positions in a sequence is reduced to a constant, so CircSSNN can quickly capture the long-term dependencies and extract the deep semantic features.

RESULTS

Experiments on 37 circRNA datasets show that the proposed model has overall advantages in stability, parallelism, and prediction performance. Keeping the network structure and hyperparameters unchanged, we directly apply the CircSSNN to linRNA datasets. The favorable results show that CircSSNN can be transformed simply and efficiently without task-oriented tuning.

CONCLUSIONS

In conclusion, CircSSNN can serve as an appealing circRNA-RBP identification tool with good identification performance, excellent scalability, and wide application scope without the need for task-oriented fine-tuning of parameters, which is expected to reduce the professional threshold required for hyperparameter tuning in bioinformatics analysis.

摘要

背景

环状 RNA（circRNA）作为转录模板在一些疾病中发挥重要作用。因此，分析 circRNA 与 RNA 结合蛋白（RBP）的相互作用机制对疾病的预防和治疗具有深远意义。现有的 circRNA-RBP 识别模型通常采用卷积神经网络（CNN）、递归神经网络（RNN）或它们的变体作为特征提取器。它们大多存在并行性差、稳定性不足、无法捕获长时依赖等缺点。

方法

本文提出了一种完全使用自注意力机制的新方法，用于捕获 RNA 序列的深层语义特征。在此基础上，我们构建了一个用于 circRNA-RBP 识别的 CircSSNN 模型。该模型通过融合 circRNA 序列表示与统计分布、静态局部上下文和动态全局上下文来构建特征方案。CircSSNN 具有稳定高效的网络架构，可将序列中任意两个位置之间的距离缩小到一个常数，从而能够快速捕获长时依赖并提取深层语义特征。

结果

在 37 个 circRNA 数据集上的实验表明，所提出的模型在稳定性、并行性和预测性能方面具有整体优势。在保持网络结构和超参数不变的情况下，我们直接将 CircSSNN 应用于 linRNA 数据集。有利的结果表明，CircSSNN 可以简单有效地转换，无需面向任务的参数调整。

结论

总之，CircSSNN 可以作为一种有吸引力的 circRNA-RBP 识别工具，具有良好的识别性能、出色的可扩展性和广泛的应用范围，无需面向任务的参数微调，有望降低生物信息学分析中对超参数调整的专业门槛。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5dc7/10230723/3ea0df907bf4/12859_2023_5352_Fig1_HTML.jpg

相似文献

CircSSNN: circRNA-binding site prediction via sequence self-attention neural networks with pre-normalization.

BMC Bioinformatics. 2023 May 30;24(1):220. doi: 10.1186/s12859-023-05352-7.

CRIECNN: Ensemble convolutional neural network and advanced feature extraction methods for the precise forecasting of circRNA-RBP binding sites.

Comput Biol Med. 2024 May;174:108466. doi: 10.1016/j.compbiomed.2024.108466. Epub 2024 Apr 10.

CRIP: predicting circRNA-RBP-binding sites using a codon-based encoding and hybrid deep neural networks.

RNA. 2019 Dec;25(12):1604-1615. doi: 10.1261/rna.070565.119. Epub 2019 Sep 19.

Predicting circRNA-RBP Binding Sites Using a Hybrid Deep Neural Network.

Interdiscip Sci. 2024 Sep;16(3):635-648. doi: 10.1007/s12539-024-00616-z. Epub 2024 Feb 21.

iCRBP-LKHA: Large convolutional kernel and hybrid channel-spatial attention for identifying circRNA-RBP interaction sites.

PLoS Comput Biol. 2024 Aug 22;20(8):e1012399. doi: 10.1371/journal.pcbi.1012399. eCollection 2024 Aug.

circRNA-binding protein site prediction based on multi-view deep learning, subspace learning and multi-view classifier.

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab394.

CRBPDL: Identification of circRNA-RBP interaction sites using an ensemble neural network approach.

PLoS Comput Biol. 2022 Jan 20;18(1):e1009798. doi: 10.1371/journal.pcbi.1009798. eCollection 2022 Jan.

iCircRBP-DHN: identification of circRNA-RBP interaction sites using deep hierarchical network.

Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa274.

SSCRB: Predicting circRNA-RBP Interaction Sites Using a Sequence and Structural Feature-Based Attention Model.

IEEE J Biomed Health Inform. 2024 Mar;28(3):1762-1772. doi: 10.1109/JBHI.2024.3354121. Epub 2024 Mar 6.

MSTCRB: Predicting circRNA-RBP interaction by extracting multi-scale features based on transformer and attention mechanism.

Int J Biol Macromol. 2024 Oct;278(Pt 2):134805. doi: 10.1016/j.ijbiomac.2024.134805. Epub 2024 Aug 15.

引用本文的文献

Auxiliary Diagnosis of Pulmonary Nodules' Benignancy and Malignancy Based on Machine Learning: A Retrospective Study.

J Multidiscip Healthc. 2025 Jun 27;18:3735-3748. doi: 10.2147/JMDH.S518166. eCollection 2025.

CR-deal: Explainable Neural Network for circRNA-RBP Binding Site Recognition and Interpretation.

Interdiscip Sci. 2025 Mar 27. doi: 10.1007/s12539-025-00694-7.

Circular RNAs in neurological conditions - computational identification, functional validation, and potential clinical applications.

Mol Psychiatry. 2025 Apr;30(4):1652-1675. doi: 10.1038/s41380-025-02925-1. Epub 2025 Feb 17.

RNA sequence analysis landscape: A comprehensive review of task types, databases, datasets, word embedding methods, and language models.

Heliyon. 2025 Jan 6;11(2):e41488. doi: 10.1016/j.heliyon.2024.e41488. eCollection 2025 Jan 30.

CRBPSA: CircRNA-RBP interaction sites identification using sequence structural attention model.

BMC Biol. 2024 Nov 14;22(1):260. doi: 10.1186/s12915-024-02055-0.

Predicting circRNA-miRNA interactions utilizing transformer-based RNA sequential learning and high-order proximity preserved embedding.

iScience. 2023 Nov 29;27(1):108592. doi: 10.1016/j.isci.2023.108592. eCollection 2024 Jan 19.

CircSI-SSL: circRNA-binding site identification based on self-supervised learning.

Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btae004.

Nucleotide-level prediction of CircRNA-protein binding based on fully convolutional neural network.

Front Genet. 2023 Oct 6;14:1283404. doi: 10.3389/fgene.2023.1283404. eCollection 2023.

本文引用的文献

HCRNet: high-throughput circRNA-binding event identification from CLIP-seq data using deep temporal convolutional network.

Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac027.

A Survey on Vision Transformer.

IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):87-110. doi: 10.1109/TPAMI.2022.3152247. Epub 2022 Dec 5.

CRBPDL: Identification of circRNA-RBP interaction sites using an ensemble neural network approach.

PLoS Comput Biol. 2022 Jan 20;18(1):e1009798. doi: 10.1371/journal.pcbi.1009798. eCollection 2022 Jan.

circRNA-binding protein site prediction based on multi-view deep learning, subspace learning and multi-view classifier.

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab394.

CircPTPRA blocks the recognition of RNA N-methyladenosine through interacting with IGF2BP1 to suppress bladder cancer progression.

Mol Cancer. 2021 Apr 14;20(1):68. doi: 10.1186/s12943-021-01359-x.

DeCban: Prediction of circRNA-RBP Interaction Sites by Using Double Embeddings and Cross-Branch Attention Networks.

Front Genet. 2021 Jan 22;11:632861. doi: 10.3389/fgene.2020.632861. eCollection 2020.

DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome.

Bioinformatics. 2021 Aug 9;37(15):2112-2120. doi: 10.1093/bioinformatics/btab083.

Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture.

BMC Bioinformatics. 2021 Jan 7;22(1):19. doi: 10.1186/s12859-020-03942-3.

iCircRBP-DHN: identification of circRNA-RBP interaction sites using deep hierarchical network.

Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa274.

PASSION: an ensemble neural network approach for identifying the binding sites of RBPs on circRNAs.

Bioinformatics. 2020 Aug 1;36(15):4276-4282. doi: 10.1093/bioinformatics/btaa522.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

CircSSNN：使用带有预归一化的序列自注意力神经网络进行 circRNA 结合位点预测。

CircSSNN: circRNA-binding site prediction via sequence self-attention neural networks with pre-normalization.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献