• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于自注意力机制的神经网络用于预测RNA-蛋白质结合位点

Self-Attention Based Neural Network for Predicting RNA-Protein Binding Sites.

作者信息

Wang Xinyi, Zhang Mingyang, Long Chunlin, Yao Lin, Zhu Min

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):1469-1479. doi: 10.1109/TCBB.2022.3204661. Epub 2023 Apr 3.

DOI:10.1109/TCBB.2022.3204661
PMID:36067103
Abstract

Proteins binding to Ribonucleic Acid (RNA) inside cells are called RNA-binding proteins (RBP), which play a crucial role in gene regulation. The identification of RNA-protein binding sites helps to understand the function of RBP better. Although many computational methods have been developed to predict RNA-protein binding sites, their prediction accuracy on small sample datasets needs improvement. To overcome this limitation, we propose a novel model called SA-Net, which utilizes k-mer embedding to encode RNA sequences and a self-attention-based neural network to extract sequence features. K-mer embedding assists the model to discover significant subsequence fragments associated with binding sites. The self-attention mechanism captures contextual information from the entire input sequence globally, performing well in small sample sequence learning. Experimental results demonstrate that SA-Net attains state-of-the-art results on the RBP-24 dataset. We find that 4-mer embedding aids the model to achieve optimal performance. We also show that the self-attention network outperforms the commonly used CNN and CNN-BLSTM models in sequence feature extraction.

摘要

细胞内与核糖核酸(RNA)结合的蛋白质被称为RNA结合蛋白(RBP),其在基因调控中起着至关重要的作用。RNA-蛋白质结合位点的识别有助于更好地理解RBP的功能。尽管已经开发了许多计算方法来预测RNA-蛋白质结合位点,但它们在小样本数据集上的预测准确性仍有待提高。为了克服这一局限性,我们提出了一种名为SA-Net的新型模型,该模型利用k-mer嵌入对RNA序列进行编码,并使用基于自注意力的神经网络来提取序列特征。k-mer嵌入有助于模型发现与结合位点相关的重要子序列片段。自注意力机制全局捕获整个输入序列的上下文信息,在小样本序列学习中表现良好。实验结果表明,SA-Net在RBP-24数据集上取得了最优结果。我们发现4-mer嵌入有助于模型实现最优性能。我们还表明,自注意力网络在序列特征提取方面优于常用的CNN和CNN-BLSTM模型。

相似文献

1
Self-Attention Based Neural Network for Predicting RNA-Protein Binding Sites.基于自注意力机制的神经网络用于预测RNA-蛋白质结合位点
IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):1469-1479. doi: 10.1109/TCBB.2022.3204661. Epub 2023 Apr 3.
2
Deep neural networks for inferring binding sites of RNA-binding proteins by using distributed representations of RNA primary sequence and secondary structure.利用 RNA 一级序列和二级结构的分布式表示来推断 RNA 结合蛋白结合位点的深度神经网络。
BMC Genomics. 2020 Dec 17;21(Suppl 13):866. doi: 10.1186/s12864-020-07239-w.
3
Predicting RBP Binding Sites of RNA With High-Order Encoding Features and CNN-BLSTM Hybrid Model.基于高阶编码特征和 CNN-BLSTM 混合模型预测 RNA 的 RBP 结合位点。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):2409-2419. doi: 10.1109/TCBB.2021.3083930. Epub 2022 Aug 8.
4
Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks.使用深度卷积和递归神经网络预测 RNA-蛋白质序列和结构的结合偏好。
BMC Genomics. 2018 Jul 3;19(1):511. doi: 10.1186/s12864-018-4889-1.
5
Prediction of the RBP binding sites on lncRNAs using the high-order nucleotide encoding convolutional neural network.使用高阶核苷酸编码卷积神经网络预测长链非编码RNA上的RBP结合位点
Anal Biochem. 2019 Oct 15;583:113364. doi: 10.1016/j.ab.2019.113364. Epub 2019 Jul 16.
6
CRIECNN: Ensemble convolutional neural network and advanced feature extraction methods for the precise forecasting of circRNA-RBP binding sites.CRIECNN:用于 circRNA-RBP 结合位点精确预测的集成卷积神经网络和高级特征提取方法。
Comput Biol Med. 2024 May;174:108466. doi: 10.1016/j.compbiomed.2024.108466. Epub 2024 Apr 10.
7
DeepA-RBPBS: A hybrid convolution and recurrent neural network combined with attention mechanism for predicting RBP binding site.DeepA-RBPBS:一种结合注意力机制的卷积和循环神经网络混合模型,用于预测 RBP 结合位点。
J Biomol Struct Dyn. 2022 Jun;40(9):4250-4258. doi: 10.1080/07391102.2020.1854861. Epub 2020 Dec 4.
8
CRIP: predicting circRNA-RBP-binding sites using a codon-based encoding and hybrid deep neural networks.CRIP:基于密码子编码和混合深度神经网络的 circRNA-RBP 结合位点预测。
RNA. 2019 Dec;25(12):1604-1615. doi: 10.1261/rna.070565.119. Epub 2019 Sep 19.
9
CRMSNet: A deep learning model that uses convolution and residual multi-head self-attention block to predict RBPs for RNA sequence.CRMSNet:一种深度学习模型,使用卷积和残差多头自注意力块来预测 RNA 序列的 RBPs。
Proteins. 2023 Aug;91(8):1032-1041. doi: 10.1002/prot.26489. Epub 2023 Mar 28.
10
DeepPN: a deep parallel neural network based on convolutional neural network and graph convolutional network for predicting RNA-protein binding sites.DeepPN:一种基于卷积神经网络和图卷积网络的深度并行神经网络,用于预测 RNA-蛋白质结合位点。
BMC Bioinformatics. 2022 Jun 29;23(1):257. doi: 10.1186/s12859-022-04798-5.

引用本文的文献

1
Ge-SAND: an explainable deep learning-driven framework for disease risk prediction by uncovering complex genetic interactions in parallel.Ge-SAND:一个通过并行揭示复杂基因相互作用来进行疾病风险预测的可解释深度学习驱动框架。
BMC Genomics. 2025 May 1;26(1):432. doi: 10.1186/s12864-025-11588-9.
2
A review on the applications of Transformer-based language models for nucleotide sequence analysis.基于Transformer的语言模型在核苷酸序列分析中的应用综述。
Comput Struct Biotechnol J. 2025 Mar 18;27:1244-1254. doi: 10.1016/j.csbj.2025.03.024. eCollection 2025.
3
Deep learning-based MVIT-MLKA model for accurate classification of pancreatic lesions: a multicenter retrospective cohort study.
基于深度学习的MVIT-MLKA模型用于胰腺病变的准确分类:一项多中心回顾性队列研究
Radiol Med. 2025 Apr;130(4):508-523. doi: 10.1007/s11547-025-01949-5. Epub 2025 Jan 20.
4
Deep Learning for Elucidating Modifications to RNA-Status and Challenges Ahead.深度学习解析 RNA 状态修饰及其面临的挑战。
Genes (Basel). 2024 May 15;15(5):629. doi: 10.3390/genes15050629.
5
Cross-and-Diagonal Networks: An Indirect Self-Attention Mechanism for Image Classification.交叉与对角网络:一种用于图像分类的间接自注意力机制
Sensors (Basel). 2024 Mar 23;24(7):2055. doi: 10.3390/s24072055.