• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DRFormer:一种用于RNA序列下游任务的基准模型。

DRFormer: A Benchmark Model for RNA Sequence Downstream Tasks.

作者信息

Fu Jianqi, Li Haohao, Kang Yanlei, Zhu Hancan, Huang Tiren, Li Zhong

机构信息

School of Information Engineering, Huzhou University, Huzhou 313000, China.

College of Science, Zhejiang Sci-Tech University, Hangzhou 310018, China.

出版信息

Genes (Basel). 2025 Feb 26;16(3):284. doi: 10.3390/genes16030284.

DOI:10.3390/genes16030284
PMID:40149436
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11942477/
Abstract

RNA research is critical for understanding gene regulation, disease mechanisms, and therapeutic development. Constructing effective RNA benchmark models for accurate downstream analysis has become a significant research challenge. The objective of this study is to propose a robust benchmark model, DRFormer, for RNA sequence downstream tasks. The DRFormer model utilizes RNA sequences to construct novel vision features based on secondary structure and sequence distance. These features are pre-trained using the SWIN model to develop a SWIN-RNA submodel. This submodel is then integrated with an RNA sequence model to construct a multimodal model for downstream analysis. : We conducted experiments on various RNA downstream tasks. In the sequence classification task, the MCC reached 94.4%, surpassing the state-of-the-art RNAErnie model by 1.2%. In the protein-RNA interaction prediction, DRFormer achieved an MCC of 0.492, outperforming advanced models like BERT-RBP and PrismNet. In RNA secondary structure prediction, the F1 score was 0.690, exceeding the widely used SPOT-RNA model by 1%. Additionally, generalization experiments on DNA tasks yielded satisfactory results. DRFormer is the first RNA sequence downstream analysis model that leverages structural features to construct a vision model and integrates sequence and vision models in a multimodal manner. This approach yields excellent prediction and analysis results, making it a valuable contribution to RNA research.

摘要

RNA研究对于理解基因调控、疾病机制和治疗发展至关重要。构建有效的RNA基准模型以进行准确的下游分析已成为一项重大研究挑战。本研究的目的是为RNA序列下游任务提出一个强大的基准模型DRFormer。DRFormer模型利用RNA序列基于二级结构和序列距离构建新颖的视觉特征。这些特征使用SWIN模型进行预训练以开发一个SWIN-RNA子模型。然后将该子模型与RNA序列模型集成以构建用于下游分析的多模态模型。我们在各种RNA下游任务上进行了实验。在序列分类任务中,马修斯相关系数(MCC)达到94.4%,比最先进的RNAErnie模型高出1.2%。在蛋白质-RNA相互作用预测中,DRFormer的MCC为0.492,优于BERT-RBP和PrismNet等先进模型。在RNA二级结构预测中,F1分数为0.690,比广泛使用的SPOT-RNA模型高出1%。此外,在DNA任务上的泛化实验也产生了令人满意的结果。DRFormer是第一个利用结构特征构建视觉模型并以多模态方式整合序列和视觉模型的RNA序列下游分析模型。这种方法产生了出色的预测和分析结果,使其成为RNA研究的一项有价值的贡献。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/27cac0e7ed0b/genes-16-00284-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/106eec3e439b/genes-16-00284-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/53c60fd8051a/genes-16-00284-g0A2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/5f09649be126/genes-16-00284-g0A3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/997dd81a9aed/genes-16-00284-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/2e9615140ba5/genes-16-00284-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/986f83052cef/genes-16-00284-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/77e31389878a/genes-16-00284-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/435442b3402a/genes-16-00284-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/084a1f2ee5d4/genes-16-00284-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/e19b87955650/genes-16-00284-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/cce0d71c9fbc/genes-16-00284-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/13bf593749f1/genes-16-00284-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/27cac0e7ed0b/genes-16-00284-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/106eec3e439b/genes-16-00284-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/53c60fd8051a/genes-16-00284-g0A2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/5f09649be126/genes-16-00284-g0A3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/997dd81a9aed/genes-16-00284-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/2e9615140ba5/genes-16-00284-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/986f83052cef/genes-16-00284-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/77e31389878a/genes-16-00284-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/435442b3402a/genes-16-00284-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/084a1f2ee5d4/genes-16-00284-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/e19b87955650/genes-16-00284-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/cce0d71c9fbc/genes-16-00284-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/13bf593749f1/genes-16-00284-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f0e/11942477/27cac0e7ed0b/genes-16-00284-g010.jpg

相似文献

1
DRFormer: A Benchmark Model for RNA Sequence Downstream Tasks.DRFormer:一种用于RNA序列下游任务的基准模型。
Genes (Basel). 2025 Feb 26;16(3):284. doi: 10.3390/genes16030284.
2
How to benchmark RNA secondary structure prediction accuracy.如何评估 RNA 二级结构预测准确性。
Methods. 2019 Jun 1;162-163:60-67. doi: 10.1016/j.ymeth.2019.04.003. Epub 2019 Apr 2.
3
Energy-based RNA consensus secondary structure prediction in multiple sequence alignments.基于能量的多序列比对中RNA共有二级结构预测
Methods Mol Biol. 2014;1097:125-41. doi: 10.1007/978-1-62703-709-9_7.
4
R3Design: deep tertiary structure-based RNA sequence design and beyond.R3设计:基于三级结构的深度RNA序列设计及其他
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae682.
5
Integrating thermodynamic and sequence contexts improves protein-RNA binding prediction.整合热力学和序列背景可提高蛋白质-RNA 结合预测。
PLoS Comput Biol. 2019 Sep 4;15(9):e1007283. doi: 10.1371/journal.pcbi.1007283. eCollection 2019 Sep.
6
The art of editing RNA structural alignments.编辑RNA结构比对的技术。
Methods Mol Biol. 2014;1097:379-94. doi: 10.1007/978-1-62703-709-9_17.
7
Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences.全面综述和评估基于 RNA 序列预测 RNA 转录后修饰位点的计算方法。
Brief Bioinform. 2020 Sep 25;21(5):1676-1696. doi: 10.1093/bib/bbz112.
8
Comprehensive benchmarking of large language models for RNA secondary structure prediction.用于RNA二级结构预测的大语言模型的全面基准测试。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf137.
9
Sequence Design Using RNAstructure.使用 RNAstructure 进行序列设计。
Methods Mol Biol. 2025;2847:17-31. doi: 10.1007/978-1-0716-4079-1_2.
10
REDalign: accurate RNA structural alignment using residual encoder-decoder network.REDalign:使用残差编码器-解码器网络进行精确的 RNA 结构比对。
BMC Bioinformatics. 2024 Nov 5;25(1):346. doi: 10.1186/s12859-024-05956-7.

本文引用的文献

1
LncLSTA: a versatile predictor unveiling subcellular localization of lncRNAs through long-short term attention.LncLSTA:一种通过长短期注意力揭示lncRNA亚细胞定位的多功能预测工具。
Bioinform Adv. 2024 Nov 22;5(1):vbae173. doi: 10.1093/bioadv/vbae173. eCollection 2025.
2
Nucleotide Transformer: building and evaluating robust foundation models for human genomics.核苷酸变换器:构建和评估用于人类基因组学的强大基础模型。
Nat Methods. 2025 Feb;22(2):287-297. doi: 10.1038/s41592-024-02523-z. Epub 2024 Nov 28.
3
Multiple sequence alignment-based RNA language model and its application to structural inference.
基于多重序列比对的 RNA 语言模型及其在结构推断中的应用。
Nucleic Acids Res. 2024 Jan 11;52(1):e3. doi: 10.1093/nar/gkad1031.
4
PrismNet: predicting protein-RNA interaction using in vivo RNA structural information.PrismNet:利用体内 RNA 结构信息预测蛋白质-RNA 相互作用。
Nucleic Acids Res. 2023 Jul 5;51(W1):W468-W477. doi: 10.1093/nar/gkad353.
5
Prediction of RNA-protein interactions using a nucleotide language model.使用核苷酸语言模型预测RNA-蛋白质相互作用。
Bioinform Adv. 2022 Apr 7;2(1):vbac023. doi: 10.1093/bioadv/vbac023. eCollection 2022.
6
Informative RNA base embedding for RNA structural alignment and clustering by deep representation learning.通过深度表示学习进行RNA结构比对和聚类的信息性RNA碱基嵌入
NAR Genom Bioinform. 2022 Feb 22;4(1):lqac012. doi: 10.1093/nargab/lqac012. eCollection 2022 Mar.
7
UFold: fast and accurate RNA secondary structure prediction with deep learning.UFold:使用深度学习进行快速准确的 RNA 二级结构预测。
Nucleic Acids Res. 2022 Feb 22;50(3):e14. doi: 10.1093/nar/gkab1074.
8
Predicting dynamic cellular protein-RNA interactions by deep learning using in vivo RNA structures.通过使用体内 RNA 结构的深度学习来预测动态细胞蛋白-RNA 相互作用。
Cell Res. 2021 May;31(5):495-516. doi: 10.1038/s41422-021-00476-y. Epub 2021 Feb 23.
9
RNA secondary structure prediction using deep learning with thermodynamic integration.使用热力学积分的深度学习进行 RNA 二级结构预测。
Nat Commun. 2021 Feb 11;12(1):941. doi: 10.1038/s41467-021-21194-4.
10
DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome.DNABERT:用于基因组中DNA语言的基于变换器的预训练双向编码器表征模型。
Bioinformatics. 2021 Aug 9;37(15):2112-2120. doi: 10.1093/bioinformatics/btab083.