• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用预训练语言模型的注意力预测RNA结构

Predicting RNA Structure Utilizing Attention from Pretrained Language Models.

作者信息

Papazoglou Ioannis, Chatzigoulas Alexios, Tsekenis George, Cournia Zoe

机构信息

Biomedical Research Foundation, Academy of Athens, Athens 11527, Greece.

Department of Biology, National and Kapodistrian University of Athens, Athens 15784, Greece.

出版信息

J Chem Inf Model. 2025 Jul 14;65(13):6483-6498. doi: 10.1021/acs.jcim.4c02094. Epub 2025 Jul 2.

DOI:10.1021/acs.jcim.4c02094
PMID:40601389
Abstract

RNA possesses functional significance that extends beyond the transport of genetic information. The functional roles of noncoding RNA can be mediated through their tertiary and secondary structure, and thus, predicting RNA structure holds great promise for unleashing their applications in diagnostics and therapeutics. However, predicting the three-dimensional (3D) structure of RNA remains challenging. Applying artificial intelligence techniques in the context of natural language processing and large language models (LLMs) could incorporate evolutionary information to RNA 3D structure predictions and address both resource and data scarcity limitations. This approach could achieve faster inference times, while keeping similar accuracy outcomes compared to employing time-consuming multiple sequence alignment schemes, akin to its successful application in protein structure prediction. Herein, we evaluate the suitability of currently available pretrained nucleic acid language models (RNABERT, ERNIE-RNA, RNA Foundational Model (RNA-FM), RiboNucleic Acid Language Model (RiNALMo), and DNABERT) to predict secondary and tertiary RNA structures. We demonstrate that current nucleic acid language models do not effectively capture structural information, mainly due to architectural constraints.

摘要

RNA具有超越遗传信息传递的功能意义。非编码RNA的功能作用可通过其三级和二级结构介导,因此,预测RNA结构对于释放其在诊断和治疗中的应用具有巨大潜力。然而,预测RNA的三维(3D)结构仍然具有挑战性。在自然语言处理和大语言模型(LLM)的背景下应用人工智能技术,可以将进化信息纳入RNA 3D结构预测,并解决资源和数据稀缺的限制。与采用耗时的多序列比对方案相比,这种方法可以实现更快的推理时间,同时保持相似的准确性结果,类似于其在蛋白质结构预测中的成功应用。在此,我们评估了目前可用的预训练核酸语言模型(RNABERT、ERNIE-RNA、RNA基础模型(RNA-FM)、核糖核酸语言模型(RiNALMo)和DNABERT)对预测RNA二级和三级结构的适用性。我们证明,目前的核酸语言模型不能有效地捕捉结构信息,主要是由于架构限制。

相似文献

1
Predicting RNA Structure Utilizing Attention from Pretrained Language Models.利用预训练语言模型的注意力预测RNA结构
J Chem Inf Model. 2025 Jul 14;65(13):6483-6498. doi: 10.1021/acs.jcim.4c02094. Epub 2025 Jul 2.
2
Unveiling the evolution of policies for enhancing protein structure predictions: A comprehensive analysis.揭示增强蛋白质结构预测政策的演变:全面分析。
Comput Biol Med. 2024 Sep;179:108815. doi: 10.1016/j.compbiomed.2024.108815. Epub 2024 Jul 11.
3
Examining the Role of Large Language Models in Orthopedics: Systematic Review.检查大型语言模型在骨科中的作用:系统评价。
J Med Internet Res. 2024 Nov 15;26:e59607. doi: 10.2196/59607.
4
Short-Term Memory Impairment短期记忆障碍
5
The first step is the hardest: pitfalls of representing and tokenizing temporal data for large language models.第一步是最困难的:为大型语言模型表示和标记时间数据的陷阱。
J Am Med Inform Assoc. 2024 Sep 1;31(9):2151-2158. doi: 10.1093/jamia/ocae090.
6
Predicting Drug-Side Effect Relationships From Parametric Knowledge Embedded in Biomedical BERT Models: Methodological Study With a Natural Language Processing Approach.从生物医学BERT模型中嵌入的参数知识预测药物副作用关系:一种自然语言处理方法的方法学研究
JMIR Med Inform. 2025 Jul 10;13:e67513. doi: 10.2196/67513.
7
Large Language Model Architectures in Health Care: Scoping Review of Research Perspectives.医疗保健中的大语言模型架构:研究视角的范围综述
J Med Internet Res. 2025 Jun 19;27:e70315. doi: 10.2196/70315.
8
Use of Large Language Models to Classify Epidemiological Characteristics in Synthetic and Real-World Social Media Posts About Conjunctivitis Outbreaks: Infodemiology Study.利用大语言模型对合成及真实世界社交媒体上有关结膜炎爆发的帖子中的流行病学特征进行分类:信息流行病学研究
J Med Internet Res. 2025 Jul 2;27:e65226. doi: 10.2196/65226.
9
Utilizing large language models for detecting hospital-acquired conditions: an empirical study on pulmonary embolism.利用大语言模型检测医院获得性疾病:关于肺栓塞的实证研究
J Am Med Inform Assoc. 2025 May 1;32(5):876-884. doi: 10.1093/jamia/ocaf048.
10
Nucleic Acid Nanocapsules as a New Platform to Deliver Therapeutic Nucleic Acids for Gene Regulation.核酸纳米胶囊作为用于基因调控的治疗性核酸递送新平台。
Acc Chem Res. 2025 Jul 1;58(13):1951-1962. doi: 10.1021/acs.accounts.5c00126. Epub 2025 Jun 9.

本文引用的文献

1
RiNALMo: general-purpose RNA language models can generalize well on structure prediction tasks.RiNALMo:通用RNA语言模型在结构预测任务上能很好地泛化。
Nat Commun. 2025 Jul 1;16(1):5671. doi: 10.1038/s41467-025-60872-5.
2
Accurate RNA 3D structure prediction using a language model-based deep learning approach.使用基于语言模型的深度学习方法进行准确的RNA三维结构预测。
Nat Methods. 2024 Dec;21(12):2287-2298. doi: 10.1038/s41592-024-02487-0. Epub 2024 Nov 21.
3
Non-coding RNA notations, regulations and interactive resources.非编码 RNA 符号、规范和交互资源。
Funct Integr Genomics. 2024 Nov 18;24(6):217. doi: 10.1007/s10142-024-01494-w.
4
UniProt: the Universal Protein Knowledgebase in 2025.通用蛋白质知识库(UniProt):2025年的情况
Nucleic Acids Res. 2025 Jan 6;53(D1):D609-D617. doi: 10.1093/nar/gkae1010.
5
Protein-small molecule binding site prediction based on a pre-trained protein language model with contrastive learning.基于带有对比学习的预训练蛋白质语言模型的蛋白质-小分子结合位点预测
J Cheminform. 2024 Nov 6;16(1):125. doi: 10.1186/s13321-024-00920-2.
6
State-of-the-RNArt: benchmarking current methods for RNA 3D structure prediction.RNA领域现状:RNA三维结构预测当前方法的基准测试
NAR Genom Bioinform. 2024 May 14;6(2):lqae048. doi: 10.1093/nargab/lqae048. eCollection 2024 Jun.
7
Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
8
Generalized biomolecular modeling and design with RoseTTAFold All-Atom.基于 RoseTTAFold All-Atom 的广义生物分子建模与设计。
Science. 2024 Apr 19;384(6693):eadl2528. doi: 10.1126/science.adl2528.
9
The Nucleic Acid Knowledgebase: a new portal for 3D structural information about nucleic acids.核酸知识库:核酸 3D 结构信息的新门户。
Nucleic Acids Res. 2024 Jan 5;52(D1):D245-D254. doi: 10.1093/nar/gkad957.
10
trRosettaRNA: automated prediction of RNA 3D structure with transformer network.trRosettaRNA:基于 Transformer 网络的 RNA 三维结构自动预测。
Nat Commun. 2023 Nov 9;14(1):7266. doi: 10.1038/s41467-023-42528-4.