• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用热力学积分的深度学习进行 RNA 二级结构预测。

RNA secondary structure prediction using deep learning with thermodynamic integration.

机构信息

Department of Biosciences and Informatics, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama, Japan.

出版信息

Nat Commun. 2021 Feb 11;12(1):941. doi: 10.1038/s41467-021-21194-4.

DOI:10.1038/s41467-021-21194-4
PMID:33574226
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7878809/
Abstract

Accurate predictions of RNA secondary structures can help uncover the roles of functional non-coding RNAs. Although machine learning-based models have achieved high performance in terms of prediction accuracy, overfitting is a common risk for such highly parameterized models. Here we show that overfitting can be minimized when RNA folding scores learnt using a deep neural network are integrated together with Turner's nearest-neighbor free energy parameters. Training the model with thermodynamic regularization ensures that folding scores and the calculated free energy are as close as possible. In computational experiments designed for newly discovered non-coding RNAs, our algorithm (MXfold2) achieves the most robust and accurate predictions of RNA secondary structures without sacrificing computational efficiency compared to several other algorithms. The results suggest that integrating thermodynamic information could help improve the robustness of deep learning-based predictions of RNA secondary structure.

摘要

准确预测 RNA 二级结构有助于揭示功能非编码 RNA 的作用。虽然基于机器学习的模型在预测准确性方面取得了很高的性能,但对于这种高度参数化的模型来说,过拟合是一个常见的风险。在这里,我们表明,当使用深度神经网络学习的 RNA 折叠分数与 Turner 的最近邻自由能参数集成在一起时,可以最大限度地减少过拟合。使用热力学正则化训练模型可确保折叠分数和计算出的自由能尽可能接近。在针对新发现的非编码 RNA 设计的计算实验中,与其他几种算法相比,我们的算法(MXfold2)在不牺牲计算效率的情况下,实现了对 RNA 二级结构最稳健和准确的预测。结果表明,整合热力学信息可以帮助提高基于深度学习的 RNA 二级结构预测的稳健性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d9d/7878809/1b5f0524151b/41467_2021_21194_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d9d/7878809/22f319250a9b/41467_2021_21194_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d9d/7878809/966b0c0f7985/41467_2021_21194_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d9d/7878809/f8aa921f6bdc/41467_2021_21194_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d9d/7878809/effc3492c4ec/41467_2021_21194_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d9d/7878809/1b5f0524151b/41467_2021_21194_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d9d/7878809/22f319250a9b/41467_2021_21194_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d9d/7878809/966b0c0f7985/41467_2021_21194_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d9d/7878809/f8aa921f6bdc/41467_2021_21194_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d9d/7878809/effc3492c4ec/41467_2021_21194_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d9d/7878809/1b5f0524151b/41467_2021_21194_Fig5_HTML.jpg

相似文献

1
RNA secondary structure prediction using deep learning with thermodynamic integration.使用热力学积分的深度学习进行 RNA 二级结构预测。
Nat Commun. 2021 Feb 11;12(1):941. doi: 10.1038/s41467-021-21194-4.
2
Predicting RNA SHAPE scores with deep learning.用深度学习预测 RNA SHAPE 分数。
RNA Biol. 2020 Sep;17(9):1324-1330. doi: 10.1080/15476286.2020.1760534. Epub 2020 May 31.
3
A max-margin training of RNA secondary structure prediction integrated with the thermodynamic model.一种结合热力学模型的RNA二级结构预测的最大间隔训练。
J Bioinform Comput Biol. 2018 Dec;16(6):1840025. doi: 10.1142/S0219720018400255.
4
A range of complex probabilistic models for RNA secondary structure prediction that includes the nearest-neighbor model and more.一系列复杂的 RNA 二级结构预测概率模型,包括最近邻模型等。
RNA. 2012 Feb;18(2):193-212. doi: 10.1261/rna.030049.111. Epub 2011 Dec 22.
5
Integrating thermodynamic and sequence contexts improves protein-RNA binding prediction.整合热力学和序列背景可提高蛋白质-RNA 结合预测。
PLoS Comput Biol. 2019 Sep 4;15(9):e1007283. doi: 10.1371/journal.pcbi.1007283. eCollection 2019 Sep.
6
Analysis of RNA nearest neighbor parameters reveals interdependencies and quantifies the uncertainty in RNA secondary structure prediction.分析 RNA 近邻参数揭示了相互关系,并量化了 RNA 二级结构预测中的不确定性。
RNA. 2018 Nov;24(11):1568-1582. doi: 10.1261/rna.065102.117. Epub 2018 Aug 13.
7
RNA Secondary Structure Prediction Based on Energy Models.基于能量模型的RNA二级结构预测
Methods Mol Biol. 2023;2586:89-105. doi: 10.1007/978-1-0716-2768-6_6.
8
RNA secondary structure prediction using an ensemble of two-dimensional deep neural networks and transfer learning.使用二维深度神经网络集成和迁移学习进行 RNA 二级结构预测。
Nat Commun. 2019 Nov 27;10(1):5407. doi: 10.1038/s41467-019-13395-9.
9
sincFold: end-to-end learning of short- and long-range interactions in RNA secondary structure.sincFold:RNA 二级结构中短程和远程相互作用的端到端学习。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae271.
10
DEBFold: Computational Identification of RNA Secondary Structures for Sequences across Structural Families Using Deep Learning.DEBFold:使用深度学习对跨结构家族的序列进行 RNA 二级结构的计算识别。
J Chem Inf Model. 2024 May 13;64(9):3756-3766. doi: 10.1021/acs.jcim.4c00458. Epub 2024 Apr 22.

引用本文的文献

1
Fitness landscapes and thermodynamic approaches to development of nucleic acids enzymes: from classical methods to AI integration.核酸酶开发的适应度景观与热力学方法:从经典方法到人工智能整合
RSC Chem Biol. 2025 Aug 21. doi: 10.1039/d5cb00105f.
2
Enhanced Generalizability of RNA Secondary Structure Prediction via Convolutional Block Attention Network and Ensemble Learning.通过卷积块注意力网络和集成学习提高RNA二级结构预测的泛化能力
Molecules. 2025 Aug 21;30(16):3447. doi: 10.3390/molecules30163447.
3
ECSFinder: optimized prediction of evolutionarily conserved RNA secondary structures from genome sequences.

本文引用的文献

1
Accelerated cryo-EM-guided determination of three-dimensional RNA-only structures.加速冷冻电镜引导的三维 RNA 结构测定。
Nat Methods. 2020 Jul;17(7):699-707. doi: 10.1038/s41592-020-0878-9. Epub 2020 Jul 2.
2
RNA secondary structure prediction using an ensemble of two-dimensional deep neural networks and transfer learning.使用二维深度神经网络集成和迁移学习进行 RNA 二级结构预测。
Nat Commun. 2019 Nov 27;10(1):5407. doi: 10.1038/s41467-019-13395-9.
3
LinearFold: linear-time approximate RNA folding by 5'-to-3' dynamic programming and beam search.
ECSFinder:从基因组序列中对进化保守RNA二级结构进行优化预测。
Nucleic Acids Res. 2025 Aug 11;53(15). doi: 10.1093/nar/gkaf780.
4
Advances in Protein-RNA aptamer recognition and modeling: Current trends and future perspectives.蛋白质-RNA适配体识别与建模的进展:当前趋势与未来展望。
Curr Opin Struct Biol. 2025 Aug 14;94:103133. doi: 10.1016/j.sbi.2025.103133.
5
Graph-CRISPR: a gene editing efficiency prediction model based on graph neural network with integrated sequence and secondary structure feature extraction.Graph-CRISPR:一种基于具有整合序列和二级结构特征提取功能的图神经网络的基因编辑效率预测模型。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf410.
6
What does it take to learn the rules of RNA base pairing? A lot less than you may think.学习RNA碱基配对规则需要什么?比你想象的要少得多。
bioRxiv. 2025 Aug 2:2025.07.31.668042. doi: 10.1101/2025.07.31.668042.
7
Deep generative model of RNAs based on variational autoencoder with context-free grammar.基于具有上下文无关语法的变分自编码器的RNA深度生成模型。
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf427.
8
Modeling RNA duplex dynamics with Gibbs sampling enhances base-pair prediction accuracy and reveals structural activity profiles.使用吉布斯采样对RNA双链体动力学进行建模可提高碱基对预测准确性并揭示结构活性概况。
NAR Genom Bioinform. 2025 Jul 17;7(3):lqaf099. doi: 10.1093/nargab/lqaf099. eCollection 2025 Sep.
9
mRNABench: A curated benchmark for mature mRNA property and function prediction.mRNABench:用于成熟mRNA特性和功能预测的精选基准。
bioRxiv. 2025 Jul 8:2025.07.05.662870. doi: 10.1101/2025.07.05.662870.
10
Improving RNA Secondary Structure Prediction Through Expanded Training Data.通过扩展训练数据改进RNA二级结构预测
bioRxiv. 2025 May 3:2025.05.03.652028. doi: 10.1101/2025.05.03.652028.
LinearFold:通过 5'-to-3' 动态规划和束搜索进行线性时间近似 RNA 折叠。
Bioinformatics. 2019 Jul 15;35(14):i295-i304. doi: 10.1093/bioinformatics/btz375.
4
A max-margin training of RNA secondary structure prediction integrated with the thermodynamic model.一种结合热力学模型的RNA二级结构预测的最大间隔训练。
J Bioinform Comput Biol. 2018 Dec;16(6):1840025. doi: 10.1142/S0219720018400255.
5
bpRNA: large-scale automated annotation and analysis of RNA secondary structure.bpRNA:大规模自动化 RNA 二级结构注释和分析。
Nucleic Acids Res. 2018 Jun 20;46(11):5381-5394. doi: 10.1093/nar/gky285.
6
Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families.RFAM 13.0:转向以基因组为中心的非编码 RNA 家族资源
Nucleic Acids Res. 2018 Jan 4;46(D1):D335-D342. doi: 10.1093/nar/gkx1038.
7
TurboFold II: RNA structural alignment and secondary structure prediction informed by multiple homologs.TurboFold II:基于多个同源物的RNA结构比对与二级结构预测
Nucleic Acids Res. 2017 Nov 16;45(20):11570-11581. doi: 10.1093/nar/gkx815.
8
Detection of 224 candidate structured RNAs by comparative analysis of specific subsets of intergenic regions.通过对基因间区域特定子集的比较分析检测224个候选结构化RNA。
Nucleic Acids Res. 2017 Oct 13;45(18):10811-10823. doi: 10.1093/nar/gkx699.
9
Exact calculation of loop formation probability identifies folding motifs in RNA secondary structures.环状结构形成概率的精确计算可识别RNA二级结构中的折叠基序。
RNA. 2016 Dec;22(12):1808-1818. doi: 10.1261/rna.053694.115. Epub 2016 Oct 19.
10
Rfam 12.0: updates to the RNA families database.Rfam 12.0:RNA家族数据库的更新
Nucleic Acids Res. 2015 Jan;43(Database issue):D130-7. doi: 10.1093/nar/gku1063. Epub 2014 Nov 11.