• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过结构感知深度学习预测 RNA 序列-结构可能性。

Predicting RNA sequence-structure likelihood via structure-aware deep learning.

机构信息

School of Computing and Augmented Intelligence, Arizona State University, 699 S Mill Ave, Tempe, AZ, 85281, USA.

ASU-Mayo Center for Innovative Imaging, Arizona State University, 699 S Mill Ave, Tempe, AZ, 85281, USA.

出版信息

BMC Bioinformatics. 2024 Sep 30;25(1):316. doi: 10.1186/s12859-024-05916-1.

DOI:10.1186/s12859-024-05916-1
PMID:39350066
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11443715/
Abstract

BACKGROUND

The active functionalities of RNA are recognized to be heavily dependent on the structure and sequence. Therefore, a model that can accurately evaluate a design by giving RNA sequence-structure pairs would be a valuable tool for many researchers. Machine learning methods have been explored to develop such tools, showing promising results. However, two key issues remain. Firstly, the performance of machine learning models is affected by the features used to characterize RNA. Currently, there is no consensus on which features are the most effective for characterizing RNA sequence-structure pairs. Secondly, most existing machine learning methods extract features describing entire RNA molecule. We argue that it is essential to define additional features that characterize nucleotides and specific sections of RNA structure to enhance the overall efficacy of the RNA design process.

RESULTS

We develop two deep learning models for evaluating RNA sequence-secondary structure pairs. The first model, NU-ResNet, uses a convolutional neural network architecture that solves the aforementioned problems by explicitly encoding RNA sequence-structure information into a 3D matrix. Building upon NU-ResNet, our second model, NUMO-ResNet, incorporates additional information derived from the characterizations of RNA, specifically the 2D folding motifs. In this work, we introduce an automated method to extract these motifs based on fundamental secondary structure descriptions. We evaluate the performance of both models on an independent testing dataset. Our proposed models outperform the models from literatures in this independent testing dataset. To assess the robustness of our models, we conduct 10-fold cross validation. To evaluate the generalization ability of NU-ResNet and NUMO-ResNet across different RNA families, we train and test our proposed models in different RNA families. Our proposed models show superior performance compared to the models from literatures when being tested across different independent RNA families.

CONCLUSIONS

In this study, we propose two deep learning models, NU-ResNet and NUMO-ResNet, to evaluate RNA sequence-secondary structure pairs. These two models expand the field of data-driven approaches for learning RNA. Furthermore, these two models provide the new method to encode RNA sequence-secondary structure pairs.

摘要

背景

人们认识到,RNA 的活性功能在很大程度上依赖于结构和序列。因此,能够通过提供 RNA 序列-结构对来准确评估设计的模型将是许多研究人员的宝贵工具。已经探索了机器学习方法来开发此类工具,显示出有希望的结果。然而,仍然存在两个关键问题。首先,机器学习模型的性能受到用于描述 RNA 的特征的影响。目前,对于哪些特征最有效地描述 RNA 序列-结构对,还没有共识。其次,大多数现有的机器学习方法提取描述整个 RNA 分子的特征。我们认为,定义描述 RNA 核苷酸和特定结构部分的额外特征对于提高 RNA 设计过程的整体效果至关重要。

结果

我们开发了两种用于评估 RNA 序列-二级结构对的深度学习模型。第一个模型 NU-ResNet 使用卷积神经网络架构,通过将 RNA 序列-结构信息明确地编码到 3D 矩阵中来解决上述问题。在 NU-ResNet 的基础上,我们的第二个模型 NUMO-ResNet 结合了来自 RNA 特征的附加信息,特别是 2D 折叠基序。在这项工作中,我们引入了一种基于基本二级结构描述自动提取这些基序的方法。我们在独立测试数据集上评估了这两种模型的性能。我们提出的模型在这个独立测试数据集上优于文献中的模型。为了评估我们的模型的稳健性,我们进行了 10 倍交叉验证。为了评估 NU-ResNet 和 NUMO-ResNet 在不同 RNA 家族中的泛化能力,我们在不同的 RNA 家族中训练和测试我们提出的模型。与文献中的模型相比,当在不同的独立 RNA 家族中进行测试时,我们提出的模型表现出优越的性能。

结论

在这项研究中,我们提出了两种深度学习模型,NU-ResNet 和 NUMO-ResNet,用于评估 RNA 序列-二级结构对。这两种模型扩展了基于数据的学习 RNA 的方法领域。此外,这两种模型提供了编码 RNA 序列-二级结构对的新方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b397/11443715/84544882efa2/12859_2024_5916_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b397/11443715/c675db75f817/12859_2024_5916_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b397/11443715/1a61c4232d2c/12859_2024_5916_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b397/11443715/a2a8101b0539/12859_2024_5916_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b397/11443715/5b026ae1b623/12859_2024_5916_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b397/11443715/84544882efa2/12859_2024_5916_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b397/11443715/c675db75f817/12859_2024_5916_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b397/11443715/1a61c4232d2c/12859_2024_5916_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b397/11443715/a2a8101b0539/12859_2024_5916_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b397/11443715/5b026ae1b623/12859_2024_5916_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b397/11443715/84544882efa2/12859_2024_5916_Fig5_HTML.jpg

相似文献

1
Predicting RNA sequence-structure likelihood via structure-aware deep learning.通过结构感知深度学习预测 RNA 序列-结构可能性。
BMC Bioinformatics. 2024 Sep 30;25(1):316. doi: 10.1186/s12859-024-05916-1.
2
sincFold: end-to-end learning of short- and long-range interactions in RNA secondary structure.sincFold:RNA 二级结构中短程和远程相互作用的端到端学习。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae271.
3
Predicting RNA SHAPE scores with deep learning.用深度学习预测 RNA SHAPE 分数。
RNA Biol. 2020 Sep;17(9):1324-1330. doi: 10.1080/15476286.2020.1760534. Epub 2020 May 31.
4
DEBFold: Computational Identification of RNA Secondary Structures for Sequences across Structural Families Using Deep Learning.DEBFold:使用深度学习对跨结构家族的序列进行 RNA 二级结构的计算识别。
J Chem Inf Model. 2024 May 13;64(9):3756-3766. doi: 10.1021/acs.jcim.4c00458. Epub 2024 Apr 22.
5
Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences.全面综述和评估基于 RNA 序列预测 RNA 转录后修饰位点的计算方法。
Brief Bioinform. 2020 Sep 25;21(5):1676-1696. doi: 10.1093/bib/bbz112.
6
CRMSNet: A deep learning model that uses convolution and residual multi-head self-attention block to predict RBPs for RNA sequence.CRMSNet:一种深度学习模型,使用卷积和残差多头自注意力块来预测 RNA 序列的 RBPs。
Proteins. 2023 Aug;91(8):1032-1041. doi: 10.1002/prot.26489. Epub 2023 Mar 28.
7
Application of Imaging Examination Based on Deep Learning in the Diagnosis of Viral Senile Pneumonia.基于深度学习的影像学检查在病毒性老年肺炎诊断中的应用。
Contrast Media Mol Imaging. 2022 May 31;2022:6964283. doi: 10.1155/2022/6964283. eCollection 2022.
8
RNA3DB: A structurally-dissimilar dataset split for training and benchmarking deep learning models for RNA structure prediction.RNA3DB:一个结构不同的数据集,用于训练和基准测试深度学习模型进行 RNA 结构预测。
J Mol Biol. 2024 Sep 1;436(17):168552. doi: 10.1016/j.jmb.2024.168552. Epub 2024 Mar 27.
9
DeepHE: Accurately predicting human essential genes based on deep learning.DeepHE:基于深度学习的人类必需基因精准预测。
PLoS Comput Biol. 2020 Sep 16;16(9):e1008229. doi: 10.1371/journal.pcbi.1008229. eCollection 2020 Sep.
10
Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks.使用深度卷积和递归神经网络预测 RNA-蛋白质序列和结构的结合偏好。
BMC Genomics. 2018 Jul 3;19(1):511. doi: 10.1186/s12864-018-4889-1.

引用本文的文献

1
Angle-controllable RNA tiles for programable array assembly and RNA sensing.用于可编程阵列组装和RNA传感的角度可控RNA瓦片
Nat Commun. 2025 Apr 19;16(1):3728. doi: 10.1038/s41467-025-58938-5.

本文引用的文献

1
RNA secondary structure packages evaluated and improved by high-throughput experiments.通过高通量实验评估和改进的 RNA 二级结构包。
Nat Methods. 2022 Oct;19(10):1234-1242. doi: 10.1038/s41592-022-01605-0. Epub 2022 Oct 3.
2
Deep learning models for RNA secondary structure prediction (probably) do not generalize across families.深度学习模型预测 RNA 二级结构(可能)不能跨家族泛化。
Bioinformatics. 2022 Aug 10;38(16):3892-3899. doi: 10.1093/bioinformatics/btac415.
3
PreRBP-TL: prediction of species-specific RNA-binding proteins based on transfer learning.
PreRBP-TL:基于迁移学习的物种特异性 RNA 结合蛋白预测。
Bioinformatics. 2022 Apr 12;38(8):2135-2143. doi: 10.1093/bioinformatics/btac106.
4
RNAcmap: a fully automatic pipeline for predicting contact maps of RNAs by evolutionary coupling analysis.RNAcmap:一种通过进化耦合分析预测RNA接触图的全自动流程。
Bioinformatics. 2021 Oct 25;37(20):3494-3500. doi: 10.1093/bioinformatics/btab391.
5
Improved RNA secondary structure and tertiary base-pairing prediction using evolutionary profile, mutational coupling and two-dimensional transfer learning.利用进化谱、突变耦合和二维迁移学习改进RNA二级结构和三级碱基配对预测。
Bioinformatics. 2021 Sep 9;37(17):2589-2600. doi: 10.1093/bioinformatics/btab165.
6
RNA secondary structure prediction using deep learning with thermodynamic integration.使用热力学积分的深度学习进行 RNA 二级结构预测。
Nat Commun. 2021 Feb 11;12(1):941. doi: 10.1038/s41467-021-21194-4.
7
LinearPartition: linear-time approximation of RNA folding partition function and base-pairing probabilities.LinearPartition:RNA 折叠分区函数和碱基配对概率的线性时间逼近。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i258-i267. doi: 10.1093/bioinformatics/btaa460.
8
RNA-Puzzles toolkit: a computational resource of RNA 3D structure benchmark datasets, structure manipulation, and evaluation tools.RNA-Puzzles 工具包:一个 RNA 三维结构基准数据集、结构操作和评估工具的计算资源。
Nucleic Acids Res. 2020 Jan 24;48(2):576-588. doi: 10.1093/nar/gkz1108.
9
RNA secondary structure prediction using an ensemble of two-dimensional deep neural networks and transfer learning.使用二维深度神经网络集成和迁移学习进行 RNA 二级结构预测。
Nat Commun. 2019 Nov 27;10(1):5407. doi: 10.1038/s41467-019-13395-9.
10
ENTRNA: a framework to predict RNA foldability.ENTRNA:一种预测 RNA 折叠能力的框架。
BMC Bioinformatics. 2019 Jul 3;20(1):373. doi: 10.1186/s12859-019-2948-5.