通过整合深度学习和结构上下文分析增强RNA二级结构预测

Enhanced RNA secondary structure prediction through integrative deep learning and structural context analysis.

作者信息

Wang Yongtian, Shen Yewei, Li Jiahao, Wang Tao, Peng Jiajie, Shang Xuequn

机构信息

School of Computer Science, Northwestern Polytechnical University, 1 Dongxiang Rd, Xi'an 710129, China.

Shenzhen Research Institute of Northwestern Polytechnical University, Sanhang Science & Technology Building, No. 45th, Gaoxin South 9th Road, Nanshan District, Shenzhen City 518057, China.

出版信息

Nucleic Acids Res. 2025 Jun 6;53(11). doi: 10.1093/nar/gkaf533.

DOI:10.1093/nar/gkaf533

PMID:40530692

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12203912/

Abstract

Analyzing RNA secondary structures plays a crucial role in elucidating the functional mechanisms of RNA. Despite advances in RNA structure determination, these methods are low throughout and resource-intensive. While machine learning-based models have achieved remarkable performance in terms of prediction accuracy, challenges such as data scarcity and overfitting remain common. Here, we introduce a phased learning strategy that integrates RNA sequence and structural context information to mitigate the risk of overfitting and employs pairing constraints to train the model on folding scores. This approach effectively addresses both local and long-range nucleotide interactions, substantially improving the robustness of RNA secondary structure predictions. Our comprehensive analysis across multiple benchmarking datasets demonstrated that the performance of our model (DSRNAFold) was superior to that of existing methods, especially in pseudoknot recognition and chemical mapping activity prediction, where our approach showed positive performance.

摘要

分析RNA二级结构在阐明RNA的功能机制中起着至关重要的作用。尽管RNA结构测定取得了进展，但这些方法通量低且资源密集。虽然基于机器学习的模型在预测准确性方面取得了显著性能，但数据稀缺和过拟合等挑战仍然很常见。在此，我们引入一种分阶段学习策略，该策略整合RNA序列和结构上下文信息以降低过拟合风险，并采用配对约束在折叠分数上训练模型。这种方法有效地解决了局部和长程核苷酸相互作用问题，大幅提高了RNA二级结构预测的稳健性。我们对多个基准数据集的综合分析表明，我们的模型（DSRNAFold）的性能优于现有方法，特别是在假结识别和化学映射活性预测方面，我们的方法表现出良好性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b557/12203912/5d58e9fc232d/gkaf533figgra1.jpg

相似文献

Enhanced RNA secondary structure prediction through integrative deep learning and structural context analysis.通过整合深度学习和结构上下文分析增强RNA二级结构预测

Nucleic Acids Res. 2025 Jun 6;53(11). doi: 10.1093/nar/gkaf533.

Unveiling the evolution of policies for enhancing protein structure predictions: A comprehensive analysis.揭示增强蛋白质结构预测政策的演变：全面分析。

Comput Biol Med. 2024 Sep;179:108815. doi: 10.1016/j.compbiomed.2024.108815. Epub 2024 Jul 11.

Advancing respiratory disease diagnosis: A deep learning and vision transformer-based approach with a novel X-ray dataset.推进呼吸系统疾病诊断：一种基于深度学习和视觉Transformer的方法及新型X射线数据集

Comput Biol Med. 2025 Aug;194:110501. doi: 10.1016/j.compbiomed.2025.110501. Epub 2025 Jun 9.

Predicting cognitive decline: Deep-learning reveals subtle brain changes in pre-MCI stage.预测认知衰退：深度学习揭示轻度认知障碍前阶段大脑的细微变化。

J Prev Alzheimers Dis. 2025 May;12(5):100079. doi: 10.1016/j.tjpad.2025.100079. Epub 2025 Feb 6.

A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。

Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Measures implemented in the school setting to contain the COVID-19 pandemic.学校为控制 COVID-19 疫情而采取的措施。

Cochrane Database Syst Rev. 2022 Jan 17;1(1):CD015029. doi: 10.1002/14651858.CD015029.

TARNAS: A Software Tool for Abstracting and Translating RNA Secondary Structures.TARNAS：一种用于提取和翻译RNA二级结构的软件工具。

Int J Mol Sci. 2025 Jun 15;26(12):5728. doi: 10.3390/ijms26125728.

Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果：一种针对特定个体见解的新型验证方法。

Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.

Pre-deployment programmes for building resilience in military and frontline emergency service personnel.军事和一线应急服务人员的韧性建设部署前方案。

Cochrane Database Syst Rev. 2021 Dec 6;12(12):CD013242. doi: 10.1002/14651858.CD013242.pub2.

引用本文的文献

Smart IoT with the hybrid evolutionary method and image processing for tumor detection.基于混合进化方法和图像处理的智能物联网用于肿瘤检测。

Sci Rep. 2025 Aug 25;15(1):31156. doi: 10.1038/s41598-025-16042-0.

Machine Learning-Powered Smart Healthcare Systems in the Era of Big Data: Applications, Diagnostic Insights, Challenges, and Ethical Implications.大数据时代基于机器学习的智能医疗系统：应用、诊断见解、挑战及伦理影响

Diagnostics (Basel). 2025 Jul 30;15(15):1914. doi: 10.3390/diagnostics15151914.

AttnW2V-Enhancer: Leveraging attention and Word2Vec for enhanced enhancer prediction.注意力加权词向量增强器：利用注意力机制和词向量进行增强的增强子预测。

Comput Struct Biotechnol J. 2025 Jul 23;27:3275-3284. doi: 10.1016/j.csbj.2025.07.008. eCollection 2025.

Exploring the Role of Artificial Intelligence in Smart Healthcare: A Capability and Function-Oriented Review.探索人工智能在智能医疗中的作用：一项基于能力和功能的综述。

Healthcare (Basel). 2025 Jul 8;13(14):1642. doi: 10.3390/healthcare13141642.

本文引用的文献

Comprehensive benchmarking of large language models for RNA secondary structure prediction.用于RNA二级结构预测的大语言模型的全面基准测试。

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf137.

Integrative Graph-Based Framework for Predicting circRNA Drug Resistance Using Disease Contextualization and Deep Learning.基于整合图谱的框架，利用疾病情境化和深度学习预测环状RNA耐药性

IEEE J Biomed Health Inform. 2024 Sep 10;PP. doi: 10.1109/JBHI.2024.3457271.

sincFold: end-to-end learning of short- and long-range interactions in RNA secondary structure.sincFold：RNA 二级结构中短程和远程相互作用的端到端学习。

Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae271.

Recent trends in RNA informatics: a review of machine learning and deep learning for RNA secondary structure prediction and RNA drug discovery.RNA 信息学的最新趋势：机器学习和深度学习在 RNA 二级结构预测和 RNA 药物发现中的应用综述。

Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad186.

Collaborative deep learning improves disease-related circRNA prediction based on multi-source functional information.基于多源功能信息的协作深度学习改进疾病相关环状RNA预测

Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad069.

Predicting RNA secondary structure by a neural network: what features may be learned?通过神经网络预测 RNA 二级结构：可以学习哪些特征？

PeerJ. 2022 Dec 13;10:e14335. doi: 10.7717/peerj.14335. eCollection 2022.

RNA secondary structure packages evaluated and improved by high-throughput experiments.通过高通量实验评估和改进的 RNA 二级结构包。

Nat Methods. 2022 Oct;19(10):1234-1242. doi: 10.1038/s41592-022-01605-0. Epub 2022 Oct 3.

Deep learning models for RNA secondary structure prediction (probably) do not generalize across families.深度学习模型预测 RNA 二级结构（可能）不能跨家族泛化。

Bioinformatics. 2022 Aug 10;38(16):3892-3899. doi: 10.1093/bioinformatics/btac415.

Crowdsourced RNA design discovers diverse, reversible, efficient, self-contained molecular switches.众包 RNA 设计发现多样、可逆、高效、自给自足的分子开关。

Proc Natl Acad Sci U S A. 2022 May 3;119(18):e2112979119. doi: 10.1073/pnas.2112979119. Epub 2022 Apr 26.

UFold: fast and accurate RNA secondary structure prediction with deep learning.UFold：使用深度学习进行快速准确的 RNA 二级结构预测。

Nucleic Acids Res. 2022 Feb 22;50(3):e14. doi: 10.1093/nar/gkab1074.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过整合深度学习和结构上下文分析增强RNA二级结构预测

Enhanced RNA secondary structure prediction through integrative deep learning and structural context analysis.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献