• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

提高 RNA 分支预测的准确性:进展与局限。

Improving RNA Branching Predictions: Advances and Limitations.

机构信息

School of Mathematical and Statistical Sciences, Clemson University, Clemson, SC 29634, USA.

School of Mathematics, Georgia Institute of Technology, Atlanta, GA 30308, USA.

出版信息

Genes (Basel). 2021 Mar 25;12(4):469. doi: 10.3390/genes12040469.

DOI:10.3390/genes12040469
PMID:33805944
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8064352/
Abstract

Minimum free energy prediction of RNA secondary structures is based on the Nearest Neighbor Thermodynamics Model. While such predictions are typically good, the accuracy can vary widely even for short sequences, and the branching thermodynamics are an important factor in this variance. Recently, the simplest model for multiloop energetics-a linear function of the number of branches and unpaired nucleotides-was found to be the best. Subsequently, a parametric analysis demonstrated that per family accuracy can be improved by changing the weightings in this linear function. However, the extent of improvement was not known due to the ad hoc method used to find the new parameters. Here we develop a branch-and-bound algorithm that finds the set of optimal parameters with the highest average accuracy for a given set of sequences. Our analysis shows that the previous ad hoc parameters are nearly optimal for tRNA and 5S rRNA sequences on both training and testing sets. Moreover, cross-family improvement is possible but more difficult because competing parameter regions favor different families. The results also indicate that restricting the unpaired nucleotide penalty to small values is warranted. This reduction makes analyzing longer sequences using the present techniques more feasible.

摘要

RNA 二级结构的最小自由能预测基于最近邻热力学模型。虽然此类预测通常较为准确,但即使对于短序列,准确性也可能存在很大差异,而分支热力学是这种差异的重要因素。最近,人们发现多环能量的最简单模型是一个线性函数,与分支数和未配对核苷酸数成正比。随后,参数分析表明,通过改变该线性函数中的权重,可以提高每个家族的准确性。但是,由于使用了特定的方法来寻找新的参数,因此不知道改进的程度。在这里,我们开发了一种分支定界算法,可以为给定的序列集找到具有最高平均准确性的最佳参数集。我们的分析表明,对于 tRNA 和 5S rRNA 序列,以前的特定参数在训练集和测试集上都几乎是最优的。此外,跨家族的改进是可能的,但更困难,因为竞争的参数区域有利于不同的家族。结果还表明,将未配对核苷酸的罚分限制在较小的值是合理的。这种减少使得使用当前技术分析更长的序列变得更加可行。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58c9/8064352/35d2a7e6663c/genes-12-00469-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58c9/8064352/3387acdd045f/genes-12-00469-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58c9/8064352/35d2a7e6663c/genes-12-00469-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58c9/8064352/3387acdd045f/genes-12-00469-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/58c9/8064352/35d2a7e6663c/genes-12-00469-g002.jpg

相似文献

1
Improving RNA Branching Predictions: Advances and Limitations.提高 RNA 分支预测的准确性:进展与局限。
Genes (Basel). 2021 Mar 25;12(4):469. doi: 10.3390/genes12040469.
2
Evaluation of the suitability of free-energy minimization using nearest-neighbor energy parameters for RNA secondary structure prediction.使用最近邻能量参数的自由能最小化对RNA二级结构预测适用性的评估。
BMC Bioinformatics. 2004 Aug 5;5:105. doi: 10.1186/1471-2105-5-105.
3
The challenge of RNA branching prediction: a parametric analysis of multiloop initiation under thermodynamic optimization.RNA 分支预测的挑战:热力学优化下多环起始的参数分析。
J Struct Biol. 2020 Apr 1;210(1):107475. doi: 10.1016/j.jsb.2020.107475. Epub 2020 Feb 4.
4
RNA secondary structure prediction based on free energy and phylogenetic analysis.基于自由能和系统发育分析的RNA二级结构预测
J Mol Biol. 1999 Jun 18;289(4):935-47. doi: 10.1006/jmbi.1999.2801.
5
Moments of the Boltzmann distribution for RNA secondary structures.RNA二级结构的玻尔兹曼分布矩。
Bull Math Biol. 2005 Sep;67(5):1031-47. doi: 10.1016/j.bulm.2004.12.003.
6
Analysis of energy-based algorithms for RNA secondary structure prediction.基于能量的 RNA 二级结构预测算法分析。
BMC Bioinformatics. 2012 Feb 1;13:22. doi: 10.1186/1471-2105-13-22.
7
Advanced multi-loop algorithms for RNA secondary structure prediction reveal that the simplest model is best.用于RNA二级结构预测的先进多环算法表明,最简单的模型是最好的。
Nucleic Acids Res. 2017 Aug 21;45(14):8541-8550. doi: 10.1093/nar/gkx512.
8
A set of nearest neighbor parameters for predicting the enthalpy change of RNA secondary structure formation.一组用于预测RNA二级结构形成焓变的最近邻参数。
Nucleic Acids Res. 2006;34(17):4912-24. doi: 10.1093/nar/gkl472. Epub 2006 Sep 18.
9
Dynalign: an algorithm for finding the secondary structure common to two RNA sequences.Dynalign:一种用于寻找两个RNA序列共有的二级结构的算法。
J Mol Biol. 2002 Mar 22;317(2):191-203. doi: 10.1006/jmbi.2001.5351.
10
The 3D arrangement of the 23 S and 5 S rRNA in the Escherichia coli 50 S ribosomal subunit based on a cryo-electron microscopic reconstruction at 7.5 A resolution.基于7.5埃分辨率的冷冻电子显微镜重建,大肠杆菌50S核糖体亚基中23S和5S rRNA的三维排列。
J Mol Biol. 2000 Apr 21;298(1):35-59. doi: 10.1006/jmbi.2000.3635.

本文引用的文献

1
RNA secondary structure prediction using deep learning with thermodynamic integration.使用热力学积分的深度学习进行 RNA 二级结构预测。
Nat Commun. 2021 Feb 11;12(1):941. doi: 10.1038/s41467-021-21194-4.
2
The challenge of RNA branching prediction: a parametric analysis of multiloop initiation under thermodynamic optimization.RNA 分支预测的挑战:热力学优化下多环起始的参数分析。
J Struct Biol. 2020 Apr 1;210(1):107475. doi: 10.1016/j.jsb.2020.107475. Epub 2020 Feb 4.
3
Determining parameters for non-linear models of multi-loop free energy change.
确定多环自由能变化非线性模型的参数。
Bioinformatics. 2019 Nov 1;35(21):4298-4306. doi: 10.1093/bioinformatics/btz222.
4
Advanced multi-loop algorithms for RNA secondary structure prediction reveal that the simplest model is best.用于RNA二级结构预测的先进多环算法表明,最简单的模型是最好的。
Nucleic Acids Res. 2017 Aug 21;45(14):8541-8550. doi: 10.1093/nar/gkx512.
5
Exact calculation of loop formation probability identifies folding motifs in RNA secondary structures.环状结构形成概率的精确计算可识别RNA二级结构中的折叠基序。
RNA. 2016 Dec;22(12):1808-1818. doi: 10.1261/rna.053694.115. Epub 2016 Oct 19.
6
Computational analysis of conserved RNA secondary structure in transcriptomes and genomes.转录组和基因组中保守 RNA 二级结构的计算分析。
Annu Rev Biophys. 2014;43:433-56. doi: 10.1146/annurev-biophys-051013-022950.
7
A range of complex probabilistic models for RNA secondary structure prediction that includes the nearest-neighbor model and more.一系列复杂的 RNA 二级结构预测概率模型,包括最近邻模型等。
RNA. 2012 Feb;18(2):193-212. doi: 10.1261/rna.030049.111. Epub 2011 Dec 22.
8
RNAstructure: software for RNA secondary structure prediction and analysis.RNAstructure:用于 RNA 二级结构预测和分析的软件。
BMC Bioinformatics. 2010 Mar 15;11:129. doi: 10.1186/1471-2105-11-129.
9
NNDB: the nearest neighbor parameter database for predicting stability of nucleic acid secondary structure.NNDB:用于预测核酸二级结构稳定性的最近邻参数数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D280-2. doi: 10.1093/nar/gkp892. Epub 2009 Oct 30.
10
UNAFold: software for nucleic acid folding and hybridization.UNAFold:用于核酸折叠和杂交的软件。
Methods Mol Biol. 2008;453:3-31. doi: 10.1007/978-1-60327-429-6_1.