计算RNA二级结构设计：经验复杂性与改进方法

Computational RNA secondary structure design: empirical complexity and improved methods.

作者信息

Aguirre-Hernández Rosalía, Hoos Holger H, Condon Anne

机构信息

Institute of Applied Mathematics, University of British Columbia, Vancouver, BC, Canada.

出版信息

BMC Bioinformatics. 2007 Jan 31;8:34. doi: 10.1186/1471-2105-8-34.

DOI:10.1186/1471-2105-8-34

PMID:17266771

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1808480/

Abstract

BACKGROUND

We investigate the empirical complexity of the RNA secondary structure design problem, that is, the scaling of the typical difficulty of the design task for various classes of RNA structures as the size of the target structure is increased. The purpose of this work is to understand better the factors that make RNA structures hard to design for existing, high-performance algorithms. Such understanding provides the basis for improving the performance of one of the best algorithms for this problem, RNA-SSD, and for characterising its limitations.

RESULTS

To gain insights into the practical complexity of the problem, we present a scaling analysis on random and biologically motivated structures using an improved version of the RNA-SSD algorithm, and also the RNAinverse algorithm from the Vienna package. Since primary structure constraints are relevant for designing RNA structures, we also investigate the correlation between the number and the location of the primary structure constraints when designing structures and the performance of the RNA-SSD algorithm. The scaling analysis on random and biologically motivated structures supports the hypothesis that the running time of both algorithms scales polynomially with the size of the structure. We also found that the algorithms are in general faster when constraints are placed only on paired bases in the structure. Furthermore, we prove that, according to the standard thermodynamic model, for some structures that the RNA-SSD algorithm was unable to design, there exists no sequence whose minimum free energy structure is the target structure.

CONCLUSION

Our analysis helps to better understand the strengths and limitations of both the RNA-SSD and RNAinverse algorithms, and suggests ways in which the performance of these algorithms can be further improved.

摘要

背景

我们研究RNA二级结构设计问题的实际复杂度，即随着目标结构大小的增加，各类RNA结构设计任务的典型难度的缩放情况。这项工作的目的是更好地理解导致现有高性能算法难以设计RNA结构的因素。这种理解为改进解决该问题的最佳算法之一RNA - SSD的性能及其局限性的刻画提供了基础。

结果

为深入了解该问题的实际复杂度，我们使用RNA - SSD算法的改进版本以及维也纳软件包中的RNAinverse算法，对随机结构和具有生物学动机的结构进行了缩放分析。由于一级结构约束与RNA结构设计相关，我们还研究了设计结构时一级结构约束的数量和位置与RNA - SSD算法性能之间的相关性。对随机结构和具有生物学动机的结构的缩放分析支持了这一假设，即两种算法的运行时间均与结构大小呈多项式缩放关系。我们还发现，通常在仅对结构中的配对碱基施加约束时，算法速度更快。此外，我们证明，根据标准热力学模型，对于RNA - SSD算法无法设计的某些结构，不存在其最小自由能结构为目标结构的序列。

结论

我们的分析有助于更好地理解RNA - SSD和RNAinverse算法的优势与局限性，并提出了进一步提高这些算法性能的方法。

相似文献

Computational RNA secondary structure design: empirical complexity and improved methods.

BMC Bioinformatics. 2007 Jan 31;8:34. doi: 10.1186/1471-2105-8-34.

A new algorithm for RNA secondary structure design.

J Mol Biol. 2004 Feb 20;336(3):607-24. doi: 10.1016/j.jmb.2003.12.041.

INFO-RNA--a fast approach to inverse RNA folding.

Bioinformatics. 2006 Aug 1;22(15):1823-31. doi: 10.1093/bioinformatics/btl194. Epub 2006 May 18.

RNA secondary structure analysis using the Vienna RNA package.

Curr Protoc Bioinformatics. 2004 Feb;Chapter 12:Unit 12.2. doi: 10.1002/0471250953.bi1202s04.

RNA secondary structure design.

Phys Rev E Stat Nonlin Soft Matter Phys. 2007 Feb;75(2 Pt 1):021920. doi: 10.1103/PhysRevE.75.021920. Epub 2007 Feb 28.

Efficient parameter estimation for RNA secondary structure prediction.

Bioinformatics. 2007 Jul 1;23(13):i19-28. doi: 10.1093/bioinformatics/btm223.

Memory efficient folding algorithms for circular RNA secondary structures.

Bioinformatics. 2006 May 15;22(10):1172-6. doi: 10.1093/bioinformatics/btl023. Epub 2006 Feb 1.

INFO-RNA--a server for fast inverse RNA folding satisfying sequence constraints.

Nucleic Acids Res. 2007 Jul;35(Web Server issue):W310-3. doi: 10.1093/nar/gkm218. Epub 2007 Apr 22.

Principles for Predicting RNA Secondary Structure Design Difficulty.

J Mol Biol. 2016 Feb 27;428(5 Pt A):748-757. doi: 10.1016/j.jmb.2015.11.013. Epub 2016 Feb 17.

Can Clustal-style progressive pairwise alignment of multiple sequences be used in RNA secondary structure prediction?

BMC Bioinformatics. 2007 Jun 8;8:190. doi: 10.1186/1471-2105-8-190.

引用本文的文献

START: A Versatile Platform for Bacterial Ligand Sensing with Programmable Performances.

Adv Sci (Weinh). 2024 Sep;11(36):e2402029. doi: 10.1002/advs.202402029. Epub 2024 Jul 29.

Design of RNAs: comparing programs for inverse RNA folding.

Brief Bioinform. 2018 Mar 1;19(2):350-358. doi: 10.1093/bib/bbw120.

incaRNAfbinv: a web server for the fragment-based design of RNA sequences.

Nucleic Acids Res. 2016 Jul 8;44(W1):W308-14. doi: 10.1093/nar/gkw440. Epub 2016 May 16.

Inverse RNA folding solution based on multi-objective genetic algorithm and Gibbs sampling method.

EXCLI J. 2013 Jun 17;12:546-55. eCollection 2013.

ERD: a fast and reliable tool for RNA design including constraints.

BMC Bioinformatics. 2015 Jan 28;16:20. doi: 10.1186/s12859-014-0444-5.

In silico design and enzymatic synthesis of functional RNA nanoparticles.

Acc Chem Res. 2014 Jun 17;47(6):1731-41. doi: 10.1021/ar400329z. Epub 2014 Apr 23.

A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution.

Bioinformatics. 2013 Jul 1;29(13):i308-15. doi: 10.1093/bioinformatics/btt217.

Frnakenstein: multiple target inverse RNA folding.

BMC Bioinformatics. 2012 Oct 9;13:260. doi: 10.1186/1471-2105-13-260.

A global sampling approach to designing and reengineering RNA secondary structures.

Nucleic Acids Res. 2012 Nov 1;40(20):10041-52. doi: 10.1093/nar/gks768. Epub 2012 Aug 31.

Multistrand RNA secondary structure prediction and nanostructure design including pseudoknots.

ACS Nano. 2011 Dec 27;5(12):9542-51. doi: 10.1021/nn202666w. Epub 2011 Nov 17.

本文引用的文献

INFO-RNA--a fast approach to inverse RNA folding.

Bioinformatics. 2006 Aug 1;22(15):1823-31. doi: 10.1093/bioinformatics/btl194. Epub 2006 May 18.

RNA-RNA interaction prediction and antisense RNA target search.

J Comput Biol. 2006 Mar;13(2):267-82. doi: 10.1089/cmb.2006.13.267.

Secondary structure prediction of interacting RNA molecules.

J Mol Biol. 2005 Feb 4;345(5):987-1001. doi: 10.1016/j.jmb.2004.10.082. Epub 2004 Dec 16.

A new algorithm for RNA secondary structure design.

J Mol Biol. 2004 Feb 20;336(3):607-24. doi: 10.1016/j.jmb.2003.12.041.

Paradigms for computational nucleic acid design.

Nucleic Acids Res. 2004 Feb 27;32(4):1392-403. doi: 10.1093/nar/gkh291. Print 2004.

Ribozymes: recent advances in the development of RNA tools.

FEMS Microbiol Rev. 2003 Apr;27(1):75-97. doi: 10.1016/S0168-6445(03)00020-2.

The 3prime prime or minute-terminal structure required for replication of Barley yellow dwarf virus RNA contains an embedded 3prime prime or minute end.

Virology. 2002 Jan 5;292(1):114-26. doi: 10.1006/viro.2001.1268.

The comparative RNA web (CRW) site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs.

BMC Bioinformatics. 2002;3:2. doi: 10.1186/1471-2105-3-2. Epub 2002 Jan 17.

TectoRNA: modular assembly units for the construction of RNA nano-objects.

Nucleic Acids Res. 2001 Jan 15;29(2):455-63. doi: 10.1093/nar/29.2.455.

Logical computation using algorithmic self-assembly of DNA triple-crossover molecules.

Nature. 2000 Sep 28;407(6803):493-6. doi: 10.1038/35035038.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

计算RNA二级结构设计：经验复杂性与改进方法

Computational RNA secondary structure design: empirical complexity and improved methods.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献