Rfold：一种用于计算局部碱基配对概率的精确算法。

Rfold: an exact algorithm for computing local base pairing probabilities.

作者信息

Kiryu Hisanori, Kin Taishin, Asai Kiyoshi

机构信息

Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-42 Aomi, Koto-ku, Tokyo, Japan.

出版信息

Bioinformatics. 2008 Feb 1;24(3):367-73. doi: 10.1093/bioinformatics/btm591. Epub 2007 Dec 4.

DOI:10.1093/bioinformatics/btm591

PMID:18056736

Abstract

MOTIVATION

Base pairing probability matrices have been frequently used for the analyses of structural RNA sequences. Recently, there has been a growing need for computing these probabilities for long DNA sequences by constraining the maximal span of base pairs to a limited value. However, none of the existing programs can exactly compute the base pairing probabilities associated with the energy model of secondary structures under such a constraint.

RESULTS

We present an algorithm that exactly computes the base pairing probabilities associated with the energy model under the constraint on the maximal span W of base pairs. The complexity of our algorithm is given by O(NW2) in time and O(N+W2) in memory, where N is the sequence length. We show that our algorithm has a higher sensitivity to the true base pairs as compared to that of RNAplfold. We also present an algorithm that predicts a mutually consistent set of local secondary structures by maximizing the expected accuracy function. The comparison of the local secondary structure predictions with those of RNALfold indicates that our algorithm is more accurate. Our algorithms are implemented in the software named 'Rfold.'

AVAILABILITY

The C++ source code of the Rfold software and the test dataset used in this study are available at http://www.ncrna.org/software/Rfold/.

摘要

动机

碱基配对概率矩阵已被频繁用于分析结构性RNA序列。最近，通过将碱基对的最大跨度限制在一个有限值来计算长DNA序列的这些概率的需求日益增长。然而，现有的程序都无法在这种约束下精确计算与二级结构能量模型相关的碱基配对概率。

结果

我们提出了一种算法，该算法能在碱基对最大跨度W的约束下精确计算与能量模型相关的碱基配对概率。我们算法的时间复杂度为O(NW²)，内存复杂度为O(N + W²)，其中N是序列长度。我们表明，与RNAplfold相比，我们的算法对真实碱基对具有更高的敏感性。我们还提出了一种算法，通过最大化期望准确度函数来预测一组相互一致的局部二级结构。将局部二级结构预测结果与RNALfold的结果进行比较表明，我们的算法更准确。我们的算法在名为“Rfold”的软件中实现。

可用性

Rfold软件的C++源代码以及本研究中使用的测试数据集可在http://www.ncrna.org/software/Rfold/获取。

相似文献

Rfold: an exact algorithm for computing local base pairing probabilities.

Bioinformatics. 2008 Feb 1;24(3):367-73. doi: 10.1093/bioinformatics/btm591. Epub 2007 Dec 4.

Robust prediction of consensus secondary structures using averaged base pairing probability matrices.

Bioinformatics. 2007 Feb 15;23(4):434-41. doi: 10.1093/bioinformatics/btl636. Epub 2006 Dec 20.

Murlet: a practical multiple alignment tool for structural RNA sequences.

Bioinformatics. 2007 Jul 1;23(13):1588-98. doi: 10.1093/bioinformatics/btm146. Epub 2007 Apr 25.

Local RNA base pairing probabilities in large sequences.

Bioinformatics. 2006 Mar 1;22(5):614-5. doi: 10.1093/bioinformatics/btk014. Epub 2005 Dec 20.

Alignment of RNA base pairing probability matrices.

Bioinformatics. 2004 Sep 22;20(14):2222-7. doi: 10.1093/bioinformatics/bth229. Epub 2004 Apr 8.

GUUGle: a utility for fast exact matching under RNA complementary rules including G-U base pairing.

Bioinformatics. 2006 Mar 15;22(6):762-4. doi: 10.1093/bioinformatics/btk041. Epub 2006 Jan 10.

CentroidAlign: fast and accurate aligner for structured RNAs by maximizing expected sum-of-pairs score.

Bioinformatics. 2009 Dec 15;25(24):3236-43. doi: 10.1093/bioinformatics/btp580. Epub 2009 Oct 6.

MASTR: multiple alignment and structure prediction of non-coding RNAs using simulated annealing.

Bioinformatics. 2007 Dec 15;23(24):3304-11. doi: 10.1093/bioinformatics/btm525. Epub 2007 Nov 15.

Boltzmann probability of RNA structural neighbors and riboswitch detection.

Bioinformatics. 2007 Aug 15;23(16):2054-62. doi: 10.1093/bioinformatics/btm314. Epub 2007 Jun 14.

Predicting a set of minimal free energy RNA secondary structures common to two sequences.

Bioinformatics. 2005 May 15;21(10):2246-53. doi: 10.1093/bioinformatics/bti349. Epub 2005 Feb 24.

引用本文的文献

RNAelem: an algorithm for discovering sequence-structure motifs in RNA bound by RNA-binding proteins.

Bioinform Adv. 2024 Sep 28;4(1):vbae144. doi: 10.1093/bioadv/vbae144. eCollection 2024.

LinearCoFold and LinearCoPartition: linear-time algorithms for secondary structure prediction of interacting RNA molecules.

Nucleic Acids Res. 2023 Oct 13;51(18):e94. doi: 10.1093/nar/gkad664.

RNA Secondary Structure Alteration Caused by Single Nucleotide Variants.

Methods Mol Biol. 2023;2586:107-120. doi: 10.1007/978-1-0716-2768-6_7.

Genome-Wide RNA Secondary Structure Prediction.

Methods Mol Biol. 2023;2586:35-48. doi: 10.1007/978-1-0716-2768-6_3.

Linear-Time Algorithms for RNA Structure Prediction.

Methods Mol Biol. 2023;2586:15-34. doi: 10.1007/978-1-0716-2768-6_2.

LinAliFold and CentroidLinAliFold: fast RNA consensus secondary structure prediction for aligned sequences using beam search methods.

Bioinform Adv. 2022 Oct 22;2(1):vbac078. doi: 10.1093/bioadv/vbac078. eCollection 2022.

UFold: fast and accurate RNA secondary structure prediction with deep learning.

Nucleic Acids Res. 2022 Feb 22;50(3):e14. doi: 10.1093/nar/gkab1074.

Presence of DNA from Chlamydia-like organisms in the nasal cavities of grey seal pups (Halichoerus grypus) and three different substrates present in a breeding colony.

BMC Vet Res. 2021 Oct 13;17(1):328. doi: 10.1186/s12917-021-03032-3.

RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application.

Genes (Basel). 2020 Dec 24;12(1):14. doi: 10.3390/genes12010014.

RNA structure prediction using positive and negative evolutionary information.

PLoS Comput Biol. 2020 Oct 30;16(10):e1008387. doi: 10.1371/journal.pcbi.1008387. eCollection 2020 Oct.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Rfold：一种用于计算局部碱基配对概率的精确算法。

Rfold: an exact algorithm for computing local base pairing probabilities.

作者信息

Kiryu Hisanori, Kin Taishin, Asai Kiyoshi

机构信息

Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-42 Aomi, Koto-ku, Tokyo, Japan.

出版信息

Bioinformatics. 2008 Feb 1;24(3):367-73. doi: 10.1093/bioinformatics/btm591. Epub 2007 Dec 4.

DOI:10.1093/bioinformatics/btm591

PMID:18056736

Abstract

MOTIVATION

RESULTS

AVAILABILITY

The C++ source code of the Rfold software and the test dataset used in this study are available at http://www.ncrna.org/software/Rfold/.

摘要

动机

结果

可用性

Rfold软件的C++源代码以及本研究中使用的测试数据集可在http://www.ncrna.org/software/Rfold/获取。

Rfold：一种用于计算局部碱基配对概率的精确算法。

Rfold: an exact algorithm for computing local base pairing probabilities.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Rfold：一种用于计算局部碱基配对概率的精确算法。

Rfold: an exact algorithm for computing local base pairing probabilities.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

引用本文的文献