RNA采样器：一种基于采样的新算法，用于常见RNA二级结构预测和结构比对。

RNA Sampler: a new sampling based algorithm for common RNA secondary structure prediction and structural alignment.

作者信息

Xu Xing, Ji Yongmei, Stormo Gary D

机构信息

Department of Genetics, Washington University, School of Medicine, St. Louis, MO 63110, USA.

出版信息

Bioinformatics. 2007 Aug 1;23(15):1883-91. doi: 10.1093/bioinformatics/btm272. Epub 2007 May 30.

DOI:10.1093/bioinformatics/btm272

PMID:17537756

Abstract

MOTIVATION

Non-coding RNA genes and RNA structural regulatory motifs play important roles in gene regulation and other cellular functions. They are often characterized by specific secondary structures that are critical to their functions and are often conserved in phylogenetically or functionally related sequences. Predicting common RNA secondary structures in multiple unaligned sequences remains a challenge in bioinformatics research.

METHODS AND RESULTS

We present a new sampling based algorithm to predict common RNA secondary structures in multiple unaligned sequences. Our algorithm finds the common structure between two sequences by probabilistically sampling aligned stems based on stem conservation calculated from intrasequence base pairing probabilities and intersequence base alignment probabilities. It iteratively updates these probabilities based on sampled structures and subsequently recalculates stem conservation using the updated probabilities. The iterative process terminates upon convergence of the sampled structures. We extend the algorithm to multiple sequences by a consistency-based method, which iteratively incorporates and reinforces consistent structure information from pairwise comparisons into consensus structures. The algorithm has no limitation on predicting pseudoknots. In extensive testing on real sequence data, our algorithm outperformed other leading RNA structure prediction methods in both sensitivity and specificity with a reasonably fast speed. It also generated better structural alignments than other programs in sequences of a wide range of identities, which more accurately represent the RNA secondary structure conservations.

AVAILABILITY

The algorithm is implemented in a C program, RNA Sampler, which is available at http://ural.wustl.edu/software.html

摘要

动机

非编码RNA基因和RNA结构调控基序在基因调控和其他细胞功能中发挥着重要作用。它们通常具有特定的二级结构，这些结构对其功能至关重要，并且在系统发育或功能相关的序列中往往是保守的。预测多个未比对序列中的常见RNA二级结构仍然是生物信息学研究中的一个挑战。

方法与结果

我们提出了一种基于采样的新算法，用于预测多个未比对序列中的常见RNA二级结构。我们的算法通过基于从序列内碱基配对概率和序列间碱基比对概率计算出的茎保守性，对比对的茎进行概率采样，来找到两个序列之间的共同结构。它根据采样结构迭代更新这些概率，随后使用更新后的概率重新计算茎保守性。当采样结构收敛时，迭代过程终止。我们通过一种基于一致性的方法将该算法扩展到多个序列，该方法迭代地将成对比较中的一致结构信息纳入并强化到共有结构中。该算法在预测假结方面没有限制。在对真实序列数据的广泛测试中，我们的算法在敏感性和特异性方面均优于其他领先的RNA结构预测方法，且速度合理较快。在各种同一性的序列中，它还比其他程序生成了更好的结构比对，能更准确地表示RNA二级结构的保守性。

可用性

该算法用C程序RNA Sampler实现，可在http://ural.wustl.edu/software.html获取。

相似文献

RNA Sampler: a new sampling based algorithm for common RNA secondary structure prediction and structural alignment.RNA采样器：一种基于采样的新算法，用于常见RNA二级结构预测和结构比对。

Bioinformatics. 2007 Aug 1;23(15):1883-91. doi: 10.1093/bioinformatics/btm272. Epub 2007 May 30.

A graph theoretical approach for predicting common RNA secondary structure motifs including pseudoknots in unaligned sequences.一种用于预测未比对序列中包括假结在内的常见RNA二级结构基序的图论方法。

Bioinformatics. 2004 Jul 10;20(10):1591-602. doi: 10.1093/bioinformatics/bth131. Epub 2004 Feb 12.

Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures.用于比对和预测假结RNA结构的成对随机树邻接文法

Proc IEEE Comput Syst Bioinform Conf. 2004:290-9.

Murlet: a practical multiple alignment tool for structural RNA sequences.Murlet：一种用于结构RNA序列的实用多序列比对工具。

Bioinformatics. 2007 Jul 1;23(13):1588-98. doi: 10.1093/bioinformatics/btm146. Epub 2007 Apr 25.

Robust prediction of consensus secondary structures using averaged base pairing probability matrices.使用平均碱基配对概率矩阵对共有二级结构进行稳健预测。

Bioinformatics. 2007 Feb 15;23(4):434-41. doi: 10.1093/bioinformatics/btl636. Epub 2006 Dec 20.

RNA structure alignment by a unit-vector approach.基于单位向量法的RNA结构比对

Bioinformatics. 2008 Aug 15;24(16):i112-8. doi: 10.1093/bioinformatics/btn288.

Mining frequent stem patterns from unaligned RNA sequences.从未比对的RNA序列中挖掘频繁茎模式。

Bioinformatics. 2006 Oct 15;22(20):2480-7. doi: 10.1093/bioinformatics/btl431. Epub 2006 Aug 14.

Alignment of RNA base pairing probability matrices.RNA碱基配对概率矩阵的比对。

Bioinformatics. 2004 Sep 22;20(14):2222-7. doi: 10.1093/bioinformatics/bth229. Epub 2004 Apr 8.

Predicting a set of minimal free energy RNA secondary structures common to two sequences.预测两个序列共有的一组最小自由能RNA二级结构。

Bioinformatics. 2005 May 15;21(10):2246-53. doi: 10.1093/bioinformatics/bti349. Epub 2005 Feb 24.

Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures.用于比对和预测假结RNA结构的配对随机树邻接文法

Bioinformatics. 2005 Jun 1;21(11):2611-7. doi: 10.1093/bioinformatics/bti385. Epub 2005 Mar 22.

引用本文的文献

A Hitchhiker's guide to RNA-RNA structure and interaction prediction tools.RNA 结构和相互作用预测工具的指南

Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad421.

Studying RNA Homology and Conservation with Infernal: From Single Sequences to RNA Families.使用Infernal研究RNA同源性和保守性：从单序列到RNA家族

Curr Protoc Bioinformatics. 2016 Jun 20;54:12.13.1-12.13.25. doi: 10.1002/cpbi.4.

A Dynamic 3D Graphical Representation for RNA Structure Analysis and Its Application in Non-Coding RNA Classification.一种用于RNA结构分析的动态3D图形表示及其在非编码RNA分类中的应用

PLoS One. 2016 May 23;11(5):e0152238. doi: 10.1371/journal.pone.0152238. eCollection 2016.

Effective alignment of RNA pseudoknot structures using partition function posterior log-odds scores.使用分区函数后验对数值评分来有效对准 RNA 假结结构。

BMC Bioinformatics. 2015 Feb 6;16:39. doi: 10.1186/s12859-015-0464-9.

Effects of using coding potential, sequence conservation and mRNA structure conservation for predicting pyrrolysine containing genes.利用编码潜能、序列保守性和 mRNA 结构保守性预测含吡咯赖氨酸基因的效果。

BMC Bioinformatics. 2013 Apr 4;14:118. doi: 10.1186/1471-2105-14-118.

CompaRNA: a server for continuous benchmarking of automated methods for RNA secondary structure prediction.CompaRNA：一个用于 RNA 二级结构预测自动化方法持续基准测试的服务器。

Nucleic Acids Res. 2013 Apr;41(7):4307-23. doi: 10.1093/nar/gkt101. Epub 2013 Feb 21.

Alternative polyadenylation in glioblastoma multiforme and changes in predicted RNA binding protein profiles.胶质母细胞瘤中的可变多聚腺苷酸化和预测 RNA 结合蛋白谱的变化。

OMICS. 2013 Mar;17(3):136-49. doi: 10.1089/omi.2012.0098. Epub 2013 Feb 19.

TurboKnot: rapid prediction of conserved RNA secondary structures including pseudoknots.TurboKnot：快速预测包括伪结在内的保守 RNA 二级结构。

Bioinformatics. 2012 Mar 15;28(6):792-8. doi: 10.1093/bioinformatics/bts044. Epub 2012 Jan 27.

Statistical evaluation of improvement in RNA secondary structure prediction.RNA 二级结构预测改进的统计评估。

Nucleic Acids Res. 2012 Feb;40(4):e26. doi: 10.1093/nar/gkr1081. Epub 2011 Dec 1.

RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences.RNA 全局：一种用于预测未对齐序列 RNA 二级结构的新 Gibbs 采样器。

Bioinformatics. 2011 Sep 15;27(18):2486-93. doi: 10.1093/bioinformatics/btr421. Epub 2011 Jul 24.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

RNA采样器：一种基于采样的新算法，用于常见RNA二级结构预测和结构比对。

RNA Sampler: a new sampling based algorithm for common RNA secondary structure prediction and structural alignment.

作者信息

机构信息

出版信息

MOTIVATION

METHODS AND RESULTS

AVAILABILITY

动机

方法与结果

可用性

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献