Suppr超能文献

结构化RNA的特定比对:随机语法与序列退火

Specific alignment of structured RNA: stochastic grammars and sequence annealing.

作者信息

Bradley Robert K, Pachter Lior, Holmes Ian

机构信息

Biophysics Graduate Group, Department of Mathematics and Department of Bioengineering, University of California, Berkeley, CA 94720, USA.

出版信息

Bioinformatics. 2008 Dec 1;24(23):2677-83. doi: 10.1093/bioinformatics/btn495. Epub 2008 Sep 16.

Abstract

MOTIVATION

Whole-genome screens suggest that eukaryotic genomes are dense with non-coding RNAs (ncRNAs). We introduce a novel approach to RNA multiple alignment which couples a generative probabilistic model of sequence and structure with an efficient sequence annealing approach for exploring the space of multiple alignments. This leads to a new software program, Stemloc-AMA, that is both accurate and specific in the alignment of multiple related RNA sequences.

RESULTS

When tested on the benchmark datasets BRalibase II and BRalibase 2.1, Stemloc-AMA has comparable sensitivity to and better specificity than the best competing methods. We use a large-scale random sequence experiment to show that while most alignment programs maximize sensitivity at the expense of specificity, even to the point of giving complete alignments of non-homologous sequences, Stemloc-AMA aligns only sequences with detectable homology and leaves unrelated sequences largely unaligned. Such accurate and specific alignments are crucial for comparative-genomics analysis, from inferring phylogeny to estimating substitution rates across different lineages.

AVAILABILITY

Stemloc-AMA is available from http://biowiki.org/StemLocAMA as part of the dart software package for sequence analysis.

摘要

动机

全基因组筛选表明真核生物基因组中充满了非编码RNA(ncRNA)。我们引入了一种新的RNA多序列比对方法,该方法将序列和结构的生成概率模型与一种用于探索多序列比对空间的高效序列退火方法相结合。这产生了一个新的软件程序Stemloc-AMA,它在多个相关RNA序列的比对中既准确又具有特异性。

结果

在基准数据集BRalibase II和BRalibase 2.1上进行测试时,Stemloc-AMA与最佳竞争方法相比具有相当的灵敏度和更好的特异性。我们通过大规模随机序列实验表明,虽然大多数比对程序以牺牲特异性为代价来最大化灵敏度,甚至达到对非同源序列进行完全比对的程度,但Stemloc-AMA只比对具有可检测同源性的序列,而让不相关的序列基本不进行比对。这种准确且具有特异性的比对对于比较基因组学分析至关重要,从推断系统发育到估计不同谱系间的替换率。

可用性

Stemloc-AMA可从http://biowiki.org/StemLocAMA获取,作为用于序列分析的dart软件包的一部分。

相似文献

1
Specific alignment of structured RNA: stochastic grammars and sequence annealing.结构化RNA的特定比对:随机语法与序列退火
Bioinformatics. 2008 Dec 1;24(23):2677-83. doi: 10.1093/bioinformatics/btn495. Epub 2008 Sep 16.
2
A local multiple alignment method for detection of non-coding RNA sequences.一种用于检测非编码RNA序列的局部多重比对方法。
Bioinformatics. 2009 Jun 15;25(12):1498-505. doi: 10.1093/bioinformatics/btp261. Epub 2009 Apr 17.
5
R-Coffee: a method for multiple alignment of non-coding RNA.R-Coffee:一种非编码RNA多重比对的方法。
Nucleic Acids Res. 2008 May;36(9):e52. doi: 10.1093/nar/gkn174. Epub 2008 Apr 17.
10
A memory efficient method for structure-based RNA multiple alignment.基于结构的 RNA 多重比对的一种内存高效方法。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jan-Feb;9(1):1-11. doi: 10.1109/TCBB.2011.86. Epub 2011 Apr 29.

引用本文的文献

本文引用的文献

4
Computational RNomics of drosophilids.果蝇的计算核糖核酸组学
BMC Genomics. 2007 Nov 8;8:406. doi: 10.1186/1471-2164-8-406.
7
Clustal W and Clustal X version 2.0.Clustal W和Clustal X 2.0版本
Bioinformatics. 2007 Nov 1;23(21):2947-8. doi: 10.1093/bioinformatics/btm404. Epub 2007 Sep 10.
9
Murlet: a practical multiple alignment tool for structural RNA sequences.Murlet:一种用于结构RNA序列的实用多序列比对工具。
Bioinformatics. 2007 Jul 1;23(13):1588-98. doi: 10.1093/bioinformatics/btm146. Epub 2007 Apr 25.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验