Suppr超能文献

通过伪能量最小化预测RNA比对的共有结构

Predicting consensus structures for RNA alignments via pseudo-energy minimization.

作者信息

Spirollari Junilda, Wang Jason T L, Zhang Kaizhong, Bellofatto Vivian, Park Yongkyu, Shapiro Bruce A

机构信息

Bioinformatics Program, Department of Computer Science, New Jersey Institute of Technology, University Heights, Newark, NJ 07102, U.S.A.

出版信息

Bioinform Biol Insights. 2009 Jun 3;3:51-69. doi: 10.4137/bbi.s2578.

Abstract

Thermodynamic processes with free energy parameters are often used in algorithms that solve the free energy minimization problem to predict secondary structures of single RNA sequences. While results from these algorithms are promising, an observation is that single sequence-based methods have moderate accuracy and more information is needed to improve on RNA secondary structure prediction, such as covariance scores obtained from multiple sequence alignments. We present in this paper a new approach to predicting the consensus secondary structure of a set of aligned RNA sequences via pseudo-energy minimization. Our tool, called RSpredict, takes into account sequence covariation and employs effective heuristics for accuracy improvement. RSpredict accepts, as input data, a multiple sequence alignment in FASTA or ClustalW format and outputs the consensus secondary structure of the input sequences in both the Vienna style Dot Bracket format and the Connectivity Table format. Our method was compared with some widely used tools including KNetFold, Pfold and RNAalifold. A comprehensive test on different datasets including Rfam sequence alignments and a multiple sequence alignment obtained from our study on the Drosophila X chromosome reveals that RSpredict is competitive with the existing tools on the tested datasets. RSpredict is freely available online as a web server and also as a jar file for download at http://datalab.njit.edu/biology/RSpredict.

摘要

具有自由能参数的热力学过程常用于解决自由能最小化问题以预测单条RNA序列二级结构的算法中。虽然这些算法的结果很有前景,但一个观察结果是,基于单序列的方法准确性一般,需要更多信息来改进RNA二级结构预测,比如从多序列比对中获得的协方差分数。我们在本文中提出了一种通过伪能量最小化来预测一组比对RNA序列的共有二级结构的新方法。我们的工具名为RSpredict,它考虑了序列共变,并采用有效的启发式方法来提高准确性。RSpredict接受FASTA或ClustalW格式的多序列比对作为输入数据,并以维也纳风格的点括号格式和连通性表格式输出输入序列的共有二级结构。我们的方法与一些广泛使用的工具进行了比较,包括KNetFold、Pfold和RNAalifold。对不同数据集进行的全面测试,包括Rfam序列比对以及我们对果蝇X染色体研究中获得的多序列比对,结果表明RSpredict在测试数据集上与现有工具具有竞争力。RSpredict可作为网络服务器免费在线获取,也可作为jar文件从http://datalab.njit.edu/biology/RSpredict下载。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验