Washietl Stefan, Hofacker Ivo L
Institut für Theoretische Chemie und Molekulare Strukturbiologie, Universität Wien, Währingerstrasse 17, A-1090, Austria.
J Mol Biol. 2004 Sep 3;342(1):19-30. doi: 10.1016/j.jmb.2004.07.018.
Facing the ever-growing list of newly discovered classes of functional RNAs, it can be expected that further types of functional RNAs are still hidden in recently completed genomes. The computational identification of such RNA genes is, therefore, of major importance. While most known functional RNAs have characteristic secondary structures, their free energies are generally not statistically significant enough to distinguish RNA genes from the genomic background. Additional information is required. Considering the wide availability of new genomic data of closely related species, comparative studies seem to be the most promising approach. Here, we show that prediction of consensus structures of aligned sequences can be a significant measure to detect functional RNAs. We report a new method to test multiple sequence alignments for the existence of an unusually structured and conserved fold. We show for alignments of six types of well-known functional RNA that an energy score consisting of free energy and a covariation term significantly improves sensitivity compared to single sequence predictions. We further test our method on a number of non-coding RNAs from Caenorhabditis elegans/Caenorhabditis briggsae and seven Saccharomyces species. Most RNAs can be detected with high significance. We provide a Perl implementation that can be used readily to score single alignments and discuss how the methods described here can be extended to allow for efficient genome-wide screens.
面对新发现的功能性RNA种类不断增加的情况,可以预期仍有更多类型的功能性RNA隐藏在最近完成测序的基因组中。因此,通过计算方法识别此类RNA基因至关重要。虽然大多数已知的功能性RNA具有特征性的二级结构,但其自由能通常在统计学上不足以将RNA基因与基因组背景区分开来。还需要其他信息。考虑到密切相关物种的新基因组数据广泛可得,比较研究似乎是最有前景的方法。在这里,我们表明预测比对序列的共有结构可以作为检测功能性RNA的一项重要措施。我们报告了一种新方法,用于测试多序列比对中是否存在异常结构化和保守的折叠。我们针对六种已知功能性RNA的比对结果表明,与单序列预测相比,由自由能和共变项组成的能量得分显著提高了灵敏度。我们进一步用来自秀丽隐杆线虫/布里格氏秀丽隐杆线虫以及七个酿酒酵母物种的一些非编码RNA测试了我们的方法。大多数RNA都能被高度显著地检测到。我们提供了一个Perl实现,可方便地用于对单比对进行评分,并讨论了如何扩展这里描述的方法以实现高效的全基因组筛选。