Eggenhofer Florian, Hofacker Ivo L, Höner Zu Siederdissen Christian
Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, A-1090 Vienna, Austria Bioinformatics Group, Department of Computer Science University of Freiburg, Georges-Köhler-Allee, 79110 Freiburg, Germany
Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, A-1090 Vienna, Austria Research Group Bioinformatics and Computational Biology, Faculty of Computer Science, University of Vienna, A-1090 Vienna, Austria.
Nucleic Acids Res. 2016 Sep 30;44(17):8433-41. doi: 10.1093/nar/gkw558. Epub 2016 Jun 21.
Determining the function of a non-coding RNA requires costly and time-consuming wet-lab experiments. For this reason, computational methods which ascertain the homology of a sequence and thereby deduce functionality and family membership are often exploited. In this fashion, newly sequenced genomes can be annotated in a completely computational way. Covariance models are commonly used to assign novel RNA sequences to a known RNA family. However, to construct such models several examples of the family have to be already known. Moreover, model building is the work of experts who manually edit the necessary RNA alignment and consensus structure. Our method, RNAlien, starting from a single input sequence collects potential family member sequences by multiple iterations of homology search. RNA family models are fully automatically constructed for the found sequences. We have tested our method on a subset of the Rfam RNA family database. RNAlien models are a starting point to construct models of comparable sensitivity and specificity to manually curated ones from the Rfam database. RNAlien Tool and web server are available at http://rna.tbi.univie.ac.at/rnalien/.
确定非编码RNA的功能需要耗费成本且耗时的湿实验室实验。因此,常常会利用通过确定序列同源性从而推断功能和家族成员关系的计算方法。通过这种方式,可以完全以计算的方式对新测序的基因组进行注释。协方差模型通常用于将新的RNA序列归入已知的RNA家族。然而,要构建这样的模型,必须已经知道该家族的几个实例。此外,模型构建是由专家手动编辑必要的RNA比对和共有结构的工作。我们的方法RNAlien从单个输入序列开始,通过多次同源性搜索迭代收集潜在的家族成员序列。针对找到的序列完全自动构建RNA家族模型。我们已经在Rfam RNA家族数据库的一个子集中测试了我们的方法。RNAlien模型是构建与Rfam数据库中人工策划的模型具有可比灵敏度和特异性的模型的起点。可在http://rna.tbi.univie.ac.at/rnalien/获取RNAlien工具和网络服务器。