Kalvari Ioanna, Nawrocki Eric P, Argasinska Joanna, Quinones-Olvera Natalia, Finn Robert D, Bateman Alex, Petrov Anton I
European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom.
National Center for Biotechnology Information, National Institutes of Health, Department of Health and Human Services, Bethesda, Maryland.
Curr Protoc Bioinformatics. 2018 Jun;62(1):e51. doi: 10.1002/cpbi.51. Epub 2018 Jun 5.
Rfam is a database of non-coding RNA families in which each family is represented by a multiple sequence alignment, a consensus secondary structure, and a covariance model. Using a combination of manual and literature-based curation and a custom software pipeline, Rfam converts descriptions of RNA families found in the scientific literature into computational models that can be used to annotate RNAs belonging to those families in any DNA or RNA sequence. Valuable research outputs that are often locked up in figures and supplementary information files are encapsulated in Rfam entries and made accessible through the Rfam Web site. The data produced by Rfam have a broad application, from genome annotation to providing training sets for algorithm development. This article gives an overview of how to search and navigate the Rfam Web site, and how to annotate sequences with RNA families. The Rfam database is freely available at http://rfam.org. © 2018 by John Wiley & Sons, Inc.
Rfam是一个非编码RNA家族数据库,其中每个家族都由一个多序列比对、一个共有二级结构和一个协方差模型表示。通过结合人工整理和基于文献的筛选以及一个定制软件管道,Rfam将科学文献中发现的RNA家族描述转化为计算模型,可用于注释任何DNA或RNA序列中属于这些家族的RNA。通常锁定在图表和补充信息文件中的有价值研究成果被封装在Rfam条目中,并可通过Rfam网站获取。Rfam产生的数据具有广泛应用,从基因组注释到为算法开发提供训练集。本文概述了如何搜索和浏览Rfam网站,以及如何用RNA家族注释序列。Rfam数据库可从http://rfam.org免费获取。© 2018约翰·威利父子公司版权所有