Suppr超能文献

Rfam 15:2025年的RNA家族数据库。

Rfam 15: RNA families database in 2025.

作者信息

Ontiveros Nancy, Cooke Emma, Nawrocki Eric P, Triebel Sandra, Marz Manja, Rivas Elena, Griffiths-Jones Sam, Petrov Anton I, Bateman Alex, Sweeney Blake

机构信息

European Molecular Biology Laboratory, Wellcome Genome Campus, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD, UK.

SciBite, Cambridge, UK.

出版信息

bioRxiv. 2024 Sep 24:2024.09.23.614430. doi: 10.1101/2024.09.23.614430.

Abstract

The Rfam database, a widely-used repository of non-coding RNA (ncRNA) families, has undergone significant updates in release 15.0. This paper introduces major improvements, including the expansion of Rfamseq to 26,106 genomes, a 76% increase, incorporating the latest UniProt reference proteomes and additional viral genomes. Sixty-five RNA families were enhanced using experimentally determined 3D structures, improving the accuracy of consensus secondary structures and annotations. R-scape covariation analysis was used to refine structural predictions in 26 families. Gene Ontology and Sequence Ontology annotations were comprehensively updated, increasing GO term coverage to 75% of families. The release adds 14 new Hepatitis C Virus RNA families and completes microRNA family synchronisation with miRBase, resulting in 1,603 microRNA families. New data types, including FULL alignments, have been implemented. Integration with APICURON for improved curator attribution and multiple website enhancements further improve user experience. These updates significantly expand Rfam's coverage and improve annotation quality, reinforcing its critical role in RNA research, genome annotation, and the development of machine learning models. Rfam is freely available at https://rfam.org.

摘要

Rfam数据库是一个广泛使用的非编码RNA(ncRNA)家族知识库,在15.0版本中经历了重大更新。本文介绍了主要改进内容,包括将Rfamseq扩展到26106个基因组,增加了76%,纳入了最新的UniProt参考蛋白质组和额外的病毒基因组。利用实验确定的3D结构对65个RNA家族进行了改进,提高了共有二级结构和注释的准确性。使用R-scape共变分析对26个家族的结构预测进行了优化。全面更新了基因本体论和序列本体论注释,使GO术语覆盖率提高到75%的家族。该版本增加了14个新的丙型肝炎病毒RNA家族,并完成了与miRBase的微小RNA家族同步,产生了1603个微小RNA家族。已经实现了包括完全比对在内的新数据类型。与APICURON集成以改进策展人归属,以及对多个网站的增强进一步改善了用户体验。这些更新显著扩展了Rfam的覆盖范围并提高了注释质量,加强了其在RNA研究、基因组注释和机器学习模型开发中的关键作用。Rfam可在https://rfam.org免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63eb/11451735/70426079c9be/nihpp-2024.09.23.614430v1-f0002.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验