Suppr超能文献

闭环:环状 RNA 数据库的现状与展望。

Closing the circle: current state and perspectives of circular RNA databases.

机构信息

department of Biomolecular Medicine at Ghent University and a member of the Cancer Research Institute Ghent.

department of Biomolecular Medicine at Ghent University and a group leader at the Cancer Research Institute Ghent.

出版信息

Brief Bioinform. 2021 Jan 18;22(1):288-297. doi: 10.1093/bib/bbz175.

Abstract

Circular RNAs (circRNAs) are covalently closed RNA molecules that have been linked to various diseases, including cancer. However, a precise function and working mechanism are lacking for the larger majority. Following many different experimental and computational approaches to identify circRNAs, multiple circRNA databases were developed as well. Unfortunately, there are several major issues with the current circRNA databases, which substantially hamper progression in the field. First, as the overlap in content is limited, a true reference set of circRNAs is lacking. This results from the low abundance and highly specific expression of circRNAs, and varying sequencing methods, data-analysis pipelines, and circRNA detection tools. A second major issue is the use of ambiguous nomenclature. Thus, redundant or even conflicting names for circRNAs across different databases contribute to the reproducibility crisis. Third, circRNA databases, in essence, rely on the position of the circRNA back-splice junction, whereas alternative splicing could result in circRNAs with different length and sequence. To uniquely identify a circRNA molecule, the full circular sequence is required. Fourth, circRNA databases annotate circRNAs' microRNA binding and protein-coding potential, but these annotations are generally based on presumed circRNA sequences. Finally, several databases are not regularly updated, contain incomplete data or suffer from connectivity issues. In this review, we present a comprehensive overview of the current circRNA databases and their content, features, and usability. In addition to discussing the current issues regarding circRNA databases, we come with important suggestions to streamline further research in this growing field.

摘要

环状 RNA(circRNAs)是共价闭合的 RNA 分子,与包括癌症在内的多种疾病有关。然而,绝大多数 circRNA 的精确功能和工作机制尚不清楚。通过采用多种不同的实验和计算方法来鉴定 circRNAs,开发了多个 circRNA 数据库。不幸的是,当前的 circRNA 数据库存在几个主要问题,严重阻碍了该领域的发展。首先,由于内容的重叠有限,缺乏真正的 circRNA 参考集。这是由于 circRNAs 的丰度低且表达特异性高,以及不同的测序方法、数据分析管道和 circRNA 检测工具所致。第二个主要问题是使用模糊的命名法。因此,不同数据库中 circRNAs 的冗余甚至冲突名称导致了重现性危机。第三,circRNA 数据库本质上依赖于 circRNA 反向剪接连接点的位置,而选择性剪接可能导致具有不同长度和序列的 circRNAs。为了唯一识别 circRNA 分子,需要完整的圆形序列。第四,circRNA 数据库注释 circRNAs 的 microRNA 结合和蛋白编码潜力,但这些注释通常基于假定的 circRNA 序列。最后,一些数据库没有定期更新,包含不完整的数据或存在连接问题。在这篇综述中,我们全面介绍了当前的 circRNA 数据库及其内容、特点和可用性。除了讨论 circRNA 数据库当前存在的问题外,我们还提出了重要的建议,以简化该不断发展领域的进一步研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ae6/7820840/309324565989/bbz175f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验