Computational Genomics Lab, Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, 100101, China.
University of Chinese Academy of Sciences, Beijing, 100049, China.
Genome Biol. 2020 Apr 28;21(1):101. doi: 10.1186/s13059-020-02018-y.
Existing circular RNA (circRNA) databases have become essential for transcriptomics. However, most are unsuitable for mining in-depth information for candidate circRNA prioritization. To address this, we integrate circular transcript collections to develop the circAtlas database based on 1070 RNA-seq samples collected from 19 normal tissues across six vertebrate species. This database contains 1,007,087 highly reliable circRNAs, of which over 81.3% have been assembled into full-length sequences. We profile their expression pattern, conservation, and functional annotation. We describe a novel multiple conservation score, co-expression, and regulatory networks for circRNA annotation and prioritization. CircAtlas can be accessed at http://circatlas.biols.ac.cn/.
现有的环状 RNA (circRNA) 数据库已成为转录组学的重要资源。然而,大多数数据库不适合深入挖掘候选 circRNA 优先级的信息。为了解决这个问题,我们整合了环状转录本集合,基于从六个脊椎动物物种的 19 个正常组织中收集的 1070 个 RNA-seq 样本,开发了 circAtlas 数据库。该数据库包含 1007087 个高度可靠的 circRNA,其中超过 81.3%的 circRNA 已被组装成全长序列。我们分析了它们的表达模式、保守性和功能注释。我们描述了一种新的多保守性评分、共表达和调控网络,用于 circRNA 的注释和优先级排序。circAtlas 可在 http://circatlas.biols.ac.cn/ 上访问。