Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 7610010, Israel.
LifeMap Sciences Inc., Alameda, CA 94501, USA.
J Mol Biol. 2021 May 28;433(11):166913. doi: 10.1016/j.jmb.2021.166913. Epub 2021 Mar 4.
Non-coding RNA (ncRNA) genes assume increasing biological importance, with growing associations with diseases. Many ncRNA sources are transcript-centric, but for non-coding variant analysis and disease decipherment it is essential to transform this information into a comprehensive set of genome-mapped ncRNA genes. We present GeneCaRNA, a new all-inclusive gene-centric ncRNA database within the GeneCards Suite. GeneCaRNA information is integrated from four community-backed data structures: the major transcript database RNAcentral with its 20 encompassed databases, and the ncRNA entries of three major gene resources HGNC, Ensembl and NCBI Gene. GeneCaRNA presents 219,587 ncRNA gene pages, a 7-fold increase from those available in our three gene mining sources. Each ncRNA gene has wide-ranging annotation, mined from >100 worldwide sources, providing a powerful GeneCards-leveraged search. The latter empowers VarElect, our disease-gene interpretation tool, allowing one to systematically decipher ncRNA variants. The combined power of GeneCaRNA with GeneHancer, our regulatory elements database, facilitates wide-ranging scrutiny of the non-coding terra incognita of gene networks and whole genome analyses.
非编码 RNA (ncRNA) 基因具有越来越重要的生物学意义,与许多疾病的关联也越来越多。许多 ncRNA 来源以转录本为中心,但对于非编码变异分析和疾病破译,将这些信息转化为一套全面的基因组映射 ncRNA 基因至关重要。我们在 GeneCards 套件中推出了 GeneCaRNA,这是一个全新的、包罗万象的以基因为中心的 ncRNA 数据库。GeneCaRNA 的信息整合自四个社区支持的数据结构:主要转录本数据库 RNAcentral 及其包含的 20 个数据库,以及三个主要基因资源 HGNC、Ensembl 和 NCBI Gene 的 ncRNA 条目。GeneCaRNA 提供了 219,587 个 ncRNA 基因页面,比我们三个基因挖掘来源中的可用页面增加了 7 倍。每个 ncRNA 基因都有广泛的注释,从全球 100 多个来源中挖掘出来,提供了一个强大的 GeneCards 搜索功能。后者为我们的疾病基因解释工具 VarElect 提供了支持,允许系统地破译 ncRNA 变体。GeneCaRNA 与我们的调控元件数据库 GeneHancer 的结合,为基因网络和全基因组分析的未知非编码领域提供了广泛的研究。