Suppr超能文献

基因表达星云 (GEN):一个综合性的数据门户,整合了多个物种在 bulk 和单细胞水平的转录组谱。

Gene Expression Nebulas (GEN): a comprehensive data portal integrating transcriptomic profiles across multiple species at both bulk and single-cell levels.

机构信息

National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.

China National Center for Bioinformation, Beijing 100101, China.

出版信息

Nucleic Acids Res. 2022 Jan 7;50(D1):D1016-D1024. doi: 10.1093/nar/gkab878.

Abstract

Transcriptomic profiling is critical to uncovering functional elements from transcriptional and post-transcriptional aspects. Here, we present Gene Expression Nebulas (GEN, https://ngdc.cncb.ac.cn/gen/), an open-access data portal integrating transcriptomic profiles under various biological contexts. GEN features a curated collection of high-quality bulk and single-cell RNA sequencing datasets by using standardized data processing pipelines and a structured curation model. Currently, GEN houses a large number of gene expression profiles from 323 datasets (157 bulk and 166 single-cell), covering 50 500 samples and 15 540 169 cells across 30 species, which are further categorized into six biological contexts. Moreover, GEN integrates a full range of transcriptomic profiles on expression, RNA editing and alternative splicing for 10 bulk datasets, providing opportunities for users to conduct integrative analysis at both transcriptional and post-transcriptional levels. In addition, GEN provides abundant gene annotations based on value-added curation of transcriptomic profiles and delivers online services for data analysis and visualization. Collectively, GEN presents a comprehensive collection of transcriptomic profiles across multiple species, thus serving as a fundamental resource for better understanding genetic regulatory architecture and functional mechanisms from tissues to cells.

摘要

转录组谱分析对于从转录和转录后方面揭示功能元件至关重要。在这里,我们介绍了基因表达星云(GEN,https://ngdc.cncb.ac.cn/gen/),这是一个开放获取的数据门户,整合了各种生物背景下的转录组谱。GEN 采用标准化的数据处理流程和结构化的策展模型,以精选的高质量批量和单细胞 RNA 测序数据集为特色。目前,GEN 拥有来自 323 个数据集(157 个批量和 166 个单细胞)的大量基因表达谱,涵盖 30 个物种的 50500 个样本和 15540169 个细胞,这些数据集进一步分为六个生物学背景。此外,GEN 整合了 10 个批量数据集在表达、RNA 编辑和可变剪接方面的全转录组谱,为用户在转录和转录后水平进行综合分析提供了机会。此外,GEN 提供了丰富的基因注释,这些注释是基于对转录组谱的增值策展,还提供了数据分析和可视化的在线服务。总之,GEN 提供了多个物种的综合转录组谱集,因此是更好地理解从组织到细胞的遗传调控结构和功能机制的基本资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6066/8728231/f3aead3616fb/gkab878fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验