Rahman Mahbuba, Boughorbel Sabri, Presnell Scott, Quinn Charlie, Cugno Chiara, Chaussabel Damien, Marr Nico
Sidra Medical and Research Center, Doha, Qatar.
Benaroya Research Institute, Seattle, WA, USA.
F1000Res. 2016 Mar 30;5:414. doi: 10.12688/f1000research.8375.1. eCollection 2016.
Compendia of large-scale datasets made available in public repositories provide an opportunity to identify and fill gaps in biomedical knowledge. But first, these data need to be made readily accessible to research investigators for interpretation. Here we make available a collection of transcriptome datasets to investigate the functional programming of human hematopoietic cells in early life. Thirty two datasets were retrieved from the NCBI Gene Expression Omnibus (GEO) and loaded in a custom web application called the Gene Expression Browser (GXB), which was designed for interactive query and visualization of integrated large-scale data. Quality control checks were performed. Multiple sample groupings and gene rank lists were created allowing users to reveal age-related differences in transcriptome profiles, changes in the gene expression of neonatal hematopoietic cells to a variety of immune stimulators and modulators, as well as during cell differentiation. Available demographic, clinical, and cell phenotypic information can be overlaid with the gene expression data and used to sort samples. Web links to customized graphical views can be generated and subsequently inserted in manuscripts to report novel findings. GXB also enables browsing of a single gene across projects, thereby providing new perspectives on age- and developmental stage-specific expression of a given gene across the human hematopoietic system. This dataset collection is available at: http://developmentalimmunology.gxbsidra.org/dm3/geneBrowser/list.
公共存储库中提供的大规模数据集汇编为识别和填补生物医学知识空白提供了机会。但首先,这些数据需要让研究人员能够轻松获取以便进行解读。在此,我们提供了一组转录组数据集,用于研究人类造血细胞在生命早期的功能编程。从NCBI基因表达综合数据库(GEO)中检索到32个数据集,并加载到一个名为基因表达浏览器(GXB)的自定义网络应用程序中,该应用程序旨在对整合的大规模数据进行交互式查询和可视化。进行了质量控制检查。创建了多个样本分组和基因排名列表,使用户能够揭示转录组图谱中与年龄相关的差异、新生儿造血细胞对各种免疫刺激剂和调节剂的基因表达变化,以及细胞分化过程中的变化。可用的人口统计学、临床和细胞表型信息可以与基因表达数据叠加,并用于对样本进行分类。可以生成指向定制图形视图的网络链接,并随后插入到稿件中以报告新发现。GXB还能够跨项目浏览单个基因,从而为特定基因在人类造血系统中的年龄和发育阶段特异性表达提供新的视角。该数据集集合可在以下网址获取:http://developmentalimmunology.gxbsidra.org/dm3/geneBrowser/list。