High-Performance Computing and Networking Institute, National Research Council of Italy, Via P. Castellino, 111, 80131 Napoli, Italy.
Department of Medicine, Immunology and Allergy Unit, Karolinska Institutet, 171 76 Stockholm, Sweden.
Int J Mol Sci. 2019 Dec 3;20(23):6098. doi: 10.3390/ijms20236098.
The comparison of high throughput gene expression datasets obtained from different experimental conditions is a challenging task. It provides an opportunity to explore the cellular response to various biological events such as disease, environmental conditions, and drugs. There is a need for tools that allow the integration and analysis of such data. We developed the "RankerGUI pipeline", a user-friendly web application for the biological community. It allows users to use various rank based statistical approaches for the comparison of full differential gene expression profiles between the same or different biological states obtained from different sources. The pipeline modules are an integration of various open-source packages, a few of which are modified for extended functionality. The main modules include rank rank hypergeometric overlap, enriched rank rank hypergeometric overlap and distance calculations. Additionally, preprocessing steps such as merging differential expression profiles of multiple independent studies can be added before running the main modules. Output plots show the strength, pattern, and trends among complete differential expression profiles. In this paper, we describe the various modules and functionalities of the developed pipeline. We also present a case study that demonstrates how the pipeline can be used for the comparison of differential expression profiles obtained from multiple platforms' data of the Gene Expression Omnibus. Using these comparisons, we investigate gene expression patterns in kidney and lung cancers.
比较不同实验条件下获得的高通量基因表达数据集是一项具有挑战性的任务。它提供了一个机会来探索细胞对各种生物事件(如疾病、环境条件和药物)的反应。需要有工具来允许整合和分析这些数据。我们开发了“RankerGUI 管道”,这是一个面向生物界的用户友好的 Web 应用程序。它允许用户使用各种基于排名的统计方法来比较来自不同来源的相同或不同生物状态的全差异基因表达谱。该管道模块集成了各种开源软件包,其中一些经过修改以扩展功能。主要模块包括排名超几何重叠、富集排名超几何重叠和距离计算。此外,在运行主模块之前,可以添加合并多个独立研究的差异表达谱等预处理步骤。输出图显示了完整差异表达谱之间的强度、模式和趋势。在本文中,我们描述了开发的管道的各个模块和功能。我们还展示了一个案例研究,演示了如何使用该管道比较来自基因表达综合数据库多个平台数据的差异表达谱。使用这些比较,我们研究了肾脏和肺癌中的基因表达模式。
BMC Bioinformatics. 2005-3-17
Brief Funct Genomics. 2015-3
BMC Genomics. 2013-10-7
BMC Genomics. 2015
Life Sci Alliance. 2024-2
Cancers (Basel). 2020-12-28
Nucleic Acids Res. 2017-7-3
Bioinformatics. 2016-1-15
Nucleic Acids Res. 2013-4-24
Bioinformatics. 2011-7-6
Proc Natl Acad Sci U S A. 2010-8-2