GSEA-InContext：在表达实验中识别新颖和常见的模式。

GSEA-InContext: identifying novel and common patterns in expression experiments.

机构信息

Computational Bioscience Program, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

Department of Pharmacology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

出版信息

Bioinformatics. 2018 Jul 1;34(13):i555-i564. doi: 10.1093/bioinformatics/bty271.

DOI:10.1093/bioinformatics/bty271

PMID:29950010

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6022535/

Abstract

MOTIVATION

Gene Set Enrichment Analysis (GSEA) is routinely used to analyze and interpret coordinate pathway-level changes in transcriptomics experiments. For an experiment where less than seven samples per condition are compared, GSEA employs a competitive null hypothesis to test significance. A gene set enrichment score is tested against a null distribution of enrichment scores generated from permuted gene sets, where genes are randomly selected from the input experiment. Looking across a variety of biological conditions, however, genes are not randomly distributed with many showing consistent patterns of up- or down-regulation. As a result, common patterns of positively and negatively enriched gene sets are observed across experiments. Placing a single experiment into the context of a relevant set of background experiments allows us to identify both the common and experiment-specific patterns of gene set enrichment.

RESULTS

We compiled a compendium of 442 small molecule transcriptomic experiments and used GSEA to characterize common patterns of positively and negatively enriched gene sets. To identify experiment-specific gene set enrichment, we developed the GSEA-InContext method that accounts for gene expression patterns within a background set of experiments to identify statistically significantly enriched gene sets. We evaluated GSEA-InContext on experiments using small molecules with known targets to show that it successfully prioritizes gene sets that are specific to each experiment, thus providing valuable insights that complement standard GSEA analysis.

AVAILABILITY AND IMPLEMENTATION

GSEA-InContext implemented in Python, Supplementary results and the background expression compendium are available at: https://github.com/CostelloLab/GSEA-InContext.

摘要

动机

基因集富集分析（GSEA）常用于分析和解释转录组学实验中协调的通路水平变化。对于每个条件下少于七个样本的实验，GSEA 采用竞争的零假设来测试显著性。基因集富集得分与从随机选择输入实验中基因的排列基因集生成的富集得分的零分布进行比较。然而，在多种生物条件下，基因并不是随机分布的，许多基因表现出一致的上调或下调模式。因此，在实验中观察到正富集和负富集基因集的常见模式。将单个实验置于相关背景实验集合的上下文中，使我们能够识别基因集富集的常见和特定于实验的模式。

结果

我们编制了一个包含 442 个小分子转录组学实验的汇编，并使用 GSEA 来描述正富集和负富集基因集的常见模式。为了识别特定于实验的基因集富集，我们开发了 GSEA-InContext 方法，该方法考虑了背景实验集中的基因表达模式，以识别具有统计学意义的富集基因集。我们使用具有已知靶标的小分子实验评估了 GSEA-InContext，结果表明它成功地优先考虑了每个实验特有的基因集，从而提供了有价值的见解，补充了标准 GSEA 分析。

可用性和实施

GSEA-InContext 用 Python 实现，补充结果和背景表达汇编可在 https://github.com/CostelloLab/GSEA-InContext 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2085/6022535/4878d656e675/bty271f1.jpg

相似文献

GSEA-InContext: identifying novel and common patterns in expression experiments.GSEA-InContext：在表达实验中识别新颖和常见的模式。

Bioinformatics. 2018 Jul 1;34(13):i555-i564. doi: 10.1093/bioinformatics/bty271.

Gene expression analysis in clear cell renal cell carcinoma using gene set enrichment analysis for biostatistical management.基于基因集富集分析的 clear cell 肾细胞癌基因表达分析用于生物统计学管理。

BJU Int. 2011 Jul;108(2 Pt 2):E29-35. doi: 10.1111/j.1464-410X.2010.09794.x. Epub 2011 Mar 16.

Comparative study of gene set enrichment methods.基因集富集方法的比较研究。

BMC Bioinformatics. 2009 Sep 2;10:275. doi: 10.1186/1471-2105-10-275.

Differential Gene Set Enrichment Analysis: a statistical approach to quantify the relative enrichment of two gene sets.差异基因集富集分析：一种统计方法，用于量化两个基因集的相对富集程度。

Bioinformatics. 2021 Jan 29;36(21):5247-5254. doi: 10.1093/bioinformatics/btaa658.

blitzGSEA: efficient computation of gene set enrichment analysis through gamma distribution approximation.blitzGSEA：通过伽马分布逼近实现基因集富集分析的高效计算。

Bioinformatics. 2022 Apr 12;38(8):2356-2357. doi: 10.1093/bioinformatics/btac076.

Extensions to gene set enrichment.基因集富集的扩展

Bioinformatics. 2007 Feb 1;23(3):306-13. doi: 10.1093/bioinformatics/btl599. Epub 2006 Nov 24.

BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses.BubbleGUM：表型分子特征的自动提取及多种基因集富集分析的综合可视化

BMC Genomics. 2015 Oct 19;16:814. doi: 10.1186/s12864-015-2012-4.

PAGE: parametric analysis of gene set enrichment.PAGE：基因集富集的参数分析

BMC Bioinformatics. 2005 Jun 8;6:144. doi: 10.1186/1471-2105-6-144.

Appearance frequency modulated gene set enrichment testing.表现频率调制基因集富集测试。

BMC Bioinformatics. 2011 Mar 20;12:81. doi: 10.1186/1471-2105-12-81.

Concordant integrative gene set enrichment analysis of multiple large-scale two-sample expression data sets.多组大规模两样本表达数据集的一致整合基因集富集分析。

BMC Genomics. 2014;15 Suppl 1(Suppl 1):S6. doi: 10.1186/1471-2164-15-S1-S6. Epub 2014 Jan 24.

引用本文的文献

Bioinformatics analysis combined with untargeted metabolomics reveals lipid metabolism-related genes and their biological markers in chronic spontaneous urticaria.生物信息学分析结合非靶向代谢组学揭示慢性自发性荨麻疹中脂质代谢相关基因及其生物标志物。

Front Genet. 2025 Aug 18;16:1550205. doi: 10.3389/fgene.2025.1550205. eCollection 2025.

Front Pharmacol. 2025 Aug 6;16:1581122. doi: 10.3389/fphar.2025.1581122. eCollection 2025.

Characterizing Duodenal Immune Microenvironment in Functional Dyspepsia: An AutoML-Driven Diagnostic Framework.功能性消化不良中十二指肠免疫微环境的特征分析：一种自动机器学习驱动的诊断框架

J Inflamm Res. 2025 Jul 15;18:9201-9227. doi: 10.2147/JIR.S524791. eCollection 2025.

Characterization of lysine crotonylation-related lncRNAs for prognostic assessment and immune response in glioma.用于胶质瘤预后评估和免疫反应的赖氨酸巴豆酰化相关长链非编码RNA的特征分析

Front Pharmacol. 2025 Jun 30;16:1573694. doi: 10.3389/fphar.2025.1573694. eCollection 2025.

Casein Kinase 2α Ablation Confers Protection Against Metabolic Dysfunction-Associated Steatotic Liver Disease: Role of FUN14 Domain Containing 1-Dependent Regulation of Mitophagy and Ferroptosis.酪蛋白激酶2α缺失赋予对代谢功能障碍相关脂肪性肝病的保护作用：含FUN14结构域蛋白1依赖性调控线粒体自噬和铁死亡的作用

MedComm (2020). 2025 Jul 11;6(7):e70277. doi: 10.1002/mco2.70277. eCollection 2025 Jul.

J Cancer. 2025 May 8;16(8):2516-2536. doi: 10.7150/jca.104826. eCollection 2025.

Integrative transcriptomic and metabolomic analysis reveals the flavor formation mechanism of green bamboo () shoots.综合转录组学和代谢组学分析揭示了绿竹笋的风味形成机制。

Food Chem (Oxf). 2025 Apr 28;10:100258. doi: 10.1016/j.fochms.2025.100258. eCollection 2025 Jun.

Comprehensive analysis of a palmitoylation-related prognostic signature in colorectal cancer: Implications for immune therapy and personalized treatment.结直肠癌中棕榈酰化相关预后特征的综合分析：对免疫治疗和个性化治疗的意义

Oncol Lett. 2025 May 16;30(1):350. doi: 10.3892/ol.2025.15096. eCollection 2025 Jul.

Integrated multiomics machine learning and mediated Mendelian randomization investigate the molecular subtypes and prognosis lung squamous cell carcinoma.整合多组学机器学习和中介孟德尔随机化研究肺鳞状细胞癌的分子亚型和预后

Transl Lung Cancer Res. 2025 Mar 31;14(3):857-877. doi: 10.21037/tlcr-24-891. Epub 2025 Mar 18.

Gastroesophageal circulating tumor cell crosstalk with peripheral immune system guides CTC survival and proliferation.胃食管循环肿瘤细胞与外周免疫系统的相互作用指导循环肿瘤细胞的存活和增殖。

Cell Death Dis. 2025 Mar 29;16(1):223. doi: 10.1038/s41419-025-07530-2.

本文引用的文献

A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles.下一代连接图谱：L1000平台及首批100万个图谱

Cell. 2017 Nov 30;171(6):1437-1452.e17. doi: 10.1016/j.cell.2017.10.049.

The Drug Repurposing Hub: a next-generation drug library and information resource.药物再利用中心：一个新一代药物库及信息资源。

Nat Med. 2017 Apr 7;23(4):405-408. doi: 10.1038/nm.4306.

Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd.大众对基因表达综合数据库中的特征进行提取和分析。

Nat Commun. 2016 Sep 26;7:12846. doi: 10.1038/ncomms12846.

GR and ER Coactivation Alters the Expression of Differentiation Genes and Associates with Improved ER+ Breast Cancer Outcome.糖皮质激素受体（GR）与雌激素受体（ER）的共激活改变分化基因的表达并与雌激素受体阳性（ER+）乳腺癌的良好预后相关。

Mol Cancer Res. 2016 Aug;14(8):707-19. doi: 10.1158/1541-7786.MCR-15-0433. Epub 2016 May 2.

The statistical properties of gene-set analysis.基因集分析的统计特性。

Nat Rev Genet. 2016 Apr 12;17(6):353-64. doi: 10.1038/nrg.2016.29.

The Molecular Signatures Database (MSigDB) hallmark gene set collection.分子特征数据库（MSigDB）标志性基因集集合。

Cell Syst. 2015 Dec 23;1(6):417-425. doi: 10.1016/j.cels.2015.12.004.

Tumor-Derived Cell Lines as Molecular Models of Cancer Pharmacogenomics.肿瘤衍生细胞系作为癌症药物基因组学的分子模型

Mol Cancer Res. 2016 Jan;14(1):3-13. doi: 10.1158/1541-7786.MCR-15-0189. Epub 2015 Aug 6.

DSigDB: drug signatures database for gene set analysis.DSigDB：用于基因集分析的药物特征数据库。

Bioinformatics. 2015 Sep 15;31(18):3069-71. doi: 10.1093/bioinformatics/btv313. Epub 2015 May 19.

limma powers differential expression analyses for RNA-sequencing and microarray studies.limma为RNA测序和微阵列研究提供差异表达分析的动力。

Nucleic Acids Res. 2015 Apr 20;43(7):e47. doi: 10.1093/nar/gkv007. Epub 2015 Jan 20.

PSEA-Quant: a protein set enrichment analysis on label-free and label-based protein quantification data.PSEA-Quant：基于无标记和标记的蛋白质定量数据的蛋白质集富集分析

J Proteome Res. 2014 Dec 5;13(12):5496-509. doi: 10.1021/pr500473n. Epub 2014 Oct 16.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

GSEA-InContext：在表达实验中识别新颖和常见的模式。

GSEA-InContext: identifying novel and common patterns in expression experiments.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

动机

结果

可用性和实施

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献