• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

双基因集富集分析(dualGSEA);一种R函数,可从转录组学数据中实现更可靠的生物学发现和临床前模型比对。

Dual gene set enrichment analysis (dualGSEA); an R function that enables more robust biological discovery and pre-clinical model alignment from transcriptomics data.

作者信息

Bull Courtney, Byrne Ryan M, Fisher Natalie C, Corry Shania M, Amirkhah Raheleh, Edwards Jessica, Hillson Lily V S, Lawler Mark, Ryan Aideen E, Lamrock Felicity, Dunne Philip D, Malla Sudhir B

机构信息

The Patrick G Johnston Centre for Cancer Research, Queen's University Belfast, Belfast, UK.

School of Cancer Sciences, University of Glasgow, Glasgow, UK.

出版信息

Sci Rep. 2024 Dec 4;14(1):30202. doi: 10.1038/s41598-024-80534-8.

DOI:10.1038/s41598-024-80534-8
PMID:39632911
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11618328/
Abstract

Gene set enrichment analysis (GSEA) tools can identify biological insights within gene expression-based studies. Although their statistical performance has been compared, the downstream biological implications that arise when choosing between the range of pairwise or single sample forms of GSEA methods remain understudied. We compare the statistical and biological results obtained from various pre-ranking methods/options for pairwise GSEA, followed by a stand-alone comparison of GSEA, single sample GSEA (ssGSEA) and gene set variation analysis (GSVA). Pairwise GSEA and fGSEA provide similar results when deployed using a range of gene pre-ranking methods. However, pairwise GSEA can overgeneralise biological enrichment, as when the most statistically significant signatures were assessed using single sample approaches, there was a complete absence of biological distinction between these groups. To avoid these issues, we developed a new dualGSEA tool, which provides users with multiple statistics and visuals to aid interpretation of results. This new tool removes the possibility of users inadvertently interpreting statistical findings as equating to biological distinction between samples within groups-of-interest. dualGSEA provides a more robust basis for discovery research, one which allows user to compare both statistical significance alongside biological distinctions in their data.

摘要

基因集富集分析(GSEA)工具能够在基于基因表达的研究中识别生物学见解。尽管已对其统计性能进行了比较,但在GSEA方法的成对或单样本形式范围内进行选择时所产生的下游生物学影响仍未得到充分研究。我们比较了从成对GSEA的各种预排序方法/选项中获得的统计和生物学结果,随后对GSEA、单样本GSEA(ssGSEA)和基因集变异分析(GSVA)进行了单独比较。当成对GSEA使用一系列基因预排序方法进行部署时,成对GSEA和fGSEA提供了相似的结果。然而,成对GSEA可能会过度概括生物学富集情况,因为当使用单样本方法评估最具统计学意义的特征时,这些组之间完全没有生物学差异。为避免这些问题,我们开发了一种新的dualGSEA工具,它为用户提供多种统计信息和可视化手段,以帮助解释结果。这个新工具消除了用户无意中将统计结果解释为等同于感兴趣组内样本之间生物学差异的可能性。dualGSEA为发现性研究提供了一个更可靠的基础,它允许用户在数据中同时比较统计显著性和生物学差异。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3926/11618328/5e6a2c429b08/41598_2024_80534_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3926/11618328/1203c9b8e32c/41598_2024_80534_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3926/11618328/767d63588c12/41598_2024_80534_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3926/11618328/84ebe1cc05c6/41598_2024_80534_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3926/11618328/5e6a2c429b08/41598_2024_80534_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3926/11618328/1203c9b8e32c/41598_2024_80534_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3926/11618328/767d63588c12/41598_2024_80534_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3926/11618328/84ebe1cc05c6/41598_2024_80534_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3926/11618328/5e6a2c429b08/41598_2024_80534_Fig4_HTML.jpg

相似文献

1
Dual gene set enrichment analysis (dualGSEA); an R function that enables more robust biological discovery and pre-clinical model alignment from transcriptomics data.双基因集富集分析(dualGSEA);一种R函数,可从转录组学数据中实现更可靠的生物学发现和临床前模型比对。
Sci Rep. 2024 Dec 4;14(1):30202. doi: 10.1038/s41598-024-80534-8.
2
Gene expression analysis in clear cell renal cell carcinoma using gene set enrichment analysis for biostatistical management.基于基因集富集分析的 clear cell 肾细胞癌基因表达分析用于生物统计学管理。
BJU Int. 2011 Jul;108(2 Pt 2):E29-35. doi: 10.1111/j.1464-410X.2010.09794.x. Epub 2011 Mar 16.
3
BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses.BubbleGUM:表型分子特征的自动提取及多种基因集富集分析的综合可视化
BMC Genomics. 2015 Oct 19;16:814. doi: 10.1186/s12864-015-2012-4.
4
RNA-Seq and Gene Set Enrichment Analysis (GSEA) in Peripheral Blood Mononuclear Cells (PBMCs).外周血单个核细胞(PBMCs)中的RNA测序(RNA-Seq)和基因集富集分析(GSEA)
Methods Mol Biol. 2025;2880:179-192. doi: 10.1007/978-1-0716-4276-4_8.
5
multiGSEA: a GSEA-based pathway enrichment analysis for multi-omics data.multiGSEA:一种基于 GSEA 的多组学数据通路富集分析方法。
BMC Bioinformatics. 2020 Dec 7;21(1):561. doi: 10.1186/s12859-020-03910-x.
6
Confero: an integrated contrast data and gene set platform for computational analysis and biological interpretation of omics data.Confero:一个集成的对比数据和基因集平台,用于计算分析和生物学解释组学数据。
BMC Genomics. 2013 Jul 29;14:514. doi: 10.1186/1471-2164-14-514.
7
Comparative study of gene set enrichment methods.基因集富集方法的比较研究。
BMC Bioinformatics. 2009 Sep 2;10:275. doi: 10.1186/1471-2105-10-275.
8
Assessment of Gene Set Enrichment Analysis using curated RNA-seq-based benchmarks.基于 RNA-seq 验证集的基因集富集分析评估。
PLoS One. 2024 May 16;19(5):e0302696. doi: 10.1371/journal.pone.0302696. eCollection 2024.
9
Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates.利用小样本重复改进RNA测序数据的基因集富集分析
PLoS One. 2016 Nov 9;11(11):e0165919. doi: 10.1371/journal.pone.0165919. eCollection 2016.
10
PAGE: parametric analysis of gene set enrichment.PAGE:基因集富集的参数分析
BMC Bioinformatics. 2005 Jun 8;6:144. doi: 10.1186/1471-2105-6-144.

引用本文的文献

1
Biomaterial-mediated Cell Atlas: an insight from single-cell and spatial transcriptomics.生物材料介导的细胞图谱:来自单细胞和空间转录组学的见解
Bioact Mater. 2025 Aug 8;54:1-33. doi: 10.1016/j.bioactmat.2025.07.047. eCollection 2025 Dec.
2
Optimizing Model Performance and Interpretability: Application to Biological Data Classification.优化模型性能与可解释性:在生物数据分类中的应用
Genes (Basel). 2025 Feb 28;16(3):297. doi: 10.3390/genes16030297.

本文引用的文献

1
Author Correction: Pathway level subtyping identifies a slow-cycling biological phenotype associated with poor clinical outcomes in colorectal cancer.作者更正:通路水平亚型分类识别出一种与结直肠癌临床预后不良相关的慢循环生物学表型。
Nat Genet. 2024 Jun;56(6):1321. doi: 10.1038/s41588-024-01809-4.
2
Pathway level subtyping identifies a slow-cycling biological phenotype associated with poor clinical outcomes in colorectal cancer.通路水平亚分型确定了一种与结直肠癌临床预后不良相关的慢周期生物学表型。
Nat Genet. 2024 Mar;56(3):458-472. doi: 10.1038/s41588-024-01654-5. Epub 2024 Feb 13.
3
Longitudinal whole blood transcriptomic analysis characterizes neutrophil activation and interferon signaling in moderate and severe COVID-19.
纵向全血转录组分析描绘了中重度 COVID-19 患者中性粒细胞激活和干扰素信号转导特征。
Sci Rep. 2023 Jun 26;13(1):10368. doi: 10.1038/s41598-023-37606-y.
4
Integrated Machine Learning and Single-Sample Gene Set Enrichment Analysis Identifies a TGF-Beta Signaling Pathway Derived Score in Headneck Squamous Cell Carcinoma.整合机器学习与单样本基因集富集分析鉴定头颈部鳞状细胞癌中源自转化生长因子-β信号通路的评分
J Oncol. 2022 Sep 1;2022:3140263. doi: 10.1155/2022/3140263. eCollection 2022.
5
Activation of innate-adaptive immune machinery by poly(I:C) exposes a therapeutic vulnerability to prevent relapse in stroma-rich colon cancer.聚肌苷酸-聚胞苷酸(poly(I:C))激活固有-适应性免疫机制,为预防富含基质的结直肠癌复发提供了治疗上的弱点。
Gut. 2022 Dec;71(12):2502-2517. doi: 10.1136/gutjnl-2021-326183. Epub 2022 Apr 27.
6
clusterProfiler 4.0: A universal enrichment tool for interpreting omics data.clusterProfiler 4.0:用于解释组学数据的通用富集工具。
Innovation (Camb). 2021 Jul 1;2(3):100141. doi: 10.1016/j.xinn.2021.100141. eCollection 2021 Aug 28.
7
A hypothesis is a liability.假设是一种负担。
Genome Biol. 2020 Sep 3;21(1):231. doi: 10.1186/s13059-020-02133-w.
8
Gene Set Analysis: Challenges, Opportunities, and Future Research.基因集分析:挑战、机遇与未来研究
Front Genet. 2020 Jun 30;11:654. doi: 10.3389/fgene.2020.00654. eCollection 2020.
9
ssGSEA score-based Ras dependency indexes derived from gene expression data reveal potential Ras addiction mechanisms with possible clinical implications.基于 ssGSEA 评分的 Ras 依赖性指数来源于基因表达数据,揭示了潜在的 Ras 成瘾机制,并具有潜在的临床意义。
Sci Rep. 2020 Jun 24;10(1):10258. doi: 10.1038/s41598-020-66986-8.
10
The FAIR Guiding Principles for scientific data management and stewardship.科学数据管理和保存的 FAIR 指导原则。
Sci Data. 2016 Mar 15;3:160018. doi: 10.1038/sdata.2016.18.