• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GSVA:用于微阵列和 RNA-seq 数据的基因集变异分析。

GSVA: gene set variation analysis for microarray and RNA-seq data.

机构信息

Research Program on Biomedical Informatics, Hospital del Mar Medical Research Institute, Barcelona, Catalonia, Spain.

出版信息

BMC Bioinformatics. 2013 Jan 16;14:7. doi: 10.1186/1471-2105-14-7.

DOI:10.1186/1471-2105-14-7
PMID:23323831
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3618321/
Abstract

BACKGROUND

Gene set enrichment (GSE) analysis is a popular framework for condensing information from gene expression profiles into a pathway or signature summary. The strengths of this approach over single gene analysis include noise and dimension reduction, as well as greater biological interpretability. As molecular profiling experiments move beyond simple case-control studies, robust and flexible GSE methodologies are needed that can model pathway activity within highly heterogeneous data sets.

RESULTS

To address this challenge, we introduce Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner. We demonstrate the robustness of GSVA in a comparison with current state of the art sample-wise enrichment methods. Further, we provide examples of its utility in differential pathway activity and survival analysis. Lastly, we show how GSVA works analogously with data from both microarray and RNA-seq experiments.

CONCLUSIONS

GSVA provides increased power to detect subtle pathway activity changes over a sample population in comparison to corresponding methods. While GSE methods are generally regarded as end points of a bioinformatic analysis, GSVA constitutes a starting point to build pathway-centric models of biology. Moreover, GSVA contributes to the current need of GSE methods for RNA-seq data. GSVA is an open source software package for R which forms part of the Bioconductor project and can be downloaded at http://www.bioconductor.org.

摘要

背景

基因集富集(GSE)分析是一种将基因表达谱信息浓缩为途径或特征摘要的流行框架。与单个基因分析相比,这种方法的优势包括降噪和降维,以及更大的生物学可解释性。随着分子分析实验超越简单的病例对照研究,需要稳健且灵活的 GSE 方法来对高度异质数据集内的途径活性进行建模。

结果

为了解决这一挑战,我们引入了基因集变异分析(GSVA),这是一种 GSE 方法,能够以无监督的方式估计样本群体中途径活性的变化。我们通过与当前最先进的样本富集方法进行比较,证明了 GSVA 的稳健性。此外,我们还提供了其在差异途径活性和生存分析中的应用示例。最后,我们展示了 GSVA 如何与微阵列和 RNA-seq 实验数据类似地工作。

结论

与相应的方法相比,GSVA 提供了在样本群体中检测细微途径活性变化的能力。虽然 GSE 方法通常被视为生物信息学分析的终点,但 GSVA 构成了构建以途径为中心的生物学模型的起点。此外,GSVA 有助于满足当前对 RNA-seq 数据的 GSE 方法的需求。GSVA 是一个用于 R 的开源软件包,它是 Bioconductor 项目的一部分,可以在 http://www.bioconductor.org 下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/4fb4b19a290f/1471-2105-14-7-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/896963265f7e/1471-2105-14-7-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/252822d30bdc/1471-2105-14-7-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/42050a2fa9a0/1471-2105-14-7-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/83790343a6d6/1471-2105-14-7-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/626275724c94/1471-2105-14-7-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/f8b4fcb30b2f/1471-2105-14-7-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/4fb4b19a290f/1471-2105-14-7-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/896963265f7e/1471-2105-14-7-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/252822d30bdc/1471-2105-14-7-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/42050a2fa9a0/1471-2105-14-7-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/83790343a6d6/1471-2105-14-7-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/626275724c94/1471-2105-14-7-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/f8b4fcb30b2f/1471-2105-14-7-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/216d/3618321/4fb4b19a290f/1471-2105-14-7-7.jpg

相似文献

1
GSVA: gene set variation analysis for microarray and RNA-seq data.GSVA:用于微阵列和 RNA-seq 数据的基因集变异分析。
BMC Bioinformatics. 2013 Jan 16;14:7. doi: 10.1186/1471-2105-14-7.
2
A flexible count data model to fit the wide diversity of expression profiles arising from extensively replicated RNA-seq experiments.一种灵活的计数数据模型,可适用于广泛复制的 RNA-seq 实验所产生的广泛多样化的表达谱。
BMC Bioinformatics. 2013 Aug 21;14:254. doi: 10.1186/1471-2105-14-254.
3
Importing ArrayExpress datasets into R/Bioconductor.将 ArrayExpress 数据集导入 R/Bioconductor。
Bioinformatics. 2009 Aug 15;25(16):2092-4. doi: 10.1093/bioinformatics/btp354. Epub 2009 Jun 8.
4
Combining multiple tools outperforms individual methods in gene set enrichment analyses.在基因集富集分析中,结合多种工具比单独使用方法表现更优。
Bioinformatics. 2017 Feb 1;33(3):414-424. doi: 10.1093/bioinformatics/btw623.
5
SeqGSEA: a Bioconductor package for gene set enrichment analysis of RNA-Seq data integrating differential expression and splicing.SeqGSEA:一个用于 RNA-Seq 数据基因集富集分析的 Bioconductor 软件包,集成了差异表达和剪接分析。
Bioinformatics. 2014 Jun 15;30(12):1777-9. doi: 10.1093/bioinformatics/btu090. Epub 2014 Feb 17.
6
DeepGSEA: explainable deep gene set enrichment analysis for single-cell transcriptomic data.DeepGSEA:单细胞转录组数据的可解释深度基因集富集分析。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae434.
7
Importing MAGE-ML format microarray data into BioConductor.将MAGE-ML格式的微阵列数据导入生物导体(BioConductor)。
Bioinformatics. 2004 Dec 12;20(18):3641-2. doi: 10.1093/bioinformatics/bth396. Epub 2004 Jul 15.
8
ExpressionPlot: a web-based framework for analysis of RNA-Seq and microarray gene expression data.表达谱图:一个基于网络的 RNA-Seq 和微阵列基因表达数据分析框架。
Genome Biol. 2011 Jul 28;12(7):R69. doi: 10.1186/gb-2011-12-7-r69.
9
puma 3.0: improved uncertainty propagation methods for gene and transcript expression analysis.puma 3.0:改进了基因和转录本表达分析的不确定性传播方法。
BMC Bioinformatics. 2013 Feb 5;14:39. doi: 10.1186/1471-2105-14-39.
10
DEGseq: an R package for identifying differentially expressed genes from RNA-seq data.DEGseq:一个用于从 RNA-seq 数据中识别差异表达基因的 R 包。
Bioinformatics. 2010 Jan 1;26(1):136-8. doi: 10.1093/bioinformatics/btp612. Epub 2009 Oct 24.

引用本文的文献

1
A multi-tissue single-cell expression atlas in cattle.牛的多组织单细胞表达图谱。
Nat Genet. 2025 Sep 5. doi: 10.1038/s41588-025-02329-5.
2
A framework to mine laser microdissection-based omics data and uncover regulators of pancreatic cancer heterogeneity.一种挖掘基于激光显微切割的组学数据并揭示胰腺癌异质性调节因子的框架。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf101.
3
Developing a prognostic model of glutamine metabolism-related genes associated with clinical features and immune status in melanoma.构建与黑色素瘤临床特征和免疫状态相关的谷氨酰胺代谢相关基因的预后模型。

本文引用的文献

1
The limitations of simple gene set enrichment analysis assuming gene independence.假设基因独立性的简单基因集富集分析的局限性。
Stat Methods Med Res. 2016 Feb;25(1):472-87. doi: 10.1177/0962280212460441. Epub 2012 Oct 14.
2
Bayesian Kernel Mixtures for Counts.用于计数的贝叶斯核混合模型
J Am Stat Assoc. 2011 Dec 1;106(496):1528-1539. doi: 10.1198/jasa.2011.tm10552. Epub 2012 Jan 24.
3
Ten years of pathway analysis: current approaches and outstanding challenges.十年的通路分析:当前方法和突出挑战。
Front Oncol. 2025 Aug 20;15:1485006. doi: 10.3389/fonc.2025.1485006. eCollection 2025.
4
Exploring immunological alterations of B cells in peripheral immunity via single-cell RNA sequencing: insights into primary membranous nephropathy.通过单细胞RNA测序探索外周免疫中B细胞的免疫改变:对原发性膜性肾病的见解
Front Immunol. 2025 Aug 19;16:1622395. doi: 10.3389/fimmu.2025.1622395. eCollection 2025.
5
Selective targeting of TBXT with DARPins identifies regulatory networks and therapeutic vulnerabilities in chordoma.用抗肌动蛋白重复结构域蛋白选择性靶向TBXT可确定脊索瘤中的调控网络和治疗弱点。
Sci Adv. 2025 Sep 5;11(36):eadu2796. doi: 10.1126/sciadv.adu2796. Epub 2025 Sep 3.
6
Uncovering key biomarkers, potential therapeutic targets and development of deep learning model in heart failure.揭示心力衰竭中的关键生物标志物、潜在治疗靶点及深度学习模型的开发。
PLoS One. 2025 Sep 3;20(9):e0330780. doi: 10.1371/journal.pone.0330780. eCollection 2025.
7
Identification and validation of PSMB7 as a novel biomarker for prognosis and immune infiltrates of lung adenocarcinoma.鉴定和验证PSMB7作为肺腺癌预后和免疫浸润的新型生物标志物。
PeerJ. 2025 Aug 29;13:e19958. doi: 10.7717/peerj.19958. eCollection 2025.
8
A Novel Mouse Model of Granular Corneal Dystrophy Type II Reveals Impaired Autophagy and Recapitulates Human Pathogenesis.一种新型的II型颗粒状角膜营养不良小鼠模型揭示了自噬受损并重现了人类发病机制。
Invest Ophthalmol Vis Sci. 2025 Sep 2;66(12):7. doi: 10.1167/iovs.66.12.7.
9
Regulation of m7G methylation in long COVID: Expression profiles and early predictive value of key genes.长新冠中m7G甲基化的调控:关键基因的表达谱及早期预测价值
Medicine (Baltimore). 2025 Aug 29;104(35):e44209. doi: 10.1097/MD.0000000000044209.
10
Deep learning-based histomorphological subtyping and risk stratification of small cell lung cancer from hematoxylin and eosin-stained whole slide images.基于深度学习的苏木精和伊红染色全切片图像对小细胞肺癌进行组织形态学亚型分类和风险分层
Genome Med. 2025 Sep 2;17(1):98. doi: 10.1186/s13073-025-01526-5.
PLoS Comput Biol. 2012;8(2):e1002375. doi: 10.1371/journal.pcbi.1002375. Epub 2012 Feb 23.
4
Removing technical variability in RNA-seq data using conditional quantile normalization.使用条件分位数归一化去除 RNA-seq 数据中的技术变异性。
Biostatistics. 2012 Apr;13(2):204-16. doi: 10.1093/biostatistics/kxr054. Epub 2012 Jan 27.
5
Gene set enrichment analysis: performance evaluation and usage guidelines.基因集富集分析:性能评估和使用指南。
Brief Bioinform. 2012 May;13(3):281-91. doi: 10.1093/bib/bbr049. Epub 2011 Sep 7.
6
Integrated genomic analyses of ovarian carcinoma.卵巢癌的综合基因组分析。
Nature. 2011 Jun 29;474(7353):609-15. doi: 10.1038/nature10166.
7
Comparison of global tests for functional gene sets in two-group designs and selection of potentially effect-causing genes.两组设计中功能基因集的全局检验比较及潜在效应基因的选择。
Bioinformatics. 2011 May 15;27(10):1377-83. doi: 10.1093/bioinformatics/btr152. Epub 2011 Mar 26.
8
Joint genetic analysis of gene expression data with inferred cellular phenotypes.联合基因表达数据与推断的细胞表型的遗传分析。
PLoS Genet. 2011 Jan 20;7(1):e1001276. doi: 10.1371/journal.pgen.1001276.
9
COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer.COSMIC:在癌症体细胞突变目录中挖掘完整的癌症基因组。
Nucleic Acids Res. 2011 Jan;39(Database issue):D945-50. doi: 10.1093/nar/gkq929. Epub 2010 Oct 15.
10
Common inherited variation in mitochondrial genes is not enriched for associations with type 2 diabetes or related glycemic traits.常见的线粒体基因遗传变异与 2 型糖尿病或相关血糖特征的关联没有富集。
PLoS Genet. 2010 Aug 12;6(8):e1001058. doi: 10.1371/journal.pgen.1001058.