• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于微阵列的基因集分析:当前方法的比较。

Microarray-based gene set analysis: a comparison of current methods.

作者信息

Song Sarah, Black Michael A

机构信息

Department of Biochemistry, University of Otago, Dunedin, New Zealand.

出版信息

BMC Bioinformatics. 2008 Nov 27;9:502. doi: 10.1186/1471-2105-9-502.

DOI:10.1186/1471-2105-9-502
PMID:19038052
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2607289/
Abstract

BACKGROUND

The analysis of gene sets has become a popular topic in recent times, with researchers attempting to improve the interpretability and reproducibility of their microarray analyses through the inclusion of supplementary biological information. While a number of options for gene set analysis exist, no consensus has yet been reached regarding which methodology performs best, and under what conditions. The goal of this work was to examine the performance characteristics of a collection of existing gene set analysis methods, on both simulated and real microarray data sets. Of particular interest was the potential utility gained through the incorporation of inter-gene correlation into the analysis process.

RESULTS

Each of six gene set analysis methods was applied to both simulated and publicly available microarray data sets. Overall, the various methodologies were all found to be better at detecting gene sets that moved from non-active (i.e., genes not expressed) to active states (or vice versa), rather than those that simply changed their level of activity. Methods which incorporate correlation structures were found to provide increased ability to detect altered gene sets in some settings.

CONCLUSION

Based on the results obtained through the analysis of simulated data, it is clear that the performance of gene set analysis methods is strongly influenced by the features of the data set in question, and that methods which incorporate correlation structures into the analysis process tend to achieve better performance, relative to methods which rely on univariate test statistics.

摘要

背景

近年来,基因集分析已成为一个热门话题,研究人员试图通过纳入补充生物学信息来提高其微阵列分析的可解释性和可重复性。虽然存在多种基因集分析选项,但对于哪种方法表现最佳以及在何种条件下表现最佳尚未达成共识。这项工作的目标是在模拟和真实微阵列数据集上检验一系列现有基因集分析方法的性能特征。特别感兴趣的是通过将基因间相关性纳入分析过程所获得的潜在效用。

结果

六种基因集分析方法中的每一种都应用于模拟和公开可用的微阵列数据集。总体而言,发现各种方法在检测从非活跃(即未表达的基因)转变为活跃状态(反之亦然)的基因集方面比那些只是简单改变其活性水平的基因集表现更好。发现在某些情况下,纳入相关结构的方法能够增强检测改变的基因集的能力。

结论

基于通过模拟数据分析获得的结果,很明显基因集分析方法的性能受到所讨论数据集特征的强烈影响,并且相对于依赖单变量检验统计量的方法,将相关结构纳入分析过程的方法往往能取得更好的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff80/2607289/fbf5d7c62946/1471-2105-9-502-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff80/2607289/c3dd606a304c/1471-2105-9-502-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff80/2607289/b0cf90296e12/1471-2105-9-502-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff80/2607289/216fe8f64716/1471-2105-9-502-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff80/2607289/fbf5d7c62946/1471-2105-9-502-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff80/2607289/c3dd606a304c/1471-2105-9-502-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff80/2607289/b0cf90296e12/1471-2105-9-502-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff80/2607289/216fe8f64716/1471-2105-9-502-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff80/2607289/fbf5d7c62946/1471-2105-9-502-4.jpg

相似文献

1
Microarray-based gene set analysis: a comparison of current methods.基于微阵列的基因集分析:当前方法的比较。
BMC Bioinformatics. 2008 Nov 27;9:502. doi: 10.1186/1471-2105-9-502.
2
Improving gene set analysis of microarray data by SAM-GS.通过SAM-GS改进微阵列数据的基因集分析
BMC Bioinformatics. 2007 Jul 5;8:242. doi: 10.1186/1471-2105-8-242.
3
Comparisons and validation of statistical clustering techniques for microarray gene expression data.微阵列基因表达数据统计聚类技术的比较与验证
Bioinformatics. 2003 Mar 1;19(4):459-66. doi: 10.1093/bioinformatics/btg025.
4
Multivariate hierarchical Bayesian model for differential gene expression analysis in microarray experiments.用于微阵列实验中差异基因表达分析的多元分层贝叶斯模型。
BMC Bioinformatics. 2008;9 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-9-S1-S9.
5
Gene set enrichment analysis for non-monotone association and multiple experimental categories.针对非单调关联和多个实验类别的基因集富集分析。
BMC Bioinformatics. 2008 Nov 14;9:481. doi: 10.1186/1471-2105-9-481.
6
Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.基于疾病谱数据中错误发现率的七种生成Affymetrix表达分数方法的比较。
BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26.
7
A new outlier removal approach for cDNA microarray normalization.一种用于cDNA微阵列标准化的新离群值去除方法。
Biotechniques. 2009 Aug;47(2):691-2, 694-700. doi: 10.2144/000113195.
8
Interactively optimizing signal-to-noise ratios in expression profiling: project-specific algorithm selection and detection p-value weighting in Affymetrix microarrays.在表达谱分析中交互式优化信噪比:Affymetrix微阵列中特定项目的算法选择和检测p值加权
Bioinformatics. 2004 Nov 1;20(16):2534-44. doi: 10.1093/bioinformatics/bth280. Epub 2004 Apr 29.
9
Nonparametric methods for identifying differentially expressed genes in microarray data.用于识别微阵列数据中差异表达基因的非参数方法。
Bioinformatics. 2002 Nov;18(11):1454-61. doi: 10.1093/bioinformatics/18.11.1454.
10
Outcome prediction based on microarray analysis: a critical perspective on methods.基于微阵列分析的结果预测:对方法的批判性观点
BMC Bioinformatics. 2009 Feb 7;10:53. doi: 10.1186/1471-2105-10-53.

引用本文的文献

1
Robustness evaluations of pathway activity inference methods on gene expression data.基于基因表达数据的通路活性推断方法的稳健性评估。
BMC Bioinformatics. 2024 Jan 12;25(1):23. doi: 10.1186/s12859-024-05632-w.
2
Identification of highly connected and differentially expressed gene subnetworks in metastasizing endometrial cancer.鉴定转移性子宫内膜癌中高度连接和差异表达的基因子网络。
PLoS One. 2018 Nov 1;13(11):e0206665. doi: 10.1371/journal.pone.0206665. eCollection 2018.
3
Integrative gene set enrichment analysis utilizing isoform-specific expression.

本文引用的文献

1
Comparative evaluation of gene-set analysis methods.基因集分析方法的比较评估
BMC Bioinformatics. 2007 Nov 7;8:431. doi: 10.1186/1471-2105-8-431.
2
Improving gene set analysis of microarray data by SAM-GS.通过SAM-GS改进微阵列数据的基因集分析
BMC Bioinformatics. 2007 Jul 5;8:242. doi: 10.1186/1471-2105-8-242.
3
Analyzing gene expression data in terms of gene sets: methodological issues.从基因集角度分析基因表达数据:方法学问题。
利用异构体特异性表达的整合基因集富集分析。
Genet Epidemiol. 2017 Sep;41(6):498-510. doi: 10.1002/gepi.22052. Epub 2017 Jun 4.
4
Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates.利用小样本重复改进RNA测序数据的基因集富集分析
PLoS One. 2016 Nov 9;11(11):e0165919. doi: 10.1371/journal.pone.0165919. eCollection 2016.
5
Genetics of structural connectivity and information processing in the brain.大脑结构连接性与信息处理的遗传学
Brain Struct Funct. 2016 Dec;221(9):4643-4661. doi: 10.1007/s00429-016-1194-0. Epub 2016 Feb 6.
6
Transferring genomics to the clinic: distinguishing Burkitt and diffuse large B cell lymphomas.将基因组学应用于临床:鉴别伯基特淋巴瘤和弥漫性大B细胞淋巴瘤
Genome Med. 2015 Jul 1;7(1):64. doi: 10.1186/s13073-015-0187-6. eCollection 2015.
7
Comparison of High-Level Microarray Analysis Methods in the Context of Result Consistency.在结果一致性背景下对高级微阵列分析方法的比较
PLoS One. 2015 Jun 9;10(6):e0128845. doi: 10.1371/journal.pone.0128845. eCollection 2015.
8
What mRNA Abundances Can Tell us about Metabolism.信使核糖核酸丰度能告诉我们关于新陈代谢的哪些信息。
Metabolites. 2012 Sep 12;2(3):614-31. doi: 10.3390/metabo2030614.
9
EDDY: a novel statistical gene set test method to detect differential genetic dependencies.EDDY:一种新型的统计基因集测试方法,用于检测差异的遗传依赖性。
Nucleic Acids Res. 2014 Apr;42(7):e60. doi: 10.1093/nar/gku099. Epub 2014 Feb 5.
10
Hierarchical clustering of high-throughput expression data based on general dependences.基于广义相关性的高通量表达数据的层次聚类。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Jul-Aug;10(4):1080-5. doi: 10.1109/TCBB.2013.99.
Bioinformatics. 2007 Apr 15;23(8):980-7. doi: 10.1093/bioinformatics/btm051. Epub 2007 Feb 15.
4
A multivariate approach for integrating genome-wide expression data and biological knowledge.一种整合全基因组表达数据与生物学知识的多变量方法。
Bioinformatics. 2006 Oct 1;22(19):2373-80. doi: 10.1093/bioinformatics/btl401. Epub 2006 Jul 28.
5
Linear models and empirical bayes methods for assessing differential expression in microarray experiments.用于评估微阵列实验中差异表达的线性模型和经验贝叶斯方法。
Stat Appl Genet Mol Biol. 2004;3:Article3. doi: 10.2202/1544-6115.1027. Epub 2004 Feb 12.
6
Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.基因集富集分析:一种基于知识的方法用于解读全基因组表达谱。
Proc Natl Acad Sci U S A. 2005 Oct 25;102(43):15545-50. doi: 10.1073/pnas.0506580102. Epub 2005 Sep 30.
7
Discovering statistically significant pathways in expression profiling studies.在基因表达谱研究中发现具有统计学意义的通路。
Proc Natl Acad Sci U S A. 2005 Sep 20;102(38):13544-9. doi: 10.1073/pnas.0506577102. Epub 2005 Sep 8.
8
Testing differential gene expression in functional groups. Goeman's global test versus an ANCOVA approach.测试功能组中的差异基因表达。戈曼全局检验与协方差分析方法的比较。
Methods Inf Med. 2005;44(3):449-53.
9
Significance analysis of functional categories in gene expression studies: a structured permutation approach.基因表达研究中功能类别的显著性分析:一种结构化置换方法。
Bioinformatics. 2005 May 1;21(9):1943-9. doi: 10.1093/bioinformatics/bti260. Epub 2005 Jan 12.
10
Bioconductor: open software development for computational biology and bioinformatics.生物导体:用于计算生物学和生物信息学的开源软件开发。
Genome Biol. 2004;5(10):R80. doi: 10.1186/gb-2004-5-10-r80. Epub 2004 Sep 15.