Suppr超能文献

基于独立检验的基因集分析:复杂的零假设和特定备择假设。

Gene set analysis for self-contained tests: complex null and specific alternative hypotheses.

机构信息

Division of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR 72205, USA.

出版信息

Bioinformatics. 2012 Dec 1;28(23):3073-80. doi: 10.1093/bioinformatics/bts579. Epub 2012 Oct 7.

Abstract

MOTIVATION

The analysis of differentially expressed gene sets became a routine in the analyses of gene expression data. There is a multitude of tests available, ranging from aggregation tests that summarize gene-level statistics for a gene set to true multivariate tests, accounting for intergene correlations. Most of them detect complex departures from the null hypothesis but when the null hypothesis is rejected, the specific alternative leading to the rejection is not easily identifiable.

RESULTS

In this article we compare the power and Type I error rates of minimum-spanning tree (MST)-based non-parametric multivariate tests with several multivariate and aggregation tests, which are frequently used for pathway analyses. In our simulation study, we demonstrate that MST-based tests have power that is for many settings comparable with the power of conventional approaches, but outperform them in specific regions of the parameter space corresponding to biologically relevant configurations. Further, we find for simulated and for gene expression data that MST-based tests discriminate well against shift and scale alternatives. As a general result, we suggest a two-step practical analysis strategy that may increase the interpretability of experimental data: first, apply the most powerful multivariate test to find the subset of pathways for which the null hypothesis is rejected and second, apply MST-based tests to these pathways to select those that support specific alternative hypotheses.

CONTACT

gvglazko@uams.edu or yrahmatallah@uams.edu

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

差异表达基因集的分析已成为基因表达数据分析的常规方法。有多种测试方法可供选择,从汇总基因集基因水平统计信息的聚合测试到真正的多变量测试,都可以考虑基因间相关性。大多数方法都可以检测到复杂的偏离零假设的情况,但当零假设被拒绝时,导致拒绝的具体替代方案不容易识别。

结果

在本文中,我们比较了基于最小生成树(MST)的非参数多变量测试与几种常用的通路分析多变量和聚合测试的功效和 I 型错误率。在我们的模拟研究中,我们证明基于 MST 的测试具有功效,对于许多情况来说,其功效与传统方法相当,但在对应于生物学相关配置的特定参数空间区域中表现优于它们。此外,我们发现对于模拟数据和基因表达数据,基于 MST 的测试可以很好地区分偏移和缩放替代方案。总的来说,我们建议采用一种两步实用分析策略,可以提高实验数据的可解释性:首先,应用最强大的多变量测试来找到拒绝零假设的通路子集,其次,将基于 MST 的测试应用于这些通路,以选择支持特定替代假设的通路。

联系方式

gvglazko@uams.eduyrahmatallah@uams.edu

补充信息

补充数据可在 Bioinformatics 在线获取。

相似文献

5
A new gene selection procedure based on the covariance distance.基于协方差距离的新基因选择过程。
Bioinformatics. 2010 Feb 1;26(3):348-54. doi: 10.1093/bioinformatics/btp672. Epub 2009 Dec 8.
10
Statistical methods for gene set co-expression analysis.基因集共表达分析的统计方法。
Bioinformatics. 2009 Nov 1;25(21):2780-6. doi: 10.1093/bioinformatics/btp502. Epub 2009 Aug 18.

引用本文的文献

7
Gene set analysis using sufficient dimension reduction.使用充分降维的基因集分析。
BMC Bioinformatics. 2016 Feb 6;17:74. doi: 10.1186/s12859-016-0928-6.

本文引用的文献

4
Linear combination test for hierarchical gene set analysis.用于分层基因集分析的线性组合检验
Stat Appl Genet Mol Biol. 2011;10:Article 13. doi: 10.2202/1544-6115.1641.
7
ROAST: rotation gene set tests for complex microarray experiments.ROAST:用于复杂微阵列实验的旋转基因集检验。
Bioinformatics. 2010 Sep 1;26(17):2176-82. doi: 10.1093/bioinformatics/btq401. Epub 2010 Jul 7.
8
Independent filtering increases detection power for high-throughput experiments.独立过滤提高了高通量实验的检测能力。
Proc Natl Acad Sci U S A. 2010 May 25;107(21):9546-51. doi: 10.1073/pnas.0914005107. Epub 2010 May 11.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验