基于可交换性和借势的微阵列数据非参数方法。

Nonparametric methods for microarray data based on exchangeability and borrowed power.

作者信息

Lee Mei-Ling Ting, Whitmore G A, Björkbacka Harry, Freeman Mason W

机构信息

Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA.

出版信息

J Biopharm Stat. 2005;15(5):783-97. doi: 10.1081/BIP-200067778.

DOI:10.1081/BIP-200067778

PMID:16078385

Abstract

This article proposes nonparametric inference procedures for analyzing microarray gene expression data that are reliable, robust, and simple to implement. They are conceptually transparent and require no special-purpose software. The analysis begins by normalizing gene expression data in a unique way. The resulting adjusted observations consist of gene-treatment interaction terms (representing differential expression) and error terms. The error terms are considered to be exchangeable, which is the only substantial assumption. Thus, under a family null hypothesis of no differential expression, the adjusted observations are exchangeable and all permutations of the observations are equally probable. The investigator may use the adjusted observations directly in a distribution-free test method or use their ranks in a rank-based method, where the ranking is taken over the whole data set. For the latter, the essential steps are as follows: (1) Calculate a Wilcoxon rank-sum difference or a corresponding Kruskal-Wallis rank statistic for each gene. (2) Randomly permute the observations and repeat the previous step. (3) Independently repeat the random permutation a suitable number of times. Under the exchangeability assumption, the permutation statistics are independent random draws from a null cumulative distribution function (c.d.f) approximated by the empirical c.d.f Reference to the empirical c.d.f tells if the test statistic for a gene is outlying and, hence, shows differential expression. This feature is judged by using an appropriate rejection region or computing a p-value for each test statistic, taking into account multiple testing. The distribution-free analog of the rank-based approach is also available and has parallel steps which are described in the article. The proposed nonparametric analysis tends to give good results with no additional refinement, although a few refinements are presented that may interest some investigators. The implementation is illustrated with a case application involving differential gene expression in wild-type and knockout mice of an E. coli lipopolysaccharide (LPS) endotoxin treatment, relative to a baseline untreated condition.

摘要

本文提出了用于分析微阵列基因表达数据的非参数推断程序，这些程序可靠、稳健且易于实施。它们在概念上清晰易懂，无需专用软件。分析从以独特方式对基因表达数据进行归一化开始。得到的调整后的观测值由基因 - 处理相互作用项（代表差异表达）和误差项组成。误差项被认为是可交换的，这是唯一的实质性假设。因此，在无差异表达的总体原假设下，调整后的观测值是可交换的，并且观测值的所有排列可能性相同。研究者可以直接在无分布检验方法中使用调整后的观测值，或者在基于秩的方法中使用它们的秩，其中排序是在整个数据集上进行的。对于后者，基本步骤如下：(1) 为每个基因计算 Wilcoxon 秩和差异或相应的 Kruskal - Wallis 秩统计量。(2) 对观测值进行随机排列并重复上一步。(3) 独立地重复随机排列适当次数。在可交换性假设下，排列统计量是从由经验累积分布函数近似的原累积分布函数中独立随机抽取的。参考经验累积分布函数可以判断某个基因的检验统计量是否异常，从而表明差异表达。通过使用适当的拒绝域或为每个检验统计量计算 p 值（考虑多重检验）来判断这一特征。基于秩的方法的无分布类似方法也可用，并且具有本文中描述的并行步骤。所提出的非参数分析在无需额外改进的情况下往往能给出良好结果，不过也给出了一些可能会引起一些研究者兴趣的改进方法。通过一个案例应用说明了该方法的实施过程，该案例涉及大肠杆菌脂多糖（LPS）内毒素处理的野生型和基因敲除小鼠相对于未处理基线条件下的差异基因表达。

相似文献

Nonparametric methods for microarray data based on exchangeability and borrowed power.基于可交换性和借势的微阵列数据非参数方法。

J Biopharm Stat. 2005;15(5):783-97. doi: 10.1081/BIP-200067778.

Construction of null statistics in permutation-based multiple testing for multi-factorial microarray experiments.基于排列的多因素微阵列实验多重检验中零统计量的构建。

Bioinformatics. 2006 Jun 15;22(12):1486-94. doi: 10.1093/bioinformatics/btl109. Epub 2006 Mar 30.

A new efficient statistical test for detecting variability in the gene expression data.一种用于检测基因表达数据变异性的新型高效统计检验方法。

Stat Methods Med Res. 2008 Aug;17(4):405-19. doi: 10.1177/0962280206078643. Epub 2007 Aug 14.

A moment-based method for estimating the proportion of true null hypotheses and its application to microarray gene expression data.一种基于时刻估计真零假设比例的方法及其在微阵列基因表达数据中的应用。

Biostatistics. 2007 Oct;8(4):744-55. doi: 10.1093/biostatistics/kxm002. Epub 2007 Jan 22.

Empirical Bayes screening of many p-values with applications to microarray studies.用于微阵列研究的多p值经验贝叶斯筛选。

Bioinformatics. 2005 May 1;21(9):1987-94. doi: 10.1093/bioinformatics/bti301. Epub 2005 Feb 2.

An improved nonparametric approach for detecting differentially expressed genes with replicated microarray data.一种用于利用重复微阵列数据检测差异表达基因的改进非参数方法。

Stat Appl Genet Mol Biol. 2006;5:Article30. doi: 10.2202/1544-6115.1246. Epub 2007 Jan 2.

Parametric and nonparametric FDR estimation revisited.参数化和非参数化错误发现率估计的再探讨。

Biometrics. 2006 Sep;62(3):735-44. doi: 10.1111/j.1541-0420.2006.00531.x.

Testing the prediction error difference between 2 predictors.测试两个预测指标之间的预测误差差异。

Biostatistics. 2009 Jul;10(3):550-60. doi: 10.1093/biostatistics/kxp011. Epub 2009 Apr 20.

A nonparametric approach to the analysis of longitudinal data via a set of level crossing problems with application to the analysis of microarray time course experiments.一种通过一组水平交叉问题对纵向数据进行分析的非参数方法及其在微阵列时间进程实验分析中的应用。

Biostatistics. 2005 Apr;6(2):271-8. doi: 10.1093/biostatistics/kxi008.

Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.基于疾病谱数据中错误发现率的七种生成Affymetrix表达分数方法的比较。

BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26.

引用本文的文献

Discovering monotonic stemness marker genes from time-series stem cell microarray data.从时间序列干细胞微阵列数据中发现单调干性标记基因。

BMC Genomics. 2015;16 Suppl 2(Suppl 2):S2. doi: 10.1186/1471-2164-16-S2-S2. Epub 2015 Jan 21.

Bioinformatic approaches to metabolic pathways analysis.代谢途径分析的生物信息学方法。

Methods Mol Biol. 2011;756:99-130. doi: 10.1007/978-1-61779-160-4_5.

Biological assessment of robust noise models in microarray data analysis.生物评估稳健噪声模型在微阵列数据分析中的应用。

Bioinformatics. 2011 Mar 15;27(6):807-14. doi: 10.1093/bioinformatics/btr018. Epub 2011 Jan 19.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于可交换性和借势的微阵列数据非参数方法。

Nonparametric methods for microarray data based on exchangeability and borrowed power.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献