一种用于识别微阵列数据中差异表达模式的实用错误发现率方法。

A practical false discovery rate approach to identifying patterns of differential expression in microarray data.

作者信息

Grant Gregory R, Liu Junmin, Stoeckert Christian J

机构信息

Center for Bioinformatics, University of Pennsylvania, 1429 Blockley Hall, 423 Guardian Drive, Philadelphia, PA 19104-6021, USA.

出版信息

Bioinformatics. 2005 Jun 1;21(11):2684-90. doi: 10.1093/bioinformatics/bti407. Epub 2005 Mar 29.

DOI:10.1093/bioinformatics/bti407

PMID:15797908

Abstract

Searching for differentially expressed genes is one of the most common applications for microarrays, yet statistically there are difficult hurdles to achieving adequate rigor and practicality. False discovery rate (FDR) approaches have become relatively standard; however, how to define and control the FDR has been hotly debated. Permutation estimation approaches such as SAM and PaGE can be effective; however, they leave much room for improvement. We pursue the permutation estimation method and describe a convenient definition for the FDR that can be estimated in a straightforward manner. We then discuss issues regarding the choice of statistic and data transformation. It is impossible to optimize the power of any statistic for thousands of genes simultaneously, and we look at the practical consequences of this. For example, the log transform can both help and hurt at the same time, depending on the gene. We examine issues surrounding the SAM 'fudge factor' parameter, and how to handle these issues by optimizing with respect to power.

摘要

寻找差异表达基因是微阵列最常见的应用之一，但从统计学角度来看，要达到足够的严谨性和实用性存在诸多困难。错误发现率（FDR）方法已相对标准化；然而，如何定义和控制FDR一直是激烈争论的焦点。诸如SAM和PaGE等排列估计方法可能有效；然而，它们仍有很大的改进空间。我们采用排列估计方法，并描述了一种便于定义的FDR，它可以通过直接的方式进行估计。然后我们讨论了关于统计量选择和数据转换的问题。不可能同时针对数千个基因优化任何统计量的功效，我们探讨了这一情况的实际影响。例如，对数变换可能同时产生帮助和造成损害，这取决于具体基因。我们研究了围绕SAM“调整因子”参数的问题，以及如何通过优化功效来处理这些问题。

相似文献

A practical false discovery rate approach to identifying patterns of differential expression in microarray data.

Bioinformatics. 2005 Jun 1;21(11):2684-90. doi: 10.1093/bioinformatics/bti407. Epub 2005 Mar 29.

A note on using permutation-based false discovery rate estimates to compare different analysis methods for microarray data.

Bioinformatics. 2005 Dec 1;21(23):4280-8. doi: 10.1093/bioinformatics/bti685. Epub 2005 Sep 27.

Construction of null statistics in permutation-based multiple testing for multi-factorial microarray experiments.

Bioinformatics. 2006 Jun 15;22(12):1486-94. doi: 10.1093/bioinformatics/btl109. Epub 2006 Mar 30.

Estimating the false discovery rate using nonparametric deconvolution.

Biometrics. 2007 Sep;63(3):806-15. doi: 10.1111/j.1541-0420.2006.00736.x.

Multidimensional local false discovery rate for microarray studies.

Bioinformatics. 2006 Mar 1;22(5):556-65. doi: 10.1093/bioinformatics/btk013. Epub 2005 Dec 20.

Sample size for FDR-control in microarray data analysis.

Bioinformatics. 2005 Jul 15;21(14):3097-104. doi: 10.1093/bioinformatics/bti456. Epub 2005 Apr 21.

Practical FDR-based sample size calculations in microarray experiments.

Bioinformatics. 2005 Aug 1;21(15):3264-72. doi: 10.1093/bioinformatics/bti519. Epub 2005 Jun 2.

Bias in the estimation of false discovery rate in microarray studies.

Bioinformatics. 2005 Oct 15;21(20):3865-72. doi: 10.1093/bioinformatics/bti626. Epub 2005 Aug 16.

Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.

BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26.

False discovery rate, sensitivity and sample size for microarray studies.

Bioinformatics. 2005 Jul 1;21(13):3017-24. doi: 10.1093/bioinformatics/bti448. Epub 2005 Apr 19.

引用本文的文献

Uncovering the complexity of childhood undernutrition through strain-level analysis of the gut microbiome.

BMC Microbiol. 2024 Mar 5;24(1):73. doi: 10.1186/s12866-024-03211-w.

Deciphering associations between three RNA splicing-related genetic variants and lung cancer risk.

NPJ Precis Oncol. 2022 Jun 30;6(1):48. doi: 10.1038/s41698-022-00281-9.

Longitudinal virological changes and underlying pathogenesis in hospitalized COVID-19 patients in Guangzhou, China.

Sci China Life Sci. 2021 Dec;64(12):2129-2143. doi: 10.1007/s11427-020-1921-5. Epub 2021 Apr 28.

Intestinal sp. Imbalance Associated With the Occurrence of Childhood Undernutrition in China.

Front Microbiol. 2019 Nov 29;10:2635. doi: 10.3389/fmicb.2019.02635. eCollection 2019.

Identification of serum exosomal microRNAs in acute spinal cord injured rats.

Exp Biol Med (Maywood). 2019 Oct;244(14):1149-1161. doi: 10.1177/1535370219872759. Epub 2019 Aug 26.

Lipocalin-Like Prostaglandin D Synthase but Not Hemopoietic Prostaglandin D Synthase Deletion Causes Hypertension and Accelerates Thrombogenesis in Mice.

J Pharmacol Exp Ther. 2018 Dec;367(3):425-432. doi: 10.1124/jpet.118.250936. Epub 2018 Oct 10.

Spatial phenotyping of the endocardial endothelium as a function of intracardiac hemodynamic shear stress.

J Biomech. 2017 Jan 4;50:11-19. doi: 10.1016/j.jbiomech.2016.11.018. Epub 2016 Nov 16.

Investigation of the functional role of human Interleukin-8 gene haplotypes by CRISPR/Cas9 mediated genome editing.

Sci Rep. 2016 Aug 8;6:31180. doi: 10.1038/srep31180.

Poly(A) code analyses reveal key determinants for tissue-specific mRNA alternative polyadenylation.

RNA. 2016 Jun;22(6):813-21. doi: 10.1261/rna.055681.115. Epub 2016 Apr 19.

Integrated Regional Cardiac Hemodynamic Imaging and RNA Sequencing Reveal Corresponding Heterogeneity of Ventricular Wall Shear Stress and Endocardial Transcriptome.

J Am Heart Assoc. 2016 Apr 18;5(4):e003170. doi: 10.1161/JAHA.115.003170.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于识别微阵列数据中差异表达模式的实用错误发现率方法。

A practical false discovery rate approach to identifying patterns of differential expression in microarray data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献