控制家族性错误率和错误发现率的功率增强型多重决策函数

POWER-ENHANCED MULTIPLE DECISION FUNCTIONS CONTROLLING FAMILY-WISE ERROR AND FALSE DISCOVERY RATES.

作者信息

Peña Edsel A, Habiger Joshua D, Wu Wensong

机构信息

University of South Carolina, Columbia, Oklahoma State University and University of South Carolina, Columbia.

出版信息

Ann Stat. 2011 Feb;39(1):556-583. doi: 10.1214/10-aos844.

DOI:10.1214/10-aos844

PMID:25018568

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4091923/

Abstract

Improved procedures, in terms of smaller missed discovery rates (MDR), for performing multiple hypotheses testing with weak and strong control of the family-wise error rate (FWER) or the false discovery rate (FDR) are developed and studied. The improvement over existing procedures such as the Šidák procedure for FWER control and the Benjamini-Hochberg (BH) procedure for FDR control is achieved by exploiting possible differences in the powers of the individual tests. Results signal the need to take into account the powers of the individual tests and to have multiple hypotheses decision functions which are not limited to simply using the individual -values, as is the case, for example, with the Šidák, Bonferroni, or BH procedures. They also enhance understanding of the role of the powers of individual tests, or more precisely the receiver operating characteristic (ROC) functions of decision processes, in the search for better multiple hypotheses testing procedures. A decision-theoretic framework is utilized, and through auxiliary randomizers the procedures could be used with discrete or mixed-type data or with rank-based nonparametric tests. This is in contrast to existing -value based procedures whose theoretical validity is contingent on each of these -value statistics being stochastically equal to or greater than a standard uniform variable under the null hypothesis. Proposed procedures are relevant in the analysis of high-dimensional "large , small " data sets arising in the natural, physical, medical, economic and social sciences, whose generation and creation is accelerated by advances in high-throughput technology, notably, but not limited to, microarray technology.

摘要

开发并研究了在控制族系错误率（FWER）或错误发现率（FDR）方面具有更强控制能力且漏检率（MDR）更小的改进程序，用于进行多重假设检验。通过利用各个检验功效的可能差异，相对于现有程序（如用于控制FWER的Šidák程序和用于控制FDR的Benjamini-Hochberg（BH）程序）实现了改进。结果表明，需要考虑各个检验的功效，并拥有不限于简单使用各个p值的多重假设决策函数，例如Šidák、Bonferroni或BH程序就是如此。它们还增进了对各个检验功效（或更准确地说是决策过程的接收者操作特征（ROC）函数）在寻找更好的多重假设检验程序中的作用的理解。利用了决策理论框架，并且通过辅助随机化器，这些程序可用于离散或混合型数据或基于秩的非参数检验。这与现有的基于p值的程序形成对比，后者的理论有效性取决于在原假设下每个p值统计量随机等于或大于标准均匀变量。所提出的程序在分析自然科学、物理科学、医学、经济学和社会科学中出现的高维“大p，小n”数据集时具有相关性，这些数据集的生成和创建因高通量技术（特别是但不限于微阵列技术）的进步而加速。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae3f/4091923/f69b0d5b3e94/nihms585692f1.jpg

相似文献

POWER-ENHANCED MULTIPLE DECISION FUNCTIONS CONTROLLING FAMILY-WISE ERROR AND FALSE DISCOVERY RATES.控制家族性错误率和错误发现率的功率增强型多重决策函数

Ann Stat. 2011 Feb;39(1):556-583. doi: 10.1214/10-aos844.

Classes of Multiple Decision Functions Strongly Controlling FWER and FDR.强控制族错误率（FWER）和错误发现率（FDR）的多重决策函数类别。

Metrika. 2015 Jul 1;78(5):563-595. doi: 10.1007/s00184-014-0516-6.

Weighted multiple hypothesis testing procedures.加权多重假设检验程序。

Stat Appl Genet Mol Biol. 2009;8(1):Article23. doi: 10.2202/1544-6115.1437. Epub 2009 Apr 16.

Resampling-based empirical Bayes multiple testing procedures for controlling generalized tail probability and expected value error rates: focus on the false discovery rate and simulation study.基于重采样的经验贝叶斯多重检验程序，用于控制广义尾概率和期望值错误率：聚焦于错误发现率及模拟研究

Biom J. 2008 Oct;50(5):716-44. doi: 10.1002/bimj.200710473.

Controlling the false discovery rate with constraints: the Newman-Keuls test revisited.带约束条件控制错误发现率：重新审视纽曼-基尔斯检验。

Biom J. 2007 Feb;49(1):136-43. doi: 10.1002/bimj.200610297.

Modifying the false discovery rate procedure based on the information theory under arbitrary correlation structure and its performance in high-dimensional genomic data.基于信息论在任意相关结构下修改错误发现率程序及其在高维基因组数据中的性能。

BMC Bioinformatics. 2024 Feb 5;25(1):57. doi: 10.1186/s12859-024-05678-w.

Multiple testing with discrete data: Proportion of true null hypotheses and two adaptive FDR procedures.离散数据的多重检验：真零假设的比例及两种自适应错误发现率程序

Biom J. 2018 Jul;60(4):761-779. doi: 10.1002/bimj.201700157. Epub 2018 May 11.

SOME STEP-DOWN PROCEDURES CONTROLLING THE FALSE DISCOVERY RATE UNDER DEPENDENCE.一些在相依性下控制错误发现率的逐步降阶程序。

Stat Sin. 2008;18(3):881-904.

Gaining power in multiple testing of interval hypotheses via conditionalization.通过条件化在区间假设的多重检验中获得权力。

Biostatistics. 2020 Apr 1;21(2):e65-e79. doi: 10.1093/biostatistics/kxy042.

A generalized Sidak-Holm procedure and control of generalized error rates under independence.一种广义的西达克 - 霍尔姆方法及独立条件下广义错误率的控制。

Stat Appl Genet Mol Biol. 2007;6:Article3. doi: 10.2202/1544-6115.1247. Epub 2007 Jan 25.

引用本文的文献

False Discovery Rate Control for Lesion-Symptom Mapping With Heterogeneous Data via Weighted p-Values.基于加权 p 值的异质数据病变-症状映射的假发现率控制。

Biom J. 2024 Sep;66(6):e202300198. doi: 10.1002/bimj.202300198.

Mapping potential pathways from polygenic liability through brain structure to psychological problems across the transition to adolescence.绘制从多基因风险通过大脑结构到青少年过渡期间心理问题的潜在途径图。

J Child Psychol Psychiatry. 2024 Aug;65(8):1047-1060. doi: 10.1111/jcpp.13944. Epub 2024 Jan 7.

Sex-specific analysis of traumatic brain injury events: applying computational and data visualization techniques to inform prevention and management.创伤性脑损伤事件的性别特异性分析：应用计算和数据可视化技术为预防和管理提供信息。

BMC Med Res Methodol. 2022 Jan 30;22(1):30. doi: 10.1186/s12874-021-01493-6.

Weighted False Discovery Rate Control in Large-Scale Multiple Testing.大规模多重检验中的加权错误发现率控制

J Am Stat Assoc. 2018;113(523):1172-1183. doi: 10.1080/01621459.2017.1336443. Epub 2018 Jun 12.

Weighted mining of massive collections of [Formula: see text]-values by convex optimization.通过凸优化对大量[公式：见文本]值集合进行加权挖掘。

Inf inference. 2018 Jun;7(2):251-275. doi: 10.1093/imaiai/iax013. Epub 2017 Dec 8.

Data-driven hypothesis weighting increases detection power in genome-scale multiple testing.数据驱动的假设加权提高了基因组规模多重检验中的检测能力。

Nat Methods. 2016 Jul;13(7):577-80. doi: 10.1038/nmeth.3885. Epub 2016 May 30.

Optimal multiple testing under a Gaussian prior on the effect sizes.效应量的高斯先验下的最优多重检验。

Biometrika. 2015 Dec;102(4):753-766. doi: 10.1093/biomet/asv050. Epub 2015 Nov 4.

Classes of Multiple Decision Functions Strongly Controlling FWER and FDR.强控制族错误率（FWER）和错误发现率（FDR）的多重决策函数类别。

Metrika. 2015 Jul 1;78(5):563-595. doi: 10.1007/s00184-014-0516-6.

Bayes multiple decision functions.贝叶斯多重决策函数。

Electron J Stat. 2013;7(1):1272-1300. doi: 10.1214/13-EJS813.

Compound -value statistics for multiple testing procedures.多重检验程序的复合值统计

J Multivar Anal. 2014 Apr 1;126:153-166. doi: 10.1016/j.jmva.2014.01.007.

本文引用的文献

Randomised -values and nonparametric procedures in multiple testing.多重检验中的随机化值和非参数方法。

J Nonparametr Stat. 2011;23(3):583-604. doi: 10.1080/10485252.2010.482154.

A Bayesian Discovery Procedure.一种贝叶斯发现程序。

J R Stat Soc Series B Stat Methodol. 2009 Nov 1;71(5):905-925. doi: 10.1111/j.1467-9868.2009.00714.x.

Weighted multiple hypothesis testing procedures.加权多重假设检验程序。

Stat Appl Genet Mol Biol. 2009;8(1):Article23. doi: 10.2202/1544-6115.1437. Epub 2009 Apr 16.

Biom J. 2008 Oct;50(5):716-44. doi: 10.1002/bimj.200710473.

Multiple testing with minimal assumptions.在最小假设条件下进行多重检验。

Biom J. 2008 Oct;50(5):745-55. doi: 10.1002/bimj.200710456.

A method to increase the power of multiple testing procedures through sample splitting.一种通过样本分割提高多重检验程序功效的方法。

Stat Appl Genet Mol Biol. 2006;5:Article19. doi: 10.2202/1544-6115.1148. Epub 2006 Aug 1.

The optimal discovery procedure for large-scale significance testing, with applications to comparative microarray experiments.大规模显著性检验的最优发现程序及其在比较微阵列实验中的应用

Biostatistics. 2007 Apr;8(2):414-32. doi: 10.1093/biostatistics/kxl019. Epub 2006 Aug 23.

Using prior information to allocate significance levels for multiple endpoints.利用先验信息为多个终点分配显著性水平。

Stat Med. 1998 Sep 30;17(18):2107-19. doi: 10.1002/(sici)1097-0258(19980930)17:18<2107::aid-sim910>3.0.co;2-w.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验