• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于经验贝叶斯和重采样的多重检验程序,用于控制假阳性比例的尾概率。

Empirical Bayes and resampling based multiple testing procedure controlling tail probability of the proportion of false positives.

作者信息

van der Laan Mark J, Birkner Merrill D, Hubbard Alan E

机构信息

Division of Biostatistics, School of Public Health, University of California, Berkeley, USA.

出版信息

Stat Appl Genet Mol Biol. 2005;4:Article29. doi: 10.2202/1544-6115.1143. Epub 2005 Oct 7.

DOI:10.2202/1544-6115.1143
PMID:16646847
Abstract

Simultaneously testing a collection of null hypotheses about a data generating distribution based on a sample of independent and identically distributed observations is a fundamental and important statistical problem involving many applications. In this article we propose a new re-sampling based multiple testing procedure asymptotically controlling the probability that the proportion of false positives among the set of rejections exceeds q at level alpha, where q and alpha are user supplied numbers. The procedure involves 1) specifying a conditional distribution for a guessed set of true null hypotheses, given the data, which asymptotically is degenerate at the true set of null hypotheses, and 2) specifying a generally valid null distribution for the vector of test-statistics proposed in Pollard & van der Laan (2003), and generalized in our subsequent article Dudoit, van der Laan, & Pollard (2004), van der Laan, Dudoit, & Pollard (2004), and van der Laan, Dudoit, & Pollard (2004b). Ingredient 1) is established by fitting the empirical Bayes two component mixture model (Efron (2001b)) to the data to obtain an upper bound for marginal posterior probabilities of the null being true, given the data. We establish the finite sample rational behind our proposal, and prove that this new multiple testing procedure asymptotically controls the wished tail probability for the proportion of false positives under general data generating distributions. In addition, we provide simulation studies establishing that this method is generally more powerful in finite samples than our previously proposed augmentation multiple testing procedure (van der Laan, Dudoit, & Pollard (2004b)) and competing procedures from the literature. Finally, we illustrate our methodology with a data analysis.

摘要

基于独立同分布观测样本,同时检验关于数据生成分布的一组原假设,是一个涉及许多应用的基本且重要的统计问题。在本文中,我们提出了一种基于重采样的新多重检验程序,该程序能在渐近意义下控制拒绝集中假阳性比例超过q的概率,其水平为α,其中q和α是用户给定的数值。该程序包括:1)给定数据,为一组猜测的真原假设指定一个条件分布,在渐近意义下,该分布在真原假设集处退化;2)为Pollard和van der Laan(2003)中提出并在我们后续文章Dudoit、van der Laan和Pollard(2004)、van der Laan、Dudoit和Pollard(2004)以及van der Laan、Dudoit和Pollard(2004b)中推广的检验统计量向量指定一个普遍有效的原分布。成分1)是通过将经验贝叶斯双成分混合模型(Efron(2001b))拟合到数据,以获得给定数据时原假设为真的边际后验概率的上界来建立的。我们确立了我们提议背后的有限样本合理性,并证明了这种新的多重检验程序在一般数据生成分布下渐近地控制了假阳性比例的期望尾概率。此外,我们提供了模拟研究,证明该方法在有限样本中通常比我们之前提出的增强多重检验程序(van der Laan、Dudoit和Pollard(2004b))以及文献中的竞争程序更具功效。最后,我们用一个数据分析来说明我们的方法。

相似文献

1
Empirical Bayes and resampling based multiple testing procedure controlling tail probability of the proportion of false positives.基于经验贝叶斯和重采样的多重检验程序,用于控制假阳性比例的尾概率。
Stat Appl Genet Mol Biol. 2005;4:Article29. doi: 10.2202/1544-6115.1143. Epub 2005 Oct 7.
2
Quantile-function based null distribution in resampling based multiple testing.基于重采样的多重检验中基于分位数函数的零分布。
Stat Appl Genet Mol Biol. 2006;5:Article14. doi: 10.2202/1544-6115.1199. Epub 2006 May 21.
3
Augmentation procedures for control of the generalized family-wise error rate and tail probabilities for the proportion of false positives.用于控制广义族错误率和假阳性比例的尾概率的增强程序。
Stat Appl Genet Mol Biol. 2004;3:Article15. doi: 10.2202/1544-6115.1042. Epub 2004 Jun 15.
4
Multiple testing. Part I. Single-step procedures for control of general type I error rates.多重检验。第一部分。控制一般I型错误率的单步程序。
Stat Appl Genet Mol Biol. 2004;3:Article13. doi: 10.2202/1544-6115.1040. Epub 2004 Jun 9.
5
Multiple testing and data adaptive regression: an application to HIV-1 sequence data.多重检验与数据自适应回归:在HIV-1序列数据中的应用
Stat Appl Genet Mol Biol. 2005;4:Article8. doi: 10.2202/1544-6115.1110. Epub 2005 Apr 18.
6
A method to increase the power of multiple testing procedures through sample splitting.一种通过样本分割提高多重检验程序功效的方法。
Stat Appl Genet Mol Biol. 2006;5:Article19. doi: 10.2202/1544-6115.1148. Epub 2006 Aug 1.
7
Resampling-based empirical Bayes multiple testing procedures for controlling generalized tail probability and expected value error rates: focus on the false discovery rate and simulation study.基于重采样的经验贝叶斯多重检验程序,用于控制广义尾概率和期望值错误率:聚焦于错误发现率及模拟研究
Biom J. 2008 Oct;50(5):716-44. doi: 10.1002/bimj.200710473.
8
Wavelet thresholding with bayesian false discovery rate control.采用贝叶斯错误发现率控制的小波阈值处理
Biometrics. 2005 Mar;61(1):25-35. doi: 10.1111/j.0006-341X.2005.031102.x.
9
Multiple testing. Part II. Step-down procedures for control of the family-wise error rate.多重检验。第二部分。用于控制族系错误率的逐步下调程序。
Stat Appl Genet Mol Biol. 2004;3:Article14. doi: 10.2202/1544-6115.1041. Epub 2004 Jun 14.
10
Semi-parametric differential expression analysis via partial mixture estimation.通过部分混合估计进行半参数差异表达分析。
Stat Appl Genet Mol Biol. 2008;7(1):Article15. doi: 10.2202/1544-6115.1333. Epub 2008 Apr 28.

引用本文的文献

1
New metrics for multiple testing with correlated outcomes.用于相关结果多重检验的新指标。
Front Appl Math Stat. 2023;9. doi: 10.3389/fams.2023.1151314. Epub 2023 May 1.
2
Genome-Wide DNA Methylation in Peripheral Blood and Long-Term Exposure to Source-Specific Transportation Noise and Air Pollution: The SAPALDIA Study.基于外周血的全基因组 DNA 甲基化与特定来源交通噪声和空气污染的长期暴露:SAPALDIA 研究。
Environ Health Perspect. 2020 Jun;128(6):67003. doi: 10.1289/EHP6174. Epub 2020 Jun 1.
3
Targeted deletion of NFAT-Interacting-Protein-(NIP) 45 resolves experimental asthma by inhibiting Innate Lymphoid Cells group 2 (ILC2).
靶向删除 NFAT 相互作用蛋白(NIP)45 通过抑制固有淋巴细胞 2 组(ILC2)来解决实验性哮喘。
Sci Rep. 2019 Oct 30;9(1):15695. doi: 10.1038/s41598-019-51690-z.
4
DNA Methylation in Inflammatory Pathways Modifies the Association between BMI and Adult-Onset Non-Atopic Asthma.炎症通路中的 DNA 甲基化改变了 BMI 与成人非特应性哮喘之间的关联。
Int J Environ Res Public Health. 2019 Feb 19;16(4):600. doi: 10.3390/ijerph16040600.
5
Neural correlates of trial-to-trial fluctuations in response caution.反应谨慎中试次间波动的神经关联。
J Neurosci. 2011 Nov 30;31(48):17488-95. doi: 10.1523/JNEUROSCI.2924-11.2011.
6
Testing for associations with missing high-dimensional categorical covariates.检验与缺失的高维分类协变量的关联。
Int J Biostat. 2008 Sep 29;4(1):Article 18. doi: 10.2202/1557-4679.1102.
7
Detecting intergene correlation changes in microarray analysis: a new approach to gene selection.检测微阵列分析中的基因间相关性变化:一种新的基因选择方法。
BMC Bioinformatics. 2009 Jan 15;10:20. doi: 10.1186/1471-2105-10-20.
8
Resampling-based empirical Bayes multiple testing procedures for controlling generalized tail probability and expected value error rates: focus on the false discovery rate and simulation study.基于重采样的经验贝叶斯多重检验程序,用于控制广义尾概率和期望值错误率:聚焦于错误发现率及模拟研究
Biom J. 2008 Oct;50(5):716-44. doi: 10.1002/bimj.200710473.