Han Seungbong, Andrei Adin-Cristian, Tsui Kam-Wah
Department of Statistics, University of Wisconsin-Madison, Medical Science Center 1300 University Avenue, Madison, WI 53706, USA.
Biom J. 2010 Apr;52(2):222-32. doi: 10.1002/bimj.200900177.
When drawing large-scale simultaneous inference, such as in genomics and imaging problems, multiplicity adjustments should be made, since, otherwise, one would be faced with an inflated type I error. Numerous methods are available to estimate the proportion of true null hypotheses pi(0), among a large number of hypotheses tested. Many methods implicitly assume that the pi(0) is large, that is, close to 1. However, in practice, mid-range pi(0) values are frequently encountered and many of the widely used methods tend to produce highly variable or biased estimates of pi(0). As a remedy in such situations, we propose a hierarchical Bayesian model that produces an estimator of pi(0) that exhibits considerably less bias and is more stable. Simulation studies seem indicative of good method performance even when low-to-moderate correlation exists among test statistics. Method performance is assessed in simulated settings and its practical usefulness is illustrated in an application to a type II diabetes study.
在进行大规模同时推断时,例如在基因组学和成像问题中,应该进行多重性调整,因为否则会面临第一类错误膨胀的问题。有许多方法可用于估计在大量检验的假设中真零假设(\pi(0))的比例。许多方法隐含地假设(\pi(0))很大,即接近1。然而,在实际中,经常会遇到中等范围的(\pi(0))值,并且许多广泛使用的方法往往会产生高度可变或有偏差的(\pi(0))估计值。作为这种情况的补救措施,我们提出了一种分层贝叶斯模型,该模型产生的(\pi(0))估计量偏差明显较小且更稳定。模拟研究表明,即使检验统计量之间存在低到中等程度的相关性,该方法也具有良好的性能。在模拟环境中评估了方法性能,并在一项II型糖尿病研究的应用中说明了其实用性。