Su Ting-Li, Glimm Ekkehard, Whitehead John, Branson Mike
Medical and Pharmaceutical Statistics Research Unit, Department of Mathematics and Statistics, Lancaster University, Lancaster, UK.
Pharm Stat. 2012 Mar-Apr;11(2):107-17. doi: 10.1002/pst.504. Epub 2012 Feb 15.
The issues and dangers involved in testing multiple hypotheses are well recognised within the pharmaceutical industry. In reporting clinical trials, strenuous efforts are taken to avoid the inflation of type I error, with procedures such as the Bonferroni adjustment and its many elaborations and refinements being widely employed. Typically, such methods are conservative. They tend to be accurate if the multiple test statistics involved are mutually independent and achieve less than the type I error rate specified if these statistics are positively correlated. An alternative approach is to estimate the correlations between the test statistics and to perform a test that is conditional on those estimates being the true correlations. In this paper, we begin by assuming that test statistics are normally distributed and that their correlations are known. Under these circumstances, we explore several approaches to multiple testing, adapt them so that type I error is preserved exactly and then compare their powers over a range of true parameter values. For simplicity, the explorations are confined to the bivariate case. Having described the relative strengths and weaknesses of the approaches under study, we use simulation to assess the accuracy of the approximate theory developed when the correlations are estimated from the study data rather than being known in advance and when data are binary so that test statistics are only approximately normally distributed.
制药行业充分认识到了检验多个假设所涉及的问题和风险。在报告临床试验时,人们会做出巨大努力来避免第一类错误的膨胀,广泛采用诸如邦费罗尼校正及其众多改进方法等程序。通常,这些方法较为保守。如果所涉及的多个检验统计量相互独立,它们往往是准确的;而如果这些统计量呈正相关,它们实现的第一类错误率会低于规定值。另一种方法是估计检验统计量之间的相关性,并基于这些估计值为真实相关性进行检验。在本文中,我们首先假设检验统计量呈正态分布且其相关性已知。在这种情况下,我们探索多种多重检验方法,对其进行调整以使第一类错误率得到精确保持,然后在一系列真实参数值范围内比较它们的检验效能。为了简便起见,探索仅限于双变量情形。在描述了所研究方法的相对优缺点之后,我们通过模拟来评估当从研究数据中估计相关性而非事先已知相关性,以及数据为二元数据从而检验统计量仅近似呈正态分布时所发展的近似理论的准确性。