Berger R L, Sidik K
Statistics Department, North Carolina State University, Raleigh, NC 27695-8203, USA.
Stat Methods Med Res. 2003 Mar;12(2):91-108. doi: 10.1191/0962280203sm312ra.
The problem of comparing two proportions in a 2 x 2 matched-pairs design with binary responses is considered. We consider one-sided null and alternative hypotheses. The problem has two nuisance parameters. Using the monotonicity of the multinomial distribution, four exact unconditional tests based on p-values are proposed by reducing the dimension of the nuisance parameter space from two to one in computation. The size and power of the four exact tests and two other tests, the exact conditional binomial test and the asymptotic McNemar's test, are considered. It is shown that the tests based on the confidence interval p-value are more powerful than the tests based on the standard p-value. In addition, it is found that the exact conditional binomial test is conservative and not powerful for testing the hypothesis. Moreover, the asymptotic McNemar's test is shown to have incorrect size; that is, its size is larger than the nominal level of the test. Overall, the test based on McNemar's statistic and the confidence interval p-value is found to be the most powerful test with the correct size among the tests in this comparison.
考虑在具有二元响应的2×2匹配对设计中比较两个比例的问题。我们考虑单侧原假设和备择假设。该问题有两个干扰参数。利用多项分布的单调性,通过在计算中将干扰参数空间的维度从二维降至一维,提出了基于p值的四个精确无条件检验。考虑了这四个精确检验以及另外两个检验(精确条件二项式检验和渐近McNemar检验)的大小和功效。结果表明,基于置信区间p值的检验比基于标准p值的检验更具功效。此外,发现精确条件二项式检验对于检验该假设是保守的且功效不足。而且,渐近McNemar检验的大小被证明是不正确的;即,其大小大于检验的名义水平。总体而言,在该比较中的检验中,基于McNemar统计量和置信区间p值的检验被发现是具有正确大小且最具功效的检验。