Tian Lili, Cappelleri Joseph C
Department of Statistics, Biostatistics Division, University of Florida, Gainesville 32610, USA.
Stat Med. 2004 Jul 15;23(13):2125-35. doi: 10.1002/sim.1782.
We consider the problems of interval estimation and hypothesis testing for the intraclass correlation coefficient in an interrater reliability study when both raters and subjects are assumed to be randomly selected from populations of raters and subjects, respectively. We propose a novel approach for the confidence interval estimation and hypothesis testing using the concepts of generalized confidence interval (GCI) and generalized P-values. A simulation study is conducted to investigate the coverage probabilities of the GCI approach relative to the modified large sample (MLS) approach. Both methods tend to provide somewhat conservative coverage. Relative to the MLS approach, the GCI approach is closer to the correct (nominal) coverage for a two-sided interval, but farther to the correct coverage for a one-sided lower interval. Unlike the MLS approach, the GCI approach can also easily provide P-values. The fact that the GCI approach is suitable for confidence interval estimation and obtaining P-values makes the GCI approach a suitable candidate for making inference about interrater reliability.
在评分者信度研究中,当假定评分者和受试者分别从评分者总体和受试者总体中随机选取时,我们考虑组内相关系数的区间估计和假设检验问题。我们提出一种使用广义置信区间(GCI)和广义P值概念进行置信区间估计和假设检验的新方法。进行了一项模拟研究,以调查GCI方法相对于修正大样本(MLS)方法的覆盖概率。两种方法往往都提供了较为保守的覆盖范围。相对于MLS方法,GCI方法对于双侧区间更接近正确(名义)覆盖范围,但对于单侧下限区间则离正确覆盖范围更远。与MLS方法不同,GCI方法还可以轻松提供P值。GCI方法适用于置信区间估计和获得P值这一事实,使得GCI方法成为对评分者信度进行推断的合适候选方法。