Chen Guanmin, Faris Peter, Hemmelgarn Brenda, Walker Robin L, Quan Hude
Department of Community Health Sciences, University of Calgary, Calgary, Alberta, Canada.
BMC Med Res Methodol. 2009 Jan 21;9:5. doi: 10.1186/1471-2288-9-5.
Kappa is commonly used when assessing the agreement of conditions with reference standard, but has been criticized for being highly dependent on the prevalence. To overcome this limitation, a prevalence-adjusted and bias-adjusted kappa (PABAK) has been developed. The purpose of this study is to demonstrate the performance of Kappa and PABAK, and assess the agreement between hospital discharge administrative data and chart review data conditions.
The agreement was compared for random sampling, restricted sampling by conditions, and case-control sampling from the four teaching hospitals in Alberta, Canada from ICD10 administrative data during January 1, 2003 and June 30, 2003. A total of 4,008 hospital discharge records and chart view, linked for personal unique identifier and admission date, for 32 conditions of random sampling were analyzed. The restricted sample for hypertension, myocardial infarction and congestive heart failure, and case-control sample for those three conditions were extracted from random sample. The prevalence, kappa, PABAK, positive agreement, negative agreement for the condition was compared for each of three samples.
The prevalence of each condition was highly dependent on the sampling method, and this variation in prevalence had a significant effect on both kappa and PABAK. PABAK values were obviously high for certain conditions with low kappa values. The gap between these two statistical values for the same condition narrowed as the prevalence of the condition approached 50%.
Kappa values varied more widely than PABAK values across the 32 conditions. PABAK values should usually not be interpreted as measuring the same agreement as kappa in administrative data, particular for the condition with low prevalence. There is no single statistic measuring agreement that captures the desired information for validity of administrative data. Researchers should report kappa, the prevalence, positive agreement, negative agreement, and the relative frequency in each cell (i.e. a, b, c and d) to enable the reader to judge the validity of administrative data from multiple aspects.
在评估条件与参考标准的一致性时,kappa常用于此,但因其高度依赖患病率而受到批评。为克服这一局限性,已开发出患病率调整和偏差调整kappa(PABAK)。本研究的目的是展示kappa和PABAK的性能,并评估医院出院管理数据与病历审查数据条件之间的一致性。
对2003年1月1日至2003年6月30日期间加拿大艾伯塔省四家教学医院的ICD10管理数据进行随机抽样、按条件限制抽样和病例对照抽样,并比较其一致性。分析了总共4008份医院出院记录和病历视图,这些记录通过个人唯一标识符和入院日期进行关联,涉及32种随机抽样条件。从随机样本中提取高血压、心肌梗死和充血性心力衰竭的限制样本以及这三种疾病的病例对照样本。比较了三个样本中每种疾病的患病率、kappa、PABAK、阳性一致性和阴性一致性。
每种疾病的患病率高度依赖于抽样方法,患病率的这种变化对kappa和PABAK均有显著影响。对于某些kappa值较低的疾病,PABAK值明显较高。当疾病患病率接近50%时,同一疾病的这两个统计值之间的差距缩小。
在32种疾病中,kappa值的变化比PABAK值更为广泛。PABAK值通常不应被解释为与管理数据中的kappa测量相同的一致性,特别是对于患病率较低的疾病。没有单一的统计量来测量一致性,能够获取管理数据有效性所需的信息。研究人员应报告kappa、患病率、阳性一致性、阴性一致性以及每个单元格中的相对频率(即a、b、c和d),以便读者能够从多个方面判断管理数据的有效性。