作为一致性度量的科恩kappa系数的精确单侧置信限。

Exact one-sided confidence limits for Cohen's kappa as a measurement of agreement.

作者信息

Shan Guogen, Wang Weizhen

机构信息

1 Epidemiology and Biostatistics Program, Department of Environmental and Occupational Health, School of Community Health Sciences, University of Nevada Las Vegas, Las Vegas, USA.

2 College of Applied Sciences, Beijing University of Technology, Beijing, PR China.

出版信息

Stat Methods Med Res. 2017 Apr;26(2):615-632. doi: 10.1177/0962280214552881. Epub 2014 Oct 6.

DOI:10.1177/0962280214552881

PMID:25288510

Abstract

Cohen's kappa coefficient, κ, is a statistical measure of inter-rater agreement or inter-annotator agreement for qualitative items. In this paper, we focus on interval estimation of κ in the case of two raters and binary items. So far, only asymptotic and bootstrap intervals are available for κ due to its complexity. However, there is no guarantee that such intervals will capture κ with the desired nominal level 1- α. In other words, the statistical inferences based on these intervals are not reliable. We apply the Buehler method to obtain exact confidence intervals based on four widely used asymptotic intervals, three Wald-type confidence intervals and one interval constructed from a profile variance. These exact intervals are compared with regard to coverage probability and length for small to medium sample sizes. The exact intervals based on the Garner interval and the Lee and Tu interval are generally recommended for use in practice due to good performance in both coverage probability and length.

摘要

科恩kappa系数κ是用于定性项目的评分者间一致性或注释者间一致性的一种统计量度。在本文中，我们聚焦于两名评分者和二元项目情形下κ的区间估计。到目前为止，由于κ的复杂性，仅有渐近区间和自助法区间可用于κ。然而，无法保证此类区间能以期望的名义水平1-α包含κ。换句话说，基于这些区间的统计推断并不可靠。我们应用比勒方法，基于四个广泛使用的渐近区间、三个瓦尔德型置信区间以及一个由轮廓方差构建的区间来获得精确置信区间。针对中小样本量，对这些精确区间在覆盖概率和区间长度方面进行了比较。基于加纳区间以及李和涂区间的精确区间，因其在覆盖概率和区间长度方面均表现良好，通常建议在实际中使用。

相似文献

Exact one-sided confidence limits for Cohen's kappa as a measurement of agreement.

Stat Methods Med Res. 2017 Apr;26(2):615-632. doi: 10.1177/0962280214552881. Epub 2014 Oct 6.

Interval estimation for Cohen's kappa as a measure of agreement.

Stat Med. 2000 Mar 15;19(5):723-41. doi: 10.1002/(sici)1097-0258(20000315)19:5<723::aid-sim379>3.0.co;2-a.

Homogeneity score test of AC statistics and estimation of common AC in multiple or stratified inter-rater agreement studies.

BMC Med Res Methodol. 2020 Feb 5;20(1):20. doi: 10.1186/s12874-019-0887-5.

Exact one-sided confidence limits for the difference between two correlated proportions.

Stat Med. 2007 Aug 15;26(18):3369-84. doi: 10.1002/sim.2708.

Measuring inter-rater reliability for nominal data - which coefficients and confidence intervals are appropriate?

BMC Med Res Methodol. 2016 Aug 5;16:93. doi: 10.1186/s12874-016-0200-9.

Confidence intervals based on some weighting functions for the difference of two binomial proportions.

Stat Med. 2014 Jun 15;33(13):2288-96. doi: 10.1002/sim.6147. Epub 2014 Mar 19.

Fully specified bootstrap confidence intervals for the difference of two independent binomial proportions based on the median unbiased estimator.

Stat Med. 2009 Oct 15;28(23):2876-90. doi: 10.1002/sim.3670.

A comparison of confidence interval methods for the intraclass correlation coefficient in community-based cluster randomization trials with a binary outcome.

Clin Trials. 2016 Apr;13(2):180-7. doi: 10.1177/1740774515606377. Epub 2015 Sep 28.

Confidence intervals for the difference between independent binomial proportions: comparison using a graphical approach and moving averages.

Pharm Stat. 2014 Sep-Oct;13(5):294-308. doi: 10.1002/pst.1631. Epub 2014 Aug 27.

Efficient confidence limits for adaptive one-arm two-stage clinical trials with binary endpoints.

BMC Med Res Methodol. 2017 Feb 6;17(1):22. doi: 10.1186/s12874-017-0297-5.

引用本文的文献

Response adaptive randomization design for a two-stage study with binary response.

J Biopharm Stat. 2023 Sep 3;33(5):575-585. doi: 10.1080/10543406.2023.2170399. Epub 2023 Feb 3.

New Confidence Intervals for Relative Risk of Two Correlated Proportions.

Stat Biosci. 2023;15(1):1-30. doi: 10.1007/s12561-022-09345-7. Epub 2022 May 20.

Correlation Coefficients for a Study with Repeated Measures.

Comput Math Methods Med. 2020 Mar 26;2020:7398324. doi: 10.1155/2020/7398324. eCollection 2020.

Two-stage optimal designs with survival endpoint when the follow-up time is restricted.

BMC Med Res Methodol. 2019 Apr 3;19(1):74. doi: 10.1186/s12874-019-0696-x.

Statistical advances in clinical trials and clinical research.

Alzheimers Dement (N Y). 2018 Jun 14;4:366-371. doi: 10.1016/j.trci.2018.04.006. eCollection 2018.

Fisher's exact approach for post hoc analysis of a chi-squared test.

PLoS One. 2017 Dec 20;12(12):e0188709. doi: 10.1371/journal.pone.0188709. eCollection 2017.

Efficient Noninferiority Testing Procedures for Simultaneously Assessing Sensitivity and Specificity of Two Diagnostic Tests.

Comput Math Methods Med. 2015;2015:128930. doi: 10.1155/2015/128930. Epub 2015 Aug 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

作为一致性度量的科恩kappa系数的精确单侧置信限。

Exact one-sided confidence limits for Cohen's kappa as a measurement of agreement.

作者信息

Shan Guogen, Wang Weizhen

机构信息

1 Epidemiology and Biostatistics Program, Department of Environmental and Occupational Health, School of Community Health Sciences, University of Nevada Las Vegas, Las Vegas, USA.

2 College of Applied Sciences, Beijing University of Technology, Beijing, PR China.

出版信息

Stat Methods Med Res. 2017 Apr;26(2):615-632. doi: 10.1177/0962280214552881. Epub 2014 Oct 6.

DOI:10.1177/0962280214552881

PMID:25288510

Abstract

摘要

作为一致性度量的科恩kappa系数的精确单侧置信限。

Exact one-sided confidence limits for Cohen's kappa as a measurement of agreement.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

作为一致性度量的科恩kappa系数的精确单侧置信限。

Exact one-sided confidence limits for Cohen's kappa as a measurement of agreement.

作者信息

机构信息

出版信息

相似文献

引用本文的文献