Suppr超能文献

组织形态学诊断的可重复性:特别提及kappa统计量

Reproducibility of histomorphologic diagnoses with special reference to the kappa statistic.

作者信息

Svanholm H, Starklint H, Gundersen H J, Fabricius J, Barlebo H, Olsen S

机构信息

Dept. of Pathology, University Hospital, Odense, Denmark.

出版信息

APMIS. 1989 Aug;97(8):689-98. doi: 10.1111/j.1699-0463.1989.tb00464.x.

Abstract

Systems for classification and grading used in pathology should ideally be biologically meaningful and at least be reproducible from one pathologist to another. A statistical method to evaluate reproducibility (non-chance agreement) for several observers using nominal or ordinal categories has been developed and refined over the past few decades--the kappa statistic. A high level of observed agreement among different pathologists can either signify a high level of reproducibility, if agreement by chance is low, or express a low level of reproducibility, if agreement by chance is almost as high as the observed agreement. Therefore, the observed agreement says nothing in itself, unless it is low. The kappa value, however, indicates how much better the observers are compared to a throw of the dice, and therefore gives the real credit to the agreement which was found. We have developed a user-friendly computer program for calculating inter- and intra-observer agreement of 2 or more observers. By calculating associations between different categories and different observers, the statistic furthermore obtains a function close to the parameter of accuracy. We recommend the use of the above method before a set of nominal or rank scale parameters are used for deciding prognosis and treatment of patients. By submitting a diskette the computer program will be available at no cost.

摘要

病理学中使用的分类和分级系统理想情况下应具有生物学意义,并且至少在不同病理学家之间具有可重复性。在过去几十年中,已经开发并完善了一种用于评估多名观察者使用名义或有序类别时的可重复性(非偶然一致性)的统计方法——kappa统计量。如果偶然一致性较低,不同病理学家之间观察到的高度一致性可能表示高度的可重复性;如果偶然一致性几乎与观察到的一致性一样高,则表示可重复性较低。因此,观察到的一致性本身并不能说明什么,除非它很低。然而,kappa值表明观察者比掷骰子的情况要好多少,因此真正体现了所发现的一致性。我们开发了一个用户友好的计算机程序,用于计算两名或更多观察者之间以及观察者内部的一致性。通过计算不同类别和不同观察者之间的关联,该统计量还获得了一个接近准确性参数的函数。我们建议在使用一组名义或等级量表参数来决定患者的预后和治疗之前使用上述方法。通过提交一张软盘,该计算机程序将免费提供。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验