Carlsson M, Löfström L, Ahlfeldt H
Department of Biomedical Engineering Medical Informatics, Linköping University, Sweden.
J Med Syst. 2001 Feb;25(1):47-61. doi: 10.1023/a:1005636432502.
This paper relates a study of reliability of coding of surgical procedures in the domain of thoracic surgery. The reliability measured is inter-coder variability in form of agreement. Four classifications were used by four physicians on 100 patient cases. The classifications, having differing granularity and structure, were analyzed using a statistical method (kappa). These results are discussed and related to the differences between the classifications. One of the topics for discussion is how the granularity affects the degree of agreement, coupled to the usefulness of the classification. Also the concept of using formal methods for representing classifications is discussed, how this will affect how classifications are designed and used.
本文讲述了一项关于胸外科领域手术程序编码可靠性的研究。所测量的可靠性是以一致性形式存在的编码员间变异性。四位医生对100个患者病例使用了四种分类。这些具有不同粒度和结构的分类采用一种统计方法(kappa)进行分析。对这些结果进行了讨论,并将其与分类之间的差异相关联。讨论的主题之一是粒度如何影响一致程度,并与分类的有用性相关联。此外,还讨论了使用形式化方法表示分类的概念,以及这将如何影响分类的设计和使用。