Velupillai Sumithra, Dalianis Hercules, Kvist Maria
Dept. of Computer and Systems Sciences (DSV), Stockholm University, Forum 100, SE-164 40 Kista, Sweden.
Stud Health Technol Inform. 2011;169:559-63.
Different levels of knowledge certainty, or factuality levels, are expressed in clinical health record documentation. This information is currently not fully exploited, as the subtleties expressed in natural language cannot easily be machine analyzed. Extracting relevant information from knowledge-intensive resources such as electronic health records can be used for improving health care in general by e.g. building automated information access systems. We present an annotation model of six factuality levels linked to diagnoses in Swedish clinical assessments from an emergency ward. Our main findings are that overall agreement is fairly high (0.7/0.58 F-measure, 0.73/0.6 Cohen's κ, Intra/Inter). These distinctions are important for knowledge models, since only approx. 50% of the diagnoses are affirmed with certainty. Moreover, our results indicate that there are patterns inherent in the diagnosis expressions themselves conveying factuality levels, showing that certainty is not only dependent on context cues.
临床健康记录文档中表达了不同程度的知识确定性,即事实性水平。目前这些信息尚未得到充分利用,因为自然语言中表达的细微差别难以进行机器分析。从电子健康记录等知识密集型资源中提取相关信息,可用于通过构建自动信息访问系统等方式总体上改善医疗保健。我们提出了一个与急诊病房瑞典临床评估中的诊断相关的六个事实性水平的注释模型。我们的主要发现是总体一致性相当高(F值为0.7/0.58,科恩kappa系数为0.73/0.6,组内/组间)。这些区别对知识模型很重要,因为只有约50%的诊断被确定肯定。此外,我们的结果表明,诊断表达本身存在传达事实性水平的固有模式,这表明确定性不仅取决于上下文线索。