Héja Gergely, Surján György, Lukácsy Gergely, Pallinger Péter, Gergely Miklós
Budapest University of Technology and Economics, Department of Measurement and Information Systems, Budapest, Hungary.
Int J Med Inform. 2007 Feb-Mar;76(2-3):118-23. doi: 10.1016/j.ijmedinf.2006.07.008. Epub 2006 Oct 4.
The main objective is to create a knowledge-intensive coding support tool for the International Classification of Diseases (ICD10), which is based on formal representation of ICD10 categories. Beyond this task the resulting ontology could be reused in various ways. Decidability is an important issue for computer-assisted coding; consequently the ontology should be represented in description logic.
The meaning of the ICD10 categories is represented using the GALEN Core Reference Model. Due to the deficiencies of its representation language (GRAIL) the ontology is transformed to the quasi-standard OWL. A test system which extracts disease concepts and classifies them to ICD10 categories has been implemented in Prolog to verify the feasibility of the approach.
The formal representation of the first two chapters of ICD10 (infectious diseases and neoplasms) has been almost completed. The constructed ontology has been converted to OWL DL. The test system successfully identified diseases in medical records from gastrointestinal oncology (84% recall, however precision is only 45%). The classifier module is still under development. Due to the experiences gained during the modelling, in the future work FMA is going to be used as anatomical reference ontology.
主要目标是为国际疾病分类(ICD10)创建一个基于ICD10类别形式化表示的知识密集型编码支持工具。除了这项任务之外,生成的本体还可以以各种方式重复使用。可判定性是计算机辅助编码的一个重要问题;因此,本体应该用描述逻辑来表示。
使用盖伦核心参考模型来表示ICD10类别的含义。由于其表示语言(GRAIL)的不足,该本体被转换为准标准的OWL。已经用Prolog实现了一个提取疾病概念并将其分类到ICD10类别的测试系统,以验证该方法的可行性。
ICD10前两章(传染病和肿瘤)的形式化表示已基本完成。构建的本体已转换为OWL DL。测试系统成功地从胃肠肿瘤学的病历中识别出疾病(召回率为84%,但精确率仅为45%)。分类器模块仍在开发中。基于建模过程中获得的经验,在未来的工作中,FMA将被用作解剖学参考本体。