Leitich Harald, Adlassnig Klaus-Peter, Kolarz Gernot
Department of Medical Computer Sciences, Section of Medical Expert and Knowledge-Based Systems, University of Vienna Medical School, Spitalgasse 23, A-1090 Vienna, Austria.
Artif Intell Med. 2002 Jul;25(3):215-25. doi: 10.1016/s0933-3657(02)00025-8.
As part of a plan to promote semi-automatic knowledge acquisition for the medical consultant system CADIAG-II/RHEUMA, this study sought to explore and cope with the variability of results that may be anticipated when performing knowledge acquisition with patient data from different patient settings. Patient data were drawn both from a published study for the classification of rheumatoid arthritis (RA) and from a large database of rheumatological patient charts developed for the CADIAG-II/RHEUMA system. An analysis of the relationships between RA and selected CADIAG-II/RHEUMA symptoms was done using two models. In one of them, we controlled for the differences in baseline frequencies of symptoms and diseases in the two study populations as an important factor influencing the results of the calculations. Other factors that were identified included inconsistent definitions of symptoms and diseases, and the different composition of study groups in the two study populations. By eliminating differences in baseline frequencies as the most important bias, the results obtained from the two different knowledge sources became more consistent. All remaining inconsistencies and uncertainties about the contribution and relative importance of the factors were formalized using fuzzy intervals.
作为促进医学咨询系统CADIAG-II/RHEUMA半自动知识获取计划的一部分,本研究旨在探索并应对在使用来自不同患者群体的患者数据进行知识获取时可能预期的结果变异性。患者数据既取自一项关于类风湿性关节炎(RA)分类的已发表研究,也取自为CADIAG-II/RHEUMA系统开发的大型风湿病患者病历数据库。使用两种模型对RA与选定的CADIAG-II/RHEUMA症状之间的关系进行了分析。在其中一种模型中,我们将两个研究群体中症状和疾病的基线频率差异作为影响计算结果的重要因素加以控制。识别出的其他因素包括症状和疾病定义不一致,以及两个研究群体中研究组的不同构成。通过消除作为最重要偏差的基线频率差异,来自两种不同知识源的结果变得更加一致。关于这些因素的贡献和相对重要性的所有剩余不一致性和不确定性都使用模糊区间进行了形式化。