Krebs Jonathan, Bittrich Max, Dietrich Georg, Ertl Maximilian, Fette Georg, Kaspar Mathias, Liman Leon, Einsele Hermann, Puppe Frank, Knop Stefan
Chair for Artificial Intelligence and Applied Informatics, University of Würzburg.
Department of Medicine II, University Hospital Würzburg.
Stud Health Technol Inform. 2018;253:160-164.
Finding patient cases with extremely rare pathologies is a laborious task. To decrease time spent on manually searching through thousands of discharge letters and reports, a data warehouse with a fast fulltext search index was queried. Our use case is to find "macrofocal myeloma", i.e. Multiple Myeloma patients with few large lesions. We guessed the number of those patients in the University Hospital Würzburg at about 20. Most criteria were available in the data warehouse in an unstructured form requiring information extraction. 8 patient cases were found by searching for different spellings of "macrofocal myeloma" in discharge letters directly. With an indirect search combining several criteria, we found additional 23 candidate patient cases, from which 10 were classified by a domain expert as correct. The most difficult criteria were determining the degree of bone marrow infiltration. We achieved an F1 score of 93.2 % for this task. The number of patient cases to be screened manually for this disease decreased from about 25000 to 23.
寻找患有极其罕见病理的患者病例是一项艰巨的任务。为了减少在手动查阅数千份出院信件和报告上花费的时间,我们查询了一个具有快速全文搜索索引的数据仓库。我们的用例是查找“巨灶性骨髓瘤”,即患有少量大病灶的多发性骨髓瘤患者。我们猜测维尔茨堡大学医院中这类患者的数量约为20例。大多数标准以非结构化形式存在于数据仓库中,需要进行信息提取。通过直接在出院信件中搜索“巨灶性骨髓瘤”的不同拼写,找到了8例患者病例。通过结合多个标准进行间接搜索,我们又找到了另外23例候选患者病例,其中10例被领域专家判定为正确。最困难的标准是确定骨髓浸润程度。这项任务的F1分数达到了93.2%。针对这种疾病需要手动筛查的患者病例数量从约25000例减少到了23例。