Suppr超能文献

大海捞针:借助数据仓库和信息提取识别多发性骨髓瘤罕见亚型患者

Finding Needles in the Haystack: Identifying Patients with Rare Subtype of Multiple Myeloma Supported by a Data Warehouse and Information Extraction.

作者信息

Krebs Jonathan, Bittrich Max, Dietrich Georg, Ertl Maximilian, Fette Georg, Kaspar Mathias, Liman Leon, Einsele Hermann, Puppe Frank, Knop Stefan

机构信息

Chair for Artificial Intelligence and Applied Informatics, University of Würzburg.

Department of Medicine II, University Hospital Würzburg.

出版信息

Stud Health Technol Inform. 2018;253:160-164.

Abstract

Finding patient cases with extremely rare pathologies is a laborious task. To decrease time spent on manually searching through thousands of discharge letters and reports, a data warehouse with a fast fulltext search index was queried. Our use case is to find "macrofocal myeloma", i.e. Multiple Myeloma patients with few large lesions. We guessed the number of those patients in the University Hospital Würzburg at about 20. Most criteria were available in the data warehouse in an unstructured form requiring information extraction. 8 patient cases were found by searching for different spellings of "macrofocal myeloma" in discharge letters directly. With an indirect search combining several criteria, we found additional 23 candidate patient cases, from which 10 were classified by a domain expert as correct. The most difficult criteria were determining the degree of bone marrow infiltration. We achieved an F1 score of 93.2 % for this task. The number of patient cases to be screened manually for this disease decreased from about 25000 to 23.

摘要

寻找患有极其罕见病理的患者病例是一项艰巨的任务。为了减少在手动查阅数千份出院信件和报告上花费的时间,我们查询了一个具有快速全文搜索索引的数据仓库。我们的用例是查找“巨灶性骨髓瘤”,即患有少量大病灶的多发性骨髓瘤患者。我们猜测维尔茨堡大学医院中这类患者的数量约为20例。大多数标准以非结构化形式存在于数据仓库中,需要进行信息提取。通过直接在出院信件中搜索“巨灶性骨髓瘤”的不同拼写,找到了8例患者病例。通过结合多个标准进行间接搜索,我们又找到了另外23例候选患者病例,其中10例被领域专家判定为正确。最困难的标准是确定骨髓浸润程度。这项任务的F1分数达到了93.2%。针对这种疾病需要手动筛查的患者病例数量从约25000例减少到了23例。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验