Ananda-Rajah Michelle R, Bergmeir Christoph, Petitjean François, Slavin Monica A, Thursky Karin A, Webb Geoffrey I
Michelle R. Ananda-Rajah, Alfred Health; Michelle R. Ananda-Rajah, Christoph Bergmeir, François Petitjean, and Geoffrey I. Webb, Monash University; and Monica A. Slavin and Karin A. Thursky, Peter Doherty Centre for Infection and Immunity; University of Melbourne, Melbourne, Victoria, Australia.
JCO Clin Cancer Inform. 2017 Nov;1:1-10. doi: 10.1200/CCI.17.00011.
Prospective epidemiologic surveillance of invasive mold disease (IMD) in hematology patients is hampered by the absence of a reliable laboratory prompt. This study develops an expert system for electronic surveillance of IMD that combines probabilities using natural language processing (NLP) of computed tomography (CT) reports with microbiology and antifungal drug data to improve prediction of IMD.
Microbiology indicators and antifungal drug-dispensing data were extracted from hospital information systems at three tertiary hospitals for 123 hematology-oncology patients. Of this group, 64 case patients had 26 probable/proven IMD according to international definitions, and 59 patients were uninfected controls. Derived probabilities from NLP combined with medical expertise identified patients at high likelihood of IMD, with remaining patients processed by a machine-learning classifier trained on all available features.
Compared with the baseline text classifier, the expert system that incorporated the best performing algorithm (naïve Bayes) improved specificity from 50.8% (95% CI, 37.5% to 64.1%) to 74.6% (95% CI, 61.6% to 85.0%), reducing false positives by 48% from 29 to 15; improved sensitivity slightly from 96.9% (95% CI, 89.2% to 99.6%) to 98.4% (95% CI, 91.6% to 100%); and improved receiver operating characteristic area from 73.9% (95% CI, 67.1% to 80.6%) to 92.8% (95% CI, 88% to 97.5%).
An expert system that uses multiple sources of data (CT reports, microbiology, antifungal drug dispensing) is a promising approach to continuous prospective surveillance of IMD in the hospital, and demonstrates reduced false notifications (positives) compared with NLP of CT reports alone. Our expert system could provide decision support for IMD surveillance, which is critical to antifungal stewardship and improving supportive care in cancer.
由于缺乏可靠的实验室提示,血液学患者侵袭性霉菌病(IMD)的前瞻性流行病学监测受到阻碍。本研究开发了一种用于IMD电子监测的专家系统,该系统结合了使用计算机断层扫描(CT)报告的自然语言处理(NLP)得出的概率与微生物学和抗真菌药物数据,以改善对IMD的预测。
从三家三级医院的医院信息系统中提取了123例血液肿瘤患者的微生物学指标和抗真菌药物配药数据。在这组患者中,根据国际定义,64例病例患者有26例可能/确诊的IMD,59例患者为未感染对照。通过NLP结合医学专业知识得出的概率识别出IMD可能性高的患者,其余患者由基于所有可用特征训练的机器学习分类器进行处理。
与基线文本分类器相比,纳入表现最佳算法(朴素贝叶斯)的专家系统将特异性从50.8%(95%CI,37.5%至64.1%)提高到74.6%(95%CI,61.6%至85.0%),假阳性从29例减少48%至15例;敏感性从96.9%(95%CI,89.2%至99.6%)略有提高至98.4%(95%CI,91.6%至100%);受试者工作特征曲线下面积从73.9%(95%CI,67.1%至80.6%)提高到92.8%(95%CI,88%至97.5%)。
使用多源数据(CT报告、微生物学、抗真菌药物配药)的专家系统是医院对IMD进行持续前瞻性监测的一种有前景的方法,与仅对CT报告进行NLP相比,显示出减少了错误通知(阳性)。我们的专家系统可为IMD监测提供决策支持,这对抗真菌管理和改善癌症支持性护理至关重要。