Janssen Research & Development, Observational Health Data Analytics, Raritan, New Jersey, USA.
Janssen Research & Development, Infectious Diseases and Vaccines, Beerse, Belgium.
Pharmacoepidemiol Drug Saf. 2022 Sep;31(9):983-991. doi: 10.1002/pds.5505. Epub 2022 Jul 10.
Evaluation of novel code-based algorithms to identify invasive Escherichia coli disease (IED) among patients in healthcare databases.
Inpatient visits with microbiological evidence of invasive bacterial disease were extracted from the Optum© electronic health record database between January 1, 2016 and June 30, 2020. Six algorithms, derived from diagnosis and drug exposure codes associated to infectious diseases and Escherichia coli, were developed to identify IED. The performance characteristics of algorithms were assessed using a reference standard derived from microbiology data.
Among 97 194 eligible records, 25 310 (26.0%) were classified as IED. Algorithm 1 (diagnosis code for infectious invasive disease due to E. coli) had the highest positive predictive value (PPV; 96.0%) and lowest sensitivity (60.4%). Algorithm 2, which additionally included patients with diagnosis codes for infectious invasive disease due to an unspecified organism, had the highest sensitivity (95.5%) and lowest PPV (27.8%). Algorithm 4, which required patients with a diagnosis code for infectious invasive disease due to unspecified organism to have no diagnosis code for non-E. coli infections, achieved the most balanced performance characteristics (PPV, 93.6%; sensitivity, 78.1%; F score, 85.1%). Finally, adding exposure to antibiotics in the treatment of E. coli had limited impact on performance algorithms 5 and 6.
Algorithm 4, which achieved the most balanced performance characteristics, offers a useful tool to identify patients with IED and assess the burden of IED in healthcare databases.
评估新型基于代码的算法,以在医疗保健数据库中识别侵袭性大肠杆菌病(IED)患者。
从 Optum©电子健康记录数据库中提取了 2016 年 1 月 1 日至 2020 年 6 月 30 日期间具有侵袭性细菌疾病微生物学证据的住院患者就诊记录。基于与传染病和大肠杆菌相关的诊断和药物暴露代码,开发了 6 种算法来识别 IED。使用源自微生物学数据的参考标准评估算法的性能特征。
在 97194 份合格记录中,25310 份(26.0%)被归类为 IED。算法 1(由大肠杆菌引起的感染性侵袭性疾病的诊断代码)具有最高的阳性预测值(PPV;96.0%)和最低的灵敏度(60.4%)。算法 2 另外包括诊断代码为未指明病原体的感染性侵袭性疾病的患者,具有最高的灵敏度(95.5%)和最低的 PPV(27.8%)。算法 4 要求具有未指明病原体的感染性侵袭性疾病诊断代码的患者没有非大肠杆菌感染的诊断代码,实现了最平衡的性能特征(PPV,93.6%;灵敏度,78.1%;F 分数,85.1%)。最后,在治疗大肠杆菌时添加抗生素的暴露对算法 5 和 6 的性能影响有限。
算法 4 实现了最平衡的性能特征,为识别 IED 患者和评估 IED 在医疗保健数据库中的负担提供了有用的工具。