ConcertAI, LLC, Cambridge, MA.
Tulane University, New Orleans, LA.
JCO Clin Cancer Inform. 2023 Sep;7:e2300014. doi: 10.1200/CCI.23.00014.
This study evaluated the relative improvements in mortality data capture of adding different external data to enriched electronic medical records (EMRs) for patients with melanoma.
An enriched EMR database, containing structured and unstructured data, was used to evaluate the incremental mortality data capture of the following external data sources: Social Security Administration (SSA), public obituary, and an administrative open-claims database for the claims data set. Overall survival (OS) was assessed for each data set and the composite data set using the Kaplan-Meier method.
A total of 3,882 patients were included in the study. The enriched EMR data set identified 1,085 patients with a death record. The SSA data set identified 213 patients (73 unique when combined with enriched EMR) with a death record, while the obituary data set identified 1,127 patients (241 unique). The administrative claims data set identified 378 patients (73 unique) with a death record; however, all these unique patients were already accounted for in the combined SSA and obituary data set. The composite data set yielded a median OS of 13.39 years, about 4 years shorter than the enriched EMR data set alone (17.63 years).
When the enriched EMR data set was augmented with one external data set, the obituary data set provided the most additional value, followed by claims, and then SSA. The augmentation of all the data sources had a significant impact on the OS results compared with enriched EMR alone.
本研究评估了为黑色素瘤患者添加不同外部数据来改善电子病历(EMR)中死亡率数据捕获的相对效果。
使用包含结构化和非结构化数据的丰富 EMR 数据库来评估以下外部数据源对死亡率数据捕获的增量效果:社会保障管理局(SSA)、公共讣告和索赔数据集的开放索赔行政数据库。使用 Kaplan-Meier 方法评估每个数据集和综合数据集的总生存期(OS)。
共有 3882 名患者纳入研究。丰富的 EMR 数据集确定了 1085 名有死亡记录的患者。SSA 数据集确定了 213 名(与丰富的 EMR 结合时有 73 名是唯一的)有死亡记录的患者,而讣告数据集确定了 1127 名(241 名是唯一的)有死亡记录的患者。行政索赔数据集确定了 378 名(73 名是唯一的)有死亡记录的患者;然而,所有这些唯一的患者已经在 SSA 和讣告数据集的综合数据中被记录。综合数据集的中位 OS 为 13.39 年,比仅使用丰富的 EMR 数据集短约 4 年(17.63 年)。
当丰富的 EMR 数据集与一个外部数据集结合时,讣告数据集提供了最大的附加价值,其次是索赔数据集,然后是 SSA。与仅使用丰富的 EMR 相比,所有数据源的增加对 OS 结果有显著影响。