Flatiron Health Inc, New York, New York, USA.
Health Serv Res. 2021 Dec;56(6):1281-1287. doi: 10.1111/1475-6773.13669. Epub 2021 May 17.
We expanded the previous assessment of a mortality variable suited for real-world evidence-focused oncology research.
We used a nationwide electronic health record (EHR)-derived de-identified database.
We included patients with at least 1 of 18 cancer types between January 1, 2011 and December 31, 2017. Patient-level structured data (EHRs, obituaries, and Social Security Death Index) and unstructured EHR data (abstracted) were linked to generate a composite mortality variable.
We benchmarked sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and ±15-day agreement against the National Death Index (NDI). Real-world overall survival (rwOS) was estimated using the Kaplan-Meier method. We performed sensitivity analyses using a smaller patient cohort that underwent next-generation sequencing testing.
Compared with the NDI across 18 cancer types (overall N = 160 436): sensitivity, 83.9%-91.5% (17/18 cancer types had sensitivity ≥85.0%); specificity, 93.5%-99.7%; PPV, 96.3%-98.3%; NPV, 75.0%-98.7%; ±15-day agreement, 95.6%-97.6%; and median rwOS estimates ranging from 2.8% to 12.7% greater. Sensitivity analysis results (n = 17 540) were consistent with the main analysis.
Across all cancer types analyzed, this composite mortality variable showed high sensitivity, specificity, PPV, NPV, and ±15-day agreement, and yielded median rwOS values modestly overestimated when compared to NDI-based results.
我们扩展了之前对适用于以真实世界证据为重点的肿瘤学研究的死亡率变量的评估。
我们使用了一个全国性的电子健康记录(EHR)衍生的去识别数据库。
我们纳入了 2011 年 1 月 1 日至 2017 年 12 月 31 日期间至少患有 18 种癌症之一的患者。患者水平的结构化数据(EHR、讣告和社会保障死亡指数)和非结构化的 EHR 数据(摘要)被链接起来,以生成一个综合的死亡率变量。
我们将敏感性、特异性、阳性预测值(PPV)、阴性预测值(NPV)和±15 天的一致性与国家死亡指数(NDI)进行了基准测试。使用 Kaplan-Meier 方法估计真实世界的总体生存率(rwOS)。我们使用接受下一代测序检测的较小患者队列进行了敏感性分析。
与 18 种癌症类型的 NDI 相比(总体 N=160436):敏感性为 83.9%-91.5%(18 种癌症类型中有 17 种敏感性≥85.0%);特异性为 93.5%-99.7%;PPV 为 96.3%-98.3%;NPV 为 75.0%-98.7%;±15 天的一致性为 95.6%-97.6%;以及中位数 rwOS 估计值相差 2.8%-12.7%。敏感性分析结果(n=17540)与主要分析结果一致。
在分析的所有癌症类型中,这种综合死亡率变量表现出高敏感性、特异性、PPV、NPV 和±15 天的一致性,并且与基于 NDI 的结果相比,产生的中位数 rwOS 值略有高估。