Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA; Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA.
Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA; Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA.
Ann Epidemiol. 2022 Dec;76:143-149. doi: 10.1016/j.annepidem.2022.07.007. Epub 2022 Jul 23.
Electronic health record (EHR) discontinuity (missing out-of-network encounters) can lead to information bias. We sought to construct an algorithm that identifies high EHR-continuity among oncology patients.
Using a linked Medicare-EHR database and regression, we sought to 1) measure how often Medicare claims for outpatient encounters were substantiated by visits recorded in the EHR, and 2) predict continuity ratio, defined as the yearly proportion of outpatient encounters reported to Medicare that were captured by EHR data. The prediction model...s performance was evaluated with the coefficient of determination and Spearman...s correlation. We quantified variable misclassification by decile of continuity ratio using standardized difference and sensitivity.
A total of 79,678 subjects met all eligibility criteria. Predicted and observed continuity was highly correlated (σ=0.86). On average across all variables measured, MSD was reduced by a factor of 1/7 and sensitivity was improved 35-fold comparing subjects in the highest vs. lowest decile of CR.
In the oncology population, restricting EHR-based study cohorts to subjects with high continuity may reduce misclassification without greatly impacting representativeness. Further work is needed to elucidate the best manner of implementing continuity prediction rules in cohort studies.
电子健康记录(EHR)不连续性(漏记网外就诊)可能导致信息偏倚。我们试图构建一种算法,以确定肿瘤患者的 EHR 连续性较高。
我们使用链接的医疗保险电子健康记录数据库和回归分析,旨在 1)衡量医疗保险对外科门诊就诊记录的索赔与 EHR 中记录的就诊之间相符的频率,以及 2)预测连续性比,定义为每年报告给医疗保险的门诊就诊次数中被 EHR 数据捕获的比例。使用决定系数和斯皮尔曼相关性来评估预测模型的性能。我们使用标准化差异和敏感性来衡量连续性比的十分位数变量分类错误。
共有 79678 名患者符合所有入选标准。预测的连续性和观察的连续性高度相关(σ=0.86)。在所有测量的变量中,平均而言,与最高与最低连续性百分位组的患者相比,MSD 降低了 1/7,敏感性提高了 35 倍。
在肿瘤患者中,将基于 EHR 的研究队列限制在连续性较高的患者中,可能会减少分类错误,而不会对代表性产生重大影响。还需要进一步的工作来阐明在队列研究中实施连续性预测规则的最佳方式。