Martin Shelby, Wagner Jesse, Lupulescu-Mann Nicoleta, Ramsey Katrina, Cohen Aaron, Graven Peter, Weiskopf Nicole G, Dorr David A
David A. Dorr, Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, 3181 S.W. Sam Jackson Park Rd., MDYMICE, Portland, OR 97239-3098, USA, Email:
Appl Clin Inform. 2017 Aug 2;8(3):794-809. doi: 10.4338/ACI-2016-12-RA-0210.
To measure variation among four different Electronic Health Record (EHR) system documentation locations versus 'gold standard' manual chart review for risk stratification in patients with multiple chronic illnesses.
Adults seen in primary care with EHR evidence of at least one of 13 conditions were included. EHRs were manually reviewed to determine presence of active diagnoses, and risk scores were calculated using three different methodologies and five EHR documentation locations. Claims data were used to assess cost and utilization for the following year. Descriptive and diagnostic statistics were calculated for each EHR location. Criterion validity testing compared the gold standard verified diagnoses versus other EHR locations and risk scores in predicting future cost and utilization.
Nine hundred patients had 2,179 probable diagnoses. About 70% of the diagnoses from the EHR were verified by gold standard. For a subset of patients having baseline and prediction year data (n=750), modeling showed that the gold standard was the best predictor of outcomes on average for a subset of patients that had these data. However, combining all data sources together had nearly equivalent performance for prediction as the gold standard.
EHR data locations were inaccurate 30% of the time, leading to improvement in overall modeling from a gold standard from chart review for individual diagnoses. However, the impact on identification of the highest risk patients was minor, and combining data from different EHR locations was equivalent to gold standard performance. The reviewer's ability to identify a diagnosis as correct was influenced by a variety of factors, including completeness, temporality, and perceived accuracy of chart data.
比较四种不同电子健康记录(EHR)系统文档位置与“金标准”手工病历审查在多重慢性病患者风险分层方面的差异。
纳入在初级保健机构就诊且EHR中有13种疾病中至少一种证据的成年人。对手工审查EHR以确定现行诊断的存在情况,并使用三种不同方法和五个EHR文档位置计算风险评分。使用索赔数据评估次年的成本和利用率。对每个EHR位置计算描述性和诊断性统计数据。标准效度测试比较了金标准验证诊断与其他EHR位置及风险评分在预测未来成本和利用率方面的情况。
900名患者有2179个可能的诊断。EHR中的诊断约70%经金标准验证。对于有基线和预测年数据的患者子集(n = 750),建模显示,对于有这些数据的患者子集,金标准平均是结果的最佳预测指标。然而,将所有数据源结合在一起在预测方面的表现与金标准几乎相当。
EHR数据位置有30%的时间不准确,这导致从个体诊断的病历审查金标准进行的整体建模有所改进。然而,对识别最高风险患者的影响较小,并且将来自不同EHR位置的数据结合起来与金标准表现相当。审查者将诊断识别为正确的能力受多种因素影响,包括图表数据的完整性、时间性和感知准确性。