Children's Health Services Research, Department of Pediatrics, Indiana University School of Medicine, Indianapolis, Indiana, USA.
J Am Med Inform Assoc. 2014 Mar-Apr;21(2):345-52. doi: 10.1136/amiajnl-2013-001942. Epub 2013 Oct 10.
Electronic health records databases are increasingly used for identifying cohort populations, covariates, or outcomes, but discerning such clinical 'phenotypes' accurately is an ongoing challenge. We developed a flexible method using overlapping (Venn diagram) queries. Here we describe this approach to find patients hospitalized with acute congestive heart failure (CHF), a sampling strategy for one-by-one 'gold standard' chart review, and calculation of positive predictive value (PPV) and sensitivities, with SEs, across different definitions.
We used retrospective queries of hospitalizations (2002-2011) in the Indiana Network for Patient Care with any CHF ICD-9 diagnoses, a primary diagnosis, an echocardiogram performed, a B-natriuretic peptide (BNP) drawn, or BNP >500 pg/mL. We used a hybrid between proportional sampling by Venn zone and over-sampling non-overlapping zones. The acute CHF (presence/absence) outcome was based on expert chart review using a priori criteria.
Among 79,091 hospitalizations, we reviewed 908. A query for any ICD-9 code for CHF had PPV 42.8% (SE 1.5%) for acute CHF and sensitivity 94.3% (1.3%). Primary diagnosis of 428 and BNP >500 pg/mL had PPV 90.4% (SE 2.4%) and sensitivity 28.8% (1.1%). PPV was <10% when there was no echocardiogram, no BNP, and no primary diagnosis. 'False positive' hospitalizations were for other heart disease, lung disease, or other reasons.
This novel method successfully allowed flexible application and validation of queries for patients hospitalized with acute CHF.
电子健康记录数据库越来越多地被用于识别队列人群、协变量或结果,但准确识别这些临床“表型”仍然是一个持续的挑战。我们开发了一种使用重叠(Venn 图)查询的灵活方法。在这里,我们描述了这种方法,用于查找因急性充血性心力衰竭(CHF)住院的患者,一种逐个“金标准”图表审查的抽样策略,以及不同定义下的阳性预测值(PPV)和敏感度的计算,同时计算标准误(SE)。
我们使用印第安纳州患者护理网络中 2002 年至 2011 年的住院记录进行回顾性查询,这些记录具有任何 CHF ICD-9 诊断、主要诊断、进行的超声心动图检查、抽取的 B 型利钠肽(BNP)或 BNP>500pg/mL。我们使用 Venn 区的比例抽样和非重叠区的过采样的混合方法。急性 CHF(存在/不存在)的结果是基于使用先验标准的专家图表审查。
在 79091 次住院中,我们审查了 908 次。任何 CHF ICD-9 代码的查询对急性 CHF 的 PPV 为 42.8%(SE 为 1.5%),敏感度为 94.3%(1.3%)。428 次主要诊断和 BNP>500pg/mL 的查询的 PPV 为 90.4%(SE 为 2.4%),敏感度为 28.8%(1.1%)。当没有进行超声心动图检查、没有 BNP 检查且没有主要诊断时,PPV<10%。“假阳性”住院是由于其他心脏病、肺部疾病或其他原因。
这种新方法成功地允许灵活应用和验证用于因急性 CHF 住院的患者的查询。