Borthwick Kenneth M, Smelser Diane T, Bock Jonathan A, Elmore James R, Ryer Evan J, Ye Zi, Pacheco Jennifer A, Carrell David S, Michalkiewicz Michael, Thompson William K, Pathak Jyotishman, Bielinski Suzette J, Denny Joshua C, Linneman James G, Peissig Peggy L, Kho Abel N, Gottesman Omri, Parmar Harpreet, Kullo Iftikhar J, McCarty Catherine A, Böttinger Erwin P, Larson Eric B, Jarvik Gail P, Harley John B, Bajwa Tanvir, Franklin David P, Carey David J, Kuivaniemi Helena, Tromp Gerard
The Sigfried and Janet Weis Center for Research, Geisinger Health System, Danville, PA, USA.
Department of Vascular and Endovascular Surgery, Geisinger Health System, Danville, PA, USA.
Int J Biomed Data Min. 2015 Dec;4(1). Epub 2015 Jul 30.
We designed an algorithm to identify abdominal aortic aneurysm cases and controls from electronic health records to be shared and executed within the "electronic Medical Records and Genomics" (eMERGE) Network.
Structured Query Language, was used to script the algorithm utilizing "Current Procedural Terminology" and "International Classification of Diseases" codes, with demographic and encounter data to classify individuals as case, control, or excluded. The algorithm was validated using blinded manual chart review at three eMERGE Network sites and one non-eMERGE Network site. Validation comprised evaluation of an equal number of predicted cases and controls selected at random from the algorithm predictions. After validation at the three eMERGE Network sites, the remaining eMERGE Network sites performed verification only. Finally, the algorithm was implemented as a workflow in the Konstanz Information Miner, which represented the logic graphically while retaining intermediate data for inspection at each node. The algorithm was configured to be independent of specific access to data and was exportable (without data) to other sites.
The algorithm demonstrated positive predictive values (PPV) of 92.8% (CI: 86.8-96.7) and 100% (CI: 97.0-100) for cases and controls, respectively. It performed well also outside the eMERGE Network. Implementation of the transportable executable algorithm as a Konstanz Information Miner workflow required much less effort than implementation from pseudo code, and ensured that the logic was as intended.
This ePhenotyping algorithm identifies abdominal aortic aneurysm cases and controls from the electronic health record with high case and control PPV necessary for research purposes, can be disseminated easily, and applied to high-throughput genetic and other studies.
我们设计了一种算法,用于从电子健康记录中识别腹主动脉瘤病例和对照,以便在“电子病历与基因组学”(eMERGE)网络中共享和执行。
使用结构化查询语言,利用“当前程序术语”和“国际疾病分类”代码编写算法脚本,并结合人口统计学和就诊数据将个体分类为病例、对照或排除对象。该算法在三个eMERGE网络站点和一个非eMERGE网络站点通过盲法人工病历审查进行验证。验证包括对从算法预测中随机选择的等量预测病例和对照进行评估。在三个eMERGE网络站点验证后,其余的eMERGE网络站点仅进行核查。最后,该算法在康斯坦茨信息挖掘器中作为工作流程实施,该工具以图形方式表示逻辑,同时保留中间数据以便在每个节点进行检查。该算法配置为独立于特定的数据访问方式,并且可以(不带数据)导出到其他站点。
该算法对病例和对照的阳性预测值(PPV)分别为92.8%(CI:86.8 - 96.7)和100%(CI:97.0 - 100)。它在eMERGE网络之外也表现良好。将可移植的可执行算法作为康斯坦茨信息挖掘器工作流程实施所需的工作量比从伪代码实施要少得多,并确保逻辑符合预期。
这种电子表型算法可从电子健康记录中识别腹主动脉瘤病例和对照,具有研究目的所需的高病例和对照PPV,易于传播,并可应用于高通量基因及其他研究。