Suppr超能文献

电子病历与基因组学(eMERGE)网络中腹主动脉瘤的电子表型分析:算法开发与康斯坦茨信息挖掘器工作流程

ePhenotyping for Abdominal Aortic Aneurysm in the Electronic Medical Records and Genomics (eMERGE) Network: Algorithm Development and Konstanz Information Miner Workflow.

作者信息

Borthwick Kenneth M, Smelser Diane T, Bock Jonathan A, Elmore James R, Ryer Evan J, Ye Zi, Pacheco Jennifer A, Carrell David S, Michalkiewicz Michael, Thompson William K, Pathak Jyotishman, Bielinski Suzette J, Denny Joshua C, Linneman James G, Peissig Peggy L, Kho Abel N, Gottesman Omri, Parmar Harpreet, Kullo Iftikhar J, McCarty Catherine A, Böttinger Erwin P, Larson Eric B, Jarvik Gail P, Harley John B, Bajwa Tanvir, Franklin David P, Carey David J, Kuivaniemi Helena, Tromp Gerard

机构信息

The Sigfried and Janet Weis Center for Research, Geisinger Health System, Danville, PA, USA.

Department of Vascular and Endovascular Surgery, Geisinger Health System, Danville, PA, USA.

出版信息

Int J Biomed Data Min. 2015 Dec;4(1). Epub 2015 Jul 30.

Abstract

BACKGROUND AND OBJECTIVE

We designed an algorithm to identify abdominal aortic aneurysm cases and controls from electronic health records to be shared and executed within the "electronic Medical Records and Genomics" (eMERGE) Network.

MATERIALS AND METHODS

Structured Query Language, was used to script the algorithm utilizing "Current Procedural Terminology" and "International Classification of Diseases" codes, with demographic and encounter data to classify individuals as case, control, or excluded. The algorithm was validated using blinded manual chart review at three eMERGE Network sites and one non-eMERGE Network site. Validation comprised evaluation of an equal number of predicted cases and controls selected at random from the algorithm predictions. After validation at the three eMERGE Network sites, the remaining eMERGE Network sites performed verification only. Finally, the algorithm was implemented as a workflow in the Konstanz Information Miner, which represented the logic graphically while retaining intermediate data for inspection at each node. The algorithm was configured to be independent of specific access to data and was exportable (without data) to other sites.

RESULTS

The algorithm demonstrated positive predictive values (PPV) of 92.8% (CI: 86.8-96.7) and 100% (CI: 97.0-100) for cases and controls, respectively. It performed well also outside the eMERGE Network. Implementation of the transportable executable algorithm as a Konstanz Information Miner workflow required much less effort than implementation from pseudo code, and ensured that the logic was as intended.

DISCUSSION AND CONCLUSION

This ePhenotyping algorithm identifies abdominal aortic aneurysm cases and controls from the electronic health record with high case and control PPV necessary for research purposes, can be disseminated easily, and applied to high-throughput genetic and other studies.

摘要

背景与目的

我们设计了一种算法,用于从电子健康记录中识别腹主动脉瘤病例和对照,以便在“电子病历与基因组学”(eMERGE)网络中共享和执行。

材料与方法

使用结构化查询语言,利用“当前程序术语”和“国际疾病分类”代码编写算法脚本,并结合人口统计学和就诊数据将个体分类为病例、对照或排除对象。该算法在三个eMERGE网络站点和一个非eMERGE网络站点通过盲法人工病历审查进行验证。验证包括对从算法预测中随机选择的等量预测病例和对照进行评估。在三个eMERGE网络站点验证后,其余的eMERGE网络站点仅进行核查。最后,该算法在康斯坦茨信息挖掘器中作为工作流程实施,该工具以图形方式表示逻辑,同时保留中间数据以便在每个节点进行检查。该算法配置为独立于特定的数据访问方式,并且可以(不带数据)导出到其他站点。

结果

该算法对病例和对照的阳性预测值(PPV)分别为92.8%(CI:86.8 - 96.7)和100%(CI:97.0 - 100)。它在eMERGE网络之外也表现良好。将可移植的可执行算法作为康斯坦茨信息挖掘器工作流程实施所需的工作量比从伪代码实施要少得多,并确保逻辑符合预期。

讨论与结论

这种电子表型算法可从电子健康记录中识别腹主动脉瘤病例和对照,具有研究目的所需的高病例和对照PPV,易于传播,并可应用于高通量基因及其他研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c41/4820287/b927483a5fa1/nihms723318f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验