Phelan Matthew, Bhavsar Nrupen A, Goldstein Benjamin A
Center for Predictive Medicine, Duke Clinical Research Institute, Duke University, Durham, NC.
Division of General Internal Medicine, Department of Medicine, Duke University School of Medicine, Durham, NC.
EGEMS (Wash DC). 2017 Dec 6;5(1):22. doi: 10.5334/egems.243.
Electronic health record (EHR) data are becoming a primary resource for clinical research. Compared to traditional research data, such as those from clinical trials and epidemiologic cohorts, EHR data have a number of appealing characteristics. However, because they do not have mechanisms set in place to ensure that the appropriate data are collected, they also pose a number of analytic challenges. In this paper, we illustrate that how a patient interacts with a health system influences which data are recorded in the EHR. These interactions are typically informative, potentially resulting in bias. We term the overall set of induced biases To illustrate this, we use examples from EHR based analyses. Specifically, we show that: 1) Where a patient receives services within a health facility can induce 2) Which health system a patient chooses for an encounter can result in and 3) Referral encounters can create an While often times addressing these biases can be straightforward, it is important to understand how they are induced in any EHR based analysis.
电子健康记录(EHR)数据正成为临床研究的主要资源。与传统研究数据(如来自临床试验和流行病学队列的数据)相比,EHR数据具有许多吸引人的特征。然而,由于它们没有设置确保收集适当数据的机制,因此也带来了一些分析挑战。在本文中,我们说明了患者与医疗系统的交互方式如何影响EHR中记录的数据。这些交互通常具有信息性,可能导致偏差。我们将所引发的偏差的总体集合称为 为了说明这一点,我们使用基于EHR分析的示例。具体而言,我们表明:1)患者在医疗机构内接受服务的地点可能会引发 2)患者为一次就诊选择的医疗系统可能会导致 以及3)转诊就诊可能会产生 虽然通常解决这些偏差可能很直接,但了解它们在任何基于EHR的分析中是如何引发的很重要。