Department of Biomedical Informatics, Columbia University, New York, NY, USA.
Clin Pharmacol Ther. 2012 Aug;92(2):228-34. doi: 10.1038/clpt.2012.54. Epub 2012 Jun 20.
Electronic health records (EHRs) are an important source of data for detection of adverse drug reactions (ADRs). However, adverse events are frequently due not to medications but to the patients' underlying conditions. Mining to detect ADRs from EHR data must account for confounders. We developed an automated method using natural-language processing (NLP) and a knowledge source to differentiate cases in which the patient's disease is responsible for the event rather than a drug. Our method was applied to 199,920 hospitalization records, concentrating on two serious ADRs: rhabdomyolysis (n = 687) and agranulocytosis (n = 772). Our method automatically identified 75% of the cases, those with disease etiology. The sensitivity and specificity were 93.8% (confidence interval: 88.9-96.7%) and 91.8% (confidence interval: 84.0-96.2%), respectively. The method resulted in considerable saving of time: for every 1 h spent in development, there was a saving of at least 20 h in manual review. The review of the remaining 25% of the cases therefore became more feasible, allowing us to identify the medications that had caused the ADRs.
电子健康记录 (EHR) 是检测药物不良反应 (ADR) 的重要数据来源。然而,不良事件通常不是由药物引起,而是由患者的基础疾病引起。从 EHR 数据中挖掘以检测 ADR 必须考虑混杂因素。我们使用自然语言处理 (NLP) 和知识源开发了一种自动方法,以区分患者疾病导致事件而不是药物导致事件的情况。我们的方法应用于 199920 份住院记录,集中研究两种严重的 ADR:横纹肌溶解症 (n = 687) 和粒细胞缺乏症 (n = 772)。我们的方法自动识别了 75%的病例,即具有病因的病例。敏感性和特异性分别为 93.8%(置信区间:88.9-96.7%)和 91.8%(置信区间:84.0-96.2%)。该方法大大节省了时间:每开发 1 小时,手动审查就可节省至少 20 小时。因此,对剩余 25%的病例的审查变得更加可行,使我们能够确定导致 ADR 的药物。