Wang Xiaoyan, Hripcsak George, Markatou Marianthi, Friedman Carol
Department of Biomedical Informatics, Columbia University, 622 West 168 Street, VC5, New York, NY 10032, USA.
J Am Med Inform Assoc. 2009 May-Jun;16(3):328-37. doi: 10.1197/jamia.M3028. Epub 2009 Mar 4.
OBJECTIVE It is vital to detect the full safety profile of a drug throughout its market life. Current pharmacovigilance systems still have substantial limitations, however. The objective of our work is to demonstrate the feasibility of using natural language processing (NLP), the comprehensive Electronic Health Record (EHR), and association statistics for pharmacovigilance purposes. DESIGN Narrative discharge summaries were collected from the Clinical Information System at New York Presbyterian Hospital (NYPH). MedLEE, an NLP system, was applied to the collection to identify medication events and entities which could be potential adverse drug events (ADEs). Co-occurrence statistics with adjusted volume tests were used to detect associations between the two types of entities, to calculate the strengths of the associations, and to determine their cutoff thresholds. Seven drugs/drug classes (ibuprofen, morphine, warfarin, bupropion, paroxetine, rosiglitazone, ACE inhibitors) with known ADEs were selected to evaluate the system. RESULTS One hundred thirty-two potential ADEs were found to be associated with the 7 drugs. Overall recall and precision were 0.75 and 0.31 for known ADEs respectively. Importantly, qualitative evaluation using historic roll back design suggested that novel ADEs could be detected using our system. CONCLUSIONS This study provides a framework for the development of active, high-throughput and prospective systems which could potentially unveil drug safety profiles throughout their entire market life. Our results demonstrate that the framework is feasible although there are some challenging issues. To the best of our knowledge, this is the first study using comprehensive unstructured data from the EHR for pharmacovigilance.
目的 在药物整个市场生命周期内检测其完整的安全性概况至关重要。然而,当前的药物警戒系统仍存在重大局限性。我们这项工作的目的是证明使用自然语言处理(NLP)、全面的电子健康记录(EHR)以及关联统计进行药物警戒的可行性。
设计 从纽约长老会医院(NYPH)的临床信息系统收集叙述性出院小结。将NLP系统MedLEE应用于该收集内容,以识别可能是潜在药物不良事件(ADE)的用药事件和实体。使用带有调整后数量测试的共现统计来检测这两种实体之间的关联,计算关联强度,并确定其截断阈值。选择七种已知有ADE的药物/药物类别(布洛芬、吗啡、华法林、安非他酮、帕罗西汀、罗格列酮、血管紧张素转换酶抑制剂)来评估该系统。
结果 发现132个潜在ADE与这7种药物相关。已知ADE的总体召回率和精确率分别为0.75和0.31。重要的是,使用历史回溯设计的定性评估表明,使用我们的系统可以检测到新的ADE。
结论 本研究为开发主动、高通量和前瞻性系统提供了一个框架,该系统有可能在药物整个市场生命周期内揭示其安全性概况。我们的结果表明该框架是可行的,尽管存在一些具有挑战性的问题。据我们所知,这是第一项使用来自EHR的全面非结构化数据进行药物警戒的研究。