Xu Rong, Wang Quanqiu
Medical Informatics Program, Center for Clinical Investigation, Case Western Reserve University, United States.
ThinTek, LLC, Palo Alto, CA, United States.
J Biomed Inform. 2014 Feb;47:171-7. doi: 10.1016/j.jbi.2013.10.008. Epub 2013 Oct 28.
Targeted drugs dramatically improve the treatment outcomes in cancer patients; however, these innovative drugs are often associated with unexpectedly high cardiovascular toxicity. Currently, cardiovascular safety represents both a challenging issue for drug developers, regulators, researchers, and clinicians and a concern for patients. While FDA drug labels have captured many of these events, spontaneous reporting systems are a main source for post-marketing drug safety surveillance in 'real-world' (outside of clinical trials) cancer patients. In this study, we present approaches to extracting, prioritizing, filtering, and confirming cardiovascular events associated with targeted cancer drugs from the FDA Adverse Event Reporting System (FAERS).
The dataset includes records of 4,285,097 patients from FAERS. We first extracted drug-cardiovascular event (drug-CV) pairs from FAERS through named entity recognition and mapping processes. We then compared six ranking algorithms in prioritizing true positive signals among extracted pairs using known drug-CV pairs derived from FDA drug labels. We also developed three filtering algorithms to further improve precision. Finally, we manually validated extracted drug-CV pairs using 21 million published MEDLINE records.
We extracted a total of 11,173 drug-CV pairs from FAERS. We showed that ranking by frequency is significantly more effective than by the five standard signal detection methods (246% improvement in precision for top-ranked pairs). The filtering algorithm we developed further improved overall precision by 91.3%. By manual curation using literature evidence, we show that about 51.9% of the 617 drug-CV pairs that appeared in both FAERS and MEDLINE sentences are true positives. In addition, 80.6% of these positive pairs have not been captured by FDA drug labeling.
The unique drug-CV association dataset that we created based on FAERS could facilitate our understanding and prediction of cardiotoxic events associated with targeted cancer drugs.
靶向药物显著改善了癌症患者的治疗效果;然而,这些创新药物往往伴有出人意料的高心血管毒性。目前,心血管安全性对药物开发者、监管机构、研究人员和临床医生而言都是一个具有挑战性的问题,同时也是患者所关心的问题。虽然美国食品药品监督管理局(FDA)的药品标签已涵盖了许多此类事件,但自发报告系统是“真实世界”(临床试验之外)癌症患者上市后药物安全性监测的主要来源。在本研究中,我们展示了从FDA不良事件报告系统(FAERS)中提取、排序、筛选和确认与靶向抗癌药物相关的心血管事件的方法。
数据集包括来自FAERS的4,285,097名患者的记录。我们首先通过命名实体识别和映射过程从FAERS中提取药物-心血管事件(药物-CV)对。然后,我们使用从FDA药品标签中获得的已知药物-CV对,比较了六种排序算法,以便在提取的药物-CV对中对真阳性信号进行排序。我们还开发了三种筛选算法以进一步提高精度。最后,我们使用2100万条已发表的医学文献数据库(MEDLINE)记录对手动提取的药物-CV对进行了验证。
我们从FAERS中总共提取了11,173对药物-CV对。我们发现,按频率排序比按五种标准信号检测方法排序显著更有效(排名靠前的药物-CV对的精度提高了246%)。我们开发的筛选算法进一步将总体精度提高了91.3%。通过使用文献证据进行人工筛选,我们发现FAERS和MEDLINE句子中均出现的617对药物-CV对中,约51.9%为真阳性。此外,这些阳性对中有80.6%未被FDA药品标签涵盖。
我们基于FAERS创建的独特药物-CV关联数据集有助于我们理解和预测与靶向抗癌药物相关的心脏毒性事件。