Bae Ji-Hwan, Baek Yeon-Hee, Lee Jeong-Eun, Song Inmyung, Lee Jee-Hyong, Shin Ju-Young
School of Pharmacy, Sungkyunkwan University, Suwon-si, South Korea.
Department of Health Administration, College of Nursing and Health, Kongju National University, Gongju-si, South Korea.
Front Pharmacol. 2021 Jan 14;11:602365. doi: 10.3389/fphar.2020.602365. eCollection 2020.
Various methods have been implemented to detect adverse drug reaction (ADR) signals. However, the applicability of machine learning methods has not yet been fully evaluated. To evaluate the feasibility of machine learning algorithms in detecting ADR signals of nivolumab and docetaxel, new and old anticancer agents. We conducted a safety surveillance study of nivolumab and docetaxel using the Korea national spontaneous reporting database from 2009 to 2018. We constructed a novel input dataset for each study drug comprised of known ADRs that were listed in the drug labels and unknown ADRs. Given the known ADRs, we trained machine learning algorithms and evaluated predictive performance in generating safety signals of machine learning algorithms (gradient boosting machine [GBM] and random forest [RF]) compared with traditional disproportionality analysis methods (reporting odds ratio [ROR] and information component [IC]) by using the area under the curve (AUC). Each method then was implemented to detect new safety signals from the unknown ADR datasets. Of all methods implemented, GBM achieved the best average predictive performance (AUC: 0.97 and 0.93 for nivolumab and docetaxel). The AUC achieved by each method was 0.95 and 0.92 (RF), 0.55 and 0.51 (ROR), and 0.49 and 0.48 (IC) for respective drug. GBM detected additional 24 and nine signals for nivolumab and 82 and 76 for docetaxel compared to ROR and IC, respectively, from the unknown ADR datasets. Machine learning algorithm based on GBM performed better and detected more new ADR signals than traditional disproportionality analysis methods.
已经实施了各种方法来检测药物不良反应(ADR)信号。然而,机器学习方法的适用性尚未得到充分评估。为了评估机器学习算法在检测纳武单抗和多西他赛(新旧抗癌药物)的ADR信号方面的可行性,我们使用2009年至2018年韩国国家自发报告数据库对纳武单抗和多西他赛进行了安全性监测研究。我们为每种研究药物构建了一个新颖的输入数据集,该数据集由药物标签中列出的已知ADR和未知ADR组成。鉴于已知的ADR,我们训练了机器学习算法,并通过使用曲线下面积(AUC)评估了机器学习算法(梯度提升机[GBM]和随机森林[RF])与传统的不成比例分析方法(报告比值比[ROR]和信息成分[IC])在生成安全信号方面的预测性能。然后,每种方法都被用于从未知ADR数据集中检测新的安全信号。在所有实施的方法中,GBM实现了最佳的平均预测性能(纳武单抗和多西他赛的AUC分别为0.97和0.93)。每种方法对于相应药物的AUC分别为0.95和0.92(RF)、0.55和0.51(ROR)以及0.49和0.48(IC)。与ROR和IC相比,GBM分别从未知ADR数据集中为纳武单抗检测到另外24个和9个信号,为多西他赛检测到82个和76个信号。基于GBM的机器学习算法比传统的不成比例分析方法表现更好,并且检测到更多新的ADR信号。