Jang Min-Gyo, Cha SangHun, Kim Seunghwak, Lee Sojung, Lee Kyeong Eun, Shin Kwang-Hee
College of Pharmacy, Research Institute of Pharmaceutical Sciences, Kyungpook National University, Daegu, Republic of Korea.
Department of Statistics, College of Natural Sciences, Kyungpook National University, Daegu, Republic of Korea.
Expert Opin Drug Saf. 2023 Jul-Dec;22(7):629-636. doi: 10.1080/14740338.2023.2181341. Epub 2023 Feb 23.
Safety issues for fluoroquinolones have been provided by regulatory agencies. This study was conducted to identify signals of fluoroquinolones reported in the Korea Adverse Event Reporting System (KAERS) using tree-based machine learning (ML) methods.
All adverse events (AEs) associated with the target drugs reported in the KAERS from 2013 to 2017 were matched with drug label information. A dataset containing label-positive and -negative AEs was arbitrarily divided into training and test sets. Decision tree, random forest (RF), bagging, and gradient boosting machine (GBM) were fitted on the training set with hyperparameters tuned using five-fold cross-validation and applied to the test set. The ML method with the highest area under the curve (AUC) scores was selected as the final ML model.
Bagging was selected as the final ML model for gemifloxacin (AUC score: 1) and levofloxacin (AUC: 0.9987). RF was selected in ciprofloxacin, moxifloxacin, and ofloxacin (AUC scores: 0.9859, 0.9974, and 0.9999 respectively). We found that the final ML methods detected additional signals that were not detected using the disproportionality analysis (DPA) methods.
The bagging-or-RF-based ML methods performed better than DPA and detected novel AE signals previously unidentified using the DPA methods.
监管机构已提供了氟喹诺酮类药物的安全性问题。本研究旨在使用基于树的机器学习(ML)方法识别韩国不良事件报告系统(KAERS)中报告的氟喹诺酮类药物的信号。
将2013年至2017年KAERS中报告的与目标药物相关的所有不良事件(AE)与药物标签信息进行匹配。一个包含标签阳性和阴性AE的数据集被任意分为训练集和测试集。决策树、随机森林(RF)、装袋法和梯度提升机(GBM)在训练集上进行拟合,使用五折交叉验证调整超参数,并应用于测试集。选择曲线下面积(AUC)得分最高的ML方法作为最终的ML模型。
装袋法被选为吉米沙星(AUC得分:1)和左氧氟沙星(AUC:0.9987)的最终ML模型。环丙沙星、莫西沙星和氧氟沙星选择了RF(AUC得分分别为:0.9859、0.9974和0.9999)。我们发现最终的ML方法检测到了使用不成比例分析(DPA)方法未检测到的其他信号。
基于装袋法或RF的ML方法比DPA表现更好,并检测到了DPA方法以前未识别的新的AE信号。