四种用于信号检测的机器学习模型的评估

Evaluation of four machine learning models for signal detection.

作者信息

Dauner Daniel G, Leal Eleazar, Adam Terrence J, Zhang Rui, Farley Joel F

机构信息

Department of Pharmaceutical Care and Health Systems, College of Pharmacy, University of Minnesota Duluth, 232 Life Science, 1110 Kirby Drive, Duluth, MN 55812, USA.

Department of Computer Science, Swenson College of Science and Engineering, University of Minnesota Duluth, Duluth, MN, USA.

出版信息

Ther Adv Drug Saf. 2023 Dec 25;14:20420986231219472. doi: 10.1177/20420986231219472. eCollection 2023.

DOI:10.1177/20420986231219472

PMID:38157242

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10752114/

Abstract

BACKGROUND

Logistic regression-based signal detection algorithms have benefits over disproportionality analysis due to their ability to handle potential confounders and masking factors. Feature exploration and developing alternative machine learning algorithms can further strengthen signal detection.

OBJECTIVES

Our objective was to compare the signal detection performance of logistic regression, gradient-boosted trees, random forest and support vector machine models utilizing Food and Drug Administration adverse event reporting system data.

DESIGN

Cross-sectional study.

METHODS

The quarterly data extract files from 1 October 2017 through 31 December 2020 were downloaded. Due to an imbalanced outcome, two training sets were used: one stratified on the outcome variable and another using Synthetic Minority Oversampling Technique (SMOTE). A crude model and a model with tuned hyperparameters were developed for each algorithm. Model performance was compared against a reference set using accuracy, precision, F1 score, recall, the receiver operating characteristic area under the curve (ROCAUC), and the precision-recall curve area under the curve (PRCAUC).

RESULTS

Models trained on the balanced training set had higher accuracy, F1 score and recall compared to models trained on the SMOTE training set. When using the balanced training set, logistic regression, gradient-boosted trees, random forest and support vector machine models obtained similar performance evaluation metrics. The gradient-boosted trees hyperparameter tuned model had the highest ROCAUC (0.646) and the random forest crude model had the highest PRCAUC (0.839) when using the balanced training set.

CONCLUSION

All models trained on the balanced training set performed similarly. Logistic regression models had higher accuracy, precision and recall. Logistic regression, random forest and gradient-boosted trees hyperparameter tuned models had a PRCAUC ⩾ 0.8. All models had an ROCAUC ⩾ 0.5. Including both disproportionality analysis results and additional case report information in models resulted in higher performance evaluation metrics than disproportionality analysis alone.

摘要

背景

基于逻辑回归的信号检测算法由于能够处理潜在的混杂因素和掩盖因素，因此比不成比例分析更具优势。特征探索和开发替代机器学习算法可以进一步加强信号检测。

目的

我们的目的是利用美国食品药品监督管理局不良事件报告系统的数据，比较逻辑回归、梯度提升树、随机森林和支持向量机模型的信号检测性能。

设计

横断面研究。

方法

下载了2017年10月1日至2020年12月31日的季度数据提取文件。由于结果不均衡，使用了两个训练集：一个按结果变量分层，另一个使用合成少数过采样技术（SMOTE）。为每种算法开发了一个原始模型和一个具有调整超参数的模型。使用准确率、精确率、F1分数、召回率、曲线下面积的受试者工作特征曲线（ROCAUC）以及曲线下面积的精确率-召回率曲线（PRCAUC），将模型性能与参考集进行比较。

结果

与在SMOTE训练集上训练的模型相比，在平衡训练集上训练的模型具有更高的准确率、F1分数和召回率。使用平衡训练集时，逻辑回归、梯度提升树、随机森林和支持向量机模型获得了相似的性能评估指标。使用平衡训练集时，梯度提升树超参数调整模型具有最高的ROCAUC（0.646），随机森林原始模型具有最高的PRCAUC（0.839）。

结论

在平衡训练集上训练的所有模型表现相似。逻辑回归模型具有更高的准确率、精确率和召回率。逻辑回归、随机森林和梯度提升树超参数调整模型的PRCAUC⩾0.8。所有模型的ROCAUC⩾0.5。在模型中纳入不成比例分析结果和额外的病例报告信息，比单独进行不成比例分析产生更高的性能评估指标

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f23/10752114/e824a85d5a4f/10.1177_20420986231219472-fig1.jpg

相似文献

Evaluation of four machine learning models for signal detection.

Ther Adv Drug Saf. 2023 Dec 25;14:20420986231219472. doi: 10.1177/20420986231219472. eCollection 2023.

Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?

Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

Which supervised machine learning algorithm can best predict achievement of minimum clinically important difference in neck pain after surgery in patients with cervical myelopathy? A QOD study.

Neurosurg Focus. 2023 Jun;54(6):E5. doi: 10.3171/2023.3.FOCUS2372.

Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.

BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.

Machine Learning-Based Models Predicting Outpatient Surgery End Time and Recovery Room Discharge at an Ambulatory Surgery Center.

Anesth Analg. 2022 Jul 1;135(1):159-169. doi: 10.1213/ANE.0000000000006015. Epub 2022 Apr 7.

Explainable Machine Learning Techniques To Predict Amiodarone-Induced Thyroid Dysfunction Risk: Multicenter, Retrospective Study With External Validation.

J Med Internet Res. 2023 Feb 7;25:e43734. doi: 10.2196/43734.

Social Reminiscence in Older Adults' Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning.

J Med Internet Res. 2020 Sep 15;22(9):e19133. doi: 10.2196/19133.

Comparative Analysis of Logistic Regression, Gradient Boosted Trees, SVM, and Random Forest Algorithms for Prediction of Acute Kidney Injury Requiring Dialysis After Cardiac Surgery.

Int J Nephrol Renovasc Dis. 2024 Jul 24;17:197-204. doi: 10.2147/IJNRD.S461028. eCollection 2024.

Using Natural Language Processing to Predict Fatal Drug Overdose From Autopsy Narrative Text: Algorithm Development and Validation Study.

JMIR Public Health Surveill. 2023 May 19;9:e45246. doi: 10.2196/45246.

Application of machine learning algorithms in predicting HIV infection among men who have sex with men: Model development and validation.

Front Public Health. 2022 Aug 25;10:967681. doi: 10.3389/fpubh.2022.967681. eCollection 2022.

引用本文的文献

Multiple Strategies Confirm the Anti Hepatocellular Carcinoma Effect of Cinnamic Acid Based on the PI3k-AKT Pathway.

Pharmaceuticals (Basel). 2025 Aug 14;18(8):1205. doi: 10.3390/ph18081205.

Assessment of flight fatigue using heart rate variability and machine learning approaches.

Front Neurosci. 2025 Jul 2;19:1621638. doi: 10.3389/fnins.2025.1621638. eCollection 2025.

Artificial Intelligence: Applications in Pharmacovigilance Signal Management.

Pharmaceut Med. 2025 Apr 21. doi: 10.1007/s40290-025-00561-2.

A Pilot, Predictive Surveillance Model in Pharmacovigilance Using Machine Learning Approaches.

Adv Ther. 2024 Jun;41(6):2435-2445. doi: 10.1007/s12325-024-02870-5. Epub 2024 May 5.

本文引用的文献

Performance of subgrouped proportional reporting ratios in the US Food and Drug Administration (FDA) adverse event reporting system.

Expert Opin Drug Saf. 2023 Jul-Dec;22(7):589-597. doi: 10.1080/14740338.2023.2182289. Epub 2023 Mar 7.

Detection algorithms and attentive points of safety signal using spontaneous reporting systems as a clinical data source.

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab347.

Comparing the use of individual and composite terms to evaluate adverse drug event disproportionality: a focus on glucagon-like peptide-1 receptor agonists and diabetic retinopathy.

Expert Opin Drug Saf. 2021 Apr;20(4):475-480. doi: 10.1080/14740338.2021.1887136. Epub 2021 Mar 26.

Machine learning guided association of adverse drug reactions with in vitro target-based pharmacology.

EBioMedicine. 2020 Jul;57:102837. doi: 10.1016/j.ebiom.2020.102837. Epub 2020 Jun 18.

How to Read Articles That Use Machine Learning: Users' Guides to the Medical Literature.

JAMA. 2019 Nov 12;322(18):1806-1816. doi: 10.1001/jama.2019.16489.

A Comparison Study of Algorithms to Detect Drug-Adverse Event Associations: Frequentist, Bayesian, and Machine-Learning Approaches.

Drug Saf. 2019 Jun;42(6):743-750. doi: 10.1007/s40264-018-00792-0.

Detecting Potential Adverse Drug Reactions Using a Deep Neural Network Model.

J Med Internet Res. 2019 Feb 6;21(2):e11016. doi: 10.2196/11016.

Best Practices for Improving the Quality of Individual Case Safety Reports in Pharmacovigilance.

Ther Innov Regul Sci. 2016 Jul;50(4):464-471. doi: 10.1177/2168479016634766.

Serious Adverse Drug Events Reported to the FDA: Analysis of the FDA Adverse Event Reporting System 2006-2014 Database.

J Manag Care Spec Pharm. 2018 Jul;24(7):682-690. doi: 10.18553/jmcp.2018.24.7.682.

Pharmacovigilance of sodium-glucose co-transporter-2 inhibitors: What a clinician should know on disproportionality analysis of spontaneous reporting systems.

Nutr Metab Cardiovasc Dis. 2018 Jun;28(6):533-542. doi: 10.1016/j.numecd.2018.02.014. Epub 2018 Mar 1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

四种用于信号检测的机器学习模型的评估

Evaluation of four machine learning models for signal detection.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVES

DESIGN

METHODS

RESULTS

CONCLUSION

背景

目的

设计

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献