DBEI, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA.
Kazan Federal University, Kazan, Russia.
J Am Med Inform Assoc. 2021 Sep 18;28(10):2184-2192. doi: 10.1093/jamia/ocab114.
Research on pharmacovigilance from social media data has focused on mining adverse drug events (ADEs) using annotated datasets, with publications generally focusing on 1 of 3 tasks: ADE classification, named entity recognition for identifying the span of ADE mentions, and ADE mention normalization to standardized terminologies. While the common goal of such systems is to detect ADE signals that can be used to inform public policy, it has been impeded largely by limited end-to-end solutions for large-scale analysis of social media reports for different drugs.
We present a dataset for training and evaluation of ADE pipelines where the ADE distribution is closer to the average 'natural balance' with ADEs present in about 7% of the tweets. The deep learning architecture involves an ADE extraction pipeline with individual components for all 3 tasks.
The system presented achieved state-of-the-art performance on comparable datasets and scored a classification performance of F1 = 0.63, span extraction performance of F1 = 0.44 and an end-to-end entity resolution performance of F1 = 0.34 on the presented dataset.
The performance of the models continues to highlight multiple challenges when deploying pharmacovigilance systems that use social media data. We discuss the implications of such models in the downstream tasks of signal detection and suggest future enhancements.
Mining ADEs from Twitter posts using a pipeline architecture requires the different components to be trained and tuned based on input data imbalance in order to ensure optimal performance on the end-to-end resolution task.
从社交媒体数据中进行药物警戒研究主要集中在使用注释数据集挖掘不良药物事件(ADE),出版物通常侧重于以下 3 个任务中的 1 个:ADE 分类、识别 ADE 提及范围的命名实体识别,以及 ADE 提及到标准化术语的归一化。虽然这些系统的共同目标是检测可以用于告知公共政策的 ADE 信号,但由于缺乏针对不同药物的社交媒体报告进行大规模分析的端到端解决方案,这一目标在很大程度上受到了阻碍。
我们提出了一个用于训练和评估 ADE 管道的数据集,其中 ADE 分布更接近平均“自然平衡”,大约 7%的推文中存在 ADE。深度学习架构涉及一个 ADE 提取管道,其中包含所有 3 个任务的单独组件。
所提出的系统在可比数据集上实现了最先进的性能,在提出的数据集上,分类性能 F1 = 0.63,跨度提取性能 F1 = 0.44,端到端实体解析性能 F1 = 0.34。
模型的性能继续突出了在使用社交媒体数据部署药物警戒系统时面临的多个挑战。我们讨论了此类模型在信号检测的下游任务中的影响,并提出了未来的改进建议。
使用管道架构从 Twitter 帖子中挖掘 ADE 需要根据输入数据不平衡情况对不同组件进行训练和调优,以确保在端到端解析任务上达到最佳性能。