Suppr超能文献

提高社交媒体中药物不良反应抽取的稳健性:基于否定词和推测词的案例研究。

Increasing adverse drug events extraction robustness on social media: Case study on negation and speculation.

机构信息

Department of Mathematics, Computer Science and Physics, University of Udine, Udine 33100, Italy.

Università degli Studi di Napoli Federico II, Napoli 80138, Italy.

出版信息

Exp Biol Med (Maywood). 2022 Nov;247(22):2003-2014. doi: 10.1177/15353702221128577. Epub 2022 Oct 31.

Abstract

In the last decade, an increasing number of users have started reporting adverse drug events (ADEs) on social media platforms, blogs, and health forums. Given the large volume of reports, pharmacovigilance has focused on ways to use natural language processing (NLP) techniques to rapidly examine these large collections of text, detecting mentions of drug-related adverse reactions to trigger medical investigations. However, despite the growing interest in the task and the advances in NLP, the robustness of these models in face of linguistic phenomena such as negations and speculations is an open research question. Negations and speculations are pervasive phenomena in natural language and can severely hamper the ability of an automated system to discriminate between factual and non-factual statements in text. In this article, we take into consideration four state-of-the-art systems for ADE detection on social media texts. We introduce SNAX, a benchmark to test their performance against samples containing negated and speculated ADEs, showing their fragility against these phenomena. We then introduce two possible strategies to increase the robustness of these models, showing that both of them bring significant increases in performance, lowering the number of spurious entities predicted by the models by 60% for negation and 80% for speculations.

摘要

在过去的十年中,越来越多的用户开始在社交媒体平台、博客和健康论坛上报告药物不良事件 (ADE)。鉴于报告数量庞大,药物警戒已专注于使用自然语言处理 (NLP) 技术快速检查这些大量文本的方法,以检测与药物相关的不良反应的提及,从而触发医学调查。然而,尽管人们对这项任务越来越感兴趣,并且在 NLP 方面取得了进展,但这些模型在面对否定和推测等语言现象时的稳健性仍然是一个悬而未决的研究问题。否定和推测是自然语言中普遍存在的现象,它们会严重阻碍自动系统在文本中区分事实和非事实陈述的能力。在本文中,我们考虑了四个用于社交媒体文本中 ADE 检测的最先进系统。我们引入了 SNAX,这是一个基准测试,用于测试它们对包含否定和推测 ADE 的样本的性能,展示了它们对这些现象的脆弱性。然后,我们介绍了两种可能的策略来提高这些模型的稳健性,结果表明这两种策略都显著提高了性能,将模型预测的虚假实体数量分别降低了 60%(用于否定)和 80%(用于推测)。

相似文献

5
Comparison of text processing methods in social media-based signal detection.社交媒体信号检测中文本处理方法的比较。
Pharmacoepidemiol Drug Saf. 2019 Oct;28(10):1309-1317. doi: 10.1002/pds.4857. Epub 2019 Aug 7.

本文引用的文献

4
Cadec: A corpus of adverse drug event annotations.Cadec:一个药物不良事件注释语料库。
J Biomed Inform. 2015 Jun;55:73-81. doi: 10.1016/j.jbi.2015.03.010. Epub 2015 Mar 27.
6
Utilizing social media data for pharmacovigilance: A review.利用社交媒体数据进行药物警戒:综述
J Biomed Inform. 2015 Apr;54:202-12. doi: 10.1016/j.jbi.2015.02.004. Epub 2015 Feb 23.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验