利用多模态深度学习检测社交媒体中的医疗错误信息。

Detecting Medical Misinformation on Social Media Using Multimodal Deep Learning.

出版信息

IEEE J Biomed Health Inform. 2021 Jun;25(6):2193-2203. doi: 10.1109/JBHI.2020.3037027. Epub 2021 Jun 3.

DOI:10.1109/JBHI.2020.3037027

Abstract

In 2019, outbreaks of vaccine-preventable diseases reached the highest number in the US since 1992. Medical misinformation, such as antivaccine content propagating through social media, is associated with increases in vaccine delay and refusal. Our overall goal is to develop an automatic detector for antivaccine messages to counteract the negative impact that antivaccine messages have on the public health. Very few extant detection systems have considered multimodality of social media posts (images, texts, and hashtags), and instead focus on textual components, despite the rapid growth of photo-sharing applications (e.g., Instagram). As a result, existing systems are not sufficient for detecting antivaccine messages with heavy visual components (e.g., images) posted on these newer platforms. To solve this problem, we propose a deep learning network that leverages both visual and textual information. A new semantic- and task-level attention mechanism was created to help our model to focus on the essential contents of a post that signal antivaccine messages. The proposed model, which consists of three branches, can generate comprehensive fused features for predictions. Moreover, an ensemble method is proposed to further improve the final prediction accuracy. To evaluate the proposed model's performance, a real-world social media dataset that consists of more than 30,000 samples was collected from Instagram between January 2016 and October 2019. Our 30 experiment results demonstrate that the final network achieves above 97% testing accuracy and outperforms other relevant models, demonstrating that it can detect a large amount of antivaccine messages posted daily. The implementation code is available at https://github.com/wzhings/antivaccine_detection.

摘要

2019 年，美国疫苗可预防疾病的爆发数量达到了 1992 年以来的最高水平。医疗错误信息，如通过社交媒体传播的反疫苗内容，与疫苗接种延迟和拒绝率的上升有关。我们的总体目标是开发一种自动反疫苗信息探测器，以抵消反疫苗信息对公众健康的负面影响。很少有现有的检测系统考虑了社交媒体帖子的多模态性（图像、文本和标签），而只关注文本成分，尽管照片分享应用程序（如 Instagram）迅速发展。因此，现有的系统不足以检测在这些更新的平台上发布的具有大量视觉成分（如图像）的反疫苗信息。为了解决这个问题，我们提出了一个利用视觉和文本信息的深度学习网络。创建了一个新的语义和任务级注意力机制，以帮助我们的模型专注于帖子中表示反疫苗信息的关键内容。所提出的模型由三个分支组成，可以生成用于预测的综合融合特征。此外，还提出了一种集成方法来进一步提高最终的预测准确性。为了评估所提出模型的性能，从 2016 年 1 月到 2019 年 10 月，我们从 Instagram 上收集了一个包含 30000 多个样本的真实社交媒体数据集。我们的 30 个实验结果表明，最终网络的测试准确率超过 97%，优于其他相关模型，表明它可以检测到每天发布的大量反疫苗信息。实现代码可在 https://github.com/wzhings/antivaccine_detection 上获得。