利用深度学习识别促进或抑制反疫苗和支持疫苗内容在社交媒体上传播的语言特征。

Using Deep Learning to Identify Linguistic Features that Facilitate or Inhibit the Propagation of Anti- and Pro-Vaccine Content on Social Media.

作者信息

Argyris Young Anna, Zhang Nan, Bashyal Bidhan, Tan Pang-Ning

机构信息

Dept of Media and Information, Michigan State University, East Lansing, MI.

Dept of Advertising and Public Relations, Michigan State University, East Lansing, MI.

出版信息

2022 IEEE Int Conf Digit Health IEEE IDCH 2022 (2022). 2022 Jul;2022:107-116. doi: 10.1109/icdh55609.2022.00025. Epub 2022 Aug 24.

DOI:10.1109/icdh55609.2022.00025

PMID:37975063

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10652839/

Abstract

Anti-vaccine content is rapidly propagated via social media, fostering vaccine hesitancy, while pro-vaccine content has not replicated the opponent's successes. Despite this disparity in the dissemination of anti- and pro-vaccine posts, linguistic features that facilitate or inhibit the propagation of vaccine-related content remain less known. Moreover, most prior machine-learning algorithms classified social-media posts into binary categories (e.g., misinformation or not) and have rarely tackled a higher-order classification task based on divergent perspectives about vaccines (e.g., anti-vaccine, pro-vaccine, and neutral). Our objectives are (1) to identify sets of linguistic features that facilitate and inhibit the propagation of vaccine-related content and (2) to compare whether anti-vaccine, provaccine, and neutral tweets contain either set more frequently than the others. To achieve these goals, we collected a large set of social media posts (over 120 million tweets) between Nov. 15 and Dec. 15, 2021, coinciding with the Omicron variant surge. A two-stage framework was developed using a fine-tuned BERT classifier, demonstrating over 99 and 80 percent accuracy for binary and ternary classification. Finally, the Linguistic Inquiry Word Count text analysis tool was used to count linguistic features in each classified tweet. Our regression results show that anti-vaccine tweets are propagated (i.e., retweeted), while pro-vaccine tweets garner passive endorsements (i.e., favorited). Our results also yielded the two sets of linguistic features as facilitators and inhibitors of the propagation of vaccine-related tweets. Finally, our regression results show that anti-vaccine tweets tend to use the facilitators, while pro-vaccine counterparts employ the inhibitors. These findings and algorithms from this study will aid public health officials' efforts to counteract vaccine misinformation, thereby facilitating the delivery of preventive measures during pandemics and epidemics.

摘要

反疫苗内容通过社交媒体迅速传播，加剧了人们对疫苗的犹豫态度，而支持疫苗的内容却未能取得与反对者同样的传播成效。尽管反疫苗和支持疫苗的帖子在传播方面存在这种差异，但促进或抑制疫苗相关内容传播的语言特征仍鲜为人知。此外，大多数先前的机器学习算法将社交媒体帖子分为二元类别（例如，错误信息或非错误信息），很少处理基于对疫苗的不同观点（例如，反疫苗、支持疫苗和中立）的高阶分类任务。我们的目标是：（1）识别促进和抑制疫苗相关内容传播的语言特征集；（2）比较反疫苗、支持疫苗和中立的推文是否比其他推文更频繁地包含其中任何一组特征。为实现这些目标，我们收集了2021年11月15日至12月15日期间大量的社交媒体帖子（超过1.2亿条推文），这一时期恰逢奥密克戎变种激增。我们使用微调后的BERT分类器开发了一个两阶段框架，二元和三元分类的准确率分别超过99%和80%。最后，使用语言查询词频文本分析工具对每条分类后的推文的语言特征进行计数。我们的回归结果表明，反疫苗推文会被传播（即被转发），而支持疫苗的推文获得的是被动认可（即被点赞）。我们的结果还得出了两组作为疫苗相关推文传播促进因素和抑制因素的语言特征。最后，我们的回归结果表明，反疫苗推文倾向于使用促进因素，而支持疫苗的推文则使用抑制因素。本研究的这些发现和算法将有助于公共卫生官员努力对抗疫苗错误信息，从而在大流行和疫情期间促进预防措施的实施。

相似文献

Using Deep Learning to Identify Linguistic Features that Facilitate or Inhibit the Propagation of Anti- and Pro-Vaccine Content on Social Media.

2022 IEEE Int Conf Digit Health IEEE IDCH 2022 (2022). 2022 Jul;2022:107-116. doi: 10.1109/icdh55609.2022.00025. Epub 2022 Aug 24.

Using Machine Learning to Compare Provaccine and Antivaccine Discourse Among the Public on Social Media: Algorithm Development Study.

JMIR Public Health Surveill. 2021 Jun 24;7(6):e23105. doi: 10.2196/23105.

Public Officials' Engagement on Social Media During the Rollout of the COVID-19 Vaccine: Content Analysis of Tweets.

JMIR Infodemiology. 2023 Jul 20;3:e41582. doi: 10.2196/41582.

Vaccine sentiment analysis using BERT + NBSVM and geo-spatial approaches.

J Supercomput. 2023 May 7:1-31. doi: 10.1007/s11227-023-05319-8.

Detecting Potentially Harmful and Protective Suicide-Related Content on Twitter: Machine Learning Approach.

J Med Internet Res. 2022 Aug 17;24(8):e34705. doi: 10.2196/34705.

Emotions and Incivility in Vaccine Mandate Discourse: Natural Language Processing Insights.

JMIR Infodemiology. 2022 Sep 13;2(2):e37635. doi: 10.2196/37635. eCollection 2022 Jul-Dec.

"Thought I'd Share First" and Other Conspiracy Theory Tweets from the COVID-19 Infodemic: Exploratory Study.

JMIR Public Health Surveill. 2021 Apr 14;7(4):e26527. doi: 10.2196/26527.

An Analysis of French-Language Tweets About COVID-19 Vaccines: Supervised Learning Approach.

JMIR Med Inform. 2022 May 17;10(5):e37831. doi: 10.2196/37831.

COVID-19 Vaccine Hesitancy on Social Media: Building a Public Twitter Data Set of Antivaccine Content, Vaccine Misinformation, and Conspiracies.

JMIR Public Health Surveill. 2021 Nov 17;7(11):e30642. doi: 10.2196/30642.

ANTi-Vax: a novel Twitter dataset for COVID-19 vaccine misinformation detection.

Public Health. 2022 Feb;203:23-30. doi: 10.1016/j.puhe.2021.11.022. Epub 2021 Dec 7.

引用本文的文献

Identifying Misinformation About Unproven Cancer Treatments on Social Media Using User-Friendly Linguistic Characteristics: Content Analysis.

JMIR Infodemiology. 2025 Feb 12;5:e62703. doi: 10.2196/62703.

When Infodemic Meets Epidemic: Systematic Literature Review.

JMIR Public Health Surveill. 2025 Feb 3;11:e55642. doi: 10.2196/55642.

Vaccine rhetoric on social media and COVID-19 vaccine uptake rates: A triangulation using self-reported vaccine acceptance.

Soc Sci Med. 2024 May;348:116775. doi: 10.1016/j.socscimed.2024.116775. Epub 2024 Mar 15.

本文引用的文献

Characterizing Discourse about COVID-19 Vaccines: A Reddit Version of the Pandemic Story.

Health Data Sci. 2021 Aug 27;2021:9837856. doi: 10.34133/2021/9837856. eCollection 2021.

ANTi-Vax: a novel Twitter dataset for COVID-19 vaccine misinformation detection.

Public Health. 2022 Feb;203:23-30. doi: 10.1016/j.puhe.2021.11.022. Epub 2021 Dec 7.

CoAID-DEEP: An Optimized Intelligent Framework for Automated Detecting COVID-19 Misleading Information on Twitter.

IEEE Access. 2021 Feb 9;9:27840-27867. doi: 10.1109/ACCESS.2021.3058066. eCollection 2021.

COVID-19 Vaccine Tweets After Vaccine Rollout: Sentiment-Based Topic Modeling.

J Med Internet Res. 2022 Feb 8;24(2):e31726. doi: 10.2196/31726.

Using Machine Learning to Compare Provaccine and Antivaccine Discourse Among the Public on Social Media: Algorithm Development Study.

JMIR Public Health Surveill. 2021 Jun 24;7(6):e23105. doi: 10.2196/23105.

The mediating role of vaccine hesitancy between maternal engagement with anti- and pro-vaccine social media posts and adolescent HPV-vaccine uptake rates in the US: The perspective of loss aversion in emotion-laden decision circumstances.

Soc Sci Med. 2021 Aug;282:114043. doi: 10.1016/j.socscimed.2021.114043. Epub 2021 May 17.

An analysis of COVID-19 vaccine sentiments and opinions on Twitter.

Int J Infect Dis. 2021 Jul;108:256-262. doi: 10.1016/j.ijid.2021.05.059. Epub 2021 May 27.

Integrating Multimodal Information in Large Pretrained Transformers.

Proc Conf Assoc Comput Linguist Meet. 2020 Jul;2020:2359-2369. doi: 10.18653/v1/2020.acl-main.214.

The online competition between pro- and anti-vaccination views.

Nature. 2020 Jun;582(7811):230-233. doi: 10.1038/s41586-020-2281-1. Epub 2020 May 13.

A systematic literature review to examine the potential for social media to impact HPV vaccine uptake and awareness, knowledge, and attitudes about HPV and HPV vaccination.

Hum Vaccin Immunother. 2019;15(7-8):1465-1475. doi: 10.1080/21645515.2019.1581543. Epub 2019 Apr 11.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用深度学习识别促进或抑制反疫苗和支持疫苗内容在社交媒体上传播的语言特征。

Using Deep Learning to Identify Linguistic Features that Facilitate or Inhibit the Propagation of Anti- and Pro-Vaccine Content on Social Media.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献