• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过主题感知神经语言模型在大流行期间识别信息丰富的推文。

Identifying informative tweets during a pandemic via a topic-aware neural language model.

作者信息

Gao Wang, Li Lin, Tao Xiaohui, Zhou Jing, Tao Jun

机构信息

School of Artificial Intelligence, Jianghan University, 430056 Wuhan, China.

School of Computer Science and Artificial Intelligence, Wuhan University of Technology, 430070 Wuhan, China.

出版信息

World Wide Web. 2023;26(1):55-70. doi: 10.1007/s11280-022-01034-1. Epub 2022 Mar 16.

DOI:10.1007/s11280-022-01034-1
PMID:35308294
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8924578/
Abstract

Every epidemic affects the real lives of many people around the world and leads to terrible consequences. Recently, many tweets about the COVID-19 pandemic have been shared publicly on social media platforms. The analysis of these tweets is helpful for emergency response organizations to prioritize their tasks and make better decisions. However, most of these tweets are non-informative, which is a challenge for establishing an automated system to detect useful information in social media. Furthermore, existing methods ignore unlabeled data and topic background knowledge, which can provide additional semantic information. In this paper, we propose a novel Topic-Aware BERT (TABERT) model to solve the above challenges. TABERT first leverages a topic model to extract the latent topics of tweets. Secondly, a flexible framework is used to combine topic information with the output of BERT. Finally, we adopt adversarial training to achieve semi-supervised learning, and a large amount of unlabeled data can be used to improve inner representations of the model. Experimental results on the dataset of COVID-19 English tweets show that our model outperforms classic and state-of-the-art baselines.

摘要

每一次疫情都会影响全球许多人的真实生活,并导致可怕的后果。最近,许多关于新冠疫情的推文在社交媒体平台上被公开分享。对这些推文进行分析有助于应急响应组织确定任务优先级并做出更好的决策。然而,这些推文中大多数都没有实际信息,这对建立一个在社交媒体中检测有用信息的自动化系统来说是一项挑战。此外,现有方法忽略了未标记数据和主题背景知识,而这些可以提供额外的语义信息。在本文中,我们提出了一种新颖的主题感知BERT(TABERT)模型来解决上述挑战。TABERT首先利用主题模型提取推文的潜在主题。其次,使用一个灵活的框架将主题信息与BERT的输出相结合。最后,我们采用对抗训练来实现半监督学习,并且可以使用大量未标记数据来改进模型的内部表示。在新冠疫情英文推文数据集上的实验结果表明,我们的模型优于经典和最新的基线模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4ff/8924578/a9ff305a1b83/11280_2022_1034_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4ff/8924578/53fbba6cbee9/11280_2022_1034_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4ff/8924578/515567e477ff/11280_2022_1034_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4ff/8924578/387bb32dd2b1/11280_2022_1034_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4ff/8924578/dda127b37498/11280_2022_1034_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4ff/8924578/c555b7273843/11280_2022_1034_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4ff/8924578/a9ff305a1b83/11280_2022_1034_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4ff/8924578/53fbba6cbee9/11280_2022_1034_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4ff/8924578/515567e477ff/11280_2022_1034_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4ff/8924578/387bb32dd2b1/11280_2022_1034_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4ff/8924578/dda127b37498/11280_2022_1034_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4ff/8924578/c555b7273843/11280_2022_1034_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4ff/8924578/a9ff305a1b83/11280_2022_1034_Fig6_HTML.jpg

相似文献

1
Identifying informative tweets during a pandemic via a topic-aware neural language model.通过主题感知神经语言模型在大流行期间识别信息丰富的推文。
World Wide Web. 2023;26(1):55-70. doi: 10.1007/s11280-022-01034-1. Epub 2022 Mar 16.
2
COVID-19 outbreak: An ensemble pre-trained deep learning model for detecting informative tweets.新冠疫情:一种用于检测信息丰富推文的集成预训练深度学习模型。
Appl Soft Comput. 2021 Aug;107:107495. doi: 10.1016/j.asoc.2021.107495. Epub 2021 May 21.
3
Novel fuzzy deep learning approach for automated detection of useful COVID-19 tweets.用于自动检测有用 COVID-19 推文的新型模糊深度学习方法。
Artif Intell Med. 2023 Sep;143:102627. doi: 10.1016/j.artmed.2023.102627. Epub 2023 Jul 24.
4
Detection of Hate Speech in COVID-19-Related Tweets in the Arab Region: Deep Learning and Topic Modeling Approach.检测阿拉伯地区与 COVID-19 相关推文的仇恨言论:深度学习和主题建模方法。
J Med Internet Res. 2020 Dec 8;22(12):e22609. doi: 10.2196/22609.
5
Topics, Trends, and Sentiments of Tweets About the COVID-19 Pandemic: Temporal Infoveillance Study.关于新冠疫情的推文主题、趋势和情绪:时间信息监测研究
J Med Internet Res. 2020 Oct 23;22(10):e22624. doi: 10.2196/22624.
6
Identifying COVID-19 english informative tweets using limited labelled data.使用有限的标注数据识别关于新冠疫情的英文信息推文。
Soc Netw Anal Min. 2023;13(1):25. doi: 10.1007/s13278-023-01025-8. Epub 2023 Jan 17.
7
An Analysis of French-Language Tweets About COVID-19 Vaccines: Supervised Learning Approach.关于新冠疫苗的法语推文分析:监督学习方法
JMIR Med Inform. 2022 May 17;10(5):e37831. doi: 10.2196/37831.
8
Emergency Physician Twitter Use in the COVID-19 Pandemic as a Potential Predictor of Impending Surge: Retrospective Observational Study.《COVID-19 大流行期间急诊医师在 Twitter 上的使用情况可能预示着即将出现的疫情高峰:回顾性观察研究》
J Med Internet Res. 2021 Jul 14;23(7):e28615. doi: 10.2196/28615.
9
Top Concerns of Tweeters During the COVID-19 Pandemic: Infoveillance Study.新冠疫情期间推特用户的主要担忧:信息监测研究
J Med Internet Res. 2020 Apr 21;22(4):e19016. doi: 10.2196/19016.
10
A deep multi-view imbalanced learning approach for identifying informative COVID-19 tweets from social media.一种用于从社交媒体中识别有价值的 COVID-19 推文的深度多视图不平衡学习方法。
Comput Biol Med. 2023 Sep;164:107232. doi: 10.1016/j.compbiomed.2023.107232. Epub 2023 Jul 8.

引用本文的文献

1
Use of Large Language Models to Classify Epidemiological Characteristics in Synthetic and Real-World Social Media Posts About Conjunctivitis Outbreaks: Infodemiology Study.利用大语言模型对合成及真实世界社交媒体上有关结膜炎爆发的帖子中的流行病学特征进行分类:信息流行病学研究
J Med Internet Res. 2025 Jul 2;27:e65226. doi: 10.2196/65226.
2
Use of Large Language Models to Assess the Likelihood of Epidemics From the Content of Tweets: Infodemiology Study.利用大型语言模型从推文内容评估传染病爆发的可能性:信息流行病学研究。
J Med Internet Res. 2024 Mar 1;26:e49139. doi: 10.2196/49139.

本文引用的文献

1
NeedFull - a Tweet Analysis Platform to Study Human Needs During the COVID-19 Pandemic in New York State.NeedFull——一个用于研究纽约州新冠疫情期间人类需求的推文分析平台。
IEEE Access. 2020 Jul 22;8:136046-136055. doi: 10.1109/ACCESS.2020.3011123. eCollection 2020.
2
An exploratory study of COVID-19 misinformation on Twitter.关于推特上新冠疫情错误信息的探索性研究。
Online Soc Netw Media. 2021 Mar;22:100104. doi: 10.1016/j.osnem.2020.100104. Epub 2021 Feb 19.
3
Automated epilepsy detection techniques from electroencephalogram signals: a review study.
基于脑电图信号的自动癫痫检测技术:一项综述研究
Health Inf Sci Syst. 2020 Oct 12;8(1):33. doi: 10.1007/s13755-020-00129-1. eCollection 2020 Dec.
4
Automated detection of mild and multi-class diabetic eye diseases using deep learning.使用深度学习自动检测轻度和多类糖尿病眼病。
Health Inf Sci Syst. 2020 Oct 8;8(1):32. doi: 10.1007/s13755-020-00125-5. eCollection 2020 Dec.
5
Using online social networks to track a pandemic: A systematic review.利用在线社交网络追踪大流行病:一项系统综述。
J Biomed Inform. 2016 Aug;62:1-11. doi: 10.1016/j.jbi.2016.05.005. Epub 2016 May 17.