文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

对主题模型用于短文本社交媒体分析的系统综述。

A systematic review of the use of topic models for short text social media analysis.

作者信息

Laureate Caitlin Doogan Poet, Buntine Wray, Linger Henry

机构信息

Faculty of IT, Monash University, Wellington Rd, Clayton, VIC 3800 Australia.

College of Engineering and Computer Science, VinUniversity, Vinhomes Ocean Park, Gia Lam District, Hanoi 10000 Vietnam.

出版信息

Artif Intell Rev. 2023 May 1:1-33. doi: 10.1007/s10462-023-10471-x.


DOI:10.1007/s10462-023-10471-x
PMID:37362887
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10150353/
Abstract

UNLABELLED: Recently, research on short text topic models has addressed the challenges of social media datasets. These models are typically evaluated using automated measures. However, recent work suggests that these evaluation measures do not inform whether the topics produced can yield meaningful insights for those examining social media data. Efforts to address this issue, including gauging the alignment between automated and human evaluation tasks, are hampered by a lack of knowledge about how researchers use topic models. Further problems could arise if researchers do not construct topic models optimally or use them in a way that exceeds the models' limitations. These scenarios threaten the validity of topic model development and the insights produced by researchers employing topic modelling as a methodology. However, there is currently a lack of information about how and why topic models are used in applied research. As such, we performed a systematic literature review of 189 articles where topic modelling was used for social media analysis to understand how and why topic models are used for social media analysis. Our results suggest that the development of topic models is not aligned with the needs of those who use them for social media analysis. We have found that researchers use topic models sub-optimally. There is a lack of methodological support for researchers to build and interpret topics. We offer a set of recommendations for topic model researchers to address these problems and bridge the gap between development and applied research on short text topic models. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10462-023-10471-x.

摘要

未标注:最近,关于短文本主题模型的研究已经解决了社交媒体数据集的挑战。这些模型通常使用自动化方法进行评估。然而,最近的研究表明,这些评估方法并不能说明所生成的主题是否能为研究社交媒体数据的人员提供有意义的见解。由于缺乏对研究人员如何使用主题模型的了解,解决这一问题的努力,包括衡量自动化评估任务与人工评估任务之间的一致性,受到了阻碍。如果研究人员没有以最佳方式构建主题模型,或者以超出模型限制的方式使用它们,可能会出现进一步的问题。这些情况威胁到主题模型开发的有效性以及将主题建模作为一种方法的研究人员所产生的见解。然而,目前缺乏关于主题模型在应用研究中如何以及为何被使用的信息。因此,我们对189篇将主题建模用于社交媒体分析的文章进行了系统的文献综述,以了解主题模型如何以及为何被用于社交媒体分析。我们的结果表明,主题模型的开发与将其用于社交媒体分析的人员的需求不一致。我们发现研究人员对主题模型的使用并不理想。研究人员在构建和解释主题方面缺乏方法上的支持。我们为主题模型研究人员提供了一套建议,以解决这些问题,并弥合短文本主题模型开发与应用研究之间的差距。 补充信息:在线版本包含可在10.1007/s10462-023-10471-x获取的补充材料。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f92/10150353/cb6fe63fd00a/10462_2023_10471_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f92/10150353/02849545fffc/10462_2023_10471_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f92/10150353/0ceb77b54d3b/10462_2023_10471_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f92/10150353/7a81d5d99d9e/10462_2023_10471_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f92/10150353/cb6fe63fd00a/10462_2023_10471_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f92/10150353/02849545fffc/10462_2023_10471_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f92/10150353/0ceb77b54d3b/10462_2023_10471_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f92/10150353/7a81d5d99d9e/10462_2023_10471_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2f92/10150353/cb6fe63fd00a/10462_2023_10471_Fig4_HTML.jpg

相似文献

[1]
A systematic review of the use of topic models for short text social media analysis.

Artif Intell Rev. 2023-5-1

[2]
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022-2-1

[3]
Short text topic modelling approaches in the context of big data: taxonomy, survey, and analysis.

Artif Intell Rev. 2023

[4]
The future of Cochrane Neonatal.

Early Hum Dev. 2020-11

[5]
Evaluation of clustering and topic modeling methods over health-related tweets and emails.

Artif Intell Med. 2021-7

[6]
Investigating the Efficient Use of Word Embedding with Neural-Topic Models for Interpretable Topics from Short Texts.

Sensors (Basel). 2022-1-23

[7]
Public Concern About Monitoring Twitter Users and Their Conversations to Recruit for Clinical Trials: Survey Study.

J Med Internet Res. 2019-10-30

[8]
Pseudo-document simulation for comparing LDA, GSDMM and GPM topic models on short and sparse text using Twitter data.

Comput Stat. 2023

[9]

2014-5

[10]
Social media based surveillance systems for healthcare using machine learning: A systematic review.

J Biomed Inform. 2020-8

引用本文的文献

[1]
Sentiment analysis and topic modeling of social media data to explore public discourse on irritable bowel syndrome.

Sci Rep. 2025-7-1

[2]
AI framework for DRIVE model based mental health detection in text: a case study on how coping strategies are expressed during COVID-19.

PeerJ Comput Sci. 2025-4-25

[3]
Towards Identifying Objectivity in Short Informal Text.

Entropy (Basel). 2025-5-30

[4]
Improving topic modeling performance on social media through semantic relationships within biomedical terminology.

PLoS One. 2025-2-21

[5]
Social media activism and women's health: Endometriosis awareness and support.

Digit Health. 2025-1-21

[6]
Experiences of Alzheimer's disease and related dementia family caregivers on Reddit communities: A topic modeling and sentiment analysis.

Artif Intell Health. 2024

[7]
Large Language Models Can Enable Inductive Thematic Analysis of a Social Media Corpus in a Single Prompt: Human Validation Study.

JMIR Infodemiology. 2024-8-29

[8]
Public perception of cultural ecosystem services in historic districts based on biterm topic model.

Sci Rep. 2024-5-22

[9]
Analyzing public demands on China's online government inquiry platform: A BERTopic-Based topic modeling study.

PLoS One. 2024

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索