• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

识别与痴呆症和新冠肺炎相关的X(原推特)帖子:机器学习方法

Identifying X (Formerly Twitter) Posts Relevant to Dementia and COVID-19: Machine Learning Approach.

作者信息

Azizi Mehrnoosh, Jamali Ali Akbar, Spiteri Raymond J

机构信息

Department of Computer Science, University of Saskatchewan, Saskatoon, SK, Canada.

出版信息

JMIR Form Res. 2024 Jun 4;8:e49562. doi: 10.2196/49562.

DOI:10.2196/49562
PMID:38833288
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11185906/
Abstract

BACKGROUND

During the pandemic, patients with dementia were identified as a vulnerable population. X (formerly Twitter) became an important source of information for people seeking updates on COVID-19, and, therefore, identifying posts (formerly tweets) relevant to dementia can be an important support for patients with dementia and their caregivers. However, mining and coding relevant posts can be daunting due to the sheer volume and high percentage of irrelevant posts.

OBJECTIVE

The objective of this study was to automate the identification of posts relevant to dementia and COVID-19 using natural language processing and machine learning (ML) algorithms.

METHODS

We used a combination of natural language processing and ML algorithms with manually annotated posts to identify posts relevant to dementia and COVID-19. We used 3 data sets containing more than 100,000 posts and assessed the capability of various algorithms in correctly identifying relevant posts.

RESULTS

Our results showed that (pretrained) transfer learning algorithms outperformed traditional ML algorithms in identifying posts relevant to dementia and COVID-19. Among the algorithms tested, the transfer learning algorithm A Lite Bidirectional Encoder Representations from Transformers (ALBERT) achieved an accuracy of 82.92% and an area under the curve of 83.53%. ALBERT substantially outperformed the other algorithms tested, further emphasizing the superior performance of transfer learning algorithms in the classification of posts.

CONCLUSIONS

Transfer learning algorithms such as ALBERT are highly effective in identifying topic-specific posts, even when trained with limited or adjacent data, highlighting their superiority over other ML algorithms and applicability to other studies involving analysis of social media posts. Such an automated approach reduces the workload of manual coding of posts and facilitates their analysis for researchers and policy makers to support patients with dementia and their caregivers and other vulnerable populations.

摘要

背景

在疫情期间,痴呆症患者被认定为弱势群体。X(前身为推特)成为了人们获取新冠病毒最新信息的重要来源,因此,识别与痴呆症相关的帖子(前身为推文)对痴呆症患者及其护理人员而言可能是一项重要的支持。然而,由于无关帖子数量庞大且占比高,挖掘和编码相关帖子可能具有挑战性。

目的

本研究的目的是使用自然语言处理和机器学习(ML)算法自动识别与痴呆症和新冠病毒相关的帖子。

方法

我们将自然语言处理和ML算法与人工标注的帖子相结合,以识别与痴呆症和新冠病毒相关的帖子。我们使用了3个包含超过10万条帖子的数据集,并评估了各种算法正确识别相关帖子的能力。

结果

我们的结果表明,(预训练)迁移学习算法在识别与痴呆症和新冠病毒相关的帖子方面优于传统ML算法。在测试的算法中,迁移学习算法“来自变换器的轻量级双向编码器表征”(ALBERT)的准确率达到82.92%,曲线下面积为83.53%。ALBERT的表现显著优于其他测试算法,进一步凸显了迁移学习算法在帖子分类中的卓越性能。

结论

像ALBERT这样的迁移学习算法在识别特定主题的帖子方面非常有效,即使使用有限或相邻数据进行训练也是如此,这突出了它们相对于其他ML算法的优越性以及对其他涉及社交媒体帖子分析的研究的适用性。这种自动化方法减少了帖子人工编码的工作量,并便于研究人员和政策制定者进行分析,以支持痴呆症患者及其护理人员以及其他弱势群体。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e73a/11185906/e0cd8a7d9192/formative_v8i1e49562_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e73a/11185906/f875440e28bc/formative_v8i1e49562_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e73a/11185906/87656cd807f9/formative_v8i1e49562_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e73a/11185906/e0cd8a7d9192/formative_v8i1e49562_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e73a/11185906/f875440e28bc/formative_v8i1e49562_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e73a/11185906/87656cd807f9/formative_v8i1e49562_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e73a/11185906/e0cd8a7d9192/formative_v8i1e49562_fig3.jpg

相似文献

1
Identifying X (Formerly Twitter) Posts Relevant to Dementia and COVID-19: Machine Learning Approach.识别与痴呆症和新冠肺炎相关的X(原推特)帖子:机器学习方法
JMIR Form Res. 2024 Jun 4;8:e49562. doi: 10.2196/49562.
2
Momentary Depressive Feeling Detection Using X (Formerly Twitter) Data: Contextual Language Approach.使用X(原推特)数据检测瞬间抑郁情绪:上下文语言方法。
JMIR AI. 2023 Nov 27;2:e49531. doi: 10.2196/49531.
3
Automatically Identifying Twitter Users for Interventions to Support Dementia Family Caregivers: Annotated Data Set and Benchmark Classification Models.自动识别用于支持痴呆症家庭照顾者干预措施的推特用户:带注释的数据集和基准分类模型
JMIR Aging. 2022 Sep 16;5(3):e39547. doi: 10.2196/39547.
4
Detecting Potentially Harmful and Protective Suicide-Related Content on Twitter: Machine Learning Approach.在 Twitter 上检测潜在有害和保护自杀相关内容:机器学习方法。
J Med Internet Res. 2022 Aug 17;24(8):e34705. doi: 10.2196/34705.
5
Developing an Automatic System for Classifying Chatter About Health Services on Twitter: Case Study for Medicaid.开发一个自动系统来对 Twitter 上有关医疗服务的闲聊进行分类:以医疗补助计划为例。
J Med Internet Res. 2021 May 3;23(5):e26616. doi: 10.2196/26616.
6
Comparing Emotional Valence Scores of Twitter Posts from Manual Coding and Machine Learning Algorithms to Gain Insights to Refine Interventions for Family Caregivers of Persons with Dementia.比较手动编码和机器学习算法的 Twitter 帖子的情感效价得分,以深入了解如何改进对痴呆症患者家属的干预措施。
Stud Health Technol Inform. 2022 Jun 29;295:253-256. doi: 10.3233/SHTI220710.
7
Social Media Monitoring of the COVID-19 Pandemic and Influenza Epidemic With Adaptation for Informal Language in Arabic Twitter Data: Qualitative Study.对新冠疫情和流感流行进行社交媒体监测,并针对阿拉伯语推特数据中的非正式语言进行调整:定性研究。
JMIR Med Inform. 2021 Sep 17;9(9):e27670. doi: 10.2196/27670.
8
Using twitter to examine smoking behavior and perceptions of emerging tobacco products.利用推特研究吸烟行为及对新兴烟草产品的认知。
J Med Internet Res. 2013 Aug 29;15(8):e174. doi: 10.2196/jmir.2534.
9
Vaccine sentiment analysis using BERT + NBSVM and geo-spatial approaches.使用BERT + NBSVM和地理空间方法的疫苗情绪分析。
J Supercomput. 2023 May 7:1-31. doi: 10.1007/s11227-023-05319-8.
10
Dynamics of the Negative Discourse Toward COVID-19 Vaccines: Topic Modeling Study and an Annotated Data Set of Twitter Posts.针对 COVID-19 疫苗的负面话语动态:主题建模研究与 Twitter 帖子的标注数据集。
J Med Internet Res. 2023 Apr 12;25:e41319. doi: 10.2196/41319.

本文引用的文献

1
Momentary Depressive Feeling Detection Using X (Formerly Twitter) Data: Contextual Language Approach.使用X(原推特)数据检测瞬间抑郁情绪:上下文语言方法。
JMIR AI. 2023 Nov 27;2:e49531. doi: 10.2196/49531.
2
Effectiveness of lifestyle medicine on cognitive functions in mild cognitive impairments and dementia: A systematic review on randomized controlled trials.生活方式医学对轻度认知障碍和痴呆患者认知功能的有效性:一项关于随机对照试验的系统评价
Ageing Res Rev. 2023 Apr;86:101886. doi: 10.1016/j.arr.2023.101886. Epub 2023 Feb 18.
3
Telehealth Support for Dementia Caregivers During the COVID-19 Pandemic: Lessons Learned From the NYU Family Support Program.
新冠疫情期间针对痴呆照护者的远程医疗支持:来自纽约大学家庭支持项目的经验教训。
Am J Geriatr Psychiatry. 2023 Jan;31(1):14-21. doi: 10.1016/j.jagp.2022.08.005. Epub 2022 Aug 30.
4
TClustVID: A novel machine learning classification model to investigate topics and sentiment in COVID-19 tweets.TClustVID:一种用于研究新冠疫情推文主题和情感的新型机器学习分类模型。
Knowl Based Syst. 2021 Aug 17;226:107126. doi: 10.1016/j.knosys.2021.107126. Epub 2021 May 6.
5
COVID-19 and dementia: Analyses of risk, disparity, and outcomes from electronic health records in the US.COVID-19 与痴呆:来自美国电子健康记录的风险、差异和结果分析。
Alzheimers Dement. 2021 Aug;17(8):1297-1306. doi: 10.1002/alz.12296. Epub 2021 Feb 9.
6
Using Twitter to Understand the COVID-19 Experiences of People With Dementia: Infodemiology Study.利用 Twitter 了解痴呆症患者的 COVID-19 体验:信息流行病学研究。
J Med Internet Res. 2021 Feb 3;23(2):e26254. doi: 10.2196/26254.
7
Impact of dementia on informal care: a systematic review of family caregivers' perceptions.痴呆症对非正式照护的影响:对家庭照护者认知的系统评价
BMJ Support Palliat Care. 2020 Oct 14. doi: 10.1136/bmjspcare-2020-002242.
8
Towards an Ethical Framework for Publishing Twitter Data in Social Research: Taking into Account Users' Views, Online Context and Algorithmic Estimation.构建社会研究中发布推特数据的伦理框架:兼顾用户观点、网络环境及算法评估
Sociology. 2017 Dec;51(6):1149-1168. doi: 10.1177/0038038517708140. Epub 2017 May 26.
9
Decision tree and ensemble learning algorithms with their applications in bioinformatics.决策树和集成学习算法及其在生物信息学中的应用。
Adv Exp Med Biol. 2011;696:191-9. doi: 10.1007/978-1-4419-7046-6_19.