• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用机器学习和深度学习模型对推特/ X上有关COVID - 19的健康相关信息进行分类和真实性核查。

Classifying and fact-checking health-related information about COVID-19 on Twitter/X using machine learning and deep learning models.

作者信息

Sharifpoor Elham, Okhovati Maryam, Ghazizadeh-Ahsaee Mostafa, Avaz Beigi Mina

机构信息

Medical Library and Information Sciences Department, Medical Informatics Research Center, Institute for Futures Studies in Health, Kerman University of Medical Sciences, Kerman, Iran.

Department of Computer Engineering, Shahid Bahonar University of Kerman, Kerman, Iran.

出版信息

BMC Med Inform Decis Mak. 2025 Feb 11;25(1):73. doi: 10.1186/s12911-025-02895-y.

DOI:10.1186/s12911-025-02895-y
PMID:39934858
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11817542/
Abstract

BACKGROUND

Despite recent progress in misinformation detection methods, further investigation is required to develop more robust fact-checking models with particular consideration for the unique challenges of health information sharing. This study aimed to identify the most effective approach for detecting and classifying reliable information versus misinformation health content shared on Twitter/X related to COVID-19.

METHODS

We have used 7 different machine learning/deep learning models. Tweets were collected, processed, labeled, and analyzed using relevant keywords and hashtags, then classified into two distinct datasets: "Trustworthy information" versus "Misinformation", through a labeling process. The cosine similarity metric was employed to address oversampling the minority of the Trustworthy information class, ensuring a more balanced representation of both classes for training and testing purposes. Finally, the performance of the various fact-checking models was analyzed and compared using accuracy, precision, recall, and F1-score ROC curve, and AUC.

RESULTS

For measures of accuracy, precision, F1 score, and recall, the average values of TextConvoNet were found to be 90.28, 90.28, 90.29, and 0.9030, respectively. ROC AUC was 0.901."Trustworthy information" class achieved an accuracy of 85%, precision of 93%, recall of 86%, and F1 score of 89%. These values were higher than other models. Moreover, its performance in the misinformation category was even more impressive, with an accuracy of 94%, precision of 88%, recall of 94%, and F1 score of 91%.

CONCLUSION

This study showed that TextConvoNet was the most effective in detecting and classifying trustworthy information V.S misinformation related to health issues that have been shared on Twitter/X.

摘要

背景

尽管最近在错误信息检测方法方面取得了进展,但仍需要进一步研究,以开发出更强大的事实核查模型,尤其要考虑到健康信息共享所面临的独特挑战。本研究旨在确定检测和分类在推特/X上分享的与COVID-19相关的可靠信息与错误信息健康内容的最有效方法。

方法

我们使用了7种不同的机器学习/深度学习模型。通过相关关键词和主题标签收集、处理、标记和分析推文,然后通过标记过程将其分类为两个不同的数据集:“可信信息”与“错误信息”。采用余弦相似性度量来处理可信信息类中少数类的过采样问题,以确保两个类在训练和测试目的上有更平衡的表示。最后,使用准确率、精确率、召回率、F1分数、ROC曲线和AUC对各种事实核查模型的性能进行分析和比较。

结果

在准确率、精确率、F1分数和召回率方面,TextConvoNet的平均值分别为90.28、90.28、90.29和0.9030。ROC AUC为0.901。“可信信息”类的准确率为85%,精确率为93%,召回率为86%,F1分数为89%。这些值高于其他模型。此外,它在错误信息类别中的表现更令人印象深刻,准确率为94%,精确率为88%,召回率为94%,F1分数为91%。

结论

本研究表明,TextConvoNet在检测和分类推特/X上分享的与健康问题相关的可信信息与错误信息方面最为有效。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6913/11817542/e05c3a07aabe/12911_2025_2895_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6913/11817542/5ba27495e972/12911_2025_2895_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6913/11817542/0583f67c6c39/12911_2025_2895_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6913/11817542/e05c3a07aabe/12911_2025_2895_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6913/11817542/5ba27495e972/12911_2025_2895_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6913/11817542/0583f67c6c39/12911_2025_2895_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6913/11817542/e05c3a07aabe/12911_2025_2895_Fig3_HTML.jpg

相似文献

1
Classifying and fact-checking health-related information about COVID-19 on Twitter/X using machine learning and deep learning models.使用机器学习和深度学习模型对推特/ X上有关COVID - 19的健康相关信息进行分类和真实性核查。
BMC Med Inform Decis Mak. 2025 Feb 11;25(1):73. doi: 10.1186/s12911-025-02895-y.
2
Machine and deep learning algorithms for sentiment analysis during COVID-19: A vision to create fake news resistant society.用于COVID-19期间情感分析的机器学习和深度学习算法:创建抵制假新闻社会的愿景。
PLoS One. 2024 Dec 19;19(12):e0315407. doi: 10.1371/journal.pone.0315407. eCollection 2024.
3
Aspect-based classification of vaccine misinformation: a spatiotemporal analysis using Twitter chatter.基于方面的疫苗错误信息分类:使用 Twitter 消息进行的时空分析。
BMC Public Health. 2023 Jun 21;23(1):1193. doi: 10.1186/s12889-023-16067-y.
4
"Thought I'd Share First" and Other Conspiracy Theory Tweets from the COVID-19 Infodemic: Exploratory Study.“我想率先分享”和其他有关 COVID-19 信息疫情的阴谋论推文:探索性研究。
JMIR Public Health Surveill. 2021 Apr 14;7(4):e26527. doi: 10.2196/26527.
5
ANTi-Vax: a novel Twitter dataset for COVID-19 vaccine misinformation detection.抗疫苗:用于 COVID-19 疫苗错误信息检测的新型 Twitter 数据集。
Public Health. 2022 Feb;203:23-30. doi: 10.1016/j.puhe.2021.11.022. Epub 2021 Dec 7.
6
Applying Machine Learning to Identify Anti-Vaccination Tweets during the COVID-19 Pandemic.应用机器学习识别 COVID-19 大流行期间的反疫苗推文。
Int J Environ Res Public Health. 2021 Apr 12;18(8):4069. doi: 10.3390/ijerph18084069.
7
Binding Activity Classification of Anti-SARS-CoV-2 Molecules using Deep Learning Across Multiple Assays.利用深度学习对跨多种检测方法的抗SARS-CoV-2分子的结合活性进行分类
Balkan Med J. 2024 May 3;41(3):186-192. doi: 10.4274/balkanmedj.galenos.2024.2024-1-73. Epub 2024 Mar 11.
8
COVID-19 Misinformation Detection: Machine-Learned Solutions to the Infodemic.新冠疫情错误信息检测:针对信息疫情的机器学习解决方案
JMIR Infodemiology. 2022 Aug 25;2(2):e38756. doi: 10.2196/38756. eCollection 2022 Jul-Dec.
9
Fine-Tuning BERT Models to Classify Misinformation on Garlic and COVID-19 on Twitter.微调 BERT 模型以在 Twitter 上对大蒜和 COVID-19 相关的错误信息进行分类。
Int J Environ Res Public Health. 2022 Apr 22;19(9):5126. doi: 10.3390/ijerph19095126.
10
Temporal and Location Variations, and Link Categories for the Dissemination of COVID-19-Related Information on Twitter During the SARS-CoV-2 Outbreak in Europe: Infoveillance Study.欧洲SARS-CoV-2疫情期间推特上新冠疫情相关信息传播的时间和地点变化以及链接类别:信息监测研究
J Med Internet Res. 2020 Aug 28;22(8):e19629. doi: 10.2196/19629.

引用本文的文献

1
AI-driven techniques for detection and mitigation of SARS-CoV-2 spread: a review, taxonomy, and trends.用于检测和缓解新冠病毒传播的人工智能驱动技术:综述、分类及趋势
Clin Exp Med. 2025 Jun 14;25(1):204. doi: 10.1007/s10238-025-01753-5.

本文引用的文献

1
Dissecting the infodemic: An in-depth analysis of COVID-19 misinformation detection on X (formerly Twitter) utilizing machine learning and deep learning techniques.剖析信息疫情:利用机器学习和深度学习技术对X(原推特)上新冠疫情错误信息检测的深入分析。
Heliyon. 2024 Sep 12;10(18):e37760. doi: 10.1016/j.heliyon.2024.e37760. eCollection 2024 Sep 30.
2
Detecting health misinformation: A comparative analysis of machine learning and graph convolutional networks in classification tasks.检测健康错误信息:机器学习与图卷积网络在分类任务中的比较分析
MethodsX. 2024 Apr 28;12:102737. doi: 10.1016/j.mex.2024.102737. eCollection 2024 Jun.
3
Rapid Review on Publicly Available Datasets for Health Misinformation Detection.
健康类错误信息检测公开数据集的快速综述
Stud Health Technol Inform. 2023 Jun 29;305:123-126. doi: 10.3233/SHTI230439.
4
Automatic detection of health misinformation: a systematic review.健康错误信息的自动检测:一项系统综述。
J Ambient Intell Humaniz Comput. 2023 May 27:1-13. doi: 10.1007/s12652-023-04619-4.
5
Detecting and classifying online health misinformation with 'Content Similarity Measure (CSM)' algorithm: an automated fact-checking-based approach.使用“内容相似性度量(CSM)”算法检测和分类在线健康错误信息:一种基于自动事实核查的方法。
J Supercomput. 2023;79(8):9127-9156. doi: 10.1007/s11227-022-05032-y. Epub 2023 Jan 7.
6
Insights from Review and Content Analysis of Current COVID-19 Mobile Apps and Recommendations for Future Pandemics.对当前 COVID-19 移动应用程序的回顾和内容分析的见解及对未来大流行的建议。
Int J Environ Res Public Health. 2022 Nov 8;19(22):14652. doi: 10.3390/ijerph192214652.
7
: a convolutional neural network based architecture for text classification.一种基于卷积神经网络的文本分类架构。
Appl Intell (Dordr). 2023;53(11):14249-14268. doi: 10.1007/s10489-022-04221-9. Epub 2022 Oct 22.
8
Research and Application of Clustering Algorithm for Text Big Data.文本大数据聚类算法的研究与应用
Comput Intell Neurosci. 2022 Jun 8;2022:7042778. doi: 10.1155/2022/7042778. eCollection 2022.
9
COVID-19 personal health mention detection from tweets using dual convolutional neural network.使用双卷积神经网络从推文检测COVID-19个人健康提及情况。
Expert Syst Appl. 2022 Aug 15;200:117139. doi: 10.1016/j.eswa.2022.117139. Epub 2022 Apr 2.
10
Twitter and Facebook posts about COVID-19 are less likely to spread misinformation compared to other health topics.与其他健康话题相比,有关 COVID-19 的推文和 Facebook 帖子不太可能传播错误信息。
PLoS One. 2022 Jan 12;17(1):e0261768. doi: 10.1371/journal.pone.0261768. eCollection 2022.