• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于新颖文本和语言上下文特征的推特谣言识别

Rumour identification on Twitter as a function of novel textual and language-context features.

作者信息

Ali Ghulam, Malik Muhammad Shahid Iqbal

机构信息

Department of Computer Science, COMSATS University Islamabad, Attock Campus, Islamabad, Pakistan.

Department of Computer Science, Capital University of Science and Technology, Kahuta Road, Sihala, 44000 Islamabad, Pakistan.

出版信息

Multimed Tools Appl. 2023;82(5):7017-7038. doi: 10.1007/s11042-022-13595-4. Epub 2022 Aug 12.

DOI:10.1007/s11042-022-13595-4
PMID:35974894
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9371961/
Abstract

Social microblogs are one of the popular platforms for information spreading. However, with several advantages, these platforms are being used for spreading rumours. At present, the majority of existing approaches identify rumours at the topic level instead of at the tweet/post level. Moreover, prior studies used the sentiment and linguistic features for rumours identification without considering discrete positive and negative emotions and effective part-of-speech features in content-based approaches. Similarly, the majority of prior studies used content-based approaches for feature generation, and recent context-based approaches were not explored. To cope with these challenges, a robust framework for rumour detection at the tweet level is designed in this paper. The model used word2vec embeddings and bidirectional encoder representations from transformers method (BERT) from context-based and discrete emotions, linguistic, and metadata characteristics from content-based approaches. According to our knowledge, we are the first ones who used these features for rumour identification at the tweet/post level. The framework is tested on four real-life twitter microblog datasets. The results show that the detection model is capable of detecting 97%, 86%, 85%, and 80% of rumours on four datasets respectively. In addition, the proposed framework outperformed the three latest state-of-the-art baselines. BERT model presented the best performance among context-based approaches, and linguistic features are best performing among content-based approaches as a stand-alone model. Moreover, the utilization of two-step feature selection further improves the detection model performance.

摘要

社交微博是信息传播的热门平台之一。然而,尽管有诸多优点,这些平台却被用于传播谣言。目前,大多数现有方法是在话题层面而非推文/帖子层面识别谣言。此外,先前的研究在基于内容的方法中使用情感和语言特征进行谣言识别时,没有考虑离散的积极和消极情绪以及有效的词性特征。同样,大多数先前的研究使用基于内容的方法进行特征生成,而未探索最近基于上下文的方法。为应对这些挑战,本文设计了一个用于在推文层面进行谣言检测的强大框架。该模型使用了基于上下文的离散情绪、语言和基于内容的方法中的元数据特征的词向量嵌入和来自变换器方法(BERT)的双向编码器表示。据我们所知,我们是首批在推文/帖子层面使用这些特征进行谣言识别的人。该框架在四个真实的推特微博数据集上进行了测试。结果表明,该检测模型在四个数据集上分别能够检测出97%、86%、85%和80%的谣言。此外,所提出的框架优于三个最新的最先进基线。在基于上下文的方法中,BERT模型表现最佳,而在基于内容的方法中,语言特征作为独立模型表现最佳。此外,两步特征选择的运用进一步提高了检测模型的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/b2cd35e6b7e8/11042_2022_13595_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/aebdb394a672/11042_2022_13595_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/72faa7bf6671/11042_2022_13595_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/51ae118053e7/11042_2022_13595_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/bfc09f1f332d/11042_2022_13595_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/4525430b96c0/11042_2022_13595_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/ceb08869bd99/11042_2022_13595_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/cd7f8f50fee2/11042_2022_13595_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/4d7d0c8436dc/11042_2022_13595_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/b2cd35e6b7e8/11042_2022_13595_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/aebdb394a672/11042_2022_13595_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/72faa7bf6671/11042_2022_13595_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/51ae118053e7/11042_2022_13595_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/bfc09f1f332d/11042_2022_13595_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/4525430b96c0/11042_2022_13595_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/ceb08869bd99/11042_2022_13595_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/cd7f8f50fee2/11042_2022_13595_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/4d7d0c8436dc/11042_2022_13595_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35a1/9371961/b2cd35e6b7e8/11042_2022_13595_Fig9_HTML.jpg

相似文献

1
Rumour identification on Twitter as a function of novel textual and language-context features.基于新颖文本和语言上下文特征的推特谣言识别
Multimed Tools Appl. 2023;82(5):7017-7038. doi: 10.1007/s11042-022-13595-4. Epub 2022 Aug 12.
2
Comparing Pre-trained and Feature-Based Models for Prediction of Alzheimer's Disease Based on Speech.基于语音比较预训练模型和基于特征的模型对阿尔茨海默病的预测
Front Aging Neurosci. 2021 Apr 27;13:635945. doi: 10.3389/fnagi.2021.635945. eCollection 2021.
3
Rumour detection using deep learning and filter-wrapper feature selection in benchmark twitter dataset.在基准推特数据集上使用深度学习和过滤包装特征选择进行谣言检测。
Multimed Tools Appl. 2022;81(24):34615-34632. doi: 10.1007/s11042-021-11340-x. Epub 2021 Aug 20.
4
How to detect propaganda from social media? Exploitation of semantic and fine-tuned language models.如何从社交媒体中检测宣传内容?语义和微调语言模型的应用
PeerJ Comput Sci. 2023 Feb 20;9:e1248. doi: 10.7717/peerj-cs.1248. eCollection 2023.
5
A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques.基于深度学习技术的 COVID-19 谣言数据集的自然语言处理 (NLP) 评估。
Comput Intell Neurosci. 2022 Sep 14;2022:6561622. doi: 10.1155/2022/6561622. eCollection 2022.
6
: a hybrid deep neural model with mixed fusion for rumour detection in social data streams.一种用于社交数据流谣言检测的具有混合融合的混合深度神经模型。
Neural Comput Appl. 2022;34(18):15129-15140. doi: 10.1007/s00521-021-06743-8. Epub 2022 Jan 8.
7
Analysing How People Orient to and Spread Rumours in Social Media by Looking at Conversational Threads.通过查看对话线程来分析人们在社交媒体中如何传播和对待谣言。
PLoS One. 2016 Mar 4;11(3):e0150989. doi: 10.1371/journal.pone.0150989. eCollection 2016.
8
Text embedding techniques for efficient clustering of twitter data.用于高效聚类推特数据的文本嵌入技术。
Evol Intell. 2023 Feb 7:1-11. doi: 10.1007/s12065-023-00825-3.
9
Stacked DeBERT: All attention in incomplete data for text classification.堆叠型 DeBERTa:文本分类中针对不完整数据的全注意力
Neural Netw. 2021 Apr;136:87-96. doi: 10.1016/j.neunet.2020.12.018. Epub 2020 Dec 25.
10
The Moderation of Human Characteristics in the Control Mechanisms of Rumours in Social Media: The Case of Food Rumours in China.社交媒体谣言控制机制中人类特征的调节作用:以中国食品谣言为例
Front Psychol. 2022 Jan 17;12:782313. doi: 10.3389/fpsyg.2021.782313. eCollection 2021.

引用本文的文献

1
Categorization of tweets for damages: infrastructure and human damage assessment using fine-tuned BERT model.用于损害分类的推文:使用微调BERT模型进行基础设施和人员损害评估
PeerJ Comput Sci. 2024 Feb 16;10:e1859. doi: 10.7717/peerj-cs.1859. eCollection 2024.