利用深度学习揭穿多语言社交媒体帖子

Debunking multi-lingual social media posts using deep learning.

作者信息

Kotiyal Bina, Pathak Heman, Singh Nipur

机构信息

Department of Computer Science, Gurukula Kangri (Deemed to be University), Haridwar, Uttarakhand India.

出版信息

Int J Inf Technol. 2023 Jun 4:1-13. doi: 10.1007/s41870-023-01288-6.

DOI:10.1007/s41870-023-01288-6

PMID:37360313

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10239612/

Abstract

Fake news on social media has become a growing concern due to its potential impact on shaping public opinion. The proposed Debunking Multi-Lingual Social Media Posts using Deep Learning (DSMPD) approach offers a promising solution to detect fake news. The DSMPD approach involves creating a dataset of English and Hindi social media posts using web scraping and Natural Language Processing (NLP) techniques. This dataset is then used to train, test, and validate a deep learning-based model that extracts various features, including Embedding from Language Models (ELMo), word and n-gram counts, Term Frequency-Inverse Document Frequency (TF-IDF), sentiments, polarity, and Named Entity Recognition (NER). Based on these features, the model classifies news items into five categories: real, could be real, could be fabricated, fabricated, or dangerously fabricated. To evaluate the performance of the classifiers, the researchers used two datasets comprising over 45,000 articles. Machine learning (ML) algorithms and Deep learning (DL) model are compared to choose the best option for classification and prediction.

摘要

由于社交媒体上的虚假新闻对塑造公众舆论有潜在影响，它已成为一个日益受到关注的问题。提议的使用深度学习揭穿多语言社交媒体帖子（DSMPD）方法为检测虚假新闻提供了一个有前景的解决方案。DSMPD方法包括使用网络爬虫和自然语言处理（NLP）技术创建一个包含英语和印地语社交媒体帖子的数据集。然后，该数据集用于训练、测试和验证一个基于深度学习的模型，该模型提取各种特征，包括来自语言模型的嵌入（ELMo）、单词和n元语法计数、词频-逆文档频率（TF-IDF）、情感、极性和命名实体识别（NER）。基于这些特征，该模型将新闻项目分为五类：真实、可能真实、可能是编造的、编造的或危险编造的。为了评估分类器的性能，研究人员使用了两个包含超过45000篇文章的数据集。比较了机器学习（ML）算法和深度学习（DL）模型，以选择用于分类和预测的最佳选项。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05a3/10239612/d38a38fbfaff/41870_2023_1288_Fig1_HTML.jpg

相似文献

Debunking multi-lingual social media posts using deep learning.利用深度学习揭穿多语言社交媒体帖子

Int J Inf Technol. 2023 Jun 4:1-13. doi: 10.1007/s41870-023-01288-6.

Deep Ensemble Fake News Detection Model Using Sequential Deep Learning Technique.基于序列深度学习技术的深度集成假新闻检测模型。

Sensors (Basel). 2022 Sep 15;22(18):6970. doi: 10.3390/s22186970.

CoAID-DEEP: An Optimized Intelligent Framework for Automated Detecting COVID-19 Misleading Information on Twitter.CoAID-DEEP：用于自动检测推特上新冠病毒误导性信息的优化智能框架

IEEE Access. 2021 Feb 9;9:27840-27867. doi: 10.1109/ACCESS.2021.3058066. eCollection 2021.

Detection of Fake News Text Classification on COVID-19 Using Deep Learning Approaches.基于深度学习方法的 COVID-19 假新闻文本分类检测。

Comput Math Methods Med. 2021 Nov 15;2021:5514220. doi: 10.1155/2021/5514220. eCollection 2021.

A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques.基于深度学习技术的 COVID-19 谣言数据集的自然语言处理 (NLP) 评估。

Comput Intell Neurosci. 2022 Sep 14;2022:6561622. doi: 10.1155/2022/6561622. eCollection 2022.

Fake news detection in social media based on sentiment analysis using classifier techniques.基于使用分类器技术的情感分析在社交媒体中进行假新闻检测。

Multimed Tools Appl. 2023 Mar 11:1-31. doi: 10.1007/s11042-023-14883-3.

Comparative analysis of machine learning methods to detect fake news in an Urdu language .用于检测乌尔都语假新闻的机器学习方法的比较分析

PeerJ Comput Sci. 2022 Jun 28;8:e1004. doi: 10.7717/peerj-cs.1004. eCollection 2022.

Normalized effect size (NES): a novel feature selection model for Urdu fake news classification.归一化效应大小（NES）：一种用于乌尔都语假新闻分类的新型特征选择模型。

PeerJ Comput Sci. 2023 Oct 24;9:e1612. doi: 10.7717/peerj-cs.1612. eCollection 2023.

A new word embedding model integrated with medical knowledge for deep learning-based sentiment classification.一种集成医学知识的新词嵌入模型，用于基于深度学习的情感分类。

Artif Intell Med. 2024 Feb;148:102758. doi: 10.1016/j.artmed.2023.102758. Epub 2024 Jan 8.

Covid-19 fake news sentiment analysis.新冠疫情虚假新闻的情感分析。

Comput Electr Eng. 2022 Jul;101:107967. doi: 10.1016/j.compeleceng.2022.107967. Epub 2022 Apr 22.

本文引用的文献

FakeBERT: Fake news detection in social media with a BERT-based deep learning approach.FakeBERT：基于BERT的深度学习方法用于社交媒体中的假新闻检测

Multimed Tools Appl. 2021;80(8):11765-11788. doi: 10.1007/s11042-020-10183-2. Epub 2021 Jan 7.

Identifying propaganda from online social networks during COVID-19 using machine learning techniques.利用机器学习技术识别新冠疫情期间在线社交网络中的宣传内容。

Int J Inf Technol. 2021;13(1):115-122. doi: 10.1007/s41870-020-00550-5. Epub 2020 Oct 29.

Anatomy of news consumption on Facebook.脸书上新闻消费的剖析。

Proc Natl Acad Sci U S A. 2017 Mar 21;114(12):3035-3039. doi: 10.1073/pnas.1617052114. Epub 2017 Mar 6.

Rumor diffusion and convergence during the 3.11 earthquake: a twitter case study.3·11地震期间谣言的传播与汇聚：一项推特案例研究

PLoS One. 2015 Apr 1;10(4):e0121443. doi: 10.1371/journal.pone.0121443. eCollection 2015.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用深度学习揭穿多语言社交媒体帖子

Debunking multi-lingual social media posts using deep learning.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献