• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

IFND:一个用于假新闻检测的基准数据集。

IFND: a benchmark dataset for fake news detection.

作者信息

Sharma Dilip Kumar, Garg Sonal

机构信息

GLA University, Mathura, India.

出版信息

Complex Intell Systems. 2023;9(3):2843-2863. doi: 10.1007/s40747-021-00552-1. Epub 2021 Oct 16.

DOI:10.1007/s40747-021-00552-1
PMID:34777983
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8520332/
Abstract

Spotting fake news is a critical problem nowadays. Social media are responsible for propagating fake news. Fake news propagated over digital platforms generates confusion as well as induce biased perspectives in people. Detection of misinformation over the digital platform is essential to mitigate its adverse impact. Many approaches have been implemented in recent years. Despite the productive work, fake news identification poses many challenges due to the lack of a comprehensive publicly available benchmark dataset. There is no large-scale dataset that consists of Indian news only. So, this paper presents IFND (Indian fake news dataset) dataset. The dataset consists of both text and images. The majority of the content in the dataset is about events from the year 2013 to the year 2021. Dataset content is scrapped using the Parsehub tool. To increase the size of the fake news in the dataset, an intelligent augmentation algorithm is used. An intelligent augmentation algorithm generates meaningful fake news statements. The latent Dirichlet allocation (LDA) technique is employed for topic modelling to assign the categories to news statements. Various machine learning and deep-learning classifiers are implemented on text and image modality to observe the proposed IFND dataset's performance. A multi-modal approach is also proposed, which considers both textual and visual features for fake news detection. The proposed IFND dataset achieved satisfactory results. This study affirms that the accessibility of such a huge dataset can actuate research in this laborious exploration issue and lead to better prediction models.

摘要

如今,识别虚假新闻是一个关键问题。社交媒体对虚假新闻的传播负有责任。在数字平台上传播的虚假新闻会造成混乱,并在人们中引发偏见。在数字平台上检测错误信息对于减轻其不利影响至关重要。近年来已经实施了许多方法。尽管取得了丰硕的成果,但由于缺乏全面的公开基准数据集,虚假新闻识别仍面临许多挑战。没有仅包含印度新闻的大规模数据集。因此,本文提出了IFND(印度虚假新闻数据集)数据集。该数据集包含文本和图像。数据集中的大部分内容是关于2013年至2021年的事件。数据集内容使用Parsehub工具进行抓取。为了增加数据集中虚假新闻的数量,使用了一种智能增强算法。智能增强算法生成有意义的虚假新闻陈述。潜在狄利克雷分配(LDA)技术用于主题建模,为新闻陈述分配类别。在文本和图像模态上实现了各种机器学习和深度学习分类器,以观察所提出的IFND数据集的性能。还提出了一种多模态方法,该方法考虑文本和视觉特征来进行虚假新闻检测。所提出的IFND数据集取得了令人满意的结果。这项研究证实,如此庞大数据集的可获取性可以推动在这个艰巨的探索问题上的研究,并导致更好的预测模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/8c98eaefe6b5/40747_2021_552_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/4281cf163223/40747_2021_552_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/d9cebc838ecd/40747_2021_552_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/8af71b85e16c/40747_2021_552_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/54cf40f0b32b/40747_2021_552_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/271de0474451/40747_2021_552_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/c23813867728/40747_2021_552_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/c8eda1eb9b84/40747_2021_552_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/1d29fb311d16/40747_2021_552_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/24893504e754/40747_2021_552_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/8c98eaefe6b5/40747_2021_552_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/4281cf163223/40747_2021_552_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/d9cebc838ecd/40747_2021_552_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/8af71b85e16c/40747_2021_552_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/54cf40f0b32b/40747_2021_552_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/271de0474451/40747_2021_552_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/c23813867728/40747_2021_552_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/c8eda1eb9b84/40747_2021_552_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/1d29fb311d16/40747_2021_552_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/24893504e754/40747_2021_552_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f24/8520332/8c98eaefe6b5/40747_2021_552_Fig10_HTML.jpg

相似文献

1
IFND: a benchmark dataset for fake news detection.IFND:一个用于假新闻检测的基准数据集。
Complex Intell Systems. 2023;9(3):2843-2863. doi: 10.1007/s40747-021-00552-1. Epub 2021 Oct 16.
2
Deep Ensemble Fake News Detection Model Using Sequential Deep Learning Technique.基于序列深度学习技术的深度集成假新闻检测模型。
Sensors (Basel). 2022 Sep 15;22(18):6970. doi: 10.3390/s22186970.
3
Ensemble Techniques for Robust Fake News Detection: Integrating Transformers, Natural Language Processing, and Machine Learning.用于稳健假新闻检测的集成技术:整合Transformer、自然语言处理和机器学习
Sensors (Basel). 2024 Sep 19;24(18):6062. doi: 10.3390/s24186062.
4
Predicting image credibility in fake news over social media using multi-modal approach.使用多模态方法预测社交媒体上假新闻中的图像可信度。
Neural Comput Appl. 2022;34(24):21503-21517. doi: 10.1007/s00521-021-06086-4. Epub 2021 May 24.
5
CoAID-DEEP: An Optimized Intelligent Framework for Automated Detecting COVID-19 Misleading Information on Twitter.CoAID-DEEP:用于自动检测推特上新冠病毒误导性信息的优化智能框架
IEEE Access. 2021 Feb 9;9:27840-27867. doi: 10.1109/ACCESS.2021.3058066. eCollection 2021.
6
Arabic Fake News Detection Based on Textual Analysis.基于文本分析的阿拉伯语假新闻检测
Arab J Sci Eng. 2022;47(8):10453-10469. doi: 10.1007/s13369-021-06449-y. Epub 2022 Feb 11.
7
Dissecting the infodemic: An in-depth analysis of COVID-19 misinformation detection on X (formerly Twitter) utilizing machine learning and deep learning techniques.剖析信息疫情:利用机器学习和深度学习技术对X(原推特)上新冠疫情错误信息检测的深入分析。
Heliyon. 2024 Sep 12;10(18):e37760. doi: 10.1016/j.heliyon.2024.e37760. eCollection 2024 Sep 30.
8
A systematic literature review and existing challenges toward fake news detection models.关于假新闻检测模型的系统文献综述及现存挑战。
Soc Netw Anal Min. 2022;12(1):168. doi: 10.1007/s13278-022-00995-5. Epub 2022 Nov 14.
9
EchoFakeD: improving fake news detection in social media with an efficient deep neural network.回声假新闻检测(EchoFakeD):利用高效深度神经网络改进社交媒体中的假新闻检测
Neural Comput Appl. 2021;33(14):8597-8613. doi: 10.1007/s00521-020-05611-1. Epub 2021 Jan 2.
10
Dataset for multimodal fake news detection and verification tasks.用于多模态假新闻检测与验证任务的数据集。
Data Brief. 2024 Apr 16;54:110440. doi: 10.1016/j.dib.2024.110440. eCollection 2024 Jun.

引用本文的文献

1
GBERT: A hybrid deep learning model based on GPT-BERT for fake news detection.GBERT:一种基于GPT-BERT的用于虚假新闻检测的混合深度学习模型。
Heliyon. 2024 Aug 6;10(16):e35865. doi: 10.1016/j.heliyon.2024.e35865. eCollection 2024 Aug 30.
2
Multimodal analysis of disinformation and misinformation.虚假信息与错误信息的多模态分析
R Soc Open Sci. 2023 Dec 20;10(12):230964. doi: 10.1098/rsos.230964. eCollection 2023 Dec.
3
A systematic literature review and existing challenges toward fake news detection models.关于假新闻检测模型的系统文献综述及现存挑战。

本文引用的文献

1
An automatic approach based on CNN architecture to detect Covid-19 disease from chest X-ray images.一种基于卷积神经网络(CNN)架构的自动方法,用于从胸部X光图像中检测新冠肺炎。
Appl Intell (Dordr). 2021;51(5):2864-2889. doi: 10.1007/s10489-020-02010-w. Epub 2020 Nov 27.
2
A novel self-learning semi-supervised deep learning network to detect fake news on social media.一种用于检测社交媒体上虚假新闻的新型自学习半监督深度学习网络。
Multimed Tools Appl. 2022;81(14):19341-19349. doi: 10.1007/s11042-021-11065-x. Epub 2021 Jun 2.
3
Predicting image credibility in fake news over social media using multi-modal approach.
Soc Netw Anal Min. 2022;12(1):168. doi: 10.1007/s13278-022-00995-5. Epub 2022 Nov 14.
4
Evaluating the effectiveness of publishers' features in fake news detection on social media.评估出版商的特征在社交媒体假新闻检测中的有效性。
Multimed Tools Appl. 2023;82(2):2913-2939. doi: 10.1007/s11042-022-12668-8. Epub 2022 Apr 11.
使用多模态方法预测社交媒体上假新闻中的图像可信度。
Neural Comput Appl. 2022;34(24):21503-21517. doi: 10.1007/s00521-021-06086-4. Epub 2021 May 24.
4
FakeBERT: Fake news detection in social media with a BERT-based deep learning approach.FakeBERT:基于BERT的深度学习方法用于社交媒体中的假新闻检测
Multimed Tools Appl. 2021;80(8):11765-11788. doi: 10.1007/s11042-020-10183-2. Epub 2021 Jan 7.
5
FakeNewsNet: A Data Repository with News Content, Social Context, and Spatiotemporal Information for Studying Fake News on Social Media.假新闻网:一个具有新闻内容、社交背景和时空信息的数据资源库,用于研究社交媒体上的假新闻。
Big Data. 2020 Jun;8(3):171-188. doi: 10.1089/big.2020.0062.