• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

防止在道德假新闻检测中进行特征分析。

Preventing profiling for ethical fake news detection.

作者信息

Allein Liesbeth, Moens Marie-Francine, Perrotta Domenico

机构信息

European Commission, Joint Research Centre (JRC), Italy.

Department of Computer Science, KU Leuven, Belgium.

出版信息

Inf Process Manag. 2023 Mar;60(2):None. doi: 10.1016/j.ipm.2022.103206.

DOI:10.1016/j.ipm.2022.103206
PMID:36874352
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9950332/
Abstract

A news article's online audience provides useful insights about the article's identity. However, fake news classifiers using such information risk relying on profiling. In response to the rising demand for ethical AI, we present a profiling-avoiding algorithm that leverages Twitter users during model optimisation while excluding them when an article's veracity is evaluated. For this, we take inspiration from the social sciences and introduce two objective functions that maximise correlation between the article and its spreaders, and among those spreaders. We applied our profiling-avoiding algorithm to three popular neural classifiers and obtained results on fake news data discussing a variety of news topics. The positive impact on prediction performance demonstrates the soundness of the proposed objective functions to integrate social context in text-based classifiers. Moreover, statistical visualisation and dimension reduction techniques show that the user-inspired classifiers better discriminate between unseen fake and true news in their latent spaces. Our study serves as a stepping stone to resolve the underexplored issue of profiling-dependent decision-making in user-informed fake news detection.

摘要

一篇新闻文章的在线受众能提供有关该文章特征的有用见解。然而,使用此类信息的假新闻分类器有依赖用户画像的风险。为响应人们对符合伦理的人工智能日益增长的需求,我们提出一种避免用户画像的算法,该算法在模型优化期间利用推特用户,但在评估文章真实性时将他们排除在外。为此,我们从社会科学中汲取灵感,引入两个目标函数,使文章与其传播者之间以及这些传播者之间的相关性最大化。我们将我们的避免用户画像算法应用于三种流行的神经分类器,并在讨论各种新闻主题的假新闻数据上获得了结果。对预测性能的积极影响证明了所提出的目标函数在基于文本的分类器中整合社会背景的合理性。此外,统计可视化和降维技术表明,受用户启发的分类器在其潜在空间中能更好地区分未见过的假新闻和真实新闻。我们的研究是解决在用户信息辅助的假新闻检测中尚未充分探索的依赖用户画像进行决策这一问题的垫脚石。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/94c702000e9a/gr10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/e0a0e4abd62c/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/a5533aa73b7d/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/3720eece189d/fx1001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/4c99afacc91f/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/a8823d45565b/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/d797ae630a34/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/2e1b15feaac6/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/22dde0517377/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/24d89f3357aa/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/b207c065b62f/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/94c702000e9a/gr10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/e0a0e4abd62c/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/a5533aa73b7d/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/3720eece189d/fx1001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/4c99afacc91f/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/a8823d45565b/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/d797ae630a34/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/2e1b15feaac6/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/22dde0517377/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/24d89f3357aa/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/b207c065b62f/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac7c/9950332/94c702000e9a/gr10.jpg

相似文献

1
Preventing profiling for ethical fake news detection.防止在道德假新闻检测中进行特征分析。
Inf Process Manag. 2023 Mar;60(2):None. doi: 10.1016/j.ipm.2022.103206.
2
Deep Ensemble Fake News Detection Model Using Sequential Deep Learning Technique.基于序列深度学习技术的深度集成假新闻检测模型。
Sensors (Basel). 2022 Sep 15;22(18):6970. doi: 10.3390/s22186970.
3
Detection of Turkish Fake News in Twitter with Machine Learning Algorithms.使用机器学习算法在推特上检测土耳其假新闻
Arab J Sci Eng. 2022;47(2):2359-2379. doi: 10.1007/s13369-021-06223-0. Epub 2021 Oct 1.
4
A veracity dissemination consistency-based few-shot fake news detection framework by synergizing adversarial and contrastive self-supervised learning.一种基于真实性传播一致性的少样本假新闻检测框架,通过协同对抗性和对比性自监督学习实现。
Sci Rep. 2024 Aug 22;14(1):19470. doi: 10.1038/s41598-024-70039-9.
5
Dissecting the infodemic: An in-depth analysis of COVID-19 misinformation detection on X (formerly Twitter) utilizing machine learning and deep learning techniques.剖析信息疫情:利用机器学习和深度学习技术对X(原推特)上新冠疫情错误信息检测的深入分析。
Heliyon. 2024 Sep 12;10(18):e37760. doi: 10.1016/j.heliyon.2024.e37760. eCollection 2024 Sep 30.
6
A network-based positive and unlabeled learning approach for fake news detection.一种基于网络的用于虚假新闻检测的正例与无标签学习方法。
Mach Learn. 2022;111(10):3549-3592. doi: 10.1007/s10994-021-06111-6. Epub 2021 Nov 18.
7
Modeling the spread of fake news on Twitter.在 Twitter 上模拟假新闻的传播。
PLoS One. 2021 Apr 22;16(4):e0250419. doi: 10.1371/journal.pone.0250419. eCollection 2021.
8
GBERT: A hybrid deep learning model based on GPT-BERT for fake news detection.GBERT:一种基于GPT-BERT的用于虚假新闻检测的混合深度学习模型。
Heliyon. 2024 Aug 6;10(16):e35865. doi: 10.1016/j.heliyon.2024.e35865. eCollection 2024 Aug 30.
9
IFND: a benchmark dataset for fake news detection.IFND:一个用于假新闻检测的基准数据集。
Complex Intell Systems. 2023;9(3):2843-2863. doi: 10.1007/s40747-021-00552-1. Epub 2021 Oct 16.
10
Detection of Fake News Text Classification on COVID-19 Using Deep Learning Approaches.基于深度学习方法的 COVID-19 假新闻文本分类检测。
Comput Math Methods Med. 2021 Nov 15;2021:5514220. doi: 10.1155/2021/5514220. eCollection 2021.

本文引用的文献

1
Five sources of bias in natural language processing.自然语言处理中偏见的五个来源。
Lang Linguist Compass. 2021 Aug;15(8):e12432. doi: 10.1111/lnc3.12432. Epub 2021 Aug 20.
2
Bias and comparison framework for abusive language datasets.辱骂性语言数据集的偏差与比较框架
AI Ethics. 2022;2(1):79-101. doi: 10.1007/s43681-021-00081-0. Epub 2021 Jul 19.
3
Semiautomatic robust regression clustering of international trade data.国际贸易数据的半自动稳健回归聚类
Stat Methods Appt. 2021;30(3):863-894. doi: 10.1007/s10260-021-00569-3. Epub 2021 Jun 11.
4
Classification aware neural topic model for COVID-19 disinformation categorisation.用于 COVID-19 虚假信息分类的分类感知神经主题模型。
PLoS One. 2021 Feb 18;16(2):e0247086. doi: 10.1371/journal.pone.0247086. eCollection 2021.
5
FakeNewsNet: A Data Repository with News Content, Social Context, and Spatiotemporal Information for Studying Fake News on Social Media.假新闻网:一个具有新闻内容、社交背景和时空信息的数据资源库,用于研究社交媒体上的假新闻。
Big Data. 2020 Jun;8(3):171-188. doi: 10.1089/big.2020.0062.
6
Rationale-Augmented Convolutional Neural Networks for Text Classification.用于文本分类的基于原理增强的卷积神经网络。
Proc Conf Empir Methods Nat Lang Process. 2016 Nov;2016:795-804. doi: 10.18653/v1/d16-1076.
7
Analysing How People Orient to and Spread Rumours in Social Media by Looking at Conversational Threads.通过查看对话线程来分析人们在社交媒体中如何传播和对待谣言。
PLoS One. 2016 Mar 4;11(3):e0150989. doi: 10.1371/journal.pone.0150989. eCollection 2016.
8
Multidimensional scaling, tree-fitting, and clustering.多维标度、树拟合和聚类。
Science. 1980 Oct 24;210(4468):390-8. doi: 10.1126/science.210.4468.390.