• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于深度学习模型在视频深度伪造检测中的泛化

On the Generalization of Deep Learning Models in Video Deepfake Detection.

作者信息

Coccomini Davide Alessandro, Caldelli Roberto, Falchi Fabrizio, Gennaro Claudio

机构信息

Istituto di Scienza e Tecnologie dell'Informazione, 56124 Pisa, Italy.

National Inter-University Consortium for Telecommunications (CNIT), 50134 Florence, Italy.

出版信息

J Imaging. 2023 Apr 29;9(5):89. doi: 10.3390/jimaging9050089.

DOI:10.3390/jimaging9050089
PMID:37233308
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10218961/
Abstract

The increasing use of deep learning techniques to manipulate images and videos, commonly referred to as "deepfakes", is making it more challenging to differentiate between real and fake content, while various deepfake detection systems have been developed, they often struggle to detect deepfakes in real-world situations. In particular, these methods are often unable to effectively distinguish images or videos when these are modified using novel techniques which have not been used in the training set. In this study, we carry out an analysis of different deep learning architectures in an attempt to understand which is more capable of better generalizing the concept of deepfake. According to our results, it appears that Convolutional Neural Networks (CNNs) seem to be more capable of storing specific anomalies and thus excel in cases of datasets with a limited number of elements and manipulation methodologies. The Vision Transformer, conversely, is more effective when trained with more varied datasets, achieving more outstanding generalization capabilities than the other methods analysed. Finally, the Swin Transformer appears to be a good alternative for using an attention-based method in a more limited data regime and performs very well in cross-dataset scenarios. All the analysed architectures seem to have a different way to look at deepfakes, but since in a real-world environment the generalization capability is essential, based on the experiments carried out, the attention-based architectures seem to provide superior performances.

摘要

深度学习技术越来越多地被用于处理图像和视频,通常被称为“深度伪造”,这使得区分真实内容和虚假内容变得更具挑战性。虽然已经开发了各种深度伪造检测系统,但它们在现实世界的情况下往往难以检测出深度伪造。特别是,当使用训练集中未使用的新技术修改图像或视频时,这些方法通常无法有效地区分。在本研究中,我们对不同的深度学习架构进行了分析,试图了解哪种架构更能更好地概括深度伪造的概念。根据我们的结果,卷积神经网络(CNN)似乎更能够存储特定异常,因此在元素数量有限和操作方法的数据集情况下表现出色。相反,视觉Transformer在使用更多样化的数据集进行训练时更有效,比其他分析方法具有更出色的泛化能力。最后,Swin Transformer似乎是在更有限的数据环境中使用基于注意力的方法的一个很好的选择,并且在跨数据集场景中表现非常出色。所有分析的架构似乎都有不同的方式看待深度伪造,但由于在现实世界环境中泛化能力至关重要,基于所进行的实验,基于注意力的架构似乎提供了卓越的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f36/10218961/9074b4180e19/jimaging-09-00089-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f36/10218961/6d8a72c9e358/jimaging-09-00089-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f36/10218961/a27ed0ca4c3b/jimaging-09-00089-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f36/10218961/d9fd44d2c068/jimaging-09-00089-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f36/10218961/0b163c9b72e9/jimaging-09-00089-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f36/10218961/216fb27af3e8/jimaging-09-00089-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f36/10218961/9074b4180e19/jimaging-09-00089-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f36/10218961/6d8a72c9e358/jimaging-09-00089-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f36/10218961/a27ed0ca4c3b/jimaging-09-00089-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f36/10218961/d9fd44d2c068/jimaging-09-00089-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f36/10218961/0b163c9b72e9/jimaging-09-00089-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f36/10218961/216fb27af3e8/jimaging-09-00089-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f36/10218961/9074b4180e19/jimaging-09-00089-g006.jpg

相似文献

1
On the Generalization of Deep Learning Models in Video Deepfake Detection.关于深度学习模型在视频深度伪造检测中的泛化
J Imaging. 2023 Apr 29;9(5):89. doi: 10.3390/jimaging9050089.
2
A Review of Image Processing Techniques for Deepfakes.深度伪造的图像处理技术综述。
Sensors (Basel). 2022 Jun 16;22(12):4556. doi: 10.3390/s22124556.
3
Deepfake attack prevention using steganography GANs.使用隐写术生成对抗网络防止深度伪造攻击
PeerJ Comput Sci. 2022 Oct 20;8:e1125. doi: 10.7717/peerj-cs.1125. eCollection 2022.
4
The Face Deepfake Detection Challenge.面部深度伪造检测挑战赛。
J Imaging. 2022 Sep 28;8(10):263. doi: 10.3390/jimaging8100263.
5
A Robust Approach to Multimodal Deepfake Detection.一种用于多模态深度伪造检测的稳健方法。
J Imaging. 2023 Jun 19;9(6):122. doi: 10.3390/jimaging9060122.
6
An efficient deepfake video detection using robust deep learning.使用稳健深度学习的高效深度伪造视频检测
Heliyon. 2024 Feb 9;10(5):e25757. doi: 10.1016/j.heliyon.2024.e25757. eCollection 2024 Mar 15.
7
Audio deepfakes: A survey.音频深度伪造:一项调查。
Front Big Data. 2023 Jan 9;5:1001063. doi: 10.3389/fdata.2022.1001063. eCollection 2022.
8
Comparative Analysis of Deepfake Image Detection Method Using Convolutional Neural Network.基于卷积神经网络的深度伪造图像检测方法的比较分析。
Comput Intell Neurosci. 2021 Dec 16;2021:3111676. doi: 10.1155/2021/3111676. eCollection 2021.
9
Deepfake forensics analysis: An explainable hierarchical ensemble of weakly supervised models.深度伪造取证分析:一种弱监督模型的可解释分层集成
Forensic Sci Int Synerg. 2022 Jan 27;4:100217. doi: 10.1016/j.fsisyn.2022.100217. eCollection 2022.
10
Deepfakes Generation and Detection: A Short Survey.深度伪造的生成与检测:简要综述
J Imaging. 2023 Jan 13;9(1):18. doi: 10.3390/jimaging9010018.

引用本文的文献

1
TSFF-Net: A deep fake video detection model based on two-stream feature domain fusion.TSFF-Net:一种基于双流特征域融合的深度伪造视频检测模型。
PLoS One. 2024 Dec 13;19(12):e0311366. doi: 10.1371/journal.pone.0311366. eCollection 2024.
2
Detecting images generated by diffusers.检测扩散器生成的图像。
PeerJ Comput Sci. 2024 Jul 10;10:e2127. doi: 10.7717/peerj-cs.2127. eCollection 2024.
3
Harmonizing Image Forgery Detection & Localization: Fusion of Complementary Approaches.协调图像伪造检测与定位:互补方法的融合

本文引用的文献

1
The Face Deepfake Detection Challenge.面部深度伪造检测挑战赛。
J Imaging. 2022 Sep 28;8(10):263. doi: 10.3390/jimaging8100263.
2
TweepFake: About detecting deepfake tweets.TweepFake:关于检测深度伪造推文。
PLoS One. 2021 May 13;16(5):e0251415. doi: 10.1371/journal.pone.0251415. eCollection 2021.
J Imaging. 2023 Dec 25;10(1):4. doi: 10.3390/jimaging10010004.