Bai Ningning, Wang Xiaofeng, Han Ruidong, Hou Jianpeng, Wang Qin, Pang Shanmin
Department of Mathematics, Xi'an University of Technology, Xi'an 710048, China.
School of Computer Science and Engineering, Xi'an University of Technology, Xi'an 710048, China.
Neural Netw. 2025 Feb;182:106909. doi: 10.1016/j.neunet.2024.106909. Epub 2024 Nov 17.
The continuous advancement of face forgery techniques has caused a series of trust crises, posing a significant menace to information security and personal privacy. In response, deep learning is being employed to develop effective detection methods to identify deepfake images and videos. Currently, most detection methods generally achieve satisfactory performance in intra-domain detection. However, their effectiveness remains uncertain when detecting unknown forgeries and cross-domain scenarios. From a causal learning perspective, our analysis reveals that the failure of detectors in cross-domain detection primarily arises from spurious correlations between irrelevant features and category labels (real and fake). In this study, we propose the Feature Independence Constrainer (FIC) to alleviate spurious correlations and enhance the generalization for detection method. Specifically, FIC maps features to a Reproducing Kernel Hilbert Space and calculates the covariance matrix among the kernel-mapped features. Dynamic weight parameters are incorporated into the covariance matrix for iterative optimization. Consequently, statistical correlations between features are eliminated, ensuring independence between each pair of features and mitigating the impact of spurious correlations. Additionally, we introduce fine-grained high-frequency components to force the detection model to learn comprehensive forgery-related artifacts while avoiding spurious correlations between irrelevant features and labels. Furthermore, a Feature Alignment Module (FAM) is designed to explore higher-order dependencies between the spatial and frequency domains, allowing for the extraction of richer potential forgery traces. Quantitative and qualitative experiments demonstrate that the proposed method achieves competitive performance across multiple face forgery benchmark datasets, outperforming state-of-the-art methods.
面部伪造技术的不断发展引发了一系列信任危机,对信息安全和个人隐私构成了重大威胁。作为回应,人们正在利用深度学习来开发有效的检测方法,以识别深度伪造的图像和视频。目前,大多数检测方法在域内检测中通常能取得令人满意的性能。然而,在检测未知伪造和跨域场景时,它们的有效性仍然不确定。从因果学习的角度来看,我们的分析表明,检测器在跨域检测中的失败主要源于无关特征与类别标签(真实和伪造)之间的虚假相关性。在本研究中,我们提出了特征独立性约束器(FIC),以减轻虚假相关性并增强检测方法的泛化能力。具体而言,FIC将特征映射到再生核希尔伯特空间,并计算核映射特征之间的协方差矩阵。动态权重参数被纳入协方差矩阵进行迭代优化。因此,特征之间的统计相关性被消除,确保每对特征之间的独立性,并减轻虚假相关性的影响。此外,我们引入了细粒度高频分量,以迫使检测模型学习与伪造相关的全面伪像,同时避免无关特征与标签之间的虚假相关性。此外,还设计了一个特征对齐模块(FAM)来探索空间和频率域之间的高阶依赖性,从而能够提取更丰富的潜在伪造痕迹。定量和定性实验表明,所提出的方法在多个面部伪造基准数据集上取得了有竞争力的性能,优于现有方法。