基于空间域Swin-T分割网络的图像篡改检测多标签分类

Multi-label classification for image tamper detection based on Swin-T segmentation network in the spatial domain.

作者信息

Li Li, Zhang Kejia, Lu Jianfeng, Zhang Shanqing

机构信息

School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, Zhejiang, China.

Shangyu Institute of Science and Engineering, Hangzhou Dianzi University, Shaoxing, Zhejiang, China.

出版信息

PeerJ Comput Sci. 2025 Apr 8;11:e2775. doi: 10.7717/peerj-cs.2775. eCollection 2025.

DOI:10.7717/peerj-cs.2775

PMID:40567786

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12190506/

Abstract

The majority of deep learning methods for detecting image forgery fail to accurately detect and localize the tampering operations. Furthermore, they only support a single image tampering type. Our method introduces three key innovations: (1) A spatial perception module that combines the spatial rich model (SRM) with constrained convolution, enabling focused detection of tampering traces while suppressing interference from image content; (2) A hierarchical feature learning architecture that integrates Swin Transformer with UperNet for effective multi-scale tampering pattern recognition; and (3) A comprehensive optimization strategy including auxiliary supervision, self-supervised learning, and hard example mining, which significantly improves model convergence and detection accuracy. Comprehensive experiments are performed on two established datasets; namely MixTamper and DocTamper with 19,600 and 170,000 images, respectively. The experimental findings demonstrate that the proposed model enhances the IoU index by 13% compared to the leading algorithms. Additionally, it can accurately detect multiple tampering types from a single image.

摘要

大多数用于检测图像伪造的深度学习方法无法准确检测和定位篡改操作。此外，它们仅支持单一的图像篡改类型。我们的方法引入了三项关键创新：（1）一个空间感知模块，将空间丰富模型（SRM）与约束卷积相结合，能够在抑制图像内容干扰的同时，聚焦检测篡改痕迹；（2）一种层次特征学习架构，将Swin Transformer与UperNet集成，用于有效的多尺度篡改模式识别；（3）一种综合优化策略，包括辅助监督、自监督学习和难例挖掘，显著提高了模型的收敛性和检测准确性。在两个已建立的数据集上进行了全面实验；分别是包含19600张图像的MixTamper和包含170000张图像的DocTamper。实验结果表明，与领先算法相比，所提出的模型将交并比（IoU）指标提高了13%。此外，它能够从单张图像中准确检测多种篡改类型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b02/12190506/0c3c716b2e6d/peerj-cs-11-2775-g001.jpg

相似文献

Multi-label classification for image tamper detection based on Swin-T segmentation network in the spatial domain.基于空间域Swin-T分割网络的图像篡改检测多标签分类

PeerJ Comput Sci. 2025 Apr 8;11:e2775. doi: 10.7717/peerj-cs.2775. eCollection 2025.

A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。

Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.

SFTA-Net: a self-supervised approach to detect copy-move and splicing forgery to leverage triplet loss, auxiliary loss, and spatial attention.SFTA-Net：一种利用三元组损失、辅助损失和空间注意力来检测复制移动和拼接伪造的自监督方法。

PeerJ Comput Sci. 2025 Apr 16;11:e2803. doi: 10.7717/peerj-cs.2803. eCollection 2025.

TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.TLTNet：一种新颖的跨尺度级联分层Transformer 网络，用于增强视网膜血管分割。

Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25.

SODU2-NET: a novel deep learning-based approach for salient object detection utilizing U-NET.SODU2-NET：一种基于深度学习的利用U-NET进行显著目标检测的新方法。

PeerJ Comput Sci. 2025 May 19;11:e2623. doi: 10.7717/peerj-cs.2623. eCollection 2025.

Semi-Supervised Learning Allows for Improved Segmentation With Reduced Annotations of Brain Metastases Using Multicenter MRI Data.半监督学习可利用多中心MRI数据，通过减少脑转移瘤的标注来改进分割。

J Magn Reson Imaging. 2025 Jun;61(6):2469-2479. doi: 10.1002/jmri.29686. Epub 2025 Jan 10.

A review: Lightweight architecture model in deep learning approach for lung disease identification.综述：深度学习方法中用于肺病识别的轻量级架构模型

Comput Biol Med. 2025 Aug;194:110425. doi: 10.1016/j.compbiomed.2025.110425. Epub 2025 Jun 14.

Skin-CAD: Explainable deep learning classification of skin cancer from dermoscopic images by feature selection of dual high-level CNNs features and transfer learning.皮肤 CAD：基于双高级 CNN 特征选择和迁移学习的皮肤镜图像皮肤癌可解释深度学习分类。

Comput Biol Med. 2024 Aug;178:108798. doi: 10.1016/j.compbiomed.2024.108798. Epub 2024 Jun 25.

FL-W3S: Cross-domain federated learning for weakly supervised semantic segmentation of white blood cells.FL-W3S：用于白细胞弱监督语义分割的跨域联邦学习

Int J Med Inform. 2025 Mar;195:105806. doi: 10.1016/j.ijmedinf.2025.105806. Epub 2025 Jan 23.

DAC-Net: A light-weight U-shaped network based efficient convolution and attention for thyroid nodule segmentation.DAC-Net：一种基于轻量级 U 形网络的高效卷积和注意力的甲状腺结节分割方法。

Comput Biol Med. 2024 Sep;180:108972. doi: 10.1016/j.compbiomed.2024.108972. Epub 2024 Aug 9.

本文引用的文献

MVSS-Net: Multi-View Multi-Scale Supervised Networks for Image Manipulation Detection.MVSS-Net：用于图像篡改检测的多视图多尺度监督网络

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3539-3553. doi: 10.1109/TPAMI.2022.3180556. Epub 2023 Feb 3.

Digital Image Tamper Detection Technique Based on Spectrum Analysis of CFA Artifacts.基于 CFA 伪影频谱分析的数字图像篡改检测技术。

Sensors (Basel). 2018 Aug 25;18(9):2804. doi: 10.3390/s18092804.

Fully Convolutional Networks for Semantic Segmentation.全卷积网络用于语义分割。

IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于空间域Swin-T分割网络的图像篡改检测多标签分类

Multi-label classification for image tamper detection based on Swin-T segmentation network in the spatial domain.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献