用于伪装目标检测的特征分割与聚合网络

Features Split and Aggregation Network for Camouflaged Object Detection.

作者信息

Zhang Zejin, Wang Tao, Wang Jian, Sun Yao

机构信息

HDU-ITMO Joint Institute, Hangzhou Dianzi University, Hangzhou 310018, China.

School of Automation, Hangzhou Dianzi University, Hangzhou 310018, China.

出版信息

J Imaging. 2024 Jan 18;10(1):0. doi: 10.3390/jimaging10010024.

DOI:10.3390/jimaging10010024

PMID:38249009

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11154448/

Abstract

Higher standards have been proposed for detection systems since camouflaged objects are not distinct enough, making it possible to ignore the difference between their background and foreground. In this paper, we present a new framework for Camouflaged Object Detection (COD) named FSANet, which consists mainly of three operations: spatial detail mining (SDM), cross-scale feature combination (CFC), and hierarchical feature aggregation decoder (HFAD). The framework simulates the three-stage detection process of the human visual mechanism when observing a camouflaged scene. Specifically, we have extracted five feature layers using the backbone and divided them into two parts with the second layer as the boundary. The SDM module simulates the human cursory inspection of the camouflaged objects to gather spatial details (such as edge, texture, etc.) and fuses the features to create a cursory impression. The CFC module is used to observe high-level features from various viewing angles and extracts the same features by thoroughly filtering features of various levels. We also design side-join multiplication in the CFC module to avoid detail distortion and use feature element-wise multiplication to filter out noise. Finally, we construct an HFAD module to deeply mine effective features from these two stages, direct the fusion of low-level features using high-level semantic knowledge, and improve the camouflage map using hierarchical cascade technology. Compared to the nineteen deep-learning-based methods in terms of seven widely used metrics, our proposed framework has clear advantages on four public COD datasets, demonstrating the effectiveness and superiority of our model.

摘要

由于伪装物体不够清晰，难以区分其背景和前景，因此对检测系统提出了更高的标准。在本文中，我们提出了一种名为FSANet的伪装目标检测（COD）新框架，该框架主要由三个操作组成：空间细节挖掘（SDM）、跨尺度特征组合（CFC）和分层特征聚合解码器（HFAD）。该框架模拟了人类视觉机制在观察伪装场景时的三阶段检测过程。具体来说，我们使用主干网络提取了五个特征层，并以第二层为边界将它们分为两部分。SDM模块模拟人类对伪装物体的粗略检查，以收集空间细节（如边缘、纹理等）并融合特征以形成粗略印象。CFC模块用于从各个视角观察高级特征，并通过彻底过滤各级特征来提取相同的特征。我们还在CFC模块中设计了侧连接乘法以避免细节失真，并使用特征逐元素乘法来滤除噪声。最后，我们构建了一个HFAD模块，从这两个阶段中深度挖掘有效特征，使用高级语义知识指导低级特征的融合，并使用分层级联技术改进伪装地图。在七个广泛使用的指标方面，与十九种基于深度学习的方法相比，我们提出的框架在四个公共COD数据集上具有明显优势，证明了我们模型的有效性和优越性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/417c/11154448/9109c2e783e0/jimaging-10-00024-g001.jpg

相似文献

Features Split and Aggregation Network for Camouflaged Object Detection.

J Imaging. 2024 Jan 18;10(1):0. doi: 10.3390/jimaging10010024.

Edge-Guided Camouflaged Object Detection via Multi-Level Feature Integration.

Sensors (Basel). 2023 Jun 21;23(13):5789. doi: 10.3390/s23135789.

Feature Aggregation and Propagation Network for Camouflaged Object Detection.

IEEE Trans Image Process. 2022;31:7036-7047. doi: 10.1109/TIP.2022.3217695. Epub 2022 Nov 14.

MAGNet: A Camouflaged Object Detection Network Simulating the Observation Effect of a Magnifier.

Entropy (Basel). 2022 Dec 9;24(12):1804. doi: 10.3390/e24121804.

Guided multi-scale refinement network for camouflaged object detection.

Multimed Tools Appl. 2023;82(4):5785-5801. doi: 10.1007/s11042-022-13274-4. Epub 2022 Jul 30.

Hierarchical Graph Interaction Transformer With Dynamic Token Clustering for Camouflaged Object Detection.

IEEE Trans Image Process. 2024;33:5936-5948. doi: 10.1109/TIP.2024.3475219. Epub 2024 Oct 18.

Camouflaged Object Segmentation Based on Matching-Recognition-Refinement Network.

IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):15993-16007. doi: 10.1109/TNNLS.2023.3291595. Epub 2024 Oct 29.

FindNet: Can You Find Me? Boundary-and-Texture Enhancement Network for Camouflaged Object Detection.

IEEE Trans Image Process. 2022;31:6396-6411. doi: 10.1109/TIP.2022.3189828.

Collaborative Camouflaged Object Detection: A Large-Scale Dataset and Benchmark.

IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):18470-18484. doi: 10.1109/TNNLS.2023.3317091. Epub 2024 Dec 2.

Discriminative context-aware network for camouflaged object detection.

Front Artif Intell. 2024 Mar 27;7:1347898. doi: 10.3389/frai.2024.1347898. eCollection 2024.

本文引用的文献

MAGNet: A Camouflaged Object Detection Network Simulating the Observation Effect of a Magnifier.

Entropy (Basel). 2022 Dec 9;24(12):1804. doi: 10.3390/e24121804.

Feature Aggregation and Propagation Network for Camouflaged Object Detection.

IEEE Trans Image Process. 2022;31:7036-7047. doi: 10.1109/TIP.2022.3217695. Epub 2022 Nov 14.

Salient Object Detection via Integrity Learning.

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3738-3752. doi: 10.1109/TPAMI.2022.3179526. Epub 2023 Feb 3.

Concealed Object Detection.

IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):6024-6042. doi: 10.1109/TPAMI.2021.3085766. Epub 2022 Sep 14.

Res2Net: A New Multi-Scale Backbone Architecture.

IEEE Trans Pattern Anal Mach Intell. 2021 Feb;43(2):652-662. doi: 10.1109/TPAMI.2019.2938758. Epub 2021 Jan 8.

How camouflage works.

Philos Trans R Soc Lond B Biol Sci. 2017 Jul 5;372(1724). doi: 10.1098/rstb.2016.0341.

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.

IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.

Context-aware saliency detection.

IEEE Trans Pattern Anal Mach Intell. 2012 Oct;34(10):1915-26. doi: 10.1109/TPAMI.2011.272.

A computational model that recovers the 3D shape of an object from a single 2D retinal representation.

Vision Res. 2009 May;49(9):979-91. doi: 10.1016/j.visres.2008.05.013. Epub 2008 Jul 14.

When does the visual system use viewpoint-invariant representations during recognition?

Brain Res Cogn Brain Res. 2003 May;16(3):399-415. doi: 10.1016/s0926-6410(03)00054-5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于伪装目标检测的特征分割与聚合网络

Features Split and Aggregation Network for Camouflaged Object Detection.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献