IEEE Trans Image Process. 2024;33:4824-4839. doi: 10.1109/TIP.2024.3449574. Epub 2024 Sep 5.
Camouflaged objects often blend in with their surroundings, making the perception of a camouflaged object a more complex procedure. However, most neural-network-based methods that simulate the visual information processing pathway of creatures only roughly define the general process, which deficiently reproduces the process of identifying camouflaged objects. How to make modeled neural networks perceive camouflaged objects as effectively as creatures is a significant topic that deserves further consideration. After meticulous analysis of biological visual information processing, we propose an end-to-end prudent and comprehensive neural network, termed IdeNet, to model the critical information processing. Specifically, IdeNet divides the entire perception process into five stages: information collection, information augmentation, information filtering, information localization, and information correction and object identification. In addition, we design tailored visual information processing mechanisms for each stage, including the information augmentation module (IAM), the information filtering module (IFM), the information localization module (ILM), and the information correction module (ICM), to model the critical visual information processing and establish the inextricable association of biological behavior and visual information processing. The extensive experiments show that IdeNet outperforms state-of-the-art methods in all benchmarks, demonstrating the effectiveness of the five-stage partitioning of visual information processing pathway and the tailored visual information processing mechanisms for camouflaged object detection. Our code is publicly available at: https://github.com/whyandbecause/IdeNet.
伪装目标通常与周围环境融为一体,使得对伪装目标的感知成为一个更加复杂的过程。然而,大多数基于神经网络的方法只是粗略地定义了生物视觉信息处理途径的一般过程,无法充分再现识别伪装目标的过程。如何使建模神经网络像生物一样有效地感知伪装目标,是一个值得进一步考虑的重要课题。在对生物视觉信息处理进行细致分析之后,我们提出了一个端到端的谨慎而全面的神经网络 IdeNet,用于模拟关键的信息处理过程。具体来说,IdeNet 将整个感知过程分为五个阶段:信息采集、信息增强、信息过滤、信息定位和信息修正与目标识别。此外,我们为每个阶段设计了专门的视觉信息处理机制,包括信息增强模块(IAM)、信息过滤模块(IFM)、信息定位模块(ILM)和信息修正模块(ICM),以模拟关键的视觉信息处理过程,并建立生物行为与视觉信息处理之间的不可分割的联系。广泛的实验表明,在所有基准测试中,IdeNet 都优于最先进的方法,这证明了视觉信息处理途径的五阶段划分和针对伪装目标检测的专门视觉信息处理机制的有效性。我们的代码可在以下网址获取:https://github.com/whyandbecause/IdeNet。