用于图像分类的决策融合网络

Decision Fusion Networks for Image Classification.

作者信息

Tang Keke, Ma Yuexin, Miao Dingruibo, Song Peng, Gu Zhaoquan, Tian Zhihong, Wang Wenping

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):3890-3903. doi: 10.1109/TNNLS.2022.3196129. Epub 2025 Feb 28.

DOI:10.1109/TNNLS.2022.3196129

Abstract

Convolutional neural networks, in which each layer receives features from the previous layer(s) and then aggregates/abstracts higher level features from them, are widely adopted for image classification. To avoid information loss during feature aggregation/abstraction and fully utilize lower layer features, we propose a novel decision fusion module (DFM) for making an intermediate decision based on the features in the current layer and then fuse its results with the original features before passing them to the next layers. This decision is devised to determine an auxiliary category corresponding to the category at a higher hierarchical level, which can, thus, serve as category-coherent guidance for later layers. Therefore, by stacking a collection of DFMs into a classification network, the generated decision fusion network is explicitly formulated to progressively aggregate/abstract more discriminative features guided by these decisions and then refine the decisions based on the newly generated features in a layer-by-layer manner. Comprehensive results on four benchmarks validate that the proposed DFM can bring significant improvements for various common classification networks at a minimal additional computational cost and are superior to the state-of-the-art decision fusion-based methods. In addition, we demonstrate the generalization ability of the DFM to object detection and semantic segmentation.

摘要

卷积神经网络在图像分类中被广泛采用，其中每一层都从前一层接收特征，然后从这些特征中聚合/提取更高层次的特征。为了避免在特征聚合/提取过程中信息丢失并充分利用低层特征，我们提出了一种新颖的决策融合模块（DFM），用于基于当前层的特征做出中间决策，然后在将结果传递到下一层之前，将其与原始特征进行融合。此决策旨在确定与更高层次类别相对应的辅助类别，从而可以作为后续层的类别连贯指导。因此，通过将一系列DFM堆叠到分类网络中，生成的决策融合网络被明确设计为在这些决策的指导下逐步聚合/提取更具判别力的特征，然后以逐层方式基于新生成的特征细化决策。在四个基准上的综合结果验证了所提出的DFM能够以最小的额外计算成本为各种常见分类网络带来显著改进，并且优于基于决策融合的现有方法。此外，我们展示了DFM在目标检测和语义分割方面的泛化能力。