LayerCAM：探索用于定位的分层类激活映射

LayerCAM: Exploring Hierarchical Class Activation Maps for Localization.

作者信息

Jiang Peng-Tao, Zhang Chang-Bin, Hou Qibin, Cheng Ming-Ming, Wei Yunchao

出版信息

IEEE Trans Image Process. 2021;30:5875-5888. doi: 10.1109/TIP.2021.3089943. Epub 2021 Jun 28.

DOI:10.1109/TIP.2021.3089943

Abstract

The class activation maps are generated from the final convolutional layer of CNN. They can highlight discriminative object regions for the class of interest. These discovered object regions have been widely used for weakly-supervised tasks. However, due to the small spatial resolution of the final convolutional layer, such class activation maps often locate coarse regions of the target objects, limiting the performance of weakly-supervised tasks that need pixel-accurate object locations. Thus, we aim to generate more fine-grained object localization information from the class activation maps to locate the target objects more accurately. In this paper, by rethinking the relationships between the feature maps and their corresponding gradients, we propose a simple yet effective method, called LayerCAM. It can produce reliable class activation maps for different layers of CNN. This property enables us to collect object localization information from coarse (rough spatial localization) to fine (precise fine-grained details) levels. We further integrate them into a high-quality class activation map, where the object-related pixels can be better highlighted. To evaluate the quality of the class activation maps produced by LayerCAM, we apply them to weakly-supervised object localization and semantic segmentation. Experiments demonstrate that the class activation maps generated by our method are more effective and reliable than those by the existing attention methods. The code will be made publicly available.

摘要

类激活映射是从卷积神经网络（CNN）的最后一个卷积层生成的。它们可以突出显示感兴趣类别的判别性目标区域。这些发现的目标区域已被广泛用于弱监督任务。然而，由于最后一个卷积层的空间分辨率较小，此类类激活映射通常定位在目标对象的粗略区域，限制了需要像素级精确目标位置的弱监督任务的性能。因此，我们旨在从类激活映射中生成更细粒度的目标定位信息，以便更准确地定位目标对象。在本文中，通过重新思考特征映射与其相应梯度之间的关系，我们提出了一种简单而有效的方法，称为LayerCAM。它可以为CNN的不同层生成可靠的类激活映射。这一特性使我们能够从粗粒度（粗略的空间定位）到细粒度（精确的细粒度细节）级别收集目标定位信息。我们进一步将它们集成到一个高质量的类激活映射中，其中与对象相关的像素可以得到更好的突出显示。为了评估LayerCAM生成的类激活映射的质量，我们将其应用于弱监督目标定位和语义分割。实验表明，我们的方法生成的类激活映射比现有注意力方法生成的类激活映射更有效、更可靠。代码将公开提供。

相似文献

LayerCAM: Exploring Hierarchical Class Activation Maps for Localization.

IEEE Trans Image Process. 2021;30:5875-5888. doi: 10.1109/TIP.2021.3089943. Epub 2021 Jun 28.

Online Attention Accumulation for Weakly Supervised Semantic Segmentation.

IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7062-7077. doi: 10.1109/TPAMI.2021.3092573. Epub 2022 Sep 14.

Anti-Adversarially Manipulated Attributions for Weakly Supervised Semantic Segmentation and Object Localization.

IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1618-1634. doi: 10.1109/TPAMI.2022.3166916. Epub 2024 Feb 6.

Weakly supervised semantic segmentation for MRI: exploring the advantages and disadvantages of class activation maps for biological image segmentation with soft boundaries.

Sci Rep. 2023 Feb 13;13(1):2574. doi: 10.1038/s41598-023-29665-y.

Spatial Structure Constraints for Weakly Supervised Semantic Segmentation.

IEEE Trans Image Process. 2024;33:1136-1148. doi: 10.1109/TIP.2024.3359041. Epub 2024 Feb 6.

TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.

IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):9109-9121. doi: 10.1109/TNNLS.2022.3218471. Epub 2024 Jul 8.

SPANet: Structure-Preserved Attention Activated Network for Weakly Supervised Object Localization.

IEEE Trans Image Process. 2023;32:5779-5793. doi: 10.1109/TIP.2023.3323793. Epub 2023 Oct 27.

MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation.

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):8380-8395. doi: 10.1109/TPAMI.2024.3404422. Epub 2024 Nov 6.

Module of Axis-based Nexus Attention for weakly supervised object localization.

Sci Rep. 2023 Oct 30;13(1):18588. doi: 10.1038/s41598-023-45796-8.

Predictive and discriminative localization of pathology using high resolution class activation maps with CNNs.

PeerJ Comput Sci. 2021 Jul 14;7:e622. doi: 10.7717/peerj-cs.622. eCollection 2021.

引用本文的文献

Foliar disease resistance phenomics of fungal pathogens: image-based approaches for mapping quantitative resistance in cereal germplasm.

Theor Appl Genet. 2025 Aug 28;138(9):232. doi: 10.1007/s00122-025-05017-4.

Comparative evaluation of CAM methods for enhancing explainability in veterinary radiography.

Sci Rep. 2025 Aug 13;15(1):29690. doi: 10.1038/s41598-025-14060-6.

Analyzing explainability of YOLO-based breast cancer detection using heat map visualizations.

Quant Imaging Med Surg. 2025 Jul 1;15(7):6252-6271. doi: 10.21037/qims-2024-2911. Epub 2025 Jun 30.

Biomimetics (Basel). 2025 Jul 16;10(7):468. doi: 10.3390/biomimetics10070468.

A generative model uses healthy and diseased image pairs for pixel-level chest X-ray pathology localization.

Nat Biomed Eng. 2025 Jul 14. doi: 10.1038/s41551-025-01456-y.

An Interpretability Method for Broken Wire Detection.

Sensors (Basel). 2025 Jun 27;25(13):4002. doi: 10.3390/s25134002.

Utilizing shallow features and spatial context for weakly supervised intracerebral hemorrhage segmentation.

Quant Imaging Med Surg. 2025 Jun 6;15(6):5546-5566. doi: 10.21037/qims-24-1462. Epub 2025 May 30.

Network Occlusion Sensitivity Analysis Identifies Regional Contributions to Brain Age Prediction.

Hum Brain Mapp. 2025 Jun 1;46(8):e70239. doi: 10.1002/hbm.70239.

Alternative Strategies to Generate Class Activation Maps Supporting AI-based Advice in Vertebral Fracture Detection in X-ray Images.

Methods Inf Med. 2024 Sep;63(3-04):122-136. doi: 10.1055/a-2562-2163. Epub 2025 Jun 3.

A deep-learning model for predicting post-stroke cognitive impairment based on brain network damage.

Quant Imaging Med Surg. 2025 May 1;15(5):3964-3981. doi: 10.21037/qims-24-2010. Epub 2025 Apr 21.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

LayerCAM：探索用于定位的分层类激活映射

LayerCAM: Exploring Hierarchical Class Activation Maps for Localization.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献