用于弱监督单目标定位和语义分割的基于注意力的随机失活层

Attention-Based Dropout Layer for Weakly Supervised Single Object Localization and Semantic Segmentation.

作者信息

Choe Junsuk, Lee Seungho, Shim Hyunjung

出版信息

IEEE Trans Pattern Anal Mach Intell. 2021 Dec;43(12):4256-4271. doi: 10.1109/TPAMI.2020.2999099. Epub 2021 Nov 3.

DOI:10.1109/TPAMI.2020.2999099

Abstract

Both weakly supervised single object localization and semantic segmentation techniques learn an object's location using only image-level labels. However, these techniques are limited to cover only the most discriminative part of the object and not the entire object. To address this problem, we propose an attention-based dropout layer, which utilizes the attention mechanism to locate the entire object efficiently. To achieve this, we devise two key components, 1) hiding the most discriminative part from the model to capture the entire object, and 2) highlighting the informative region to improve the classification power of the model. These allow the classifier to be maintained with a reasonable accuracy while the entire object is covered. Through extensive experiments, we demonstrate that the proposed method effectively improves the weakly supervised single object localization accuracy, thereby achieving a new state-of-the-art localization accuracy on the CUB-200-2011 and a comparable accuracy existing state-of-the-arts on the ImageNet-1k. The proposed method is also effective in improving the weakly supervised semantic segmentation performance on the Pascal VOC and MS COCO. Furthermore, the proposed method is more efficient than existing techniques in terms of parameter and computation overheads. Additionally, the proposed method can be easily applied in various backbone networks.

摘要

弱监督单目标定位和语义分割技术都仅使用图像级标签来学习目标的位置。然而，这些技术仅限于覆盖目标最具判别力的部分，而非整个目标。为解决此问题，我们提出了一种基于注意力的随机失活层，其利用注意力机制来高效定位整个目标。为此，我们设计了两个关键组件：1）向模型隐藏最具判别力的部分以捕获整个目标，以及2）突出显示信息区域以提高模型的分类能力。这使得在覆盖整个目标的同时，分类器能够以合理的准确率得以维持。通过大量实验，我们证明所提出的方法有效提高了弱监督单目标定位的准确率，从而在CUB - 200 - 2011数据集上实现了新的最优定位准确率，在ImageNet - 1k数据集上达到了与现有最优方法相当的准确率。所提出的方法在提高Pascal VOC和MS COCO数据集上的弱监督语义分割性能方面也很有效。此外，所提出的方法在参数和计算开销方面比现有技术更高效。另外，所提出的方法可以很容易地应用于各种骨干网络。

相似文献

Attention-Based Dropout Layer for Weakly Supervised Single Object Localization and Semantic Segmentation.用于弱监督单目标定位和语义分割的基于注意力的随机失活层

IEEE Trans Pattern Anal Mach Intell. 2021 Dec;43(12):4256-4271. doi: 10.1109/TPAMI.2020.2999099. Epub 2021 Nov 3.

Anti-Adversarially Manipulated Attributions for Weakly Supervised Semantic Segmentation and Object Localization.用于弱监督语义分割和目标定位的抗对抗性操纵归因

IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1618-1634. doi: 10.1109/TPAMI.2022.3166916. Epub 2024 Feb 6.

Online Attention Accumulation for Weakly Supervised Semantic Segmentation.用于弱监督语义分割的在线注意力积累

IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7062-7077. doi: 10.1109/TPAMI.2021.3092573. Epub 2022 Sep 14.

Spatial Structure Constraints for Weakly Supervised Semantic Segmentation.弱监督语义分割的空间结构约束

IEEE Trans Image Process. 2024;33:1136-1148. doi: 10.1109/TIP.2024.3359041. Epub 2024 Feb 6.

Group-Wise Learning for Weakly Supervised Semantic Segmentation.基于群体学习的弱监督语义分割。

IEEE Trans Image Process. 2022;31:799-811. doi: 10.1109/TIP.2021.3132834. Epub 2022 Jan 4.

Affinity Attention Graph Neural Network for Weakly Supervised Semantic Segmentation.基于亲和注意力图神经网络的弱监督语义分割。

IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):8082-8096. doi: 10.1109/TPAMI.2021.3083269. Epub 2022 Oct 4.

TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.TS-CAM：用于弱监督目标定位的令牌语义耦合注意力图

IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):9109-9121. doi: 10.1109/TNNLS.2022.3218471. Epub 2024 Jul 8.

Enhanced Spatial Feature Learning for Weakly Supervised Object Detection.用于弱监督目标检测的增强空间特征学习

IEEE Trans Neural Netw Learn Syst. 2022 Jun 8;PP. doi: 10.1109/TNNLS.2022.3178180.

Auxiliary Tasks Enhanced Dual-Affinity Learning for Weakly Supervised Semantic Segmentation.辅助任务增强双亲和学习用于弱监督语义分割

IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):5082-5096. doi: 10.1109/TNNLS.2024.3373566. Epub 2025 Feb 28.

Looking Beyond Single Images for Weakly Supervised Semantic Segmentation Learning.超越单图像进行弱监督语义分割学习

IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1635-1649. doi: 10.1109/TPAMI.2022.3168530. Epub 2024 Feb 6.

引用本文的文献

Anomaly-guided weakly supervised lesion segmentation on retinal OCT images.基于视网膜光学相干断层扫描（OCT）图像的异常引导弱监督病变分割

Med Image Anal. 2024 May;94:103139. doi: 10.1016/j.media.2024.103139. Epub 2024 Mar 12.

Module of Axis-based Nexus Attention for weakly supervised object localization.用于弱监督目标定位的基于轴的关系注意力模块。

Sci Rep. 2023 Oct 30;13(1):18588. doi: 10.1038/s41598-023-45796-8.

A Cross-Domain Weakly Supervised Diabetic Retinopathy Lesion Identification Method Based on Multiple Instance Learning and Domain Adaptation.一种基于多实例学习和域适应的跨域弱监督糖尿病视网膜病变病变识别方法。

Bioengineering (Basel). 2023 Sep 20;10(9):1100. doi: 10.3390/bioengineering10091100.

Automated bone marrow cell classification through dual attention gates dense neural networks.基于双注意力门控密集神经网络的自动骨髓细胞分类。

J Cancer Res Clin Oncol. 2023 Dec;149(19):16971-16981. doi: 10.1007/s00432-023-05384-9. Epub 2023 Sep 23.

Diagnosis of Polypoidal Choroidal Vasculopathy From Fluorescein Angiography Using Deep Learning.基于深度学习的荧光素血管造影在息肉样脉络膜血管病变诊断中的应用。

Transl Vis Sci Technol. 2022 Feb 1;11(2):6. doi: 10.1167/tvst.11.2.6.

SSD-EMB: An Improved SSD Using Enhanced Feature Map Block for Object Detection.SSD-EMB：一种利用增强特征图块的 SSD 目标检测改进方法。

Sensors (Basel). 2021 Apr 17;21(8):2842. doi: 10.3390/s21082842.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于弱监督单目标定位和语义分割的基于注意力的随机失活层

Attention-Based Dropout Layer for Weakly Supervised Single Object Localization and Semantic Segmentation.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献