艺术品图像中弱监督目标检测的提案生成

Proposals Generation for Weakly Supervised Object Detection in Artwork Images.

作者信息

Milani Federico, Pinciroli Vago Nicolò Oreste, Fraternali Piero

机构信息

Department of Electronics Information and Bioengineering, Politecnico di Milano, 20133 Milano, Italy.

出版信息

J Imaging. 2022 Aug 6;8(8):215. doi: 10.3390/jimaging8080215.

DOI:10.3390/jimaging8080215

PMID:36005458

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9410216/

Abstract

Object Detection requires many precise annotations, which are available for natural images but not for many non-natural data sets such as artworks data sets. A solution is using Weakly Supervised Object Detection (WSOD) techniques that learn accurate object localization from image-level labels. Studies have demonstrated that state-of-the-art end-to-end architectures may not be suitable for domains in which images or classes sensibly differ from those used to pre-train networks. This paper presents a novel two-stage Weakly Supervised Object Detection approach for obtaining accurate bounding boxes on non-natural data sets. The proposed method exploits existing classification knowledge to generate pseudo-ground truth bounding boxes from Class Activation Maps (CAMs). The automatically generated annotations are used to train a robust Faster R-CNN object detector. Quantitative and qualitative analysis shows that bounding boxes generated from CAMs can compensate for the lack of manually annotated ground truth (GT) and that an object detector, trained with such pseudo-GT, surpasses end-to-end WSOD state-of-the-art methods on ArtDL 2.0 (≈41.5% mAP) and IconArt (≈17% mAP), two artworks data sets. The proposed solution is a step towards the computer-aided study of non-natural images and opens the way to more advanced tasks, e.g., automatic artwork image captioning for digital archive applications.

摘要

目标检测需要许多精确的标注，这些标注可用于自然图像，但对于许多非自然数据集（如图术数据集）则不可用。一种解决方案是使用弱监督目标检测（WSOD）技术，该技术可从图像级标签中学习准确的目标定位。研究表明，当前最先进的端到端架构可能不适用于图像或类别与用于预训练网络的图像或类别明显不同的领域。本文提出了一种新颖的两阶段弱监督目标检测方法，用于在非自然数据集上获得准确的边界框。所提出的方法利用现有的分类知识从类激活映射（CAM）生成伪真值边界框。自动生成的标注用于训练一个强大的Faster R-CNN目标检测器。定量和定性分析表明，从CAM生成的边界框可以弥补手动标注真值（GT）的不足，并且使用这种伪GT训练的目标检测器在两个图术数据集ArtDL 2.0（约41.5%平均精度均值）和IconArt（约17%平均精度均值）上超过了端到端WSOD的当前最先进方法。所提出的解决方案是朝着非自然图像的计算机辅助研究迈出的一步，并为更高级的任务（例如用于数字存档应用的自动图术图像字幕）开辟了道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c250/9410216/9616bb594230/jimaging-08-00215-g001.jpg

相似文献

Proposals Generation for Weakly Supervised Object Detection in Artwork Images.艺术品图像中弱监督目标检测的提案生成

J Imaging. 2022 Aug 6;8(8):215. doi: 10.3390/jimaging8080215.

Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection.用于交叉监督目标检测的带提议权重调制的循环自训练

IEEE Trans Image Process. 2023;32:1992-2002. doi: 10.1109/TIP.2023.3261752. Epub 2023 Apr 4.

Automatic creation of annotations for chest radiographs based on the positional information extracted from radiographic image reports.基于从放射影像报告中提取的位置信息，为胸部 X 光片自动创建注释。

Comput Methods Programs Biomed. 2021 Sep;209:106331. doi: 10.1016/j.cmpb.2021.106331. Epub 2021 Aug 4.

Weakly-Supervised Salient Object Detection With Saliency Bounding Boxes.基于显著性边界框的弱监督显著目标检测

IEEE Trans Image Process. 2021;30:4423-4435. doi: 10.1109/TIP.2021.3071691. Epub 2021 Apr 21.

Weakly supervised pneumonia localization in chest X-rays using generative adversarial networks.使用生成对抗网络进行胸部 X 光片的弱监督肺炎定位。

Med Phys. 2021 Nov;48(11):7154-7171. doi: 10.1002/mp.15185. Epub 2021 Oct 26.

Progressive Frame-Proposal Mining for Weakly Supervised Video Object Detection.用于弱监督视频目标检测的渐进式帧提议挖掘

IEEE Trans Image Process. 2024;33:1560-1573. doi: 10.1109/TIP.2024.3364536. Epub 2024 Feb 27.

Training Robust Object Detectors From Noisy Category Labels and Imprecise Bounding Boxes.从噪声类别标签和不精确边界框中训练鲁棒目标检测器。

IEEE Trans Image Process. 2021;30:5782-5792. doi: 10.1109/TIP.2021.3085208. Epub 2021 Jun 23.

Weakly-Supervised Salient Object Detection on Light Fields.光场的弱监督显著目标检测

IEEE Trans Image Process. 2022;31:6295-6305. doi: 10.1109/TIP.2022.3207605. Epub 2022 Oct 10.

Salvage of Supervision in Weakly Supervised Object Detection and Segmentation.弱监督目标检测和分割中的监控恢复。

IEEE Trans Pattern Anal Mach Intell. 2023 Aug;45(8):10394-10408. doi: 10.1109/TPAMI.2023.3243054. Epub 2023 Jun 30.

Selecting High-Quality Proposals for Weakly Supervised Object Detection With Bottom-Up Aggregated Attention and Phase-Aware Loss.通过自底向上聚合注意力和相位感知损失为弱监督目标检测选择高质量提议。

IEEE Trans Image Process. 2023;32:682-693. doi: 10.1109/TIP.2022.3231744. Epub 2023 Jan 6.

本文引用的文献

Comparing CAM Algorithms for the Identification of Salient Image Features in Iconography Artwork Analysis.比较用于识别图像艺术作品分析中显著图像特征的CAM算法

J Imaging. 2021 Jun 29;7(7):106. doi: 10.3390/jimaging7070106.

TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.TS-CAM：用于弱监督目标定位的令牌语义耦合注意力图

IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):9109-9121. doi: 10.1109/TNNLS.2022.3218471. Epub 2024 Jul 8.

Contrastive Proposal Extension With LSTM Network for Weakly Supervised Object Detection.基于 LSTM 网络的对比提案扩展的弱监督目标检测。

IEEE Trans Image Process. 2022;31:6879-6892. doi: 10.1109/TIP.2022.3216772. Epub 2022 Nov 3.

Litter Detection with Deep Learning: A Comparative Study.基于深度学习的垃圾检测：一项比较研究。

Sensors (Basel). 2022 Jan 11;22(2):548. doi: 10.3390/s22020548.

LayerCAM: Exploring Hierarchical Class Activation Maps for Localization.LayerCAM：探索用于定位的分层类激活映射

IEEE Trans Image Process. 2021;30:5875-5888. doi: 10.1109/TIP.2021.3089943. Epub 2021 Jun 28.

WS-RCNN: Learning to Score Proposals for Weakly Supervised Instance Segmentation.WS-RCNN：学习为弱监督实例分割评分提案。

Sensors (Basel). 2021 May 17;21(10):3475. doi: 10.3390/s21103475.

Weakly Supervised Object Localization and Detection: A Survey.弱监督目标定位与检测：综述

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5866-5885. doi: 10.1109/TPAMI.2021.3074313. Epub 2022 Aug 4.

Weakly Supervised Object Detection Using Proposal- and Semantic-Level Relationships.利用提议级和语义级关系的弱监督目标检测

IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):3349-3363. doi: 10.1109/TPAMI.2020.3046647. Epub 2022 May 5.

PCL: Proposal Cluster Learning for Weakly Supervised Object Detection.PCL：用于弱监督目标检测的提议聚类学习

IEEE Trans Pattern Anal Mach Intell. 2020 Jan;42(1):176-191. doi: 10.1109/TPAMI.2018.2876304. Epub 2018 Oct 16.

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN：基于区域建议网络的实时目标检测。

IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

艺术品图像中弱监督目标检测的提案生成

Proposals Generation for Weakly Supervised Object Detection in Artwork Images.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献