STC：一种用于弱监督语义分割的从简单到复杂的框架。

STC: A Simple to Complex Framework for Weakly-Supervised Semantic Segmentation.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2017 Nov;39(11):2314-2320. doi: 10.1109/TPAMI.2016.2636150. Epub 2016 Dec 6.

DOI:10.1109/TPAMI.2016.2636150

Abstract

Recently, significant improvement has been made on semantic object segmentation due to the development of deep convolutional neural networks (DCNNs). Training such a DCNN usually relies on a large number of images with pixel-level segmentation masks, and annotating these images is very costly in terms of both finance and human effort. In this paper, we propose a simple to complex (STC) framework in which only image-level annotations are utilized to learn DCNNs for semantic segmentation. Specifically, we first train an initial segmentation network called Initial-DCNN with the saliency maps of simple images (i.e., those with a single category of major object(s) and clean background). These saliency maps can be automatically obtained by existing bottom-up salient object detection techniques, where no supervision information is needed. Then, a better network called Enhanced-DCNN is learned with supervision from the predicted segmentation masks of simple images based on the Initial-DCNN as well as the image-level annotations. Finally, more pixel-level segmentation masks of complex images (two or more categories of objects with cluttered background), which are inferred by using Enhanced-DCNN and image-level annotations, are utilized as the supervision information to learn the Powerful-DCNN for semantic segmentation. Our method utilizes 40K simple images from Flickr.com and 10K complex images from PASCAL VOC for step-wisely boosting the segmentation network. Extensive experimental results on PASCAL VOC 2012 segmentation benchmark well demonstrate the superiority of the proposed STC framework compared with other state-of-the-arts.

摘要

由于深度卷积神经网络（DCNN）的发展，语义目标分割最近取得了重大进展。训练这样的 DCNN 通常依赖于具有像素级分割掩模的大量图像，而在财务和人力方面对这些图像进行注释非常昂贵。在本文中，我们提出了一种从简单到复杂（STC）的框架，该框架仅利用图像级注释来学习用于语义分割的 DCNN。具体来说，我们首先使用简单图像的显着图（即具有单个主要对象类别和干净背景的图像）训练称为 Initial-DCNN 的初始分割网络。这些显着图可以通过现有的自下而上的显着目标检测技术自动获得，这些技术不需要监督信息。然后，基于 Initial-DCNN 以及图像级注释，从简单图像的预测分割掩模中学习更好的网络，称为 enhanced-DCNN。最后，使用 enhanced-DCNN 和图像级注释推断出更复杂图像（具有杂乱背景的两个或更多类别对象）的更多像素级分割掩模，作为监督信息来学习用于语义分割的 Powerful-DCNN。我们的方法利用了来自 Flickr.com 的 40K 张简单图像和来自 PASCAL VOC 的 10K 张复杂图像，以逐步增强分割网络。在 PASCAL VOC 2012 分割基准上的广泛实验结果证明了所提出的 STC 框架优于其他最先进技术的优越性。

相似文献

STC: A Simple to Complex Framework for Weakly-Supervised Semantic Segmentation.

IEEE Trans Pattern Anal Mach Intell. 2017 Nov;39(11):2314-2320. doi: 10.1109/TPAMI.2016.2636150. Epub 2016 Dec 6.

Coarse-to-Fine Semantic Segmentation From Image-Level Labels.

IEEE Trans Image Process. 2020;29:225-236. doi: 10.1109/TIP.2019.2926748. Epub 2019 Jul 12.

Learning to Exploit the Prior Network Knowledge for Weakly-Supervised Semantic Segmentation.

IEEE Trans Image Process. 2019 Feb 25. doi: 10.1109/TIP.2019.2901393.

Learning to Segment Human by Watching YouTube.

IEEE Trans Pattern Anal Mach Intell. 2017 Jul;39(7):1462-1468. doi: 10.1109/TPAMI.2016.2598340. Epub 2016 Aug 5.

Saliency as Pseudo-Pixel Supervision for Weakly and Semi-Supervised Semantic Segmentation.

IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):12341-12357. doi: 10.1109/TPAMI.2023.3273592. Epub 2023 Sep 5.

Weaklier Supervised Semantic Segmentation With Only One Image Level Annotation per Category.

IEEE Trans Image Process. 2020;29:128-141. doi: 10.1109/TIP.2019.2930874. Epub 2019 Jul 30.

Synthesizing Supervision for Learning Deep Saliency Network without Human Annotation.

IEEE Trans Pattern Anal Mach Intell. 2020 Jul;42(7):1755-1769. doi: 10.1109/TPAMI.2019.2900649. Epub 2019 Feb 20.

Incorporating Network Built-in Priors in Weakly-Supervised Semantic Segmentation.

IEEE Trans Pattern Anal Mach Intell. 2018 Jun;40(6):1382-1396. doi: 10.1109/TPAMI.2017.2713785. Epub 2017 Jun 8.

Online Attention Accumulation for Weakly Supervised Semantic Segmentation.

IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7062-7077. doi: 10.1109/TPAMI.2021.3092573. Epub 2022 Sep 14.

Collaborative Deconvolutional Neural Networks for Joint Depth Estimation and Semantic Segmentation.

IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5655-5666. doi: 10.1109/TNNLS.2017.2787781. Epub 2018 Mar 20.

引用本文的文献

Utilizing shallow features and spatial context for weakly supervised intracerebral hemorrhage segmentation.

Quant Imaging Med Surg. 2025 Jun 6;15(6):5546-5566. doi: 10.21037/qims-24-1462. Epub 2025 May 30.

An IoMT-Based Melanoma Lesion Segmentation Using Conditional Generative Adversarial Networks.

Sensors (Basel). 2023 Mar 28;23(7):3548. doi: 10.3390/s23073548.

Dense regression activation maps for lesion segmentation in CT scans of COVID-19 patients.

Med Image Anal. 2023 May;86:102771. doi: 10.1016/j.media.2023.102771. Epub 2023 Feb 16.

AI-Driven Robust Kidney and Renal Mass Segmentation and Classification on 3D CT Images.

Bioengineering (Basel). 2023 Jan 13;10(1):116. doi: 10.3390/bioengineering10010116.

Harmonized neonatal brain MR image segmentation model for cross-site datasets.

Biomed Signal Process Control. 2021 Aug;69. doi: 10.1016/j.bspc.2021.102810. Epub 2021 Jun 1.

Weakly supervised serous retinal detachment segmentation in SD-OCT images by two-stage learning.

Biomed Opt Express. 2021 Mar 24;12(4):2312-2327. doi: 10.1364/BOE.416167. eCollection 2021 Apr 1.

Assistant Diagnosis of Basal Cell Carcinoma and Seborrheic Keratosis in Chinese Population Using Convolutional Neural Network.

J Healthc Eng. 2020 Aug 1;2020:1713904. doi: 10.1155/2020/1713904. eCollection 2020.

Weakly-supervised convolutional neural networks of renal tumor segmentation in abdominal CTA images.

BMC Med Imaging. 2020 Apr 15;20(1):37. doi: 10.1186/s12880-020-00435-w.

Iterative Label Denoising Network: Segmenting Male Pelvic Organs in CT From 3D Bounding Box Annotations.

IEEE Trans Biomed Eng. 2020 Oct;67(10):2710-2720. doi: 10.1109/TBME.2020.2969608. Epub 2020 Jan 27.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

STC：一种用于弱监督语义分割的从简单到复杂的框架。

STC: A Simple to Complex Framework for Weakly-Supervised Semantic Segmentation.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献