从图像级标签进行粗到细的语义分割。

Coarse-to-Fine Semantic Segmentation From Image-Level Labels.

出版信息

IEEE Trans Image Process. 2020;29:225-236. doi: 10.1109/TIP.2019.2926748. Epub 2019 Jul 12.

DOI:10.1109/TIP.2019.2926748

Abstract

Deep neural network-based semantic segmentation generally requires large-scale cost extensive annotations for training to obtain better performance. To avoid pixel-wise segmentation annotations that are needed for most methods, recently some researchers attempted to use object-level labels (e.g., bounding boxes) or image-level labels (e.g., image categories). In this paper, we propose a novel recursive coarse-to-fine semantic segmentation framework based on only image-level category labels. For each image, an initial coarse mask is first generated by a convolutional neural network-based unsupervised foreground segmentation model and then is enhanced by a graph model. The enhanced coarse mask is fed to a fully convolutional neural network to be recursively refined. Unlike the existing image-level label-based semantic segmentation methods, which require labeling of all categories for images that contain multiple types of objects, our framework only needs one label for each image and can handle images that contain multi-category objects. Only trained on ImageNet, our framework achieves comparable performance on the PASCAL VOC dataset with other image-level label-based state-of-the-art methods of semantic segmentation. Furthermore, our framework can be easily extended to foreground object segmentation task and achieves comparable performance with the state-of-the-art supervised methods on the Internet object dataset.

摘要

基于深度神经网络的语义分割通常需要大规模的成本广泛注释进行训练，以获得更好的性能。为了避免大多数方法所需的像素级分割注释，最近一些研究人员试图使用对象级标签（例如，边界框）或图像级标签（例如，图像类别）。在本文中，我们提出了一种新颖的基于仅图像级类别标签的递归粗到细语义分割框架。对于每张图像，首先通过基于卷积神经网络的无监督前景分割模型生成初始粗掩码，然后通过图模型进行增强。增强后的粗掩码被馈送到全卷积神经网络中进行递归细化。与现有的基于图像级标签的语义分割方法不同，这些方法需要对包含多种类型对象的图像中的所有类别进行标记，我们的框架仅需要为每张图像标记一个标签，并且可以处理包含多类别对象的图像。我们的框架仅在 ImageNet 上进行训练，就可以在 PASCAL VOC 数据集上与其他基于图像级标签的语义分割最新方法相媲美。此外，我们的框架可以很容易地扩展到前景对象分割任务，并在互联网对象数据集上与监督方法的最新方法相媲美。

相似文献

Coarse-to-Fine Semantic Segmentation From Image-Level Labels.从图像级标签进行粗到细的语义分割。

IEEE Trans Image Process. 2020;29:225-236. doi: 10.1109/TIP.2019.2926748. Epub 2019 Jul 12.

STC: A Simple to Complex Framework for Weakly-Supervised Semantic Segmentation.STC：一种用于弱监督语义分割的从简单到复杂的框架。

IEEE Trans Pattern Anal Mach Intell. 2017 Nov;39(11):2314-2320. doi: 10.1109/TPAMI.2016.2636150. Epub 2016 Dec 6.

Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation.利用实例、图像和数据集级信息进行弱监督实例分割。

IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1415-1428. doi: 10.1109/TPAMI.2020.3023152. Epub 2022 Feb 3.

Learning From Pixel-Level Label Noise: A New Perspective for Semi-Supervised Semantic Segmentation.从像素级标签噪声中学习：半监督语义分割的新视角

IEEE Trans Image Process. 2022;31:623-635. doi: 10.1109/TIP.2021.3134142. Epub 2021 Dec 22.

Weaklier Supervised Semantic Segmentation With Only One Image Level Annotation per Category.仅用每类别一张图像级标注进行弱监督语义分割。

IEEE Trans Image Process. 2020;29:128-141. doi: 10.1109/TIP.2019.2930874. Epub 2019 Jul 30.

Incorporating Network Built-in Priors in Weakly-Supervised Semantic Segmentation.在弱监督语义分割中融入网络内置先验信息。

IEEE Trans Pattern Anal Mach Intell. 2018 Jun;40(6):1382-1396. doi: 10.1109/TPAMI.2017.2713785. Epub 2017 Jun 8.

Affinity Attention Graph Neural Network for Weakly Supervised Semantic Segmentation.基于亲和注意力图神经网络的弱监督语义分割。

IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):8082-8096. doi: 10.1109/TPAMI.2021.3083269. Epub 2022 Oct 4.

Pixel Objectness: Learning to Segment Generic Objects Automatically in Images and Videos.像素目标性：学习在图像和视频中自动分割通用物体

IEEE Trans Pattern Anal Mach Intell. 2019 Nov;41(11):2677-2692. doi: 10.1109/TPAMI.2018.2865794. Epub 2018 Aug 17.

Multi-Task Deep Learning for Image Segmentation Using Recursive Approximation Tasks.使用递归近似任务的多任务深度学习进行图像分割

IEEE Trans Image Process. 2021;30:3555-3567. doi: 10.1109/TIP.2021.3062726. Epub 2021 Mar 11.

MaskMitosis: a deep learning framework for fully supervised, weakly supervised, and unsupervised mitosis detection in histopathology images.MaskMitosis：一种深度学习框架，用于在组织病理学图像中进行全监督、弱监督和无监督的有丝分裂检测。

Med Biol Eng Comput. 2020 Jul;58(7):1603-1623. doi: 10.1007/s11517-020-02175-z. Epub 2020 May 22.

引用本文的文献

Multi-Scale Guided Context-Aware Transformer for Remote Sensing Building Extraction.用于遥感建筑物提取的多尺度引导上下文感知Transformer

Sensors (Basel). 2025 Aug 29;25(17):5356. doi: 10.3390/s25175356.

A Large Kernel Convolutional Neural Network with a Noise Transfer Mechanism for Real-Time Semantic Segmentation.一种具有噪声传递机制的大型内核卷积神经网络用于实时语义分割

Sensors (Basel). 2025 Aug 29;25(17):5357. doi: 10.3390/s25175357.

Weakly Supervised Deep Learning for Monitoring Sleep Apnea Severity Using Coarse-grained Labels.使用粗粒度标签的弱监督深度学习用于监测睡眠呼吸暂停严重程度

IEEE Trans Autom Sci Eng. 2025;22:15227-15240. doi: 10.1109/tase.2025.3566682. Epub 2025 May 12.

Deep ensemble learning-driven fully automated multi-structure segmentation for precision craniomaxillofacial surgery.深度集成学习驱动的全自动多结构分割用于精确颅颌面外科手术。

Front Bioeng Biotechnol. 2025 May 8;13:1580502. doi: 10.3389/fbioe.2025.1580502. eCollection 2025.

An Adaptive Obstacle Avoidance Model for Autonomous Robots Based on Dual-Coupling Grouped Aggregation and Transformer Optimization.基于双耦合分组聚合和Transformer优化的自主机器人自适应避障模型

Sensors (Basel). 2025 Mar 15;25(6):1839. doi: 10.3390/s25061839.

Deep Dual-Resolution Road Scene Segmentation Networks Based on Decoupled Dynamic Filter and Squeeze-Excitation Module.基于解耦动态滤波器和挤压激励模块的深度双分辨率道路场景分割网络

Sensors (Basel). 2023 Aug 12;23(16):7140. doi: 10.3390/s23167140.

A novel approach to quantify calcifications of thyroid nodules in US images based on deep learning: predicting the risk of cervical lymph node metastasis in papillary thyroid cancer patients.一种基于深度学习的甲状腺结节超声图像中钙化量化的新方法：预测甲状腺乳头状癌患者颈淋巴结转移的风险。

Eur Radiol. 2023 Dec;33(12):9347-9356. doi: 10.1007/s00330-023-09909-1. Epub 2023 Jul 12.

Feature Pyramid U-Net with Attention for Semantic Segmentation of Forward-Looking Sonar Images.基于特征金字塔注意力 U-Net 的前视声纳图像语义分割

Sensors (Basel). 2022 Nov 3;22(21):8468. doi: 10.3390/s22218468.

Radiomics-Guided Global-Local Transformer for Weakly Supervised Pathology Localization in Chest X-Rays.放射组学引导的全局-局部变换在胸部 X 光片弱监督病理学定位中的应用。

IEEE Trans Med Imaging. 2023 Mar;42(3):750-761. doi: 10.1109/TMI.2022.3217218. Epub 2023 Mar 2.

Weakly Supervised Building Semantic Segmentation Based on Spot-Seeds and Refinement Process.基于点种子和细化过程的弱监督建筑语义分割

Entropy (Basel). 2022 May 23;24(5):741. doi: 10.3390/e24050741.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

从图像级标签进行粗到细的语义分割。

Coarse-to-Fine Semantic Segmentation From Image-Level Labels.

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献