• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于图像语义分割的新型上采样与上下文卷积

A Novel Upsampling and Context Convolution for Image Semantic Segmentation.

作者信息

Sediqi Khwaja Monib, Lee Hyo Jong

机构信息

Division of Computer Science and Engineering, CAIIT, Jeonbuk National University, Jeonju 54896, Korea.

出版信息

Sensors (Basel). 2021 Mar 20;21(6):2170. doi: 10.3390/s21062170.

DOI:10.3390/s21062170
PMID:33804591
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8003770/
Abstract

Semantic segmentation, which refers to pixel-wise classification of an image, is a fundamental topic in computer vision owing to its growing importance in the robot vision and autonomous driving sectors. It provides rich information about objects in the scene such as object boundary, category, and location. Recent methods for semantic segmentation often employ an encoder-decoder structure using deep convolutional neural networks. The encoder part extracts features of the image using several filters and pooling operations, whereas the decoder part gradually recovers the low-resolution feature maps of the encoder into a full input resolution feature map for pixel-wise prediction. However, the encoder-decoder variants for semantic segmentation suffer from severe spatial information loss, caused by pooling operations or stepwise convolutions, and does not consider the context in the scene. In this paper, we propose a novel dense upsampling convolution method based on a guided filter to effectively preserve the spatial information of the image in the network. We further propose a novel local context convolution method that not only covers larger-scale objects in the scene but covers them densely for precise object boundary delineation. Theoretical analyses and experimental results on several benchmark datasets verify the effectiveness of our method. Qualitatively, our approach delineates object boundaries at a level of accuracy that is beyond the current excellent methods. Quantitatively, we report a new record of 82.86% and 81.62% of pixel accuracy on ADE20K and Pascal-Context benchmark datasets, respectively. In comparison with the state-of-the-art methods, the proposed method offers promising improvements.

摘要

语义分割是指对图像进行逐像素分类,由于其在机器人视觉和自动驾驶领域的重要性日益增加,它已成为计算机视觉中的一个基本课题。它提供了有关场景中物体的丰富信息,如物体边界、类别和位置。最近的语义分割方法通常采用基于深度卷积神经网络的编码器-解码器结构。编码器部分使用多个滤波器和池化操作来提取图像特征,而解码器部分则将编码器的低分辨率特征图逐步恢复为全输入分辨率特征图,用于逐像素预测。然而,用于语义分割的编码器-解码器变体存在严重的空间信息损失问题,这是由池化操作或逐步卷积导致的,并且没有考虑场景中的上下文信息。在本文中,我们提出了一种基于引导滤波器的新型密集上采样卷积方法,以有效地在网络中保留图像的空间信息。我们还提出了一种新型的局部上下文卷积方法,该方法不仅可以覆盖场景中更大尺度的物体,而且可以密集地覆盖它们,以精确勾勒物体边界。在几个基准数据集上的理论分析和实验结果验证了我们方法的有效性。定性地说,我们的方法在精度水平上能够勾勒物体边界,超越了当前的优秀方法。定量地说,我们分别在ADE20K和Pascal-Context基准数据集上报告了新的像素准确率记录,分别为82.86%和81.62%。与现有最先进的方法相比,所提出的方法有显著的改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/ae38c96f9003/sensors-21-02170-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/86cd5de0d298/sensors-21-02170-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/f6bf9e58999f/sensors-21-02170-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/85e8f7cd2d35/sensors-21-02170-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/7a80057a3571/sensors-21-02170-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/374df74120ca/sensors-21-02170-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/3088bf764177/sensors-21-02170-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/ae38c96f9003/sensors-21-02170-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/86cd5de0d298/sensors-21-02170-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/f6bf9e58999f/sensors-21-02170-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/85e8f7cd2d35/sensors-21-02170-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/7a80057a3571/sensors-21-02170-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/374df74120ca/sensors-21-02170-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/3088bf764177/sensors-21-02170-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/061f/8003770/ae38c96f9003/sensors-21-02170-g007.jpg

相似文献

1
A Novel Upsampling and Context Convolution for Image Semantic Segmentation.一种用于图像语义分割的新型上采样与上下文卷积
Sensors (Basel). 2021 Mar 20;21(6):2170. doi: 10.3390/s21062170.
2
A Holistically-Guided Decoder for Deep Representation Learning With Applications to Semantic Segmentation and Object Detection.一种用于深度表示学习的整体引导解码器及其在语义分割和目标检测中的应用
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):11390-11406. doi: 10.1109/TPAMI.2021.3114342. Epub 2023 Sep 5.
3
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
4
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
5
DTS-Net: Depth-to-Space Networks for Fast and Accurate Semantic Object Segmentation.DTS-Net:用于快速准确语义对象分割的深度到空间网络
Sensors (Basel). 2022 Jan 3;22(1):337. doi: 10.3390/s22010337.
6
Boundary-aware context neural network for medical image segmentation.边界感知上下文神经网络在医学图像分割中的应用。
Med Image Anal. 2022 May;78:102395. doi: 10.1016/j.media.2022.102395. Epub 2022 Feb 14.
7
Feature-guided attention network for medical image segmentation.基于特征引导的注意力网络的医学图像分割。
Med Phys. 2023 Aug;50(8):4871-4886. doi: 10.1002/mp.16253. Epub 2023 Feb 16.
8
High-Resolution Encoder-Decoder Networks for Low-Contrast Medical Image Segmentation.用于低对比度医学图像分割的高分辨率编码器-解码器网络
IEEE Trans Image Process. 2019 Jun 19. doi: 10.1109/TIP.2019.2919937.
9
Attention Guided Global Enhancement and Local Refinement Network for Semantic Segmentation.用于语义分割的注意力引导全局增强与局部细化网络
IEEE Trans Image Process. 2022;31:3211-3223. doi: 10.1109/TIP.2022.3166673. Epub 2022 Apr 22.
10
Semantic segmentation method of underwater images based on encoder-decoder architecture.基于编解码器架构的水下图像语义分割方法。
PLoS One. 2022 Aug 25;17(8):e0272666. doi: 10.1371/journal.pone.0272666. eCollection 2022.

引用本文的文献

1
Detecting and localizing cervical lesions in colposcopic images with deep semantic feature mining.通过深度语义特征挖掘检测和定位阴道镜图像中的宫颈病变。
Front Oncol. 2024 Nov 22;14:1423782. doi: 10.3389/fonc.2024.1423782. eCollection 2024.
2
A new online detection method of tobacco impurities for tobacco robot.一种用于烟草机器人的烟草杂质在线检测新方法。
Front Neurorobot. 2024 Jun 7;18:1422960. doi: 10.3389/fnbot.2024.1422960. eCollection 2024.
3
Lane and Road Marker Semantic Video Segmentation Using Mask Cropping and Optical Flow Estimation.

本文引用的文献

1
Scene Segmentation With Dual Relation-Aware Attention Network.基于双重关系感知注意力网络的场景分割。
IEEE Trans Neural Netw Learn Syst. 2021 Jun;32(6):2547-2560. doi: 10.1109/TNNLS.2020.3006524. Epub 2021 Jun 2.
2
Semantic Segmentation with Context Encoding and Multi-Path Decoding.基于上下文编码和多路径解码的语义分割
IEEE Trans Image Process. 2020 Jan 9. doi: 10.1109/TIP.2019.2962685.
3
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
基于掩模裁剪和光流估计的车道和路牌语义视频分割。
Sensors (Basel). 2021 Oct 28;21(21):7156. doi: 10.3390/s21217156.
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
4
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
5
Fully Convolutional Networks for Semantic Segmentation.全卷积网络用于语义分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.
6
Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation.多尺度组合分组进行图像分割和目标提议生成。
IEEE Trans Pattern Anal Mach Intell. 2017 Jan;39(1):128-140. doi: 10.1109/TPAMI.2016.2537320. Epub 2016 Mar 2.
7
Learning hierarchical features for scene labeling.学习用于场景标注的层次特征。
IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1915-29. doi: 10.1109/TPAMI.2012.231.
8
Guided image filtering.引导图像滤波。
IEEE Trans Pattern Anal Mach Intell. 2013 Jun;35(6):1397-409. doi: 10.1109/TPAMI.2012.213.