• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

语义分割利用同时深度估计。

Semantic Segmentation Leveraging Simultaneous Depth Estimation.

机构信息

School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China.

Peng Cheng Laboratory, Shenzhen 518055, China.

出版信息

Sensors (Basel). 2021 Jan 20;21(3):690. doi: 10.3390/s21030690.

DOI:10.3390/s21030690
PMID:33498358
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7864030/
Abstract

Semantic segmentation is one of the most widely studied problems in computer vision communities, which makes a great contribution to a variety of applications. A lot of learning-based approaches, such as Convolutional Neural Network (CNN), have made a vast contribution to this problem. While rich context information of the input images can be learned from multi-scale receptive fields by convolutions with deep layers, traditional CNNs have great difficulty in learning the geometrical relationship and distribution of objects in the RGB image due to the lack of depth information, which may lead to an inferior segmentation quality. To solve this problem, we propose a method that improves segmentation quality with depth estimation on RGB images. Specifically, we estimate depth information on RGB images via a depth estimation network, and then feed the depth map into the CNN which is able to guide the semantic segmentation. Furthermore, in order to parse the depth map and RGB images simultaneously, we construct a multi-branch encoder-decoder network and fuse the RGB and depth features step by step. Extensive experimental evaluation on four baseline networks demonstrates that our proposed method can enhance the segmentation quality considerably and obtain better performance compared to other segmentation networks.

摘要

语义分割是计算机视觉领域研究最广泛的问题之一,它为各种应用做出了巨大贡献。许多基于学习的方法,如卷积神经网络 (CNN),对这个问题做出了巨大的贡献。虽然通过深层卷积可以从多尺度感受野中学习到输入图像的丰富上下文信息,但由于缺乏深度信息,传统的 CNN 在学习 RGB 图像中物体的几何关系和分布方面存在很大困难,这可能导致分割质量下降。为了解决这个问题,我们提出了一种利用 RGB 图像深度估计来提高分割质量的方法。具体来说,我们通过深度估计网络对 RGB 图像进行深度估计,然后将深度图输入到能够指导语义分割的 CNN 中。此外,为了同时解析深度图和 RGB 图像,我们构建了一个多分支编解码器网络,并逐步融合 RGB 和深度特征。在四个基线网络上的广泛实验评估表明,我们提出的方法可以显著提高分割质量,并获得比其他分割网络更好的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/5e75c706e955/sensors-21-00690-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/3c50ab34b3e3/sensors-21-00690-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/47dcd66d53b9/sensors-21-00690-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/c371550fef29/sensors-21-00690-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/10314ff226ae/sensors-21-00690-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/e7bf45c71eeb/sensors-21-00690-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/c1e5e97d75d6/sensors-21-00690-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/ca04e57fbaf7/sensors-21-00690-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/5e75c706e955/sensors-21-00690-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/3c50ab34b3e3/sensors-21-00690-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/47dcd66d53b9/sensors-21-00690-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/c371550fef29/sensors-21-00690-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/10314ff226ae/sensors-21-00690-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/e7bf45c71eeb/sensors-21-00690-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/c1e5e97d75d6/sensors-21-00690-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/ca04e57fbaf7/sensors-21-00690-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a137/7864030/5e75c706e955/sensors-21-00690-g008.jpg

相似文献

1
Semantic Segmentation Leveraging Simultaneous Depth Estimation.语义分割利用同时深度估计。
Sensors (Basel). 2021 Jan 20;21(3):690. doi: 10.3390/s21030690.
2
SCN: Switchable Context Network for Semantic Segmentation of RGB-D Images.SCN:用于 RGB-D 图像语义分割的可切换上下文网络。
IEEE Trans Cybern. 2020 Mar;50(3):1120-1131. doi: 10.1109/TCYB.2018.2885062. Epub 2018 Dec 20.
3
Zig-Zag Network for Semantic Segmentation of RGB-D Images.Zig-Zag 网络用于 RGB-D 图像的语义分割。
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2642-2655. doi: 10.1109/TPAMI.2019.2923513. Epub 2019 Jun 18.
4
Collaborative Deconvolutional Neural Networks for Joint Depth Estimation and Semantic Segmentation.用于联合深度估计和语义分割的协作式反卷积神经网络
IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5655-5666. doi: 10.1109/TNNLS.2017.2787781. Epub 2018 Mar 20.
5
Exploiting Depth From Single Monocular Images for Object Detection and Semantic Segmentation.利用单目图像的深度进行目标检测和语义分割。
IEEE Trans Image Process. 2017 Feb;26(2):836-846. doi: 10.1109/TIP.2016.2621673. Epub 2016 Oct 26.
6
Multi-Receptive-Field CNN for Semantic Segmentation of Medical Images.多感受野卷积神经网络在医学图像语义分割中的应用
IEEE J Biomed Health Inform. 2020 Nov;24(11):3215-3225. doi: 10.1109/JBHI.2020.3016306. Epub 2020 Nov 4.
7
Depth Estimation and Semantic Segmentation from a Single RGB Image Using a Hybrid Convolutional Neural Network.使用混合卷积神经网络从单张RGB图像进行深度估计和语义分割
Sensors (Basel). 2019 Apr 15;19(8):1795. doi: 10.3390/s19081795.
8
Swin-MFA: A Multi-Modal Fusion Attention Network Based on Swin-Transformer for Low-Light Image Human Segmentation.Swin-MFA:一种基于 Swin-Transformer 的多模态融合注意力网络,用于低光照图像人体分割。
Sensors (Basel). 2022 Aug 19;22(16):6229. doi: 10.3390/s22166229.
9
Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection.边缘保持和多尺度上下文神经网络的显著目标检测。
IEEE Trans Image Process. 2018;27(1):121-134. doi: 10.1109/TIP.2017.2756825.
10
Simultaneous Semantic Segmentation and Depth Completion with Constraint of Boundary.基于边界约束的语义分割和深度补全的同步实现
Sensors (Basel). 2020 Jan 23;20(3):635. doi: 10.3390/s20030635.

引用本文的文献

1
Improving Depth Estimation by Embedding Semantic Segmentation: A Hybrid CNN Model.通过嵌入语义分割来提高深度估计:一种混合 CNN 模型。
Sensors (Basel). 2022 Feb 21;22(4):1669. doi: 10.3390/s22041669.
2
New End-to-End Strategy Based on DeepLabv3+ Semantic Segmentation for Human Head Detection.基于 DeepLabv3+语义分割的全新端到端人体头部检测策略。
Sensors (Basel). 2021 Aug 30;21(17):5848. doi: 10.3390/s21175848.

本文引用的文献

1
Deep Ordinal Regression Network for Monocular Depth Estimation.用于单目深度估计的深度序数回归网络
Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2018 Jun;2018:2002-2011. doi: 10.1109/CVPR.2018.00214. Epub 2018 Dec 17.
2
Stacked Deconvolutional Network for Semantic Segmentation.用于语义分割的堆叠去卷积网络
IEEE Trans Image Process. 2019 Jan 25. doi: 10.1109/TIP.2019.2895460.
3
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
4
Automatic Skin Lesion Segmentation Using Deep Fully Convolutional Networks With Jaccard Distance.基于交并比的深度全卷积网络自动皮肤病变分割。
IEEE Trans Med Imaging. 2017 Sep;36(9):1876-1886. doi: 10.1109/TMI.2017.2695227. Epub 2017 Apr 18.
5
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
6
On-line retrainable neural networks: improving the performance of neural networks in image analysis problems.在线可再训练神经网络:提升神经网络在图像分析问题中的性能
IEEE Trans Neural Netw. 2000;11(1):137-55. doi: 10.1109/72.822517.
7
A semantic approach to segmentation of overlapping objects.一种用于重叠物体分割的语义方法。
Methods Inf Med. 2004;43(4):343-53.