• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用混合卷积神经网络从单张RGB图像进行深度估计和语义分割

Depth Estimation and Semantic Segmentation from a Single RGB Image Using a Hybrid Convolutional Neural Network.

作者信息

Lin Xiao, Sánchez-Escobedo Dalila, Casas Josep R, Pardàs Montse

机构信息

Visual Interactions and Communication Technologies (Vicomtech), 20009 Donostia/San Sebastián, Spain.

Image Processing Group, TSC Department, Technical University of Catalonia (UPC), 08034 Barcelona, Spain.

出版信息

Sensors (Basel). 2019 Apr 15;19(8):1795. doi: 10.3390/s19081795.

DOI:10.3390/s19081795
PMID:30991663
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6514714/
Abstract

Semantic segmentation and depth estimation are two important tasks in computer vision, and many methods have been developed to tackle them. Commonly these two tasks are addressed independently, but recently the idea of merging these two problems into a sole framework has been studied under the assumption that integrating two highly correlated tasks may benefit each other to improve the estimation accuracy. In this paper, depth estimation and semantic segmentation are jointly addressed using a single RGB input image under a unified convolutional neural network. We analyze two different architectures to evaluate which features are more relevant when shared by the two tasks and which features should be kept separated to achieve a mutual improvement. Likewise, our approaches are evaluated under two different scenarios designed to review our results versus single-task and multi-task methods. Qualitative and quantitative experiments demonstrate that the performance of our methodology outperforms the state of the art on single-task approaches, while obtaining competitive results compared with other multi-task methods.

摘要

语义分割和深度估计是计算机视觉中的两项重要任务,并且已经开发了许多方法来处理它们。通常这两项任务是独立处理的,但最近在整合两个高度相关的任务可能会相互受益以提高估计精度的假设下,将这两个问题合并到一个单独框架中的想法得到了研究。在本文中,深度估计和语义分割在统一的卷积神经网络下使用单个RGB输入图像进行联合处理。我们分析了两种不同的架构,以评估哪些特征在由这两项任务共享时更相关,以及哪些特征应该保持分离以实现相互改进。同样,我们的方法在两种不同的场景下进行评估,旨在将我们的结果与单任务和多任务方法进行比较。定性和定量实验表明,我们方法的性能优于单任务方法的现有技术水平,同时与其他多任务方法相比获得了有竞争力的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/d1cbd3e0a17c/sensors-19-01795-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/161fed99d435/sensors-19-01795-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/0e3578f4f2ba/sensors-19-01795-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/beb8368b7731/sensors-19-01795-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/f6d6f9cd0a17/sensors-19-01795-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/ba6b8a7c0789/sensors-19-01795-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/193c2bc82bf3/sensors-19-01795-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/f2fec355d320/sensors-19-01795-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/3b54da71ae6f/sensors-19-01795-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/d1cbd3e0a17c/sensors-19-01795-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/161fed99d435/sensors-19-01795-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/0e3578f4f2ba/sensors-19-01795-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/beb8368b7731/sensors-19-01795-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/f6d6f9cd0a17/sensors-19-01795-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/ba6b8a7c0789/sensors-19-01795-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/193c2bc82bf3/sensors-19-01795-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/f2fec355d320/sensors-19-01795-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/3b54da71ae6f/sensors-19-01795-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c14/6514714/d1cbd3e0a17c/sensors-19-01795-g009.jpg

相似文献

1
Depth Estimation and Semantic Segmentation from a Single RGB Image Using a Hybrid Convolutional Neural Network.使用混合卷积神经网络从单张RGB图像进行深度估计和语义分割
Sensors (Basel). 2019 Apr 15;19(8):1795. doi: 10.3390/s19081795.
2
Collaborative Deconvolutional Neural Networks for Joint Depth Estimation and Semantic Segmentation.用于联合深度估计和语义分割的协作式反卷积神经网络
IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5655-5666. doi: 10.1109/TNNLS.2017.2787781. Epub 2018 Mar 20.
3
Semantic Segmentation Leveraging Simultaneous Depth Estimation.语义分割利用同时深度估计。
Sensors (Basel). 2021 Jan 20;21(3):690. doi: 10.3390/s21030690.
4
DTS-Net: Depth-to-Space Networks for Fast and Accurate Semantic Object Segmentation.DTS-Net:用于快速准确语义对象分割的深度到空间网络
Sensors (Basel). 2022 Jan 3;22(1):337. doi: 10.3390/s22010337.
5
Exploiting Depth From Single Monocular Images for Object Detection and Semantic Segmentation.利用单目图像的深度进行目标检测和语义分割。
IEEE Trans Image Process. 2017 Feb;26(2):836-846. doi: 10.1109/TIP.2016.2621673. Epub 2016 Oct 26.
6
Improving Depth Estimation by Embedding Semantic Segmentation: A Hybrid CNN Model.通过嵌入语义分割来提高深度估计:一种混合 CNN 模型。
Sensors (Basel). 2022 Feb 21;22(4):1669. doi: 10.3390/s22041669.
7
SCN: Switchable Context Network for Semantic Segmentation of RGB-D Images.SCN:用于 RGB-D 图像语义分割的可切换上下文网络。
IEEE Trans Cybern. 2020 Mar;50(3):1120-1131. doi: 10.1109/TCYB.2018.2885062. Epub 2018 Dec 20.
8
Latent 3D Volume for Joint Depth Estimation and Semantic Segmentation from a Single Image.基于单张图像的联合深度估计和语义分割的潜在 3D 体素。
Sensors (Basel). 2020 Oct 12;20(20):5765. doi: 10.3390/s20205765.
9
Semantic Segmentation and Depth Estimation Based on Residual Attention Mechanism.基于残差注意力机制的语义分割与深度估计
Sensors (Basel). 2023 Aug 28;23(17):7466. doi: 10.3390/s23177466.
10
RT-ViT: Real-Time Monocular Depth Estimation Using Lightweight Vision Transformers.RT-ViT:基于轻量级视觉Transformer 的实时单目深度估计。
Sensors (Basel). 2022 May 19;22(10):3849. doi: 10.3390/s22103849.

引用本文的文献

1
Unification of Road Scene Segmentation Strategies Using Multistream Data and Latent Space Attention.使用多流数据和潜在空间注意力统一道路场景分割策略
Sensors (Basel). 2023 Aug 23;23(17):7355. doi: 10.3390/s23177355.
2
Monocular Depth Estimation: Lightweight Convolutional and Matrix Capsule Feature-Fusion Network.单目深度估计:轻量级卷积和矩阵胶囊特征融合网络。
Sensors (Basel). 2022 Aug 23;22(17):6344. doi: 10.3390/s22176344.
3
Recent Advanced Deep Learning Architectures for Retinal Fluid Segmentation on Optical Coherence Tomography Images.

本文引用的文献

1
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
2
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
3
Graph-based segmentation for RGB-D data using 3-D geometry enhanced superpixels.
最近的光学相干断层扫描图像视网膜液分割的深度学习架构。
Sensors (Basel). 2022 Apr 15;22(8):3055. doi: 10.3390/s22083055.
4
Improving Depth Estimation by Embedding Semantic Segmentation: A Hybrid CNN Model.通过嵌入语义分割来提高深度估计:一种混合 CNN 模型。
Sensors (Basel). 2022 Feb 21;22(4):1669. doi: 10.3390/s22041669.
5
DTS-Net: Depth-to-Space Networks for Fast and Accurate Semantic Object Segmentation.DTS-Net:用于快速准确语义对象分割的深度到空间网络
Sensors (Basel). 2022 Jan 3;22(1):337. doi: 10.3390/s22010337.
6
Monocular Depth Estimation with Joint Attention Feature Distillation and Wavelet-Based Loss Function.基于联合注意特征提取和基于小波的损失函数的单目深度估计。
Sensors (Basel). 2020 Dec 24;21(1):54. doi: 10.3390/s21010054.
7
Latent 3D Volume for Joint Depth Estimation and Semantic Segmentation from a Single Image.基于单张图像的联合深度估计和语义分割的潜在 3D 体素。
Sensors (Basel). 2020 Oct 12;20(20):5765. doi: 10.3390/s20205765.
8
Multi-Scale Spatio-Temporal Feature Extraction and Depth Estimation from Sequences by Ordinal Classification.基于序分类的序列多尺度时空特征提取与深度估计。
Sensors (Basel). 2020 Apr 1;20(7):1979. doi: 10.3390/s20071979.
9
SemanticDepth: Fusing Semantic Segmentation and Monocular Depth Estimation for Enabling Autonomous Driving in Roads without Lane Lines.语义深度:融合语义分割与单目深度估计以实现无车道线道路上的自动驾驶
Sensors (Basel). 2019 Jul 22;19(14):3224. doi: 10.3390/s19143224.
基于图的 RGB-D 数据分割,使用增强 3-D 几何信息的超像素。
IEEE Trans Cybern. 2015 May;45(5):913-26. doi: 10.1109/TCYB.2014.2340032. Epub 2014 Jul 29.
4
Watershed cuts: thinnings, shortest path forests, and topological watersheds.流域切割:稀疏处理、最短路径森林和拓扑流域。
IEEE Trans Pattern Anal Mach Intell. 2010 May;32(5):925-39. doi: 10.1109/TPAMI.2009.71.