• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于联合深度估计和语义分割的协作式反卷积神经网络

Collaborative Deconvolutional Neural Networks for Joint Depth Estimation and Semantic Segmentation.

作者信息

Liu Jing, Wang Yuhang, Li Yong, Fu Jun, Li Jiangyun, Lu Hanqing

出版信息

IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5655-5666. doi: 10.1109/TNNLS.2017.2787781. Epub 2018 Mar 20.

DOI:10.1109/TNNLS.2017.2787781
PMID:29994159
Abstract

Semantic segmentation and single-view depth estimation are two fundamental problems in computer vision. They exploit the semantic and geometric properties of images, respectively, and are thus complementary in scene understanding. In this paper, we propose a collaborative deconvolutional neural network (C-DCNN) to jointly model these two problems for mutual promotion. The C-DCNN consists of two DCNNs, of which each is for one task. The DCNNs provide a finer resolution reconstruction method and are pretrained with hierarchical supervision. The feature maps from these two DCNNs are integrated via a pointwise bilinear layer, which fuses the semantic and depth information and produces higher order features. Then, the integrated features are fed into two sibling classification layers to simultaneously learn for semantic segmentation and depth estimation. In this way, we combine the semantic and depth features in a unified deep network and jointly train them to benefit each other. Specifically, during network training, we process depth estimation as a classification problem where a soft mapping strategy is proposed to map the continuous depth values into discrete probability distributions and the cross entropy loss is used. Besides, a fully connected conditional random field is also used as postprocessing to further improve the performance of semantic segmentation, where the proximity relations of pixels on position, intensity, and depth are jointly considered. We evaluate our approach on two challenging benchmarks: NYU Depth V2 and SUN RGB-D. It is demonstrated that our approach effectively utilizes these two kinds of information and achieves state-of-the-art results on both the semantic segmentation and depth estimation tasks.

摘要

语义分割和单视图深度估计是计算机视觉中的两个基本问题。它们分别利用图像的语义和几何属性,因此在场景理解中是互补的。在本文中,我们提出了一种协作反卷积神经网络(C-DCNN)来联合建模这两个问题以实现相互促进。C-DCNN由两个DCNN组成,每个DCNN用于一个任务。这些DCNN提供了一种更精细分辨率的重建方法,并通过分层监督进行预训练。来自这两个DCNN的特征图通过逐点双线性层进行整合,该层融合语义和深度信息并产生高阶特征。然后,将整合后的特征输入到两个并列的分类层中,以同时学习语义分割和深度估计。通过这种方式,我们在一个统一的深度网络中结合语义和深度特征,并联合训练它们以相互受益。具体来说,在网络训练期间,我们将深度估计作为一个分类问题来处理,其中提出了一种软映射策略将连续的深度值映射到离散的概率分布,并使用交叉熵损失。此外,还使用了一个全连接条件随机场作为后处理来进一步提高语义分割的性能,其中同时考虑了像素在位置、强度和深度上的邻近关系。我们在两个具有挑战性的基准数据集上评估我们的方法:NYU Depth V2和SUN RGB-D。结果表明我们的方法有效地利用了这两种信息,并在语义分割和深度估计任务上都取得了领先的成果。

相似文献

1
Collaborative Deconvolutional Neural Networks for Joint Depth Estimation and Semantic Segmentation.用于联合深度估计和语义分割的协作式反卷积神经网络
IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5655-5666. doi: 10.1109/TNNLS.2017.2787781. Epub 2018 Mar 20.
2
Exploiting Depth From Single Monocular Images for Object Detection and Semantic Segmentation.利用单目图像的深度进行目标检测和语义分割。
IEEE Trans Image Process. 2017 Feb;26(2):836-846. doi: 10.1109/TIP.2016.2621673. Epub 2016 Oct 26.
3
Depth Estimation and Semantic Segmentation from a Single RGB Image Using a Hybrid Convolutional Neural Network.使用混合卷积神经网络从单张RGB图像进行深度估计和语义分割
Sensors (Basel). 2019 Apr 15;19(8):1795. doi: 10.3390/s19081795.
4
Semantic Segmentation Leveraging Simultaneous Depth Estimation.语义分割利用同时深度估计。
Sensors (Basel). 2021 Jan 20;21(3):690. doi: 10.3390/s21030690.
5
STC: A Simple to Complex Framework for Weakly-Supervised Semantic Segmentation.STC:一种用于弱监督语义分割的从简单到复杂的框架。
IEEE Trans Pattern Anal Mach Intell. 2017 Nov;39(11):2314-2320. doi: 10.1109/TPAMI.2016.2636150. Epub 2016 Dec 6.
6
Deep Monocular Depth Estimation Based on Content and Contextual Features.基于内容和上下文特征的深度单目深度估计。
Sensors (Basel). 2023 Mar 8;23(6):2919. doi: 10.3390/s23062919.
7
Semantic Segmentation and Depth Estimation Based on Residual Attention Mechanism.基于残差注意力机制的语义分割与深度估计
Sensors (Basel). 2023 Aug 28;23(17):7466. doi: 10.3390/s23177466.
8
DTS-Net: Depth-to-Space Networks for Fast and Accurate Semantic Object Segmentation.DTS-Net:用于快速准确语义对象分割的深度到空间网络
Sensors (Basel). 2022 Jan 3;22(1):337. doi: 10.3390/s22010337.
9
Joint Task-Recursive Learning for RGB-D Scene Understanding.用于RGB-D场景理解的联合任务递归学习
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2608-2623. doi: 10.1109/TPAMI.2019.2926728. Epub 2019 Jul 10.
10
Discriminative Training of Deep Fully Connected Continuous CRFs With Task-Specific Loss.基于任务特定损失的深度全连接连续条件随机场的判别式训练
IEEE Trans Image Process. 2017 May;26(5):2127-2136. doi: 10.1109/TIP.2017.2675166. Epub 2017 Feb 24.

引用本文的文献

1
DDNet: Depth Dominant Network for Semantic Segmentation of RGB-D Images.DDNet:用于RGB-D图像语义分割的深度主导网络
Sensors (Basel). 2024 Oct 28;24(21):6914. doi: 10.3390/s24216914.
2
STRAINet: Spatially Varying sTochastic Residual AdversarIal Networks for MRI Pelvic Organ Segmentation.STRAINet:用于 MRI 盆腔器官分割的空间变化随机残差对抗网络。
IEEE Trans Neural Netw Learn Syst. 2019 May;30(5):1552-1564. doi: 10.1109/TNNLS.2018.2870182. Epub 2018 Oct 9.