• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于边界约束的语义分割和深度补全的同步实现

Simultaneous Semantic Segmentation and Depth Completion with Constraint of Boundary.

机构信息

College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China.

Zhejiang Provincial Key Laboratory of Information Processing, Communication and Networking, Zhejiang University, Hangzhou 310000, China.

出版信息

Sensors (Basel). 2020 Jan 23;20(3):635. doi: 10.3390/s20030635.

DOI:10.3390/s20030635
PMID:31979249
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7038358/
Abstract

As the core task of scene understanding, semantic segmentation and depth completion play a vital role in lots of applications such as robot navigation, AR/VR and autonomous driving. They are responsible for parsing scenes from the angle of semantics and geometry, respectively. While great progress has been made in both tasks through deep learning technologies, few works have been done on building a joint model by deeply exploring the inner relationship of the above tasks. In this paper, semantic segmentation and depth completion are jointly considered under a multi-task learning framework. By sharing a common encoder part and introducing boundary features as inner constraints in the decoder part, the two tasks can properly share the required information from each other. An extra boundary detection sub-task is responsible for providing the boundary features and constructing cross-task joint loss functions for network training. The entire network is implemented end-to-end and evaluated with both RGB and sparse depth input. Experiments conducted on synthesized and real scene datasets show that our proposed multi-task CNN model can effectively improve the performance of every single task.

摘要

作为场景理解的核心任务,语义分割和深度完成在机器人导航、AR/VR 和自动驾驶等许多应用中起着至关重要的作用。它们分别负责从语义和几何的角度解析场景。虽然深度学习技术在这两个任务上都取得了很大的进展,但很少有工作深入探讨上述任务的内在关系,从而构建联合模型。在本文中,我们在多任务学习框架下联合考虑语义分割和深度完成。通过共享一个公共的编码器部分,并在解码器部分引入边界特征作为内部约束,两个任务可以从彼此那里适当地共享所需的信息。一个额外的边界检测子任务负责提供边界特征,并为网络训练构建跨任务联合损失函数。整个网络是端到端实现的,使用 RGB 和稀疏深度输入进行评估。在合成和真实场景数据集上的实验表明,我们提出的多任务 CNN 模型可以有效地提高每个单独任务的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bb9/7038358/16b228330f19/sensors-20-00635-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bb9/7038358/79922cfb7152/sensors-20-00635-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bb9/7038358/2c19a6bbc12e/sensors-20-00635-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bb9/7038358/ccf193456fee/sensors-20-00635-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bb9/7038358/66e6541e91b7/sensors-20-00635-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bb9/7038358/16b228330f19/sensors-20-00635-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bb9/7038358/79922cfb7152/sensors-20-00635-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bb9/7038358/2c19a6bbc12e/sensors-20-00635-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bb9/7038358/ccf193456fee/sensors-20-00635-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bb9/7038358/66e6541e91b7/sensors-20-00635-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8bb9/7038358/16b228330f19/sensors-20-00635-g005.jpg

相似文献

1
Simultaneous Semantic Segmentation and Depth Completion with Constraint of Boundary.基于边界约束的语义分割和深度补全的同步实现
Sensors (Basel). 2020 Jan 23;20(3):635. doi: 10.3390/s20030635.
2
Semantic Segmentation Leveraging Simultaneous Depth Estimation.语义分割利用同时深度估计。
Sensors (Basel). 2021 Jan 20;21(3):690. doi: 10.3390/s21030690.
3
Multitask GANs for Semantic Segmentation and Depth Completion With Cycle Consistency.具有循环一致性的用于语义分割和深度补全的多任务生成对抗网络
IEEE Trans Neural Netw Learn Syst. 2021 Dec;32(12):5404-5415. doi: 10.1109/TNNLS.2021.3072883. Epub 2021 Nov 30.
4
BASeg: Boundary aware semantic segmentation for autonomous driving.BASeg:用于自动驾驶的边界感知语义分割。
Neural Netw. 2023 Jan;157:460-470. doi: 10.1016/j.neunet.2022.10.034. Epub 2022 Nov 9.
5
Collaborative Deconvolutional Neural Networks for Joint Depth Estimation and Semantic Segmentation.用于联合深度估计和语义分割的协作式反卷积神经网络
IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5655-5666. doi: 10.1109/TNNLS.2017.2787781. Epub 2018 Mar 20.
6
MResTNet: A Multi-Resolution Transformer Framework with CNN Extensions for Semantic Segmentation.MResTNet:一种带有卷积神经网络扩展的多分辨率Transformer框架用于语义分割
J Imaging. 2024 May 21;10(6):125. doi: 10.3390/jimaging10060125.
7
SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches.SFA-MDEN:基于语义特征辅助的双通道单目深度估计网络。
Sensors (Basel). 2021 Aug 13;21(16):5476. doi: 10.3390/s21165476.
8
GMNet: Graded-Feature Multilabel-Learning Network for RGB-Thermal Urban Scene Semantic Segmentation.GMNet:用于RGB-热红外城市场景语义分割的分级特征多标签学习网络
IEEE Trans Image Process. 2021;30:7790-7802. doi: 10.1109/TIP.2021.3109518. Epub 2021 Sep 14.
9
A Novel Upsampling and Context Convolution for Image Semantic Segmentation.一种用于图像语义分割的新型上采样与上下文卷积
Sensors (Basel). 2021 Mar 20;21(6):2170. doi: 10.3390/s21062170.
10
Depth Estimation and Semantic Segmentation from a Single RGB Image Using a Hybrid Convolutional Neural Network.使用混合卷积神经网络从单张RGB图像进行深度估计和语义分割
Sensors (Basel). 2019 Apr 15;19(8):1795. doi: 10.3390/s19081795.

引用本文的文献

1
GAC-Net: A Geometric-Attention Fusion Network for Sparse Depth Completion from LiDAR and Image.GAC网络:一种用于从激光雷达和图像进行稀疏深度补全的几何注意力融合网络。
Sensors (Basel). 2025 Sep 4;25(17):5495. doi: 10.3390/s25175495.
2
PLIN: A Network for Pseudo-LiDAR Point Cloud Interpolation.PLIN:用于伪激光雷达点云插值的网络。
Sensors (Basel). 2020 Mar 12;20(6):1573. doi: 10.3390/s20061573.

本文引用的文献

1
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
2
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
3
Image quality assessment: from error visibility to structural similarity.
图像质量评估:从误差可见性到结构相似性。
IEEE Trans Image Process. 2004 Apr;13(4):600-12. doi: 10.1109/tip.2003.819861.