• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用单目图像的深度进行目标检测和语义分割。

Exploiting Depth From Single Monocular Images for Object Detection and Semantic Segmentation.

出版信息

IEEE Trans Image Process. 2017 Feb;26(2):836-846. doi: 10.1109/TIP.2016.2621673. Epub 2016 Oct 26.

DOI:10.1109/TIP.2016.2621673
PMID:28113765
Abstract

Augmenting RGB data with measured depth has been shown to improve the performance of a range of tasks in computer vision, including object detection and semantic segmentation. Although depth sensors such as the Microsoft Kinect have facilitated easy acquisition of such depth information, the vast majority of images used in vision tasks do not contain depth information. In this paper, we show that augmenting RGB images with estimated depth can also improve the accuracy of both object detection and semantic segmentation. Specifically, we first exploit the recent success of depth estimation from monocular images and learn a deep depth estimation model. Then, we learn deep depth features from the estimated depth and combine with RGB features for object detection and semantic segmentation. In addition, we propose an RGB-D semantic segmentation method, which applies a multi-task training scheme: semantic label prediction and depth value regression. We test our methods on several data sets and demonstrate that incorporating information from estimated depth improves the performance of object detection and semantic segmentation remarkably.

摘要

事实证明,用测量得到的深度信息增强RGB数据可以提高计算机视觉中一系列任务的性能,包括目标检测和语义分割。尽管像微软Kinect这样的深度传感器使得此类深度信息的获取变得容易,但视觉任务中使用的绝大多数图像并不包含深度信息。在本文中,我们表明用估计深度增强RGB图像也可以提高目标检测和语义分割的准确率。具体而言,我们首先利用单目图像深度估计的最新成果,学习一个深度深度估计模型。然后,我们从估计深度中学习深度特征,并与RGB特征相结合用于目标检测和语义分割。此外,我们提出了一种RGB-D语义分割方法,该方法应用了一种多任务训练方案:语义标签预测和深度值回归。我们在多个数据集上测试了我们的方法,并证明合并来自估计深度的信息能显著提高目标检测和语义分割的性能。

相似文献

1
Exploiting Depth From Single Monocular Images for Object Detection and Semantic Segmentation.利用单目图像的深度进行目标检测和语义分割。
IEEE Trans Image Process. 2017 Feb;26(2):836-846. doi: 10.1109/TIP.2016.2621673. Epub 2016 Oct 26.
2
Collaborative Deconvolutional Neural Networks for Joint Depth Estimation and Semantic Segmentation.用于联合深度估计和语义分割的协作式反卷积神经网络
IEEE Trans Neural Netw Learn Syst. 2018 Nov;29(11):5655-5666. doi: 10.1109/TNNLS.2017.2787781. Epub 2018 Mar 20.
3
Semantic Segmentation Leveraging Simultaneous Depth Estimation.语义分割利用同时深度估计。
Sensors (Basel). 2021 Jan 20;21(3):690. doi: 10.3390/s21030690.
4
Deep Monocular Depth Estimation Based on Content and Contextual Features.基于内容和上下文特征的深度单目深度估计。
Sensors (Basel). 2023 Mar 8;23(6):2919. doi: 10.3390/s23062919.
5
SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches.SFA-MDEN:基于语义特征辅助的双通道单目深度估计网络。
Sensors (Basel). 2021 Aug 13;21(16):5476. doi: 10.3390/s21165476.
6
Depth Estimation and Semantic Segmentation from a Single RGB Image Using a Hybrid Convolutional Neural Network.使用混合卷积神经网络从单张RGB图像进行深度估计和语义分割
Sensors (Basel). 2019 Apr 15;19(8):1795. doi: 10.3390/s19081795.
7
SCN: Switchable Context Network for Semantic Segmentation of RGB-D Images.SCN:用于 RGB-D 图像语义分割的可切换上下文网络。
IEEE Trans Cybern. 2020 Mar;50(3):1120-1131. doi: 10.1109/TCYB.2018.2885062. Epub 2018 Dec 20.
8
RGB-D-Based Pose Estimation of Workpieces with Semantic Segmentation and Point Cloud Registration.基于RGB-D的工件姿态估计:语义分割与点云配准
Sensors (Basel). 2019 Apr 19;19(8):1873. doi: 10.3390/s19081873.
9
MonoDCN: Monocular 3D object detection based on dynamic convolution.MonoDCN:基于动态卷积的单目三维目标检测。
PLoS One. 2022 Oct 4;17(10):e0275438. doi: 10.1371/journal.pone.0275438. eCollection 2022.
10
Convolutional Neural Networks-Based Object Detection Algorithm by Jointing Semantic Segmentation for Images.基于卷积神经网络的图像语义分割联合的目标检测算法。
Sensors (Basel). 2020 Sep 7;20(18):5080. doi: 10.3390/s20185080.

引用本文的文献

1
Effect of Depth Band Replacement on Red, Green and Blue Image for Deep Learning Weed Detection.深度波段替换对用于深度学习杂草检测的红、绿、蓝图像的影响
Sensors (Basel). 2024 Dec 30;25(1):161. doi: 10.3390/s25010161.
2
A Residual Network and FPGA Based Real-Time Depth Map Enhancement System.一种基于残差网络和现场可编程门阵列的实时深度图增强系统。
Entropy (Basel). 2021 Apr 28;23(5):546. doi: 10.3390/e23050546.
3
Reconstruction of 3D Object Shape Using Hybrid Modular Neural Network Architecture Trained on 3D Models from Dataset.基于 数据集上的 3D 模型训练的混合模块化神经网络架构重建 3D 对象形状。
Sensors (Basel). 2019 Mar 31;19(7):1553. doi: 10.3390/s19071553.