• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

增强器:用于从镜面和透明表面图像获取深度信息的基准

Booster: A Benchmark for Depth From Images of Specular and Transparent Surfaces.

作者信息

Ramirez Pierluigi Zama, Costanzino Alex, Tosi Fabio, Poggi Matteo, Salti Samuele, Mattoccia Stefano, Stefano Luigi Di

出版信息

IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):85-102. doi: 10.1109/TPAMI.2023.3323858. Epub 2023 Dec 5.

DOI:10.1109/TPAMI.2023.3323858
PMID:37819829
Abstract

Estimating depth from images nowadays yields outstanding results, both in terms of in-domain accuracy and generalization. However, we identify two main challenges that remain open in this field: dealing with non-Lambertian materials and effectively processing high-resolution images. Purposely, we propose a novel dataset that includes accurate and dense ground-truth labels at high resolution, featuring scenes containing several specular and transparent surfaces. Our acquisition pipeline leverages a novel deep space-time stereo framework, enabling easy and accurate labeling with sub-pixel precision. The dataset is composed of 606 samples collected in 85 different scenes, each sample includes both a high-resolution pair (12 Mpx) as well as an unbalanced stereo pair (Left: 12 Mpx, Right: 1.1 Mpx), typical of modern mobile devices that mount sensors with different resolutions. Additionally, we provide manually annotated material segmentation masks and 15 K unlabeled samples. The dataset is composed of a train set and two test sets, the latter devoted to the evaluation of stereo and monocular depth estimation networks. Our experiments highlight the open challenges and future research directions in this field.

摘要

如今,从图像估计深度在域内精度和泛化方面都产生了出色的结果。然而,我们发现该领域仍存在两个主要挑战:处理非朗伯材质以及有效处理高分辨率图像。为此,我们提出了一个新颖的数据集,该数据集在高分辨率下包含准确且密集的地面真值标签,其场景包含多个镜面和透明表面。我们的采集管道利用了一种新颖的深度时空立体框架,能够以亚像素精度轻松且准确地进行标注。该数据集由在85个不同场景中收集的606个样本组成,每个样本都包括一个高分辨率对(1200万像素)以及一个不平衡立体对(左:1200万像素,右:110万像素),这是现代安装不同分辨率传感器的移动设备的典型配置。此外,我们还提供了手动标注的材质分割掩码和15000个未标注样本。该数据集由一个训练集和两个测试集组成,后者用于评估立体和单目深度估计网络。我们的实验突出了该领域中存在的开放性挑战和未来的研究方向。

相似文献

1
Booster: A Benchmark for Depth From Images of Specular and Transparent Surfaces.增强器:用于从镜面和透明表面图像获取深度信息的基准
IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):85-102. doi: 10.1109/TPAMI.2023.3323858. Epub 2023 Dec 5.
2
TranSpec3D: A Novel Measurement Principle to Generate A Non-Synthetic Data Set of Transparent and Specular Surfaces without Object Preparation.TranSpec3D:一种无需对物体进行预处理即可生成透明和镜面表面非合成数据集的新型测量原理。
Sensors (Basel). 2023 Oct 18;23(20):8567. doi: 10.3390/s23208567.
3
Photometric Stereo-Based Depth Map Reconstruction for Monocular Capsule Endoscopy.基于光度立体的单目胶囊内窥镜深度图重建。
Sensors (Basel). 2020 Sep 21;20(18):5403. doi: 10.3390/s20185403.
4
MaskMitosis: a deep learning framework for fully supervised, weakly supervised, and unsupervised mitosis detection in histopathology images.MaskMitosis:一种深度学习框架,用于在组织病理学图像中进行全监督、弱监督和无监督的有丝分裂检测。
Med Biol Eng Comput. 2020 Jul;58(7):1603-1623. doi: 10.1007/s11517-020-02175-z. Epub 2020 May 22.
5
AmodalAppleSize_RGB-D dataset: RGB-D images of apple trees annotated with modal and amodal segmentation masks for fruit detection, visibility and size estimation.无模态苹果尺寸_RGB-D数据集:苹果树的RGB-D图像,带有用于果实检测、可见性和尺寸估计的模态和无模态分割掩码注释。
Data Brief. 2023 Dec 30;52:110000. doi: 10.1016/j.dib.2023.110000. eCollection 2024 Feb.
6
Digging Into Uncertainty-Based Pseudo-Label for Robust Stereo Matching.深入探究基于不确定性的伪标签用于稳健立体匹配
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14301-14320. doi: 10.1109/TPAMI.2023.3300976. Epub 2023 Nov 3.
7
A New Parallel Intelligence Based Light Field Dataset for Depth Refinement and Scene Flow Estimation.基于新型平行智能的用于深度细化和场景流估计的光场数据集。
Sensors (Basel). 2022 Dec 4;22(23):9483. doi: 10.3390/s22239483.
8
Unsupervised Domain Adaptation for Depth Prediction from Images.基于图像的深度预测的无监督领域自适应。
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2396-2409. doi: 10.1109/TPAMI.2019.2940948. Epub 2019 Sep 12.
9
A Benchmark Dataset and Evaluation for Non-Lambertian and Uncalibrated Photometric Stereo.一个用于非朗伯体和未校准光度立体视觉的基准数据集及评估
IEEE Trans Pattern Anal Mach Intell. 2019 Feb;41(2):271-284. doi: 10.1109/TPAMI.2018.2799222. Epub 2018 Feb 5.
10
A multi-camera dataset for depth estimation in an indoor scenario.一个用于室内场景深度估计的多摄像头数据集。
Data Brief. 2019 Oct 7;27:104619. doi: 10.1016/j.dib.2019.104619. eCollection 2019 Dec.

引用本文的文献

1
TranSpec3D: A Novel Measurement Principle to Generate A Non-Synthetic Data Set of Transparent and Specular Surfaces without Object Preparation.TranSpec3D:一种无需对物体进行预处理即可生成透明和镜面表面非合成数据集的新型测量原理。
Sensors (Basel). 2023 Oct 18;23(20):8567. doi: 10.3390/s23208567.
2
Triangle-Mesh-Rasterization-Projection (TMRP): An Algorithm to Project a Point Cloud onto a Consistent, Dense and Accurate 2D Raster Image.三角形网格光栅化投影(TMRP):一种将点云投影到一致、密集且精确的二维光栅图像上的算法。
Sensors (Basel). 2023 Aug 8;23(16):7030. doi: 10.3390/s23167030.