• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深入探究基于不确定性的伪标签用于稳健立体匹配

Digging Into Uncertainty-Based Pseudo-Label for Robust Stereo Matching.

作者信息

Shen Zhelun, Song Xibin, Dai Yuchao, Zhou Dingfu, Rao Zhibo, Zhang Liangjun

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14301-14320. doi: 10.1109/TPAMI.2023.3300976. Epub 2023 Nov 3.

DOI:10.1109/TPAMI.2023.3300976
PMID:37590113
Abstract

Due to the domain differences and unbalanced disparity distribution across multiple datasets, current stereo matching approaches are commonly limited to a specific dataset and generalize poorly to others. Such domain shift issue is usually addressed by substantial adaptation on costly target-domain ground-truth data, which cannot be easily obtained in practical settings. In this paper, we propose to dig into uncertainty estimation for robust stereo matching. Specifically, to balance the disparity distribution, we employ a pixel-level uncertainty estimation to adaptively adjust the next stage disparity searching space, in this way driving the network progressively prune out the space of unlikely correspondences. Then, to solve the limited ground truth data, an uncertainty-based pseudo-label is proposed to adapt the pre-trained model to the new domain, where pixel-level and area-level uncertainty estimation are proposed to filter out the high-uncertainty pixels of predicted disparity maps and generate sparse while reliable pseudo-labels to align the domain gap. Experimentally, our method shows strong cross-domain, adapt, and joint generalization and obtains 1st place on the stereo task of Robust Vision Challenge 2020. Additionally, our uncertainty-based pseudo-labels can be extended to train monocular depth estimation networks in an unsupervised way and even achieves comparable performance with the supervised methods.

摘要

由于多个数据集之间存在领域差异和不平衡的视差分布,当前的立体匹配方法通常局限于特定数据集,对其他数据集的泛化能力较差。这种领域转移问题通常通过对昂贵的目标领域真实数据进行大量适配来解决,而在实际场景中难以轻松获取这些数据。在本文中,我们建议深入研究用于鲁棒立体匹配的不确定性估计。具体而言,为了平衡视差分布,我们采用像素级不确定性估计来自适应调整下一阶段的视差搜索空间,以此驱动网络逐步剔除不太可能的对应关系空间。然后,为了解决真实数据有限的问题,我们提出了基于不确定性的伪标签,以使预训练模型适应新领域,其中提出了像素级和区域级不确定性估计,以滤除预测视差图中的高不确定性像素,并生成稀疏而可靠的伪标签来弥合领域差距。实验表明,我们的方法具有强大的跨领域、自适应和联合泛化能力,并在2020年鲁棒视觉挑战赛的立体任务中获得第一名。此外,我们基于不确定性的伪标签可以扩展到以无监督方式训练单目深度估计网络,甚至能达到与监督方法相当的性能。

相似文献

1
Digging Into Uncertainty-Based Pseudo-Label for Robust Stereo Matching.深入探究基于不确定性的伪标签用于稳健立体匹配
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14301-14320. doi: 10.1109/TPAMI.2023.3300976. Epub 2023 Nov 3.
2
A Disparity Refinement Framework for Learning-based Stereo Matching Methods in Cross-domain Setting for Laparoscopic Images.一种用于腹腔镜图像跨域设置中基于学习的立体匹配方法的视差细化框架。
Proc SPIE Int Soc Opt Eng. 2023 Feb;12466. doi: 10.1117/12.2654804. Epub 2023 Apr 3.
3
Self-Supervised Multiscale Adversarial Regression Network for Stereo Disparity Estimation.用于立体视差估计的自监督多尺度对抗回归网络
IEEE Trans Cybern. 2021 Oct;51(10):4770-4783. doi: 10.1109/TCYB.2020.2999492. Epub 2021 Oct 12.
4
Disparity refinement framework for learning-based stereo matching methods in cross-domain setting for laparoscopic images.用于腹腔镜图像跨域设置中基于学习的立体匹配方法的视差细化框架。
J Med Imaging (Bellingham). 2023 Jul;10(4):045001. doi: 10.1117/1.JMI.10.4.045001. Epub 2023 Jul 14.
5
Stereo Matching Using Multi-Level Cost Volume and Multi-Scale Feature Constancy.基于多级代价体和多尺度特征一致性的立体匹配
IEEE Trans Pattern Anal Mach Intell. 2021 Jan;43(1):300-315. doi: 10.1109/TPAMI.2019.2928550. Epub 2020 Dec 4.
6
Details preserved unsupervised depth estimation by fusing traditional stereo knowledge from laparoscopic images.通过融合来自腹腔镜图像的传统立体视觉知识来保留无监督深度估计的细节。
Healthc Technol Lett. 2019 Nov 13;6(6):154-158. doi: 10.1049/htl.2019.0063. eCollection 2019 Dec.
7
SENSE: Self-Evolving Learning for Self-Supervised Monocular Depth Estimation.SENSE:用于自监督单目深度估计的自进化学习
IEEE Trans Image Process. 2024;33:439-450. doi: 10.1109/TIP.2023.3338053. Epub 2023 Dec 29.
8
Uncertainty-Aware Active Domain Adaptive Salient Object Detection.不确定性感知主动域自适应显著目标检测
IEEE Trans Image Process. 2024;33:5510-5524. doi: 10.1109/TIP.2024.3413598. Epub 2024 Oct 4.
9
S-CUDA: Self-cleansing unsupervised domain adaptation for medical image segmentation.S-CUDA:用于医学图像分割的自清洁无监督域适应
Med Image Anal. 2021 Dec;74:102214. doi: 10.1016/j.media.2021.102214. Epub 2021 Aug 12.
10
An application of stereo matching algorithm based on transfer learning on robots in multiple scenes.基于迁移学习的立体匹配算法在多场景机器人中的应用。
Sci Rep. 2023 Aug 6;13(1):12739. doi: 10.1038/s41598-023-39964-z.

引用本文的文献

1
Monocular Depth Estimation via Self-Supervised Self-Distillation.通过自监督自蒸馏进行单目深度估计
Sensors (Basel). 2024 Jun 24;24(13):4090. doi: 10.3390/s24134090.
2
An efficient and accurate multi-level cascaded recurrent network for stereo matching.一种用于立体匹配的高效且准确的多级级联循环网络。
Sci Rep. 2024 Apr 8;14(1):8148. doi: 10.1038/s41598-024-57321-6.
3
TranSpec3D: A Novel Measurement Principle to Generate A Non-Synthetic Data Set of Transparent and Specular Surfaces without Object Preparation.TranSpec3D:一种无需对物体进行预处理即可生成透明和镜面表面非合成数据集的新型测量原理。
Sensors (Basel). 2023 Oct 18;23(20):8567. doi: 10.3390/s23208567.