• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

半监督对抗式单目深度估计

Semi-Supervised Adversarial Monocular Depth Estimation.

作者信息

Ji Rongrong, Li Ke, Wang Yan, Sun Xiaoshuai, Guo Feng, Guo Xiaowei, Wu Yongjian, Huang Feiyue, Luo Jiebo

出版信息

IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2410-2422. doi: 10.1109/TPAMI.2019.2936024. Epub 2019 Aug 20.

DOI:10.1109/TPAMI.2019.2936024
PMID:31442969
Abstract

In this paper, we address the problem of monocular depth estimation when only a limited number of training image-depth pairs are available. To achieve a high regression accuracy, the state-of-the-art estimation methods rely on CNNs trained with a large number of image-depth pairs, which are prohibitively costly or even infeasible to acquire. Aiming to break the curse of such expensive data collections, we propose a semi-supervised adversarial learning framework that only utilizes a small number of image-depth pairs in conjunction with a large number of easily-available monocular images to achieve high performance. In particular, we use one generator to regress the depth and two discriminators to evaluate the predicted depth, i.e., one inspects the image-depth pair while the other inspects the depth channel alone. These two discriminators provide their feedbacks to the generator as the loss to generate more realistic and accurate depth predictions. Experiments show that the proposed approach can (1) improve most state-of-the-art models on the NYUD v2 dataset by effectively leveraging additional unlabeled data sources; (2) reach state-of-the-art accuracy when the training set is small, e.g., on the Make3D dataset; (3) adapt well to an unseen new dataset (Make3D in our case) after training on an annotated dataset (KITTI in our case).

摘要

在本文中,我们探讨了在仅有有限数量的训练图像-深度对可用时的单目深度估计问题。为了实现较高的回归精度,当前最先进的估计方法依赖于使用大量图像-深度对训练的卷积神经网络(CNNs),而获取这些图像-深度对的成本过高甚至不可行。旨在打破这种昂贵数据收集的限制,我们提出了一种半监督对抗学习框架,该框架仅利用少量图像-深度对并结合大量易于获取的单目图像来实现高性能。具体而言,我们使用一个生成器来回归深度,并使用两个判别器来评估预测的深度,即一个判别器检查图像-深度对,而另一个判别器仅检查深度通道。这两个判别器将它们的反馈作为损失提供给生成器,以生成更真实、准确的深度预测。实验表明,所提出的方法能够:(1)通过有效利用额外的未标记数据源,在NYUD v2数据集上改进大多数当前最先进的模型;(2)在训练集较小时,例如在Make3D数据集上达到当前最先进的精度;(3)在一个带注释的数据集(在我们的例子中是KITTI)上训练后,能很好地适应一个未见过的新数据集(在我们的例子中是Make3D)。

相似文献

1
Semi-Supervised Adversarial Monocular Depth Estimation.半监督对抗式单目深度估计
IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2410-2422. doi: 10.1109/TPAMI.2019.2936024. Epub 2019 Aug 20.
2
Joint Soft-Hard Attention for Self-Supervised Monocular Depth Estimation.基于联合软-硬注意力的自监督单目深度估计。
Sensors (Basel). 2021 Oct 20;21(21):6956. doi: 10.3390/s21216956.
3
SENSE: Self-Evolving Learning for Self-Supervised Monocular Depth Estimation.SENSE:用于自监督单目深度估计的自进化学习
IEEE Trans Image Process. 2024;33:439-450. doi: 10.1109/TIP.2023.3338053. Epub 2023 Dec 29.
4
Monocular Depth Estimation Using Multi-Scale Continuous CRFs as Sequential Deep Networks.使用多尺度连续条件随机场作为序列深度网络的单目深度估计
IEEE Trans Pattern Anal Mach Intell. 2019 Jun;41(6):1426-1440. doi: 10.1109/TPAMI.2018.2839602. Epub 2018 May 22.
5
SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches.SFA-MDEN:基于语义特征辅助的双通道单目深度估计网络。
Sensors (Basel). 2021 Aug 13;21(16):5476. doi: 10.3390/s21165476.
6
Adversarial Learning for Joint Optimization of Depth and Ego-Motion.用于深度和自我运动联合优化的对抗学习
IEEE Trans Image Process. 2020 Jan 28. doi: 10.1109/TIP.2020.2968751.
7
Masked GAN for Unsupervised Depth and Pose Prediction With Scale Consistency.用于具有尺度一致性的无监督深度和姿态预测的掩码生成对抗网络
IEEE Trans Neural Netw Learn Syst. 2021 Dec;32(12):5392-5403. doi: 10.1109/TNNLS.2020.3044181. Epub 2021 Nov 30.
8
Progressive Hard-Mining Network for Monocular Depth Estimation.渐进式硬挖掘网络的单目深度估计。
IEEE Trans Image Process. 2018 Aug;27(8):3691-3702. doi: 10.1109/TIP.2018.2821979.
9
Weakly Supervised Adversarial Learning for 3D Human Pose Estimation from Point Clouds.基于点云的弱监督对抗学习三维人体姿态估计
IEEE Trans Vis Comput Graph. 2020 May;26(5):1851-1859. doi: 10.1109/TVCG.2020.2973076. Epub 2020 Feb 13.
10
Unsupervised Estimation of Monocular Depth and VO in Dynamic Environments via Hybrid Masks.通过混合掩码对动态环境中的单目深度和视觉里程计进行无监督估计。
IEEE Trans Neural Netw Learn Syst. 2022 May;33(5):2023-2033. doi: 10.1109/TNNLS.2021.3100895. Epub 2022 May 2.

引用本文的文献

1
An advanced three stage lightweight model for underwater human detection.一种用于水下人体检测的先进三级轻量级模型。
Sci Rep. 2025 May 25;15(1):18137. doi: 10.1038/s41598-025-03677-2.
2
Influence of CT dose reduction on AI-driven malignancy estimation of incidental pulmonary nodules.CT 剂量降低对偶然发现的肺结节 AI 驱动恶性肿瘤评估的影响。
Eur Radiol. 2024 May;34(5):3444-3452. doi: 10.1007/s00330-023-10348-1. Epub 2023 Oct 23.
3
Monocular Depth Estimation: Lightweight Convolutional and Matrix Capsule Feature-Fusion Network.
单目深度估计:轻量级卷积和矩阵胶囊特征融合网络。
Sensors (Basel). 2022 Aug 23;22(17):6344. doi: 10.3390/s22176344.
4
A successful hybrid deep learning model aiming at promoter identification.一个成功的混合深度学习模型,旨在进行启动子识别。
BMC Bioinformatics. 2022 May 31;23(Suppl 1):206. doi: 10.1186/s12859-022-04735-6.
5
SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches.SFA-MDEN:基于语义特征辅助的双通道单目深度估计网络。
Sensors (Basel). 2021 Aug 13;21(16):5476. doi: 10.3390/s21165476.
6
Deep Learning-Based Monocular Depth Estimation Methods-A State-of-the-Art Review.基于深度学习的单目深度估计方法——最新综述。
Sensors (Basel). 2020 Apr 16;20(8):2272. doi: 10.3390/s20082272.