• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

终身单目深度估计:多领域单目度量深度估计的终身学习

Lifelong-MonoDepth: Lifelong Learning for Multidomain Monocular Metric Depth Estimation.

作者信息

Hu Junjie, Fan Chenyou, Zhou Liguang, Gao Qing, Liu Honghai, Lam Tin Lun

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):796-806. doi: 10.1109/TNNLS.2023.3323487. Epub 2025 Jan 7.

DOI:10.1109/TNNLS.2023.3323487
PMID:37874732
Abstract

With the rapid advancements in autonomous driving and robot navigation, there is a growing demand for lifelong learning (LL) models capable of estimating metric (absolute) depth. LL approaches potentially offer significant cost savings in terms of model training, data storage, and collection. However, the quality of RGB images and depth maps is sensor-dependent, and depth maps in the real world exhibit domain-specific characteristics, leading to variations in depth ranges. These challenges limit existing methods to LL scenarios with small domain gaps and relative depth map estimation. To facilitate lifelong metric depth learning, we identify three crucial technical challenges that require attention: 1) developing a model capable of addressing the depth scale variation through scale-aware depth learning; 2) devising an effective learning strategy to handle significant domain gaps; and 3) creating an automated solution for domain-aware depth inference in practical applications. Based on the aforementioned considerations, in this article, we present 1) a lightweight multihead framework that effectively tackles the depth scale imbalance; 2) an uncertainty-aware LL solution that adeptly handles significant domain gaps; and 3) an online domain-specific predictor selection method for real-time inference. Through extensive numerical studies, we show that the proposed method can achieve good efficiency, stability, and plasticity, leading the benchmarks by 8%-15%. The code is available at https://github.com/FreeformRobotics/Lifelong-MonoDepth.

摘要

随着自动驾驶和机器人导航技术的飞速发展,对能够估计度量(绝对)深度的终身学习(LL)模型的需求日益增长。LL方法在模型训练、数据存储和收集方面可能会显著节省成本。然而,RGB图像和深度图的质量取决于传感器,并且现实世界中的深度图具有特定领域的特征,导致深度范围存在差异。这些挑战将现有方法限制在具有小领域差距和相对深度图估计的LL场景中。为了促进终身度量深度学习,我们确定了三个需要关注的关键技术挑战:1)开发一种能够通过尺度感知深度学习解决深度尺度变化的模型;2)设计一种有效的学习策略来处理显著的领域差距;3)为实际应用中的领域感知深度推理创建一种自动化解决方案。基于上述考虑,在本文中,我们提出了1)一种有效地解决深度尺度不平衡问题的轻量级多头框架;2)一种能够巧妙处理显著领域差距的不确定性感知LL解决方案;3)一种用于实时推理的在线特定领域预测器选择方法。通过广泛的数值研究,我们表明所提出的方法能够实现良好的效率、稳定性和可塑性,比基准领先8%-15%。代码可在https://github.com/FreeformRobotics/Lifelong-MonoDepth获取。

相似文献

1
Lifelong-MonoDepth: Lifelong Learning for Multidomain Monocular Metric Depth Estimation.终身单目深度估计:多领域单目度量深度估计的终身学习
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):796-806. doi: 10.1109/TNNLS.2023.3323487. Epub 2025 Jan 7.
2
G2-MonoDepth: A General Framework of Generalized Depth Inference From Monocular RGB+X Data.G2-MonoDepth:一种从单目RGB+X数据进行广义深度推断的通用框架。
IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):3753-3771. doi: 10.1109/TPAMI.2023.3346466. Epub 2024 Apr 3.
3
Self-supervised Monocular Depth Estimation with 3D Displacement Module for Laparoscopic Images.用于腹腔镜图像的基于3D位移模块的自监督单目深度估计
IEEE Trans Med Robot Bionics. 2022 May;4(2):331-334. doi: 10.1109/TMRB.2022.3170206.
4
Monocular Absolute Depth Estimation from Motion for Small Unmanned Aerial Vehicles by Geometry-Based Scale Recovery.基于几何尺度恢复的小型无人机运动单目绝对深度估计
Sensors (Basel). 2024 Jul 13;24(14):4541. doi: 10.3390/s24144541.
5
Lightweight and Energy-Aware Monocular Depth Estimation Models for IoT Embedded Devices: Challenges and Performances in Terrestrial and Underwater Scenarios.面向物联网嵌入式设备的轻量级和节能单目深度估计模型:在地面和水下场景中的挑战和性能。
Sensors (Basel). 2023 Feb 16;23(4):2223. doi: 10.3390/s23042223.
6
Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-Shot Metric Depth and Surface Normal Estimation.Metric3D v2:一种用于零样本度量深度和表面法线估计的通用单目几何基础模型。
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):10579-10596. doi: 10.1109/TPAMI.2024.3444912. Epub 2024 Nov 6.
7
Occlusion-Aware Unsupervised Learning of Depth From 4-D Light Fields.基于遮挡感知的4D光场深度无监督学习
IEEE Trans Image Process. 2022;31:2216-2228. doi: 10.1109/TIP.2022.3154288. Epub 2022 Mar 8.
8
Advanced Monocular Outdoor Pose Estimation in Autonomous Systems: Leveraging Optical Flow, Depth Estimation, and Semantic Segmentation with Dynamic Object Removal.自主系统中的高级单目户外姿态估计:利用光流、深度估计和语义分割去除动态物体
Sensors (Basel). 2024 Dec 17;24(24):8040. doi: 10.3390/s24248040.
9
Learning Then, Learning Now, and Every Second in Between: Lifelong Learning With a Simulated Humanoid Robot.彼时之学,此时之学,以及其间的每一秒:借助模拟人形机器人进行终身学习。
Front Neurorobot. 2021 Jul 1;15:669534. doi: 10.3389/fnbot.2021.669534. eCollection 2021.
10
MonoAux: Fully Exploiting Auxiliary Information and Uncertainty for Monocular 3D Object Detection.单目辅助(MonoAux):充分利用辅助信息和不确定性进行单目3D目标检测
Cyborg Bionic Syst. 2024 Mar 27;5:0097. doi: 10.34133/cbsystems.0097. eCollection 2024.

引用本文的文献

1
A simple monocular depth estimation network for balancing complexity and accuracy.一种用于平衡复杂度和准确性的简单单目深度估计网络。
Sci Rep. 2025 Apr 15;15(1):12860. doi: 10.1038/s41598-025-97568-1.