• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习和双目立体视觉的图像深度估计协同研究综述。

On the Synergies Between Machine Learning and Binocular Stereo for Depth Estimation From Images: A Survey.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5314-5334. doi: 10.1109/TPAMI.2021.3070917. Epub 2022 Aug 4.

DOI:10.1109/TPAMI.2021.3070917
PMID:33819150
Abstract

Stereo matching is one of the longest-standing problems in computer vision with close to 40 years of studies and research. Throughout the years the paradigm has shifted from local, pixel-level decision to various forms of discrete and continuous optimization to data-driven, learning-based methods. Recently, the rise of machine learning and the rapid proliferation of deep learning enhanced stereo matching with new exciting trends and applications unthinkable until a few years ago. Interestingly, the relationship between these two worlds is two-way. While machine, and especially deep, learning advanced the state-of-the-art in stereo matching, stereo itself enabled new ground-breaking methodologies such as self-supervised monocular depth estimation based on deep networks. In this paper, we review recent research in the field of learning-based depth estimation from single and binocular images highlighting the synergies, the successes achieved so far and the open challenges the community is going to face in the immediate future.

摘要

立体匹配是计算机视觉中存在时间最长的问题之一,已有近 40 年的研究历史。多年来,该范例已经从局部的、基于像素的决策转变为各种形式的离散和连续优化,再到基于数据的、基于学习的方法。最近,机器学习的兴起和深度学习的快速普及为立体匹配带来了新的令人兴奋的趋势和应用,这些在几年前是难以想象的。有趣的是,这两个领域之间的关系是双向的。虽然机器,特别是深度学习,推动了立体匹配的技术发展,但立体本身也为新的突破性方法提供了可能,例如基于深度网络的自我监督单目深度估计。在本文中,我们回顾了基于单目和双目图像的学习深度估计领域的最新研究,重点介绍了协同作用、迄今为止取得的成功以及社区在不久的将来将要面临的开放挑战。

相似文献

1
On the Synergies Between Machine Learning and Binocular Stereo for Depth Estimation From Images: A Survey.基于机器学习和双目立体视觉的图像深度估计协同研究综述。
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5314-5334. doi: 10.1109/TPAMI.2021.3070917. Epub 2022 Aug 4.
2
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation.基于立体视觉的深度估计深度学习技术研究综述。
IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):1738-1764. doi: 10.1109/TPAMI.2020.3032602. Epub 2022 Mar 4.
3
Depth Estimation for Light-Field Images Using Stereo Matching and Convolutional Neural Networks.基于立体匹配和卷积神经网络的光场图像深度估计
Sensors (Basel). 2020 Oct 30;20(21):6188. doi: 10.3390/s20216188.
4
A Fast Stereo Matching Network with Multi-Cross Attention.一种具有多交叉注意力的快速立体匹配网络。
Sensors (Basel). 2021 Sep 8;21(18):6016. doi: 10.3390/s21186016.
5
On the Confidence of Stereo Matching in a Deep-Learning Era: A Quantitative Evaluation.深度学习时代立体匹配的置信度:定量评估
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5293-5313. doi: 10.1109/TPAMI.2021.3069706. Epub 2022 Aug 4.
6
Monocular Depth Estimation with Augmented Ordinal Depth Relationships.基于增强序数深度关系的单目深度估计
IEEE Trans Image Process. 2018 Oct 24. doi: 10.1109/TIP.2018.2877944.
7
Deep Learning-Based Monocular Depth Estimation Methods-A State-of-the-Art Review.基于深度学习的单目深度估计方法——最新综述。
Sensors (Basel). 2020 Apr 16;20(8):2272. doi: 10.3390/s20082272.
8
Dense Depth Estimation from Stereo Endoscopy Videos Using Unsupervised Optical Flow Methods.使用无监督光流方法从立体内窥镜视频中进行密集深度估计
Med Image Underst Anal. 2021 Jul;12722:337-349. doi: 10.1007/978-3-030-80432-9_26. Epub 2021 Jul 6.
9
Unsupervised Stereo Matching with Surface Normal Assistance for Indoor Depth Estimation.基于表面法线辅助的无监督立体匹配用于室内深度估计
Sensors (Basel). 2023 Dec 15;23(24):9850. doi: 10.3390/s23249850.
10
Depth Estimation from Light Field Geometry Using Convolutional Neural Networks.基于卷积神经网络的光场几何深度估计
Sensors (Basel). 2021 Sep 10;21(18):6061. doi: 10.3390/s21186061.

引用本文的文献

1
Glare suppressed 3D mapping system based on linear polarization filtering and binocular vision fusion.基于线性偏振滤波和双目视觉融合的眩光抑制3D映射系统。
Sci Rep. 2025 Jul 1;15(1):20998. doi: 10.1038/s41598-025-06887-w.
2
Vision-Based Collision Warning Systems with Deep Learning: A Systematic Review.基于深度学习的视觉碰撞预警系统:系统综述
J Imaging. 2025 Feb 17;11(2):64. doi: 10.3390/jimaging11020064.
3
Reliable Disparity Estimation Using Multiocular Vision with Adjustable Baseline.使用具有可调基线的多目视觉进行可靠的视差估计。
Sensors (Basel). 2024 Dec 24;25(1):21. doi: 10.3390/s25010021.
4
Enhancing Situational Awareness of Helicopter Pilots in Unmanned Aerial Vehicle-Congested Environments Using an Airborne Visual Artificial Intelligence Approach.在无人机密集环境中使用机载视觉人工智能方法增强直升机飞行员的态势感知能力
Sensors (Basel). 2024 Dec 4;24(23):7762. doi: 10.3390/s24237762.
5
Point Cloud Densification Algorithm for Multiple Cameras and Lidars Data Fusion.用于多相机和激光雷达数据融合的点云致密化算法
Sensors (Basel). 2024 Sep 5;24(17):5786. doi: 10.3390/s24175786.
6
Nanowatt all-optical 3D perception for mobile robotics.用于移动机器人的纳瓦全光三维感知
Sci Adv. 2024 Jul 5;10(27):eadn2031. doi: 10.1126/sciadv.adn2031.
7
A Binocular Stereo-Imaging-Perception System with a Wide Field-of-View and Infrared- and Visible Light-Dual-Band Fusion.一种具有宽视场以及红外与可见光双波段融合功能的双目立体成像感知系统。
Sensors (Basel). 2024 Jan 21;24(2):0. doi: 10.3390/s24020676.
8
TranSpec3D: A Novel Measurement Principle to Generate A Non-Synthetic Data Set of Transparent and Specular Surfaces without Object Preparation.TranSpec3D:一种无需对物体进行预处理即可生成透明和镜面表面非合成数据集的新型测量原理。
Sensors (Basel). 2023 Oct 18;23(20):8567. doi: 10.3390/s23208567.
9
Application of three-dimensional reconstruction technology in dentistry: a narrative review.三维重建技术在牙科中的应用:叙述性综述。
BMC Oral Health. 2023 Sep 4;23(1):630. doi: 10.1186/s12903-023-03142-4.
10
Triangle-Mesh-Rasterization-Projection (TMRP): An Algorithm to Project a Point Cloud onto a Consistent, Dense and Accurate 2D Raster Image.三角形网格光栅化投影(TMRP):一种将点云投影到一致、密集且精确的二维光栅图像上的算法。
Sensors (Basel). 2023 Aug 8;23(16):7030. doi: 10.3390/s23167030.