一种基于散焦的形状几何方法。

A geometric approach to shape from defocus.

作者信息

Favaro P, Soatto S

出版信息

IEEE Trans Pattern Anal Mach Intell. 2005 Mar;27(3):406-417. doi: 10.1109/TPAMI.2005.43.

DOI:10.1109/TPAMI.2005.43

Abstract

We introduce a novel approach to shape from defocus, i.e., the problem of inferring the three-dimensional (3D) geometry of a scene from a collection of defocused images. Typically, in shape from defocus, the task of extracting geometry also requires deblurring the given images. A common approach to bypass this task relies on approximating the scene locally by a plane parallel to the image (the so-called equifocal assumption). We show that this approximation is indeed not necessary, as one can estimate 3D geometry while avoiding deblurring without strong assumptions on the scene. Solving the problem of shape from defocus requires modeling how light interacts with the optics before reaching the imaging surface. This interaction is described by the so-called point spread function (PSF). When the form of the PSF is known, we propose an optimal method to infer 3D geometry from defocused images that involves computing orthogonal operators which are regularized via functional singular value decomposition. When the form of the PSF is unknown, we propose a simple and efficient method that first learns a set of projection operators from blurred images and then uses these operators to estimate the 3D geometry of the scene from novel blurred images. Our experiments on both real and synthetic images show that the performance of the algorithm is relatively insensitive to the form of the PSF. Our general approach is to minimize the Euclidean norm of the difference between the estimated images and the observed images. The method is geometric in that we reduce the minimization to performing projections onto linear subspaces, by using inner product structures on both infinite and finite-dimensional Hilbert spaces. Both proposed algorithms involve only simple matrix-vector multiplications which can be implemented in real-time.

摘要

我们介绍了一种从散焦中恢复形状的新方法，即从一组散焦图像推断场景三维（3D）几何形状的问题。通常，在从散焦中恢复形状时，提取几何形状的任务还需要对给定图像进行去模糊处理。一种绕过此任务的常见方法依赖于用平行于图像的平面局部逼近场景（所谓的等焦假设）。我们表明这种逼近实际上并非必要，因为在不对场景做强假设的情况下，人们可以在避免去模糊的同时估计3D几何形状。解决从散焦中恢复形状的问题需要对光在到达成像表面之前与光学器件的相互作用进行建模。这种相互作用由所谓的点扩散函数（PSF）描述。当PSF的形式已知时，我们提出一种从散焦图像推断3D几何形状的最优方法，该方法涉及计算通过函数奇异值分解进行正则化的正交算子。当PSF的形式未知时，我们提出一种简单有效的方法，该方法首先从模糊图像中学习一组投影算子，然后使用这些算子从新的模糊图像中估计场景的3D几何形状。我们在真实图像和合成图像上的实验表明，该算法的性能对PSF的形式相对不敏感。我们的一般方法是最小化估计图像与观测图像之间差异的欧几里得范数。该方法是几何方法，因为我们通过在无限维和有限维希尔伯特空间上使用内积结构，将最小化问题简化为在线性子空间上进行投影。所提出的两种算法都只涉及简单的矩阵 - 向量乘法，可实时实现。

相似文献

A geometric approach to shape from defocus.一种基于散焦的形状几何方法。

IEEE Trans Pattern Anal Mach Intell. 2005 Mar;27(3):406-417. doi: 10.1109/TPAMI.2005.43.

Shape from defocus via diffusion.通过扩散实现离焦形状恢复。

IEEE Trans Pattern Anal Mach Intell. 2008 Mar;30(3):518-31. doi: 10.1109/TPAMI.2007.1175.

A 3D shape constraint on video.视频上的三维形状约束

IEEE Trans Pattern Anal Mach Intell. 2006 Jun;28(6):1018-23. doi: 10.1109/TPAMI.2006.109.

Geometric and algebraic constraints of projected concentric circles and their applications to camera calibration.投影同心圆的几何与代数约束及其在相机校准中的应用。

IEEE Trans Pattern Anal Mach Intell. 2005 Apr;27(4):637-642. doi: 10.1109/TPAMI.2005.80.

A quasi-dense approach to surface reconstruction from uncalibrated images.一种从未校准图像进行表面重建的准密集方法。

IEEE Trans Pattern Anal Mach Intell. 2005 Mar;27(3):418-433. doi: 10.1109/TPAMI.2005.44.

Make3D: learning 3D scene structure from a single still image.Make3D：从单张静止图像学习3D场景结构。

IEEE Trans Pattern Anal Mach Intell. 2009 May;31(5):824-40. doi: 10.1109/TPAMI.2008.132.

Efficient shape matching using shape contexts.使用形状上下文进行高效形状匹配。

IEEE Trans Pattern Anal Mach Intell. 2005 Nov;27(11):1832-7. doi: 10.1109/TPAMI.2005.220.

Depth estimation and image restoration using defocused stereo pairs.利用散焦立体图像对进行深度估计和图像复原

IEEE Trans Pattern Anal Mach Intell. 2004 Nov;26(11):1521-5. doi: 10.1109/tpami.2004.102.

Geometric properties of central catadioptric line images and their application in calibration.中心折反射线图像的几何特性及其在标定中的应用。

IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1327-33. doi: 10.1109/TPAMI.2005.163.

Orientation in manhattan: equiprojective classes and sequential estimation.曼哈顿中的定向：等射影类与序列估计。

IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):822-7. doi: 10.1109/TPAMI.2005.107.

引用本文的文献

Monocular depth estimation via a detail semantic collaborative network for indoor scenes.通过细节语义协作网络进行室内场景的单目深度估计

Sci Rep. 2025 Mar 31;15(1):10990. doi: 10.1038/s41598-025-96024-4.

The Depth Estimation and Visualization of Dermatological Lesions: Development and Usability Study.皮肤病变的深度估计与可视化：开发与可用性研究

JMIR Dermatol. 2024 Dec 18;7:e59839. doi: 10.2196/59839.

In Search of Basement Indicators from Street View Imagery Data: An Investigation of Data Sources and Analysis Strategies.从街景图像数据中寻找底层指标：数据来源与分析策略研究

Kunstliche Intell (Oldenbourg). 2023;37(1):41-53. doi: 10.1007/s13218-022-00792-4. Epub 2023 Jan 20.

Defocus Blur Detection and Estimation from Imaging Sensors.基于成像传感器的散焦模糊检测与估计

Sensors (Basel). 2018 Apr 8;18(4):1135. doi: 10.3390/s18041135.

3-D Imaging Systems for Agricultural Applications-A Review.用于农业应用的三维成像系统——综述

Sensors (Basel). 2016 Apr 29;16(5):618. doi: 10.3390/s16050618.

Using fuzzy logic to enhance stereo matching in multiresolution images.利用模糊逻辑增强多分辨率图像中的立体匹配。

Sensors (Basel). 2010;10(2):1093-118. doi: 10.3390/100201093. Epub 2010 Jan 29.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种基于散焦的形状几何方法。

A geometric approach to shape from defocus.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献