He Renzhi, Hong Hualin, Cheng Zhou, Liu Fei
IEEE Trans Pattern Anal Mach Intell. 2025 Sep;47(9):8268-8279. doi: 10.1109/TPAMI.2025.3576638.
The light field camera has significantly advanced conventional imaging methods and microscopy over the past decades, providing high-dimensional information in 2D images and enabling a variety of applications. However, inherent shortcomings persist, mainly due to the complex optical setup and the trade-off between resolution. In this work, we propose a Neural Defocus Light Field (NDLF) rendering method, which constructs the light field without a micro-lens array but achieves the same resolution as the original image. The basic unit of NDLF is the 3D point spread function (3D-PSF), which extends the 2D-PSF by incorporating the focus depth axis. NDLF can directly solve the distribution of PSFs in 3D space, enabling direct manipulation of the PSF in 3D and enhancing our understanding of the defocus process. NDLF achieves the focused images rendering by redefining the focus images as slices of the NDLF, which are superpositions of cross-sections of the 3D-PSFs. NDLF modulates the 3D-PSFs using three multilayer perceptron modules, corresponding to three Gaussian-based models from coarse to fine. NDLF is trained on 20 highresolution (1024 × 1024) images at different focus depths, enabling it to render focused images at any given focus depth. The structural similarity index between the predicted and measured focused images is 0.9794. Moreover, we developed a hardware system to collect the high resolution focused images with corresponding focus depth, and depth maps. NDLF achieves high-resolution light field imaging with a single-lens camera and also resolves the distribution of 3D-PSFs in 3D space, paving the way for novel lightfield synthesis techniques and deeper insights into defocus blur.
在过去几十年中,光场相机显著改进了传统成像方法和显微镜技术,在二维图像中提供高维信息并实现了多种应用。然而,其固有缺点仍然存在,主要是由于复杂的光学设置以及分辨率之间的权衡。在这项工作中,我们提出了一种神经散焦光场(NDLF)渲染方法,该方法无需微透镜阵列即可构建光场,但能实现与原始图像相同的分辨率。NDLF的基本单元是三维点扩散函数(3D-PSF),它通过纳入焦深轴扩展了二维PSF。NDLF可以直接解决PSF在三维空间中的分布问题,从而能够在三维中直接操纵PSF并增强我们对散焦过程的理解。NDLF通过将聚焦图像重新定义为NDLF的切片来实现聚焦图像渲染,这些切片是3D-PSF横截面的叠加。NDLF使用三个多层感知器模块调制3D-PSF,对应于从粗到细的三个基于高斯的模型。NDLF在20张不同焦深的高分辨率(1024×1024)图像上进行训练,使其能够在任何给定焦深下渲染聚焦图像。预测的聚焦图像与测量的聚焦图像之间的结构相似性指数为0.9794。此外,我们开发了一个硬件系统来收集具有相应焦深和深度图的高分辨率聚焦图像。NDLF通过单镜头相机实现了高分辨率光场成像,还解决了3D-PSF在三维空间中的分布问题,为新型光场合成技术和对散焦模糊的更深入理解铺平了道路。