Nguyen Andrew-Hieu, Wang Zhaoyang
Neuroimaging Research Branch, National Institute on Drug Abuse, National Institutes of Health, Baltimore, MD 21224, USA.
Department of Mechanical Engineering, School of Engineering, The Catholic University of America, Washington, DC 20064, USA.
Sensors (Basel). 2024 May 20;24(10):3246. doi: 10.3390/s24103246.
The field of computer vision has been focusing on achieving accurate three-dimensional (3D) object representations from a single two-dimensional (2D) image through deep artificial neural networks. Recent advancements in 3D shape reconstruction techniques that combine structured light and deep learning show promise in acquiring high-quality geometric information about object surfaces. This paper introduces a new single-shot 3D shape reconstruction method that uses a nonlinear fringe transformation approach through both supervised and unsupervised learning networks. In this method, a deep learning network learns to convert a grayscale fringe input into multiple phase-shifted fringe outputs with different frequencies, which act as an intermediate result for the subsequent 3D reconstruction process using the structured-light fringe projection profilometry technique. Experiments have been conducted to validate the practicality and robustness of the proposed technique. The experimental results demonstrate that the unsupervised learning approach using a deep convolutional generative adversarial network (DCGAN) is superior to the supervised learning approach using UNet in image-to-image generation. The proposed technique's ability to accurately reconstruct 3D shapes of objects using only a single fringe image opens up vast opportunities for its application across diverse real-world scenarios.
计算机视觉领域一直致力于通过深度人工神经网络从单个二维(2D)图像中实现精确的三维(3D)物体表示。结合结构化光和深度学习的3D形状重建技术的最新进展显示出在获取有关物体表面的高质量几何信息方面的潜力。本文介绍了一种新的单镜头3D形状重建方法,该方法通过监督和无监督学习网络使用非线性条纹变换方法。在这种方法中,深度学习网络学习将灰度条纹输入转换为具有不同频率的多个相移条纹输出,这些输出用作后续使用结构化光条纹投影轮廓测量技术进行3D重建过程的中间结果。已经进行了实验以验证所提出技术的实用性和鲁棒性。实验结果表明,在图像到图像生成中,使用深度卷积生成对抗网络(DCGAN)的无监督学习方法优于使用UNet的监督学习方法。所提出的技术仅使用单个条纹图像就能准确重建物体3D形状的能力为其在各种现实世界场景中的应用开辟了广阔的机会。