Shi Liang, Li Beichen, Matusik Wojciech
Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 32 Vassar St, Cambridge, MA, 02139, USA.
Light Sci Appl. 2022 Aug 3;11(1):247. doi: 10.1038/s41377-022-00894-6.
Computer-generated holography (CGH) provides volumetric control of coherent wavefront and is fundamental to applications such as volumetric 3D displays, lithography, neural photostimulation, and optical/acoustic trapping. Recently, deep learning-based methods emerged as promising computational paradigms for CGH synthesis that overcome the quality-runtime tradeoff in conventional simulation/optimization-based methods. Yet, the quality of the predicted hologram is intrinsically bounded by the dataset's quality. Here we introduce a new hologram dataset, MIT-CGH-4K-V2, that uses a layered depth image as a data-efficient volumetric 3D input and a two-stage supervised+unsupervised training protocol for direct synthesis of high-quality 3D phase-only holograms. The proposed system also corrects vision aberration, allowing customization for end-users. We experimentally show photorealistic 3D holographic projections and discuss relevant spatial light modulator calibration procedures. Our method runs in real-time on a consumer GPU and 5 FPS on an iPhone 13 Pro, promising drastically enhanced performance for the applications above.
计算机生成全息术(CGH)可实现对相干波前的体控制,是诸如体三维显示、光刻、神经光刺激以及光学/声学捕获等应用的基础。最近,基于深度学习的方法作为CGH合成的有前景的计算范式出现,克服了传统基于模拟/优化方法中的质量-运行时间权衡。然而,预测全息图的质量本质上受数据集质量的限制。在此,我们引入一个新的全息图数据集MIT-CGH-4K-V2,它使用分层深度图像作为数据高效的体三维输入,并采用两阶段监督+无监督训练协议直接合成高质量的仅相位三维全息图。所提出的系统还校正视觉像差,允许为终端用户定制。我们通过实验展示了逼真的三维全息投影,并讨论了相关的空间光调制器校准程序。我们的方法在消费级GPU上实时运行,在iPhone 13 Pro上为每秒5帧,有望显著提升上述应用的性能。