Wani Pranav, Usmani Kashif, Krishnan Gokul, Javidi Bahram
Opt Express. 2024 Feb 26;32(5):7495-7512. doi: 10.1364/OE.517312.
Integral imaging has proven useful for three-dimensional (3D) object visualization in adverse environmental conditions such as partial occlusion and low light. This paper considers the problem of 3D object tracking. Two-dimensional (2D) object tracking within a scene is an active research area. Several recent algorithms use object detection methods to obtain 2D bounding boxes around objects of interest in each frame. Then, one bounding box can be selected out of many for each object of interest using motion prediction algorithms. Many of these algorithms rely on images obtained using traditional 2D imaging systems. A growing literature demonstrates the advantage of using 3D integral imaging instead of traditional 2D imaging for object detection and visualization in adverse environmental conditions. Integral imaging's depth sectioning ability has also proven beneficial for object detection and visualization. Integral imaging captures an object's depth in addition to its 2D spatial position in each frame. A recent study uses integral imaging for the 3D reconstruction of the scene for object classification and utilizes the mutual information between the object's bounding box in this 3D reconstructed scene and the 2D central perspective to achieve passive depth estimation. We build over this method by using Bayesian optimization to track the object's depth in as few 3D reconstructions as possible. We study the performance of our approach on laboratory scenes with occluded objects moving in 3D and show that the proposed approach outperforms 2D object tracking. In our experimental setup, mutual information-based depth estimation with Bayesian optimization achieves depth tracking with as few as two 3D reconstructions per frame which corresponds to the theoretical minimum number of 3D reconstructions required for depth estimation. To the best of our knowledge, this is the first report on 3D object tracking using the proposed approach.
积分成像已被证明在诸如部分遮挡和低光照等恶劣环境条件下对三维(3D)物体可视化很有用。本文考虑三维物体跟踪问题。场景内的二维(2D)物体跟踪是一个活跃的研究领域。最近的几种算法使用物体检测方法在每一帧中获取感兴趣物体周围的二维边界框。然后,可使用运动预测算法从多个边界框中为每个感兴趣物体选择一个。这些算法中的许多都依赖于使用传统二维成像系统获得的图像。越来越多的文献表明,在恶劣环境条件下进行物体检测和可视化时,使用三维积分成像而非传统二维成像具有优势。积分成像的深度切片能力也已被证明对物体检测和可视化有益。积分成像除了在每一帧中捕获物体的二维空间位置外,还能捕获其深度。最近的一项研究将积分成像用于场景的三维重建以进行物体分类,并利用此三维重建场景中物体边界框与二维中心视角之间的互信息来实现被动深度估计。我们在此方法的基础上,通过使用贝叶斯优化在尽可能少的三维重建中跟踪物体的深度。我们研究了我们的方法在实验室场景中的性能,场景中有被遮挡的物体在三维空间中移动,结果表明所提出的方法优于二维物体跟踪。在我们的实验设置中,基于互信息的深度估计与贝叶斯优化相结合,每帧只需进行两次三维重建就能实现深度跟踪,这与深度估计所需的理论最小三维重建次数相对应。据我们所知,这是关于使用所提出方法进行三维物体跟踪的首次报告。