Ma Xinzhu, Ouyang Wanli, Simonelli Andrea, Ricci Elisa
IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):3537-3556. doi: 10.1109/TPAMI.2023.3346386. Epub 2024 Apr 3.
3D object detection from images, one of the fundamental and challenging problems in autonomous driving, has received increasing attention from both industry and academia in recent years. Benefiting from the rapid development of deep learning technologies, image-based 3D detection has achieved remarkable progress. Particularly, more than 200 works have studied this problem from 2015 to 2021, encompassing a broad spectrum of theories, algorithms, and applications. However, to date no recent survey exists to collect and organize this knowledge. In this paper, we fill this gap in the literature and provide the first comprehensive survey of this novel and continuously growing research field, summarizing the most commonly used pipelines for image-based 3D detection and deeply analyzing each of their components. Additionally, we also propose two new taxonomies to organize the state-of-the-art methods into different categories, with the intent of providing a more systematic review of existing methods and facilitating fair comparisons with future works. In retrospect of what has been achieved so far, we also analyze the current challenges in the field and discuss future directions for image-based 3D detection research.
从图像中进行3D目标检测是自动驾驶领域的基本且具有挑战性的问题之一,近年来受到了工业界和学术界越来越多的关注。受益于深度学习技术的快速发展,基于图像的3D检测取得了显著进展。特别是,从2015年到2021年,有200多篇论文研究了这个问题,涵盖了广泛的理论、算法和应用。然而,到目前为止,还没有近期的综述来收集和整理这些知识。在本文中,我们填补了文献中的这一空白,首次对这个新颖且不断发展的研究领域进行了全面综述,总结了基于图像的3D检测中最常用的流程,并深入分析了其每个组件。此外,我们还提出了两种新的分类法,将最先进的方法组织成不同的类别,旨在对现有方法进行更系统的综述,并便于与未来的工作进行公平比较。回顾迄今为止所取得的成果,我们还分析了该领域当前面临的挑战,并讨论了基于图像的3D检测研究的未来方向。