Department of Electronic and Electrical Engineering, Brunel University London, Kingston Ln, Uxbridge UB8 3PH, UK.
Centre for Data Science, Coventry University, Priory Road, Coventry CV1 5FB, UK.
Sensors (Basel). 2021 Oct 31;21(21):7267. doi: 10.3390/s21217267.
Autonomous Vehicles (AVs) have the potential to solve many traffic problems, such as accidents, congestion and pollution. However, there are still challenges to overcome, for instance, AVs need to accurately perceive their environment to safely navigate in busy urban scenarios. The aim of this paper is to review recent articles on computer vision techniques that can be used to build an AV perception system. AV perception systems need to accurately detect non-static objects and predict their behaviour, as well as to detect static objects and recognise the information they are providing. This paper, in particular, focuses on the computer vision techniques used to detect pedestrians and vehicles. There have been many papers and reviews on pedestrians and vehicles detection so far. However, most of the past papers only reviewed pedestrian or vehicle detection separately. This review aims to present an overview of the AV systems in general, and then review and investigate several detection computer vision techniques for pedestrians and vehicles. The review concludes that both traditional and Deep Learning (DL) techniques have been used for pedestrian and vehicle detection; however, DL techniques have shown the best results. Although good detection results have been achieved for pedestrians and vehicles, the current algorithms still struggle to detect small, occluded, and truncated objects. In addition, there is limited research on how to improve detection performance in difficult light and weather conditions. Most of the algorithms have been tested on well-recognised datasets such as Caltech and KITTI; however, these datasets have their own limitations. Therefore, this paper recommends that future works should be implemented on more new challenging datasets, such as PIE and BDD100K.
自动驾驶汽车 (AVs) 有潜力解决许多交通问题,例如事故、拥堵和污染。然而,仍然存在需要克服的挑战,例如 AVs 需要准确感知其环境,以便在繁忙的城市场景中安全行驶。本文的目的是回顾最近关于计算机视觉技术的文章,这些技术可用于构建 AV 感知系统。AV 感知系统需要准确地检测非静态物体并预测它们的行为,以及检测静态物体并识别它们提供的信息。本文特别关注用于检测行人和车辆的计算机视觉技术。迄今为止,已经有许多关于行人和车辆检测的论文和综述。然而,过去的大多数论文仅分别审查了行人或车辆检测。本综述旨在概述一般的 AV 系统,然后回顾和研究几种用于行人和车辆检测的计算机视觉技术。综述得出的结论是,传统技术和深度学习 (DL) 技术都已用于行人检测和车辆检测;然而,DL 技术已经显示出了最佳的结果。尽管已经实现了对行人和车辆的良好检测结果,但当前的算法仍然难以检测小、遮挡和截断的物体。此外,在困难的光照和天气条件下如何提高检测性能的研究还很有限。大多数算法都在 Caltech 和 KITTI 等知名数据集上进行了测试;然而,这些数据集本身也存在局限性。因此,本文建议未来的工作应在更多新的具有挑战性的数据集上实施,例如 PIE 和 BDD100K。