Arab Academy for Science, Technology and Maritime Transport, Alexandria 1029, Egypt.
School of Engineering, University of Central Lancashire, Preston PR1 2HE, UK.
Sensors (Basel). 2022 Feb 21;22(4):1663. doi: 10.3390/s22041663.
One of the primary tasks undertaken by autonomous vehicles (AVs) is object detection, which comes ahead of object tracking, trajectory estimation, and collision avoidance. Vulnerable road objects (e.g., pedestrians, cyclists, etc.) pose a greater challenge to the reliability of object detection operations due to their continuously changing behavior. The majority of commercially available AVs, and research into them, depends on employing expensive sensors. However, this hinders the development of further research on the operations of AVs. In this paper, therefore, we focus on the use of a lower-cost single-beam LiDAR in addition to a monocular camera to achieve multiple 3D vulnerable object detection in real driving scenarios, all the while maintaining real-time performance. This research also addresses the problems faced during object detection, such as the complex interaction between objects where occlusion and truncation occur, and the dynamic changes in the perspective and scale of bounding boxes. The video-processing module works upon a deep-learning detector (YOLOv3), while the LiDAR measurements are pre-processed and grouped into clusters. The output of the proposed system is objects classification and localization by having bounding boxes accompanied by a third depth dimension acquired by the LiDAR. Real-time tests show that the system can efficiently detect the 3D location of vulnerable objects in real-time scenarios.
自动驾驶车辆(AV)的主要任务之一是目标检测,它先于目标跟踪、轨迹估计和避碰。由于脆弱道路目标(例如行人和骑自行车的人等)的行为不断变化,因此对目标检测操作的可靠性构成了更大的挑战。大多数市售的 AV 及其研究都依赖于使用昂贵的传感器。然而,这阻碍了对 AV 操作的进一步研究。因此,在本文中,我们专注于在实际驾驶场景中使用低成本的单光束 LiDAR 与单目相机来实现多个 3D 脆弱目标检测,同时保持实时性能。本研究还解决了目标检测中面临的问题,例如遮挡和截断等物体之间的复杂交互以及边界框的视角和比例的动态变化。视频处理模块基于深度学习检测器(YOLOv3)运行,而 LiDAR 测量值经过预处理并分组为聚类。所提出系统的输出是通过 LiDAR 获得的带有第三个深度维度的边界框对对象分类和定位。实时测试表明,该系统可以在实时场景中有效地检测脆弱目标的 3D 位置。