State Key Laboratory of Power Transmission Equipment and System Security and New Technology, Chongqing University, Chongqing 400044, China.
TrunkTech Co., Ltd., No. 3, Danling street, ZhongGuan Town, HaiDian District, Beijing 100089, China.
Sensors (Basel). 2018 Oct 6;18(10):3337. doi: 10.3390/s18103337.
LiDAR-based or RGB-D-based object detection is used in numerous applications, ranging from autonomous driving to robot vision. Voxel-based 3D convolutional networks have been used for some time to enhance the retention of information when processing point cloud LiDAR data. However, problems remain, including a slow inference speed and low orientation estimation performance. We therefore investigate an improved sparse convolution method for such networks, which significantly increases the speed of both training and inference. We also introduce a new form of angle loss regression to improve the orientation estimation performance and a new data augmentation approach that can enhance the convergence speed and performance. The proposed network produces state-of-the-art results on the KITTI 3D object detection benchmarks while maintaining a fast inference speed.
基于 LiDAR 或 RGB-D 的目标检测在许多应用中都有使用,从自动驾驶到机器人视觉等。基于体素的 3D 卷积网络已经被用于一段时间,以增强在处理点云 LiDAR 数据时的信息保留。然而,仍然存在问题,包括推理速度慢和方向估计性能低。因此,我们研究了一种改进的稀疏卷积方法,该方法可显著提高网络的训练和推理速度。我们还引入了一种新的角度损失回归形式,以提高方向估计性能,以及一种新的数据增强方法,可以提高收敛速度和性能。所提出的网络在 KITTI 3D 目标检测基准测试中取得了最先进的结果,同时保持了快速的推理速度。