Zhao Huaqi, Peng Xiang, Wang Su, Li Jun-Bao, Pan Jeng-Shyang, Su Xiaoguang, Liu Xiaomin
The Heilongjiang Provincial Key Laboratory of Autonomous Intelligence and Information Processing, School of Information and Electronic Technology, Jiamusi University, Jiamusi, China.
Harbin Institute of Technology, Harbin, China.
Front Neurorobot. 2024 May 1;18:1342126. doi: 10.3389/fnbot.2024.1342126. eCollection 2024.
The object detection method serves as the core technology within the unmanned driving perception module, extensively employed for detecting vehicles, pedestrians, traffic signs, and various objects. However, existing object detection methods still encounter three challenges in intricate unmanned driving scenarios: unsatisfactory performance in multi-scale object detection, inadequate accuracy in detecting small objects, and occurrences of false positives and missed detections in densely occluded environments. Therefore, this study proposes an improved object detection method for unmanned driving, leveraging Transformer architecture to address these challenges. First, a multi-scale Transformer feature extraction method integrated with channel attention is used to enhance the network's capability in extracting features across different scales. Second, a training method incorporating Query Denoising with Gaussian decay was employed to enhance the network's proficiency in learning representations of small objects. Third, a hybrid matching method combining Optimal Transport and Hungarian algorithms was used to facilitate the matching process between predicted and actual values, thereby enriching the network with more informative positive sample features. Experimental evaluations conducted on datasets including KITTI demonstrate that the proposed method achieves 3% higher mean Average Precision (mAP) than that of the existing methodologies.
目标检测方法是无人驾驶感知模块的核心技术,广泛应用于检测车辆、行人、交通标志及各种物体。然而,现有的目标检测方法在复杂的无人驾驶场景中仍面临三个挑战:多尺度目标检测性能不佳、小物体检测精度不足以及在密集遮挡环境中出现误报和漏检情况。因此,本研究提出一种改进的无人驾驶目标检测方法,利用Transformer架构来应对这些挑战。首先,采用一种集成通道注意力的多尺度Transformer特征提取方法,以增强网络跨不同尺度提取特征的能力。其次,采用一种结合高斯衰减的查询去噪训练方法,以提高网络学习小物体表征的熟练度。第三,使用一种结合最优传输和匈牙利算法的混合匹配方法,以促进预测值与实际值之间的匹配过程,从而为网络丰富更多信息丰富的正样本特征。在包括KITTI在内的数据集上进行的实验评估表明,所提出的方法比现有方法的平均精度均值(mAP)高出3%。