Department of Electrical and Electronic Engineering, Faculty of Engineering, Imperial College London, London SW7 2AZ, UK.
Department of Computer Science, University College London, London WC1E 6EA, UK.
Sensors (Basel). 2022 May 7;22(9):3568. doi: 10.3390/s22093568.
Pedestrian detection is a challenging task, mainly owing to the numerous appearances of human bodies. Modern detectors extract representative features via the deep neural network; however, they usually require a large training set and high-performance GPUs. For these cases, we propose a novel human detection approach that integrates a pretrained face detector based on multitask cascaded convolutional neural networks and a traditional pedestrian detector based on aggregate channel features via a score combination module. The proposed detector is a promising approach that can be used to handle pedestrian detection with limited datasets and computational resources. The proposed detector is investigated comprehensively in terms of parameter choices to optimize its performance. The robustness of the proposed detector in terms of the training set, test set, and threshold is observed via tests and cross dataset validations on various pedestrian datasets, including the INRIA, part of the ETHZ, and the Caltech and Citypersons datasets. Experiments have proved that this integrated detector yields a significant increase in recall and a decrease in the log average miss rate compared with sole use of the traditional pedestrian detector. At the same time, the proposed method achieves a comparable performance to FRCNN on the INRIA test set compared with sole use of the Aggregated Channel Features detector.
行人检测是一项具有挑战性的任务,主要是因为人体有很多不同的外观。现代探测器通过深度神经网络提取有代表性的特征;然而,它们通常需要一个大型的训练集和高性能的 GPU。针对这些情况,我们提出了一种新颖的行人检测方法,该方法通过一个基于多任务级联卷积神经网络的预训练人脸探测器和一个基于聚合通道特征的传统行人探测器,通过一个分数组合模块进行集成。所提出的探测器是一种很有前途的方法,可以用于处理数据集和计算资源有限的行人检测问题。通过在各种行人数据集(包括 INRIA、ETHZ 的一部分以及 Caltech 和 Citypersons 数据集)上进行训练集、测试集和阈值的测试和跨数据集验证,对所提出的探测器的参数选择进行了全面的研究,以优化其性能。实验证明,与单独使用传统行人探测器相比,这种集成探测器在召回率上有显著提高,在对数平均漏检率上有显著降低。同时,与单独使用 Aggregated Channel Features 探测器相比,该方法在 INRIA 测试集上的性能与 FRCNN 相当。