Li Jing, Zhang Fangbing, Wei Lisong, Yang Tao, Lu Zhaoyang
School of Telecommunications Engineering, Xidian University, Xi'an 710071, China.
School of Computer Science, Northwestern Polytechnical University, Xi'an 710129, China.
Sensors (Basel). 2017 Oct 16;17(10):2354. doi: 10.3390/s17102354.
Pedestrian detection is among the most frequently-used preprocessing tasks in many surveillance application fields, from low-level people counting to high-level scene understanding. Even though many approaches perform well in the daytime with sufficient illumination, pedestrian detection at night is still a critical and challenging problem for video surveillance systems. To respond to this need, in this paper, we provide an affordable solution with a near-infrared stereo network camera, as well as a novel three-dimensional foreground pedestrian detection model. Specifically, instead of using an expensive thermal camera, we build a near-infrared stereo vision system with two calibrated network cameras and near-infrared lamps. The core of the system is a novel voxel surface model, which is able to estimate the dynamic changes of three-dimensional geometric information of the surveillance scene and to segment and locate foreground pedestrians in real time. A free update policy for unknown points is designed for model updating, and the extracted shadow of the pedestrian is adopted to remove foreground false alarms. To evaluate the performance of the proposed model, the system is deployed in several nighttime surveillance scenes. Experimental results demonstrate that our method is capable of nighttime pedestrian segmentation and detection in real time under heavy occlusion. In addition, the qualitative and quantitative comparison results show that our work outperforms classical background subtraction approaches and a recent RGB-D method, as well as achieving comparable performance with the state-of-the-art deep learning pedestrian detection method even with a much lower hardware cost.
行人检测是许多监控应用领域中最常用的预处理任务之一,涵盖从低级的人数统计到高级的场景理解。尽管许多方法在白天光照充足时表现良好,但夜间行人检测对于视频监控系统而言仍然是一个关键且具有挑战性的问题。为满足这一需求,在本文中,我们提供了一种经济实惠的解决方案,即使用近红外立体网络摄像头以及一种新颖的三维前景行人检测模型。具体而言,我们并非使用昂贵的热成像摄像头,而是构建了一个由两个校准后的网络摄像头和近红外灯组成的近红外立体视觉系统。该系统的核心是一种新颖的体素表面模型,它能够估计监控场景三维几何信息的动态变化,并实时分割和定位前景行人。针对模型更新设计了一种针对未知点的免费更新策略,并且采用提取的行人阴影来消除前景误报。为评估所提模型的性能,该系统被部署在多个夜间监控场景中。实验结果表明,我们的方法能够在严重遮挡的情况下实时进行夜间行人分割和检测。此外,定性和定量的比较结果表明,我们的工作优于经典的背景减法方法和一种近期的RGB-D方法,并且即使在硬件成本低得多的情况下,也能与最先进的深度学习行人检测方法取得相当的性能。