基于场景的行人检测在静态视频监控中的应用。

Scene-specific pedestrian detection for static video surveillance.

机构信息

The Chinese University of Hong Kong, Hong Kong.

the Chinese University of Hong Kong, Hong Kong.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2014 Feb;36(2):361-74. doi: 10.1109/TPAMI.2013.124.

DOI:10.1109/TPAMI.2013.124

PMID:24356355

Abstract

The performance of a generic pedestrian detector may drop significantly when it is applied to a specific scene due to the mismatch between the source training set and samples from the target scene. We propose a new approach of automatically transferring a generic pedestrian detector to a scene-specific detector in static video surveillance without manually labeling samples from the target scene. The proposed transfer learning framework consists of four steps. 1) Through exploring the indegrees from target samples to source samples on a visual affinity graph, the source samples are weighted to match the distribution of target samples. 2) It explores a set of context cues to automatically select samples from the target scene, predicts their labels, and computes confidence scores to guide transfer learning. 3) The confidence scores propagate among target samples according to their underlying visual structures. 4) Target samples with higher confidence scores have larger influence on training scene-specific detectors. All these considerations are formulated under a single objective function called confidence-encoded SVM, which avoids hard thresholding on confidence scores. During test, only the appearance-based detector is used without context cues. The effectiveness is demonstrated through experiments on two video surveillance data sets. Compared with a generic detector, it improves the detection rates by 48 and 36 percent at one false positive per image (FPPI) on the two data sets, respectively. The training process converges after one or two iterations on the data sets in experiments.

摘要

由于源训练集与目标场景样本之间的不匹配，通用行人检测器在应用于特定场景时性能可能会显著下降。我们提出了一种新的方法，可以在静态视频监控中无需手动标记目标场景样本的情况下，将通用行人检测器自动转换为特定于场景的检测器。所提出的迁移学习框架由四个步骤组成。1）通过在视觉相似性图上探索目标样本到源样本的入度，对源样本进行加权以匹配目标样本的分布。2）它探索了一组上下文线索，自动从目标场景中选择样本，预测它们的标签，并计算置信度得分以指导迁移学习。3）置信度得分根据目标样本的潜在视觉结构在目标样本之间传播。4）置信度得分较高的目标样本对训练特定于场景的检测器的影响更大。所有这些考虑都在一个称为置信编码 SVM 的单一目标函数下进行了公式化，该函数避免了对置信度得分进行硬阈值处理。在测试期间，仅使用基于外观的检测器，而无需上下文线索。通过在两个视频监控数据集上的实验证明了其有效性。与通用检测器相比，它分别在两个数据集上以每幅图像一个误报（FPPI）提高了 48%和 36%的检测率。在实验中，该数据集在经过一到两次迭代后，训练过程就会收敛。

相似文献

Scene-specific pedestrian detection for static video surveillance.基于场景的行人检测在静态视频监控中的应用。

IEEE Trans Pattern Anal Mach Intell. 2014 Feb;36(2):361-74. doi: 10.1109/TPAMI.2013.124.

Monotonicity and error type differentiability in performance measures for target detection and tracking in video.视频中目标检测和跟踪的性能度量中的单调性和误差类型可区分性。

IEEE Trans Pattern Anal Mach Intell. 2013 Oct;35(10):2553-60. doi: 10.1109/TPAMI.2013.70.

An online learning approach to occlusion boundary detection.基于在线学习的遮挡边界检测方法。

IEEE Trans Image Process. 2012 Jan;21(1):252-61. doi: 10.1109/TIP.2011.2162420. Epub 2011 Jul 22.

Pedestrian detection via classification on Riemannian manifolds.基于黎曼流形分类的行人检测

IEEE Trans Pattern Anal Mach Intell. 2008 Oct;30(10):1713-27. doi: 10.1109/TPAMI.2008.75.

Constraint integration for efficient multiview pose estimation with self-occlusions.用于具有自遮挡的高效多视图姿态估计的约束集成

IEEE Trans Pattern Anal Mach Intell. 2008 Mar;30(3):493-506. doi: 10.1109/TPAMI.2007.1173.

A practical algorithm for learning scene information from monocular video.一种从单目视频中学习场景信息的实用算法。

Opt Express. 2008 Feb 4;16(3):1448-59. doi: 10.1364/oe.16.001448.

Adaptive online performance evaluation of video trackers.视频跟踪器的自适应在线性能评估。

IEEE Trans Image Process. 2012 May;21(5):2812-23. doi: 10.1109/TIP.2011.2182520. Epub 2012 Jan 2.

A discriminative learning framework with pairwise constraints for video object classification.一种用于视频对象分类的带有成对约束的判别式学习框架。

IEEE Trans Pattern Anal Mach Intell. 2006 Apr;28(4):578-93. doi: 10.1109/TPAMI.2006.65.

Coupled kernel embedding for low resolution face image recognition.基于核嵌入的低分辨率人脸图像识别。

IEEE Trans Image Process. 2012 Aug;21(8):3770-83. doi: 10.1109/TIP.2012.2192285. Epub 2012 Apr 3.

Effective gaussian mixture learning for video background subtraction.用于视频背景减除的有效高斯混合学习

IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):827-32. doi: 10.1109/TPAMI.2005.102.

引用本文的文献

INSANet: INtra-INter Spectral Attention Network for Effective Feature Fusion of Multispectral Pedestrian Detection.INSANet：用于多光谱行人检测有效特征融合的内部-外部光谱注意力网络

Sensors (Basel). 2024 Feb 10;24(4):1168. doi: 10.3390/s24041168.

An Unsupervised Transfer Learning Framework for Visible-Thermal Pedestrian Detection.基于无监督迁移学习的可见光-热行人检测框架。

Sensors (Basel). 2022 Jun 10;22(12):4416. doi: 10.3390/s22124416.

Coarse-to-Fine Adaptive People Detection for Video Sequences by Maximizing Mutual Information .基于最大互信息的视频序列粗到精自适应人像检测

Sensors (Basel). 2018 Dec 20;19(1):4. doi: 10.3390/s19010004.

Enhancing Multi-Camera People Detection by Online Automatic Parametrization Using Detection Transfer and Self-Correlation Maximization .利用检测迁移和自相关最大化的在线自动参数化来增强多摄像机人像检测。

Sensors (Basel). 2018 Dec 11;18(12):4385. doi: 10.3390/s18124385.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于场景的行人检测在静态视频监控中的应用。

Scene-specific pedestrian detection for static video surveillance.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献