Suppr超能文献

计数具有低水平特征的人和贝叶斯回归。

Counting people with low-level features and Bayesian regression.

机构信息

Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong.

出版信息

IEEE Trans Image Process. 2012 Apr;21(4):2160-77. doi: 10.1109/TIP.2011.2172800. Epub 2011 Oct 19.

Abstract

An approach to the problem of estimating the size of inhomogeneous crowds, which are composed of pedestrians that travel in different directions, without using explicit object segmentation or tracking is proposed. Instead, the crowd is segmented into components of homogeneous motion, using the mixture of dynamic-texture motion model. A set of holistic low-level features is extracted from each segmented region, and a function that maps features into estimates of the number of people per segment is learned with Bayesian regression. Two Bayesian regression models are examined. The first is a combination of Gaussian process regression with a compound kernel, which accounts for both the global and local trends of the count mapping but is limited by the real-valued outputs that do not match the discrete counts. We address this limitation with a second model, which is based on a Bayesian treatment of Poisson regression that introduces a prior distribution on the linear weights of the model. Since exact inference is analytically intractable, a closed-form approximation is derived that is computationally efficient and kernelizable, enabling the representation of nonlinear functions. An approximate marginal likelihood is also derived for kernel hyperparameter learning. The two regression-based crowd counting methods are evaluated on a large pedestrian data set, containing very distinct camera views, pedestrian traffic, and outliers, such as bikes or skateboarders. Experimental results show that regression-based counts are accurate regardless of the crowd size, outperforming the count estimates produced by state-of-the-art pedestrian detectors. Results on 2 h of video demonstrate the efficiency and robustness of the regression-based crowd size estimation over long periods of time.

摘要

提出了一种无需显式对象分割或跟踪即可估计由朝不同方向行进的行人组成的不均匀人群大小的方法。相反,使用动态纹理运动模型的混合将人群分割成同质运动的分量。从每个分割区域提取一组整体的低级特征,并使用贝叶斯回归学习将特征映射到每个段的人数估计的函数。检查了两种贝叶斯回归模型。第一个是具有复合核的高斯过程回归的组合,该模型既考虑了计数映射的全局趋势,也考虑了局部趋势,但受到与离散计数不匹配的实值输出的限制。我们通过第二个模型解决了此限制,该模型基于泊松回归的贝叶斯处理,为模型的线性权重引入了先验分布。由于精确推断在分析上是难以处理的,因此推导出了一种计算效率高且可核化的闭式近似,从而能够表示非线性函数。还推导出了用于核超参数学习的近似边际似然。基于回归的人群计数方法在包含非常不同的摄像机视图、行人和行人交通以及自行车或滑板等异常值的大型行人数据集上进行了评估。实验结果表明,基于回归的计数无论人群大小如何都是准确的,优于最先进的行人探测器生成的计数估计。2 小时的视频结果证明了基于回归的人群大小估计在长时间内的效率和鲁棒性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验