基于深度先验的人群中人头定位与计数。

Locating and Counting Heads in Crowds With a Depth Prior.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9056-9072. doi: 10.1109/TPAMI.2021.3124956. Epub 2022 Nov 7.

DOI:10.1109/TPAMI.2021.3124956

Abstract

To simultaneously estimate the number of heads and locate heads with bounding boxes, we resort to detection-based crowd counting by leveraging RGB-D data and design a dual-path guided detection network (DPDNet). Specifically, to improve the performance of detection-based approaches for dense/tiny heads, we propose a density map guided detection module, which leverages density map to improve the head/non-head classification in detection network where the density implies the probability of a pixel being a head, and a depth-adaptive kernel that considers the variances in head sizes is also introduced to generate high-fidelity density map for more robust density map regression. In order to prevent dense heads from being filtered out during post-processing, we utilize such a density map for post-processing of head detection and propose a density map guided NMS strategy. Meanwhile, to improve the ability of detecting small heads, we also propose a depth-guided detection module to generate a dynamic dilated convolution to extract features of heads of different scales, and a depth-aware anchor is further designed for better initialization of anchor sizes in the detection framework. Then we use the bounding boxes whose sizes are generated with depth to train our DPDNet. Considering that existing RGB-D datasets are too small and not suitable for performance evaluation of data-driven based approaches, we collect two large-scale RGB-D crowd counting datasets, which comprise a synthetic dataset and a real-world dataset, respectively. Since the depth value at long-distance positions cannot be obtained in the real-world dataset, we further propose a depth completion method with meta learning, which fully utilizes the synthetic depth data to complete the depth value at long-distance positions. Extensive experiments on our proposed two RGB-D datasets and the MICC RGB-D counting dataset show that our method achieves the best performance for RGB-D crowd counting and localization. Further, our method can be easily extended to RGB image based crowd counting and achieves comparable or even better performance on the RGB datasets for both head counting and localization.

摘要

为了同时估计人头数量并定位带有边界框的人头，我们借助 RGB-D 数据采用基于检测的方法进行人群计数，并设计了一种双通道引导检测网络（DPDNet）。具体来说，为了提高基于检测的方法在密集/微小人头检测方面的性能，我们提出了一种密度图引导的检测模块，该模块利用密度图来改进检测网络中的人头/非人头分类，其中密度表示一个像素为人头的概率，还引入了一个深度自适应核，用于生成更准确的密度图，以便更稳健地进行密度图回归。为了防止密集的人头在后期处理中被过滤掉，我们利用这种密度图进行人头检测的后期处理，并提出了一种密度图引导的 NMS 策略。同时，为了提高检测小个头的能力，我们还提出了一种深度引导的检测模块，用于生成动态扩张卷积，以提取不同尺度人头的特征，并进一步设计了深度感知锚点，以更好地初始化检测框架中的锚点大小。然后，我们使用大小由深度生成的边界框来训练我们的 DPDNet。考虑到现有的 RGB-D 数据集太小，不适合基于数据驱动的方法的性能评估，我们收集了两个大规模的 RGB-D 人群计数数据集，分别由一个合成数据集和一个真实世界数据集组成。由于在真实世界数据集中无法获得远距离位置的深度值，我们进一步提出了一种基于元学习的深度补全方法，该方法充分利用合成深度数据来完成远距离位置的深度值。在我们提出的两个 RGB-D 数据集和 MICC RGB-D 计数数据集上进行的广泛实验表明，我们的方法在 RGB-D 人群计数和定位方面取得了最佳性能。此外，我们的方法可以很容易地扩展到基于 RGB 图像的人群计数，并在 RGB 数据集上实现了人头计数和定位方面的可比甚至更好的性能。

相似文献

Locating and Counting Heads in Crowds With a Depth Prior.基于深度先验的人群中人头定位与计数。

IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9056-9072. doi: 10.1109/TPAMI.2021.3124956. Epub 2022 Nov 7.

Oriented feature pyramid network for small and dense wheat heads detection and counting.面向小而密集麦穗检测与计数的定向特征金字塔网络。

Sci Rep. 2024 Apr 6;14(1):8106. doi: 10.1038/s41598-024-58638-y.

An Adaptive Multi-Scale Network Based on Depth Information for Crowd Counting.一种基于深度信息的自适应多尺度人群计数网络

Sensors (Basel). 2023 Sep 11;23(18):7805. doi: 10.3390/s23187805.

Locate, Size, and Count: Accurately Resolving People in Dense Crowds via Detection.定位、大小和计数：通过检测准确解析密集人群中的人员。

IEEE Trans Pattern Anal Mach Intell. 2021 Aug;43(8):2739-2751. doi: 10.1109/TPAMI.2020.2974830. Epub 2021 Jul 1.

JHU-CROWD++: Large-Scale Crowd Counting Dataset and A Benchmark Method.JHU-CROWD++：大规模人群计数数据集和基准方法。

IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2594-2609. doi: 10.1109/TPAMI.2020.3035969. Epub 2022 Apr 1.

Kernel-Based Density Map Generation for Dense Object Counting.用于密集目标计数的基于核的密度图生成

IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1357-1370. doi: 10.1109/TPAMI.2020.3022878. Epub 2022 Feb 3.

Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection.边缘保持和多尺度上下文神经网络的显著目标检测。

IEEE Trans Image Process. 2018;27(1):121-134. doi: 10.1109/TIP.2017.2756825.

Enhancement of Local Crowd Location and Count: Multiscale Counting Guided by Head RGB-Mask.增强局部人群定位和计数：基于头部 RGB 掩码的多尺度计数引导。

Comput Intell Neurosci. 2022 Aug 24;2022:5708807. doi: 10.1155/2022/5708807. eCollection 2022.

Multi-Task Foreground-Aware Network with Depth Completion for Enhanced RGB-D Fusion Object Detection Based on Transformer.基于Transformer的具有深度补全功能的多任务前景感知网络用于增强RGB-D融合目标检测

Sensors (Basel). 2024 Apr 8;24(7):2374. doi: 10.3390/s24072374.

RGB-Guided Depth Map Recovery by Two-Stage Coarse-to-Fine Dense CRF Models.基于两阶段粗到细密集条件随机场模型的RGB引导深度图恢复

IEEE Trans Image Process. 2023;32:1315-1328. doi: 10.1109/TIP.2023.3242144. Epub 2023 Feb 23.

基于深度先验的人群中人头定位与计数。

Locating and Counting Heads in Crowds With a Depth Prior.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9056-9072. doi: 10.1109/TPAMI.2021.3124956. Epub 2022 Nov 7.

DOI:10.1109/TPAMI.2021.3124956

PMID:34735337

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于深度先验的人群中人头定位与计数。

Locating and Counting Heads in Crowds With a Depth Prior.

出版信息

相似文献

基于深度先验的人群中人头定位与计数。

Locating and Counting Heads in Crowds With a Depth Prior.

出版信息

相似文献