解耦的两阶段人群计数及超越。

Decoupled Two-Stage Crowd Counting and Beyond.

出版信息

IEEE Trans Image Process. 2021;30:2862-2875. doi: 10.1109/TIP.2021.3055631. Epub 2021 Feb 12.

DOI:10.1109/TIP.2021.3055631

Abstract

One of appealing approaches to counting dense objects, such as crowd, is density map estimation. Density maps, however, present ambiguous appearance cues in congested scenes, rendering infeasibility in identifying individuals and difficulties in diagnosing errors. Inspired by an observation that counting can be interpreted as a two-stage process, i.e., identifying possible object regions and counting exact object numbers, we introduce a probabilistic intermediate representation termed the probability map that depicts the probability of each pixel being an object. This representation allows us to decouple counting into probability map regression (PMR) and count map regression (CMR). We therefore propose a novel decoupled two-stage counting (D2C) framework that sequentially regresses the probability map and learns a counter conditioned on the probability map. Given the probability map and the count map, a peak point detection algorithm is derived to localize each object with a point under the guidance of local counts. An advantage of D2C is that the counter can be learned reliably with additional synthesized probability maps. This addresses important data deficiency and sample imbalanced problems in counting. Our framework also enables easy diagnoses and analyses of error patterns. For instance, we find that, the counter per se is sufficiently accurate, while the bottleneck appears to be PMR. We further instantiate a network D2CNet in our framework and report state-of-the-art counting and localization performance across 6 crowd counting benchmarks. Since the probability map is a representation independent of visual appearance, D2CNet also exhibits remarkable cross-dataset transferability. Code and pretrained models are made available at: https://git.io/d2cnet.

摘要

一种用于计数密集目标（如人群）的吸引人的方法是密度图估计。然而，在拥挤的场景中，密度图呈现出模糊的外观线索，使得识别个体变得不可行，并且难以诊断错误。受计数可以解释为两个阶段的过程的观察启发，即识别可能的对象区域和计数确切的对象数量，我们引入了一种称为概率图的概率中间表示，该图描绘了每个像素成为对象的概率。这种表示允许我们将计数解耦为概率图回归（PMR）和计数图回归（CMR）。因此，我们提出了一种新颖的解耦两阶段计数（D2C）框架，该框架依次回归概率图，并根据概率图学习计数器。给定概率图和计数图，我们推导了一个峰值点检测算法，该算法在局部计数的指导下，用一个点来定位每个对象。D2C 的一个优点是，可以通过额外的合成概率图可靠地学习计数器。这解决了计数中重要的数据不足和样本不平衡问题。我们的框架还可以方便地诊断和分析错误模式。例如，我们发现，计数器本身已经足够准确，而瓶颈似乎是 PMR。我们进一步在我们的框架中实例化了一个网络 D2CNet，并在 6 个人群计数基准上报告了最先进的计数和定位性能。由于概率图是一种与视觉外观无关的表示，D2CNet 还表现出显著的跨数据集可转移性。代码和预训练模型可在以下网址获得：https://git.io/d2cnet。

相似文献

Decoupled Two-Stage Crowd Counting and Beyond.解耦的两阶段人群计数及超越。

IEEE Trans Image Process. 2021;30:2862-2875. doi: 10.1109/TIP.2021.3055631. Epub 2021 Feb 12.

Kernel-Based Density Map Generation for Dense Object Counting.用于密集目标计数的基于核的密度图生成

IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1357-1370. doi: 10.1109/TPAMI.2020.3022878. Epub 2022 Feb 3.

Counting Crowd by Weighing Counts: A Sequential Decision-Making Perspective.通过权衡计数来计算人群：一种顺序决策视角。

IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5141-5154. doi: 10.1109/TNNLS.2022.3202652. Epub 2024 Apr 4.

A Self-Training Approach for Point-Supervised Object Detection and Counting in Crowds.一种用于人群中点监督目标检测与计数的自训练方法。

IEEE Trans Image Process. 2021;30:2876-2887. doi: 10.1109/TIP.2021.3055632. Epub 2021 Feb 12.

Learning Discriminative Features for Crowd Counting.

IEEE Trans Image Process. 2024;33:3749-3764. doi: 10.1109/TIP.2024.3408609. Epub 2024 Jun 13.

Redesigning Multi-Scale Neural Network for Crowd Counting.重新设计用于人群计数的多尺度神经网络。

IEEE Trans Image Process. 2023;32:3664-3678. doi: 10.1109/TIP.2023.3289290. Epub 2023 Jul 4.

Counting dense object of multiple types based on feature enhancement.基于特征增强的多类型密集目标计数

Front Neurorobot. 2024 May 16;18:1383943. doi: 10.3389/fnbot.2024.1383943. eCollection 2024.

Adversarial Learning for Multiscale Crowd Counting Under Complex Scenes.对抗学习在复杂场景下的多尺度人群计数中的应用。

IEEE Trans Cybern. 2021 Nov;51(11):5423-5432. doi: 10.1109/TCYB.2019.2956091. Epub 2021 Nov 9.

Locating and Counting Heads in Crowds With a Depth Prior.基于深度先验的人群中人头定位与计数。

IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9056-9072. doi: 10.1109/TPAMI.2021.3124956. Epub 2022 Nov 7.

Tracking-by-Counting: Using Network Flows on Crowd Density Maps for Tracking Multiple Targets.基于计数的跟踪：利用人群密度图上的网络流跟踪多个目标。

IEEE Trans Image Process. 2021;30:1439-1452. doi: 10.1109/TIP.2020.3044219. Epub 2020 Dec 29.

引用本文的文献

MRSNet: Multi-Resolution Scale Feature Fusion-Based Universal Density Counting Network.MRSNet：基于多分辨率尺度特征融合的通用密度计数网络。

Sensors (Basel). 2024 Sep 14;24(18):5974. doi: 10.3390/s24185974.

解耦的两阶段人群计数及超越。

Decoupled Two-Stage Crowd Counting and Beyond.

出版信息

IEEE Trans Image Process. 2021;30:2862-2875. doi: 10.1109/TIP.2021.3055631. Epub 2021 Feb 12.

DOI:10.1109/TIP.2021.3055631

PMID:33539296

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

解耦的两阶段人群计数及超越。

Decoupled Two-Stage Crowd Counting and Beyond.

出版信息

相似文献

引用本文的文献

解耦的两阶段人群计数及超越。

Decoupled Two-Stage Crowd Counting and Beyond.

出版信息

相似文献

引用本文的文献