用于增强人群计数的深度秩一致金字塔模型。

Deep Rank-Consistent Pyramid Model for Enhanced Crowd Counting.

作者信息

Gao Jiaqi, Huang Zhizhong, Lei Yiming, Shan Hongming, Wang James Z, Wang Fei-Yue, Zhang Junping

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):299-312. doi: 10.1109/TNNLS.2023.3336774. Epub 2025 Jan 7.

DOI:10.1109/TNNLS.2023.3336774

Abstract

Most conventional crowd counting methods utilize a fully-supervised learning framework to establish a mapping between scene images and crowd density maps. They usually rely on a large quantity of costly and time-intensive pixel-level annotations for training supervision. One way to mitigate the intensive labeling effort and improve counting accuracy is to leverage large amounts of unlabeled images. This is attributed to the inherent self-structural information and rank consistency within a single image, offering additional qualitative relation supervision during training. Contrary to earlier methods that utilized the rank relations at the original image level, we explore such rank-consistency relation within the latent feature spaces. This approach enables the incorporation of numerous pyramid partial orders, strengthening the model representation capability. A notable advantage is that it can also increase the utilization ratio of unlabeled samples. Specifically, we propose a Deep Rank-consist Ent pyrAmid Model (DREAM), which makes full use of rank consistency across coarse-to-fine pyramid features in latent spaces for enhanced crowd counting with massive unlabeled images. In addition, we have collected a new unlabeled crowd counting dataset, FUDAN-UCC, comprising 4000 images for training purposes. Extensive experiments on four benchmark datasets, namely UCF-QNRF, ShanghaiTech PartA and PartB, and UCF-CC-50, show the effectiveness of our method compared with previous semi-supervised methods. The codes are available at https://github.com/bridgeqiqi/DREAM.

摘要

大多数传统的人群计数方法利用全监督学习框架来建立场景图像与人群密度图之间的映射。它们通常依赖大量昂贵且耗时的像素级注释进行训练监督。减轻密集标注工作并提高计数准确性的一种方法是利用大量未标注图像。这归因于单个图像中固有的自结构信息和秩一致性，在训练期间提供额外的定性关系监督。与早期在原始图像级别利用秩关系的方法不同，我们在潜在特征空间中探索这种秩一致性关系。这种方法能够纳入大量金字塔偏序，增强模型表示能力。一个显著优点是它还可以提高未标注样本的利用率。具体来说，我们提出了一种深度秩一致熵金字塔模型（DREAM），它充分利用潜在空间中从粗到细的金字塔特征之间的秩一致性，通过大量未标注图像增强人群计数。此外，我们收集了一个新的未标注人群计数数据集FUDAN-UCC，包含4000张用于训练的图像。在四个基准数据集，即UCF-QNRF、上海科技大学A部分和B部分以及UCF-CC-50上进行的大量实验表明，与之前的半监督方法相比，我们的方法是有效的。代码可在https://github.com/bridgeqiqi/DREAM获取。

相似文献

Deep Rank-Consistent Pyramid Model for Enhanced Crowd Counting.

IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):299-312. doi: 10.1109/TNNLS.2023.3336774. Epub 2025 Jan 7.

Scale-Aware Crowd Counting Network With Annotation Error Modeling.

IEEE Trans Image Process. 2025;34:2750-2764. doi: 10.1109/TIP.2025.3555116. Epub 2025 May 9.

Hybrid Perturbation Strategy for Semi-Supervised Crowd Counting.

IEEE Trans Image Process. 2024;33:1227-1240. doi: 10.1109/TIP.2024.3361730. Epub 2024 Feb 13.

COMAL: compositional multi-scale feature enhanced learning for crowd counting.

Multimed Tools Appl. 2022;81(15):20541-20560. doi: 10.1007/s11042-022-12249-9. Epub 2022 Mar 11.

Reducing Spatial Labeling Redundancy for Active Semi-Supervised Crowd Counting.

IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):9248-9255. doi: 10.1109/TPAMI.2022.3232712. Epub 2023 Jun 5.

Multidimensional Measure Matching for Crowd Counting.

IEEE Trans Neural Netw Learn Syst. 2025 May;36(5):9112-9126. doi: 10.1109/TNNLS.2024.3435854. Epub 2025 May 2.

An Adaptive Multi-Scale Network Based on Depth Information for Crowd Counting.

Sensors (Basel). 2023 Sep 11;23(18):7805. doi: 10.3390/s23187805.

DMPNet: densely connected multi-scale pyramid networks for crowd counting.

PeerJ Comput Sci. 2022 Mar 18;8:e902. doi: 10.7717/peerj-cs.902. eCollection 2022.

Consistency-Aware Anchor Pyramid Network for Crowd Localization.

IEEE Trans Pattern Anal Mach Intell. 2024 Apr 29;PP. doi: 10.1109/TPAMI.2024.3392013.

Multi-Task Credible Pseudo-Label Learning for Semi-Supervised Crowd Counting.

IEEE Trans Neural Netw Learn Syst. 2024 Aug;35(8):10394-10406. doi: 10.1109/TNNLS.2023.3241211. Epub 2024 Aug 5.

引用本文的文献

MTSC-Net: A Semi-Supervised Counting Network for Estimating the Number of Slash pine New Shoots.

Plant Phenomics. 2024 Aug 28;6:0228. doi: 10.34133/plantphenomics.0228. eCollection 2024.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于增强人群计数的深度秩一致金字塔模型。

Deep Rank-Consistent Pyramid Model for Enhanced Crowd Counting.

作者信息

Gao Jiaqi, Huang Zhizhong, Lei Yiming, Shan Hongming, Wang James Z, Wang Fei-Yue, Zhang Junping

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):299-312. doi: 10.1109/TNNLS.2023.3336774. Epub 2025 Jan 7.

DOI:10.1109/TNNLS.2023.3336774

PMID:38090870

Abstract

摘要

用于增强人群计数的深度秩一致金字塔模型。

Deep Rank-Consistent Pyramid Model for Enhanced Crowd Counting.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于增强人群计数的深度秩一致金字塔模型。

Deep Rank-Consistent Pyramid Model for Enhanced Crowd Counting.

作者信息

出版信息

相似文献

引用本文的文献