• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用自适应学习进行透视失真校正的人群计数。

Counting Crowds with Perspective Distortion Correction via Adaptive Learning.

机构信息

School of Computer Science and Technology, East China Normal University, Shanghai 200062, China.

出版信息

Sensors (Basel). 2020 Jul 6;20(13):3781. doi: 10.3390/s20133781.

DOI:10.3390/s20133781
PMID:32640552
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7374275/
Abstract

The goal of crowd counting is to estimate the number of people in the image. Presently, use regression to count people number became a mainstream method. It is worth noting that, with the development of convolutional neural networks (CNN), methods that are based on CNN have become a research hotspot. It is a more interesting topic that how to locate the site of the person in the image than simply predicting the number of people in the image. The perspective transformation present is still a challenge, because perspective distortion will cause differences in the size of the crowd in the image. To devote perspective distortion and locate the site of the person more accuracy, we design a novel framework named Adaptive Learning Network (CAL). We use the VGG as the backbone. After each pooling layer is output, we collect the 1/2, 1/4, 1/8, and 1/16 features of the original image and combine them with the weights learned by an adaptive learning branch. The object of our adaptive learning branch is each image in the datasets. By combining the output features of different sizes of each image, the challenge of drastic changes in the size of the image crowd due to perspective transformation is reduced. We conducted experiments on four population counting data sets (i.e., ShanghaiTech Part A, ShanghaiTech Part B, UCF_CC_50 and UCF-QNRF), and the results show that our model has a good performance.

摘要

人群计数的目标是估计图像中的人数。目前,使用回归来计数已成为主流方法。值得注意的是,随着卷积神经网络(CNN)的发展,基于 CNN 的方法已成为研究热点。如何在图像中定位人的位置比简单地预测图像中的人数更有趣。目前的透视变换仍然是一个挑战,因为透视失真会导致图像中人群大小的差异。为了更准确地进行透视失真和定位人的位置,我们设计了一个名为自适应学习网络(CAL)的新框架。我们使用 VGG 作为骨干。在输出每个池化层之后,我们收集原始图像的 1/2、1/4、1/8 和 1/16 特征,并将它们与自适应学习分支学习到的权重结合起来。我们自适应学习分支的目标是数据集内的每张图像。通过结合每张图像不同大小的输出特征,减少了由于透视变换导致图像人群大小急剧变化的挑战。我们在四个人群计数数据集(即 ShanghaiTech Part A、ShanghaiTech Part B、UCF_CC_50 和 UCF-QNRF)上进行了实验,结果表明我们的模型具有良好的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/e86225b86575/sensors-20-03781-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/05341ce3f1eb/sensors-20-03781-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/55446d895314/sensors-20-03781-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/0cca1a74a45b/sensors-20-03781-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/97000e7bafac/sensors-20-03781-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/77bc474975cb/sensors-20-03781-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/3d003aaa5973/sensors-20-03781-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/e86225b86575/sensors-20-03781-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/05341ce3f1eb/sensors-20-03781-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/55446d895314/sensors-20-03781-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/0cca1a74a45b/sensors-20-03781-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/97000e7bafac/sensors-20-03781-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/77bc474975cb/sensors-20-03781-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/3d003aaa5973/sensors-20-03781-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2629/7374275/e86225b86575/sensors-20-03781-g007.jpg

相似文献

1
Counting Crowds with Perspective Distortion Correction via Adaptive Learning.利用自适应学习进行透视失真校正的人群计数。
Sensors (Basel). 2020 Jul 6;20(13):3781. doi: 10.3390/s20133781.
2
SPCANet: congested crowd counting strip pooling combined attention network.SPCANet:拥堵人群计数的条带池化联合注意力网络。
PeerJ Comput Sci. 2024 Sep 18;10:e2273. doi: 10.7717/peerj-cs.2273. eCollection 2024.
3
An Adaptive Multi-Scale Network Based on Depth Information for Crowd Counting.一种基于深度信息的自适应多尺度人群计数网络
Sensors (Basel). 2023 Sep 11;23(18):7805. doi: 10.3390/s23187805.
4
Context-Aware Multi-Scale Aggregation Network for Congested Crowd Counting.上下文感知多尺度聚合网络用于拥挤人群计数。
Sensors (Basel). 2022 Apr 22;22(9):3233. doi: 10.3390/s22093233.
5
COMAL: compositional multi-scale feature enhanced learning for crowd counting.COMAL:用于人群计数的组合多尺度特征增强学习
Multimed Tools Appl. 2022;81(15):20541-20560. doi: 10.1007/s11042-022-12249-9. Epub 2022 Mar 11.
6
MH-MetroNet-A Multi-Head CNN for Passenger-Crowd Attendance Estimation.MH-MetroNet——一种用于乘客人群出勤估计的多头卷积神经网络
J Imaging. 2020 Jul 2;6(7):62. doi: 10.3390/jimaging6070062.
7
Crowd Counting Based on Multiscale Spatial Guided Perception Aggregation Network.基于多尺度空间引导感知聚合网络的人群计数
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17465-17478. doi: 10.1109/TNNLS.2023.3304348. Epub 2024 Dec 2.
8
HADF-Crowd: A Hierarchical Attention-Based Dense Feature Extraction Network for Single-Image Crowd Counting.基于分层注意力的密集特征提取网络用于单幅图像人群计数(HADF-Crowd)。
Sensors (Basel). 2021 May 17;21(10):3483. doi: 10.3390/s21103483.
9
PaDNet: Pan-Density Crowd Counting.PaDNet:全景密度人群计数
IEEE Trans Image Process. 2019 Nov 12. doi: 10.1109/TIP.2019.2952083.
10
A Dilated Convolutional Neural Network for Cross-Layers of Contextual Information for Congested Crowd Counting.一种用于拥挤人群计数的跨层上下文信息的扩张卷积神经网络。
Sensors (Basel). 2024 Mar 12;24(6):1816. doi: 10.3390/s24061816.

引用本文的文献

1
Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting.基于元知识和多任务学习的多场景自适应人群计数。
Sensors (Basel). 2022 Apr 26;22(9):3320. doi: 10.3390/s22093320.

本文引用的文献

1
Locate, Size, and Count: Accurately Resolving People in Dense Crowds via Detection.定位、大小和计数:通过检测准确解析密集人群中的人员。
IEEE Trans Pattern Anal Mach Intell. 2021 Aug;43(8):2739-2751. doi: 10.1109/TPAMI.2020.2974830. Epub 2021 Jul 1.
2
Nonlinear Regression via Deep Negative Correlation Learning.通过深度负相关学习进行非线性回归
IEEE Trans Pattern Anal Mach Intell. 2021 Mar;43(3):982-998. doi: 10.1109/TPAMI.2019.2943860. Epub 2021 Feb 4.
3
HA-CCN: Hierarchical Attention-based Crowd Counting Network.HA-CCN:基于分层注意力的人群计数网络。
IEEE Trans Image Process. 2019 Jul 19. doi: 10.1109/TIP.2019.2928634.
4
Design and Analysis of a Lightweight Context Fusion CNN Scheme for Crowd Counting.用于人群计数的轻量级上下文融合卷积神经网络方案的设计与分析
Sensors (Basel). 2019 Apr 29;19(9):2013. doi: 10.3390/s19092013.
5
Smart Camera Aware Crowd Counting via Multiple Task Fractional Stride Deep Learning.基于多任务分数步长深度学习的智能相机人群计数。
Sensors (Basel). 2019 Mar 18;19(6):1346. doi: 10.3390/s19061346.
6
Exploiting Unlabeled Data in CNNs by Self-Supervised Learning to Rank.通过自监督学习排序在卷积神经网络中利用未标记数据。
IEEE Trans Pattern Anal Mach Intell. 2019 Aug;41(8):1862-1878. doi: 10.1109/TPAMI.2019.2899857. Epub 2019 Feb 15.
7
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.
8
Anomaly detection and localization in crowded scenes.拥挤场景中的异常检测和定位。
IEEE Trans Pattern Anal Mach Intell. 2014 Jan;36(1):18-32. doi: 10.1109/TPAMI.2013.111.
9
Monocular pedestrian detection: survey and experiments.单目行人检测:综述与实验
IEEE Trans Pattern Anal Mach Intell. 2009 Dec;31(12):2179-95. doi: 10.1109/TPAMI.2008.260.
10
Pedestrian detection via classification on Riemannian manifolds.基于黎曼流形分类的行人检测
IEEE Trans Pattern Anal Mach Intell. 2008 Oct;30(10):1713-27. doi: 10.1109/TPAMI.2008.75.