• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于语义场景分割的直升机视频人群计数。

Crowd Counting with Semantic Scene Segmentation in Helicopter Footage.

机构信息

Department of Civil Engineering, The University of Tokyo, 4-6-1 Komaba, Meguro, Tokyo 1538505, Japan.

Institute of Industrial Science, The University of Tokyo, 4-6-1 Komaba, Meguro, Tokyo 1538505, Japan.

出版信息

Sensors (Basel). 2020 Aug 27;20(17):4855. doi: 10.3390/s20174855.

DOI:10.3390/s20174855
PMID:32867289
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7506704/
Abstract

Continually improving crowd counting neural networks have been developed in recent years. The accuracy of these networks has reached such high levels that further improvement is becoming very difficult. However, this high accuracy lacks deeper semantic information, such as social roles (e.g., student, company worker, or police officer) or location-based roles (e.g., pedestrian, tenant, or construction worker). Some of these can be learned from the same set of features as the human nature of an entity, whereas others require wider contextual information from the human surroundings. The primary end-goal of developing recognition software is to involve them in autonomous decision-making systems. Therefore, it must be foolproof, which is, it must have good semantic understanding of the input. In this study, we focus on counting pedestrians in helicopter footage and introduce a dataset created from helicopter videos for this purpose. We use semantic segmentation to extract the required additional contextual information from the surroundings of an entity. We demonstrate that it is possible to increase the pedestrian counting accuracy in this manner. Furthermore, we show that crowd counting and semantic segmentation can be simultaneously achieved, with comparable or even improved accuracy, by using the same crowd counting neural network for both tasks through hard parameter sharing. The presented method is generic and it can be applied to arbitrary crowd density estimation methods. A link to the dataset is available at the end of the paper.

摘要

近年来,不断改进的人群计数神经网络已经被开发出来。这些网络的准确性已经达到了非常高的水平,进一步提高变得非常困难。然而,这种高精度缺乏更深层次的语义信息,例如社会角色(例如,学生、公司员工或警察)或基于位置的角色(例如,行人、租户或建筑工人)。其中一些可以从与实体的自然属性相同的特征集中学习到,而其他的则需要来自人类环境的更广泛的上下文信息。开发识别软件的主要最终目标是将其应用于自主决策系统。因此,它必须是万无一失的,也就是说,它必须对输入有很好的语义理解。在这项研究中,我们专注于在直升机镜头中计算行人数量,并为此目的引入了一个从直升机视频创建的数据集。我们使用语义分割从实体的周围环境中提取所需的额外上下文信息。我们证明,通过使用相同的人群计数神经网络同时完成人群计数和语义分割,并且通过硬参数共享来实现类似甚至更好的准确性,这种方法是可行的。所提出的方法是通用的,可以应用于任意人群密度估计方法。数据集的链接在论文的结尾处提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d5a/7506704/c76048bfd60f/sensors-20-04855-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d5a/7506704/c76048bfd60f/sensors-20-04855-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d5a/7506704/c76048bfd60f/sensors-20-04855-g001.jpg

相似文献

1
Crowd Counting with Semantic Scene Segmentation in Helicopter Footage.基于语义场景分割的直升机视频人群计数。
Sensors (Basel). 2020 Aug 27;20(17):4855. doi: 10.3390/s20174855.
2
Body Structure Aware Deep Crowd Counting.人体结构感知的深度学习人群计数。
IEEE Trans Image Process. 2018 Mar;27(3):1049-1059. doi: 10.1109/TIP.2017.2740160. Epub 2017 Aug 14.
3
Fine-Grained Crowd Counting.细粒度人群计数。
IEEE Trans Image Process. 2021;30:2114-2126. doi: 10.1109/TIP.2021.3049938. Epub 2021 Jan 25.
4
Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network.使用语义分割网络的快速R-CNN用于稳健行人检测
Front Neurorobot. 2018 Oct 5;12:64. doi: 10.3389/fnbot.2018.00064. eCollection 2018.
5
New End-to-End Strategy Based on DeepLabv3+ Semantic Segmentation for Human Head Detection.基于 DeepLabv3+语义分割的全新端到端人体头部检测策略。
Sensors (Basel). 2021 Aug 30;21(17):5848. doi: 10.3390/s21175848.
6
COMAL: compositional multi-scale feature enhanced learning for crowd counting.COMAL:用于人群计数的组合多尺度特征增强学习
Multimed Tools Appl. 2022;81(15):20541-20560. doi: 10.1007/s11042-022-12249-9. Epub 2022 Mar 11.
7
Congested Crowd Counting via Adaptive Multi-Scale Context Learning.基于自适应多尺度上下文学习的拥挤人群计数。
Sensors (Basel). 2021 May 29;21(11):3777. doi: 10.3390/s21113777.
8
An Adaptive Multi-Scale Network Based on Depth Information for Crowd Counting.一种基于深度信息的自适应多尺度人群计数网络
Sensors (Basel). 2023 Sep 11;23(18):7805. doi: 10.3390/s23187805.
9
Redesigned Skip-Network for Crowd Counting with Dilated Convolution and Backward Connection.用于人群计数的重新设计的跳跃网络:带空洞卷积和反向连接
J Imaging. 2020 May 2;6(5):28. doi: 10.3390/jimaging6050028.
10
ACSL: Adaptive correlation-driven sparsity learning for deep neural network compression.ACSL:用于深度神经网络压缩的自适应相关驱动稀疏学习。
Neural Netw. 2021 Dec;144:465-477. doi: 10.1016/j.neunet.2021.09.012. Epub 2021 Sep 16.

引用本文的文献

1
Context-Aware Multi-Scale Aggregation Network for Congested Crowd Counting.上下文感知多尺度聚合网络用于拥挤人群计数。
Sensors (Basel). 2022 Apr 22;22(9):3233. doi: 10.3390/s22093233.
2
Congested Crowd Counting via Adaptive Multi-Scale Context Learning.基于自适应多尺度上下文学习的拥挤人群计数。
Sensors (Basel). 2021 May 29;21(11):3777. doi: 10.3390/s21113777.