• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于注意力机制的 CNN-ConvLSTM 用于行人属性识别。

Attention Based CNN-ConvLSTM for Pedestrian Attribute Recognition.

机构信息

School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China.

School of Information Technology, Shanghai Jianqiao University, Shanghai 201306, China.

出版信息

Sensors (Basel). 2020 Feb 3;20(3):811. doi: 10.3390/s20030811.

DOI:10.3390/s20030811
PMID:32028568
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7038686/
Abstract

As a result of its important role in video surveillance, pedestrian attribute recognition has become an attractive facet of computer vision research. Because of the changes in viewpoints, illumination, resolution and occlusion, the task is very challenging. In order to resolve the issue of unsatisfactory performance of existing pedestrian attribute recognition methods resulting from ignoring the correlation between pedestrian attributes and spatial information, in this paper, the task is regarded as a spatiotemporal, sequential, multi-label image classification problem. An attention-based neural network consisting of convolutional neural networks (CNN), channel attention (CAtt) and convolutional long short-term memory (ConvLSTM) is proposed (CNN-CAtt-ConvLSTM). Firstly, the salient and correlated visual features of pedestrian attributes are extracted by pre-trained CNN and CAtt. Then, ConvLSTM is used to further extract spatial information and correlations from pedestrian attributes. Finally, pedestrian attributes are predicted with optimized sequences based on attribute image area size and importance. Extensive experiments are carried out on two common pedestrian attribute datasets, PEdesTrian Attribute (PETA) dataset and Richly Annotated Pedestrian (RAP) dataset, and higher performance than other state-of-the-art (SOTA) methods is achieved, which proves the superiority and validity of our method.

摘要

由于其在视频监控中的重要作用,行人属性识别已成为计算机视觉研究的一个吸引人的方面。由于视角、光照、分辨率和遮挡的变化,这项任务极具挑战性。为了解决现有行人属性识别方法因忽略行人属性与空间信息之间的相关性而导致性能不佳的问题,本文将该任务视为时空、顺序、多标签图像分类问题。提出了一种基于注意力的神经网络,该网络由卷积神经网络(CNN)、通道注意力(CAtt)和卷积长短期记忆(ConvLSTM)组成(CNN-CAtt-ConvLSTM)。首先,通过预训练的 CNN 和 CAtt 提取行人属性的显著相关视觉特征。然后,使用 ConvLSTM 进一步从行人属性中提取空间信息和相关性。最后,根据属性图像区域大小和重要性,基于优化序列预测行人属性。在两个常见的行人属性数据集,PEdesTrian 属性(PETA)数据集和 Richly Annotated Pedestrian(RAP)数据集上进行了广泛的实验,与其他最先进(SOTA)方法相比取得了更高的性能,证明了我们方法的优越性和有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e05d/7038686/c8cbd07061be/sensors-20-00811-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e05d/7038686/09a8ff0f175e/sensors-20-00811-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e05d/7038686/e38928de20cf/sensors-20-00811-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e05d/7038686/c3f62b7d7e09/sensors-20-00811-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e05d/7038686/c8cbd07061be/sensors-20-00811-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e05d/7038686/09a8ff0f175e/sensors-20-00811-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e05d/7038686/e38928de20cf/sensors-20-00811-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e05d/7038686/c3f62b7d7e09/sensors-20-00811-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e05d/7038686/c8cbd07061be/sensors-20-00811-g004.jpg

相似文献

1
Attention Based CNN-ConvLSTM for Pedestrian Attribute Recognition.基于注意力机制的 CNN-ConvLSTM 用于行人属性识别。
Sensors (Basel). 2020 Feb 3;20(3):811. doi: 10.3390/s20030811.
2
A Richly Annotated Pedestrian Dataset for Person Retrieval in Real Surveillance Scenarios.丰富标注的行人数据集,用于真实监控场景下的人员检索。
IEEE Trans Image Process. 2019 Apr;28(4):1575-1590. doi: 10.1109/TIP.2018.2878349. Epub 2018 Oct 26.
3
Pedestrian attribute recognition using two-branch trainable Gabor wavelets network.使用双分支可训练 Gabor 小波网络的行人属性识别。
PLoS One. 2021 Jun 1;16(6):e0251667. doi: 10.1371/journal.pone.0251667. eCollection 2021.
4
Occluded Pedestrian-Attribute Recognition for Video Sensors Using Group Sparsity.基于群组稀疏性的视频传感器被遮挡行人属性识别。
Sensors (Basel). 2022 Sep 1;22(17):6626. doi: 10.3390/s22176626.
5
Multi-Information Flow CNN and Attribute-Aided Reranking for Person Reidentification.多信息流卷积神经网络与属性辅助重排序的行人再识别方法
Comput Intell Neurosci. 2019 Feb 6;2019:7028107. doi: 10.1155/2019/7028107. eCollection 2019.
6
A multi-branch separable convolution neural network for pedestrian attribute recognition.一种用于行人属性识别的多分支可分离卷积神经网络。
Heliyon. 2020 Mar 17;6(3):e03563. doi: 10.1016/j.heliyon.2020.e03563. eCollection 2020 Mar.
7
Learning From Human Attention for Attribute-Assisted Visual Recognition.从人类注意力中学习属性辅助视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):11152-11167. doi: 10.1109/TPAMI.2024.3458921. Epub 2024 Nov 6.
8
Deep Learning-Based Human Activity Real-Time Recognition for Pedestrian Navigation.基于深度学习的行人导航实时人体活动识别。
Sensors (Basel). 2020 Apr 30;20(9):2574. doi: 10.3390/s20092574.
9
Coupled Attribute Learning for Heterogeneous Face Recognition.用于异构人脸识别的耦合属性学习
IEEE Trans Neural Netw Learn Syst. 2020 Nov;31(11):4699-4712. doi: 10.1109/TNNLS.2019.2957285. Epub 2020 Oct 29.
10
Attention-Based DSC-ConvLSTM for Multiclass Motor Imagery Classification.基于注意力机制的 DSC-ConvLSTM 多类运动想象分类
Comput Intell Neurosci. 2022 May 5;2022:8187009. doi: 10.1155/2022/8187009. eCollection 2022.

引用本文的文献

1
Toward Better Pedestrian Trajectory Predictions: The Role of Density and Time-to-Collision in Hybrid Deep-Learning Algorithms.迈向更优的行人轨迹预测:密度和碰撞时间在混合深度学习算法中的作用
Sensors (Basel). 2024 Apr 8;24(7):2356. doi: 10.3390/s24072356.
2
Construction and improvement of English vocabulary learning model integrating spiking neural network and convolutional long short-term memory algorithm.构建和改进集成尖峰神经网络和卷积长短期记忆算法的英语词汇学习模型。
PLoS One. 2024 Mar 22;19(3):e0299425. doi: 10.1371/journal.pone.0299425. eCollection 2024.
3
Occluded Pedestrian-Attribute Recognition for Video Sensors Using Group Sparsity.

本文引用的文献

1
Squeeze-and-Excitation Networks.挤压激励网络。
IEEE Trans Pattern Anal Mach Intell. 2020 Aug;42(8):2011-2023. doi: 10.1109/TPAMI.2019.2913372. Epub 2019 Apr 29.
2
Joint Semantic and Latent Attribute Modelling for Cross-Class Transfer Learning.跨类迁移学习的联合语义和潜在属性建模。
IEEE Trans Pattern Anal Mach Intell. 2018 Jul;40(7):1625-1638. doi: 10.1109/TPAMI.2017.2723882. Epub 2017 Jul 6.
3
Soft Biometrics; Human Identification Using Comparative Descriptions.软生物识别技术;基于比较描述的人体识别。
基于群组稀疏性的视频传感器被遮挡行人属性识别。
Sensors (Basel). 2022 Sep 1;22(17):6626. doi: 10.3390/s22176626.
4
Polar Vortex Multi-Day Intensity Prediction Relying on New Deep Learning Model: A Combined Convolution Neural Network with Long Short-Term Memory Based on Gaussian Smoothing Method.基于新深度学习模型的极地涡旋多日强度预测:一种基于高斯平滑方法的结合卷积神经网络与长短期记忆网络的模型
Entropy (Basel). 2021 Oct 8;23(10):1314. doi: 10.3390/e23101314.
5
Detection of sitting posture using hierarchical image composition and deep learning.使用分层图像合成和深度学习检测坐姿
PeerJ Comput Sci. 2021 Mar 23;7:e442. doi: 10.7717/peerj-cs.442. eCollection 2021.
6
Hybrid Attention Network for Language-Based Person Search.基于语言的人物搜索的混合注意力网络。
Sensors (Basel). 2020 Sep 15;20(18):5279. doi: 10.3390/s20185279.
IEEE Trans Pattern Anal Mach Intell. 2014 Jun;36(6):1216-28. doi: 10.1109/TPAMI.2013.219.
4
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.