宿主-寄生虫：基于图 LSTM-in-LSTM 的群体活动识别。

Host-Parasite: Graph LSTM-in-LSTM for Group Activity Recognition.

出版信息

IEEE Trans Neural Netw Learn Syst. 2021 Feb;32(2):663-674. doi: 10.1109/TNNLS.2020.2978942. Epub 2021 Feb 4.

DOI:10.1109/TNNLS.2020.2978942

Abstract

This article aims to tackle the problem of group activity recognition in the multiple-person scene. To model the group activity with multiple persons, most long short-term memory (LSTM)-based methods first learn the person-level action representations by several LSTMs and then integrate all the person-level action representations into the following LSTM to learn the group-level activity representation. This type of solution is a two-stage strategy, which neglects the "host-parasite" relationship between the group-level activity ("host") and person-level actions ("parasite") in spatiotemporal space. To this end, we propose a novel graph LSTM-in-LSTM (GLIL) for group activity recognition by modeling the person-level actions and the group-level activity simultaneously. GLIL is a "host-parasite" architecture, which can be seen as several person LSTMs (P-LSTMs) in the local view or a graph LSTM (G-LSTM) in the global view. Specifically, P-LSTMs model the person-level actions based on the interactions among persons. Meanwhile, G-LSTM models the group-level activity, where the person-level motion information in multiple P-LSTMs is selectively integrated and stored into G-LSTM based on their contributions to the inference of the group activity class. Furthermore, to use the person-level temporal features instead of the person-level static features as the input of GLIL, we introduce a residual LSTM with the residual connection to learn the person-level residual features, consisting of temporal features and static features. Experimental results on two public data sets illustrate the effectiveness of the proposed GLIL compared with state-of-the-art methods.

摘要

本文旨在解决多人场景中的群体活动识别问题。为了对多人的群体活动进行建模，大多数基于长短期记忆网络（LSTM）的方法首先通过几个 LSTM 学习人员级别的动作表示，然后将所有人员级别的动作表示集成到后续的 LSTM 中，以学习群体级别的活动表示。这种方法是一种两阶段策略，忽略了群体活动（“宿主”）和人员级别动作（“寄生虫”）在时空空间中的“宿主-寄生虫”关系。为此，我们提出了一种新颖的图 LSTM-in-LSTM（GLIL），通过同时对人员级别动作和群体级别活动进行建模来进行群体活动识别。GLIL 是一种“宿主-寄生虫”架构，可以在局部视图中视为几个人员 LSTM（P-LSTM），也可以在全局视图中视为图 LSTM（G-LSTM）。具体来说，P-LSTM 基于人员之间的相互作用来对人员级别动作进行建模。同时，G-LSTM 对群体级别活动进行建模，其中，基于人员对群体活动类推断的贡献，从多个 P-LSTM 中选择性地整合和存储人员级别的运动信息到 G-LSTM 中。此外，为了使用人员级别的时间特征而不是人员级别的静态特征作为 GLIL 的输入，我们引入了具有残差连接的残差 LSTM 来学习人员级别的残差特征，残差特征由时间特征和静态特征组成。在两个公共数据集上的实验结果表明，与最先进的方法相比，所提出的 GLIL 是有效的。

相似文献

Host-Parasite: Graph LSTM-in-LSTM for Group Activity Recognition.宿主-寄生虫：基于图 LSTM-in-LSTM 的群体活动识别。

IEEE Trans Neural Netw Learn Syst. 2021 Feb;32(2):663-674. doi: 10.1109/TNNLS.2020.2978942. Epub 2021 Feb 4.

Hierarchical Long Short-Term Concurrent Memory for Human Interaction Recognition.用于人类交互识别的层次长短时并发记忆

IEEE Trans Pattern Anal Mach Intell. 2021 Mar;43(3):1110-1118. doi: 10.1109/TPAMI.2019.2942030. Epub 2021 Feb 4.

Coherence Constrained Graph LSTM for Group Activity Recognition.基于连贯性约束图 LSTM 的群组活动识别

IEEE Trans Pattern Anal Mach Intell. 2022 Feb;44(2):636-647. doi: 10.1109/TPAMI.2019.2928540. Epub 2022 Jan 7.

Social-Aware Pedestrian Trajectory Prediction via States Refinement LSTM.基于状态精炼 LSTM 的社交感知行人轨迹预测。

IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2742-2759. doi: 10.1109/TPAMI.2020.3038217. Epub 2022 Apr 1.

End-to-end multimodal clinical depression recognition using deep neural networks: A comparative analysis.端到端使用深度神经网络进行多模态临床抑郁症识别：比较分析。

Comput Methods Programs Biomed. 2021 Nov;211:106433. doi: 10.1016/j.cmpb.2021.106433. Epub 2021 Sep 28.

ECG-based cardiac arrhythmias detection through ensemble learning and fusion of deep spatial-temporal and long-range dependency features.基于 ECG 的心脏心律失常检测，通过深度时空和长距离依赖特征的集成学习和融合。

Artif Intell Med. 2024 Apr;150:102818. doi: 10.1016/j.artmed.2024.102818. Epub 2024 Feb 24.

Deep belief improved bidirectional LSTM for multivariate time series forecasting.用于多变量时间序列预测的深度信念改进双向长短期记忆网络

Math Biosci Eng. 2023 Aug 17;20(9):16596-16627. doi: 10.3934/mbe.2023739.

Continuous Joint Kinematics Prediction Using GAT-LSTM Framework Based on Muscle Synergy and Sparse sEMG.基于肌肉协同和稀疏表面肌电图的GAT-LSTM框架连续关节运动学预测

IEEE Trans Neural Syst Rehabil Eng. 2025;33:1763-1773. doi: 10.1109/TNSRE.2025.3565305. Epub 2025 May 8.

Skeleton-Based Human Action Recognition With Global Context-Aware Attention LSTM Networks.基于骨架的全局上下文感知注意力 LSTM 网络的人体动作识别。

IEEE Trans Image Process. 2018 Apr;27(4):1586-1599. doi: 10.1109/TIP.2017.2785279.

Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video.从第一人称视频中进行动作预测的滚动-展开 LSTM。

IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):4021-4036. doi: 10.1109/TPAMI.2020.2992889. Epub 2021 Oct 1.

引用本文的文献

Content oriented 3D-CNN sequence learning architecture for academic activities recognition using a realistic CAD dataset.用于使用真实CAD数据集进行学术活动识别的面向内容的3D-CNN序列学习架构。

Sci Rep. 2025 Jul 12;15(1):25250. doi: 10.1038/s41598-025-07620-3.

The application of suitable sports games for junior high school students based on deep learning and artificial intelligence.基于深度学习和人工智能的适合初中生的体育游戏应用

Sci Rep. 2025 May 16;15(1):17056. doi: 10.1038/s41598-025-01941-z.

ERABiLNet: enhanced residual attention with bidirectional long short-term memory.ERABiLNet：具有双向长短时记忆的增强型残差注意力网络。

Sci Rep. 2024 Sep 4;14(1):20622. doi: 10.1038/s41598-024-71299-1.

Implementing heuristic-based multiscale depth-wise separable adaptive temporal convolutional network for ambient air quality prediction using real time data.使用实时数据实现基于启发式的多尺度深度可分离自适应时间卷积网络用于环境空气质量预测。

Sci Rep. 2024 Aug 8;14(1):18437. doi: 10.1038/s41598-024-68793-x.

Clinical human activity recognition based on a wearable patch of combined tri-axial ACC and ECG sensors.基于可穿戴式三轴加速度计和心电图传感器组合贴片的临床人体活动识别

Digit Health. 2024 Jan 4;10:20552076231223804. doi: 10.1177/20552076231223804. eCollection 2024 Jan-Dec.

Contrastive self-supervised representation learning without negative samples for multimodal human action recognition.用于多模态人类动作识别的无负样本对比自监督表征学习

Front Neurosci. 2023 Jul 5;17:1225312. doi: 10.3389/fnins.2023.1225312. eCollection 2023.

SenseFi: A library and benchmark on deep-learning-empowered WiFi human sensing.SenseFi：一个基于深度学习的WiFi人体感知库及基准测试

Patterns (N Y). 2023 Feb 28;4(3):100703. doi: 10.1016/j.patter.2023.100703. eCollection 2023 Mar 10.

Multi-scale and attention enhanced graph convolution network for skeleton-based violence action recognition.用于基于骨架的暴力行为识别的多尺度注意力增强图卷积网络。

Front Neurorobot. 2022 Dec 15;16:1091361. doi: 10.3389/fnbot.2022.1091361. eCollection 2022.

Human Activity Recognition: Review, Taxonomy and Open Challenges.人体活动识别：综述、分类与开放挑战。

Sensors (Basel). 2022 Aug 27;22(17):6463. doi: 10.3390/s22176463.

Multi-Perspective Representation to Part-Based Graph for Group Activity Recognition.基于图的多视角表示的部分群组活动识别。

Sensors (Basel). 2022 Jul 24;22(15):5521. doi: 10.3390/s22155521.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

宿主-寄生虫：基于图 LSTM-in-LSTM 的群体活动识别。

Host-Parasite: Graph LSTM-in-LSTM for Group Activity Recognition.

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献