• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 SkeletonCLIP++的人体运动识别技术研究:加权视频特征融合与增强对比样本判别。

Advancing Human Motion Recognition with SkeletonCLIP++: Weighted Video Feature Integration and Enhanced Contrastive Sample Discrimination.

机构信息

Department of Control Science and Engineering, Harbin Institute of Technology, Harbin 150001, China.

出版信息

Sensors (Basel). 2024 Feb 11;24(4):1189. doi: 10.3390/s24041189.

DOI:10.3390/s24041189
PMID:38400347
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10892604/
Abstract

This paper introduces 'SkeletonCLIP++', an extension of our prior work in human action recognition, emphasizing the use of semantic information beyond traditional label-based methods. The innovation, 'Weighted Frame Integration' (WFI), shifts video feature computation from averaging to a weighted frame approach, enabling a more nuanced representation of human movements in line with semantic relevance. Another key development, 'Contrastive Sample Identification' (CSI), introduces a novel discriminative task within the model. This task involves identifying the most similar negative sample among positive ones, enhancing the model's ability to distinguish between closely related actions. Incorporating the 'BERT Text Encoder Integration' (BTEI) leverages the pre-trained BERT model as our text encoder to refine the model's performance. Empirical evaluations on HMDB-51, UCF-101, and NTU RGB+D 60 datasets illustrate positive improvements, especially in smaller datasets. 'SkeletonCLIP++' thus offers a refined approach to human action recognition, ensuring semantic integrity and detailed differentiation in video data analysis.

摘要

本文介绍了“SkeletonCLIP++”,这是我们之前在人体动作识别工作的扩展,强调使用超越传统基于标签方法的语义信息。创新的“加权帧集成”(WFI)将视频特征计算从平均化转变为加权帧方法,使人体运动的表示更加细致入微,符合语义相关性。另一个关键的发展是“对比样本识别”(CSI),在模型中引入了一个新的判别任务。该任务涉及在正样本中识别最相似的负样本,从而提高模型区分密切相关动作的能力。结合“BERT 文本编码器集成”(BTEI),利用预训练的 BERT 模型作为我们的文本编码器,以提高模型的性能。在 HMDB-51、UCF-101 和 NTU RGB+D 60 数据集上的实证评估表明了积极的改进,特别是在较小的数据集上。因此,“SkeletonCLIP++”提供了一种更精细的人体动作识别方法,确保在视频数据分析中保持语义完整性和详细的区分。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d169/10892604/a9db4d1393a9/sensors-24-01189-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d169/10892604/243614927ccd/sensors-24-01189-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d169/10892604/bdc334696a05/sensors-24-01189-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d169/10892604/ceab0791f48c/sensors-24-01189-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d169/10892604/1461d3ff68e7/sensors-24-01189-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d169/10892604/8b91da1fa49e/sensors-24-01189-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d169/10892604/a9db4d1393a9/sensors-24-01189-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d169/10892604/243614927ccd/sensors-24-01189-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d169/10892604/bdc334696a05/sensors-24-01189-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d169/10892604/ceab0791f48c/sensors-24-01189-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d169/10892604/1461d3ff68e7/sensors-24-01189-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d169/10892604/8b91da1fa49e/sensors-24-01189-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d169/10892604/a9db4d1393a9/sensors-24-01189-g006.jpg

相似文献

1
Advancing Human Motion Recognition with SkeletonCLIP++: Weighted Video Feature Integration and Enhanced Contrastive Sample Discrimination.基于 SkeletonCLIP++的人体运动识别技术研究:加权视频特征融合与增强对比样本判别。
Sensors (Basel). 2024 Feb 11;24(4):1189. doi: 10.3390/s24041189.
2
Multi-Granularity Anchor-Contrastive Representation Learning for Semi-Supervised Skeleton-Based Action Recognition.多粒度锚点对比学习在半监督骨架动作识别中的应用
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7559-7576. doi: 10.1109/TPAMI.2022.3222871. Epub 2023 May 5.
3
Contrastive Positive Sample Propagation Along the Audio-Visual Event Line.对比正样本沿着视听事件线的传播。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7239-7257. doi: 10.1109/TPAMI.2022.3223688. Epub 2023 May 5.
4
Improving few-shot relation extraction through semantics-guided learning.通过语义引导学习提高小样本关系抽取。
Neural Netw. 2024 Jan;169:453-461. doi: 10.1016/j.neunet.2023.10.053. Epub 2023 Nov 3.
5
Local structure-aware graph contrastive representation learning.基于局部结构感知的图对比表示学习。
Neural Netw. 2024 Apr;172:106083. doi: 10.1016/j.neunet.2023.12.037. Epub 2023 Dec 27.
6
sCL-ST: Supervised Contrastive Learning With Semantic Transformations for Multiple Lead ECG Arrhythmia Classification.sCL-ST:基于语义转换的监督对比学习在多导联 ECG 心律失常分类中的应用。
IEEE J Biomed Health Inform. 2023 Jun;27(6):2818-2828. doi: 10.1109/JBHI.2023.3246241. Epub 2023 Jun 5.
7
Construction and Research on Chinese Semantic Mapping Based on Linguistic Features and Sparse Self-Learning Neural Networks.基于语言特征和稀疏自学习神经网络的中文语义映射构建与研究。
Comput Intell Neurosci. 2022 Jun 20;2022:2315802. doi: 10.1155/2022/2315802. eCollection 2022.
8
Application of Entity-BERT model based on neuroscience and brain-like cognition in electronic medical record entity recognition.基于神经科学和类脑认知的实体BERT模型在电子病历实体识别中的应用
Front Neurosci. 2023 Sep 20;17:1259652. doi: 10.3389/fnins.2023.1259652. eCollection 2023.
9
Video Content Analysis of Human Sports under Engineering Management Incorporating High-Level Semantic Recognition Models.工程管理中融入高层语义识别模型的人类运动视频内容分析。
Comput Intell Neurosci. 2022 Jan 12;2022:6761857. doi: 10.1155/2022/6761857. eCollection 2022.
10
Self-Supervised 3-D Semantic Representation Learning for Vision-and-Language Navigation.用于视觉与语言导航的自监督3D语义表征学习
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):6738-6751. doi: 10.1109/TNNLS.2024.3395633. Epub 2025 Apr 4.

本文引用的文献

1
ActionCLIP: Adapting Language-Image Pretrained Models for Video Action Recognition.ActionCLIP:将语言-图像预训练模型应用于视频动作识别
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):625-637. doi: 10.1109/TNNLS.2023.3331841. Epub 2025 Jan 7.
2
Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates.基于时空长短期记忆网络及信任门控的骨骼动作识别
IEEE Trans Pattern Anal Mach Intell. 2018 Dec;40(12):3007-3021. doi: 10.1109/TPAMI.2017.2771306. Epub 2017 Nov 9.
3
Skeleton-Based Human Action Recognition With Global Context-Aware Attention LSTM Networks.
基于骨架的全局上下文感知注意力 LSTM 网络的人体动作识别。
IEEE Trans Image Process. 2018 Apr;27(4):1586-1599. doi: 10.1109/TIP.2017.2785279.