• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

层次聚类多任务学习用于联合人体动作分组和识别。

Hierarchical Clustering Multi-Task Learning for Joint Human Action Grouping and Recognition.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2017 Jan;39(1):102-114. doi: 10.1109/TPAMI.2016.2537337. Epub 2016 Mar 2.

DOI:10.1109/TPAMI.2016.2537337
PMID:26955018
Abstract

This paper proposes a hierarchical clustering multi-task learning (HC-MTL) method for joint human action grouping and recognition. Specifically, we formulate the objective function into the group-wise least square loss regularized by low rank and sparsity with respect to two latent variables, model parameters and grouping information, for joint optimization. To handle this non-convex optimization, we decompose it into two sub-tasks, multi-task learning and task relatedness discovery. First, we convert this non-convex objective function into the convex formulation by fixing the latent grouping information. This new objective function focuses on multi-task learning by strengthening the shared-action relationship and action-specific feature learning. Second, we leverage the learned model parameters for the task relatedness measure and clustering. In this way, HC-MTL can attain both optimal action models and group discovery by alternating iteratively. The proposed method is validated on three kinds of challenging datasets, including six realistic action datasets (Hollywood2, YouTube, UCF Sports, UCF50, HMDB51 & UCF101), two constrained datasets (KTH & TJU), and two multi-view datasets (MV-TJU & IXMAS). The extensive experimental results show that: 1) HC-MTL can produce competing performances to the state of the arts for action recognition and grouping; 2) HC-MTL can overcome the difficulty in heuristic action grouping simply based on human knowledge; 3) HC-MTL can avoid the possible inconsistency between the subjective action grouping depending on human knowledge and objective action grouping based on the feature subspace distributions of multiple actions. Comparison with the popular clustered multi-task learning further reveals that the discovered latent relatedness by HC-MTL aids inducing the group-wise multi-task learning and boosts the performance. To the best of our knowledge, ours is the first work that breaks the assumption that all actions are either independent for individual learning or correlated for joint modeling and proposes HC-MTL for automated, joint action grouping and modeling.

摘要

本文提出了一种层次聚类多任务学习(HC-MTL)方法,用于联合人体动作分组和识别。具体来说,我们将目标函数形式化为两个潜在变量(模型参数和分组信息)的低秩和稀疏正则化的组间最小二乘损失,进行联合优化。为了处理这个非凸优化问题,我们将其分解为两个子任务,多任务学习和任务相关性发现。首先,我们通过固定潜在的分组信息,将这个非凸目标函数转化为凸函数。这个新的目标函数通过加强共享动作关系和动作特定特征学习,专注于多任务学习。其次,我们利用学习到的模型参数进行任务相关性度量和聚类。通过这种方式,HC-MTL 可以通过交替迭代来获得最佳的动作模型和分组发现。该方法在三个具有挑战性的数据集上进行了验证,包括六个现实的动作数据集(好莱坞 2 数据集、YouTube 数据集、UCF Sports 数据集、UCF50 数据集、HMDB51 数据集和 UCF101 数据集)、两个受限数据集(KTH 数据集和 TJU 数据集)和两个多视图数据集(MV-TJU 数据集和 IXMAS 数据集)。广泛的实验结果表明:1)HC-MTL 可以在动作识别和分组方面达到最先进的水平;2)HC-MTL 可以克服基于人类知识的启发式动作分组的困难;3)HC-MTL 可以避免基于多动作特征子空间分布的主观动作分组和基于客观动作分组之间可能的不一致性。与流行的聚类多任务学习的比较进一步表明,HC-MTL 发现的潜在相关性有助于诱导分组的多任务学习,并提高性能。据我们所知,这是第一个打破所有动作要么独立于个体学习,要么相关于联合建模的假设,并提出 HC-MTL 用于自动、联合动作分组和建模的工作。

相似文献

1
Hierarchical Clustering Multi-Task Learning for Joint Human Action Grouping and Recognition.层次聚类多任务学习用于联合人体动作分组和识别。
IEEE Trans Pattern Anal Mach Intell. 2017 Jan;39(1):102-114. doi: 10.1109/TPAMI.2016.2537337. Epub 2016 Mar 2.
2
Multipe/single-view human action recognition via part-induced multitask structural learning.基于部件诱导多任务结构学习的多/单视图人体动作识别。
IEEE Trans Cybern. 2015 Jun;45(6):1194-208. doi: 10.1109/TCYB.2014.2347057. Epub 2014 Aug 27.
3
Multi-Task Convolutional Neural Network for Pose-Invariant Face Recognition.多任务卷积神经网络的姿态不变人脸识别。
IEEE Trans Image Process. 2018 Feb;27(2):964-975. doi: 10.1109/TIP.2017.2765830.
4
Multi-Grained Random Fields for Mitosis Identification in Time-Lapse Phase Contrast Microscopy Image Sequences.多时相相差显微图像序列中核分裂识别的多粒度随机场。
IEEE Trans Med Imaging. 2017 Aug;36(8):1699-1710. doi: 10.1109/TMI.2017.2686705. Epub 2017 Mar 23.
5
Multi-label zero-shot human action recognition via joint latent ranking embedding.基于联合潜在排序嵌入的多标签零镜头人体动作识别。
Neural Netw. 2020 Feb;122:1-23. doi: 10.1016/j.neunet.2019.09.029. Epub 2019 Oct 21.
6
Bayesian multi-task learning for decoding multi-subject neuroimaging data.用于解码多主体神经成像数据的贝叶斯多任务学习
Neuroimage. 2014 May 15;92(100):298-311. doi: 10.1016/j.neuroimage.2014.02.008. Epub 2014 Feb 13.
7
Clustered Multi-Task Learning Via Alternating Structure Optimization.通过交替结构优化实现聚类多任务学习
Adv Neural Inf Process Syst. 2011;2011:702-710.
8
On Better Exploring and Exploiting Task Relationships in Multitask Learning: Joint Model and Feature Learning.更好地探索和利用多任务学习中的任务关系:联合模型和特征学习
IEEE Trans Neural Netw Learn Syst. 2018 May;29(5):1975-1985. doi: 10.1109/TNNLS.2017.2690683. Epub 2017 Apr 17.
9
Jointly Learning Multiple Sequential Dynamics for Human Action Recognition.联合学习多种序列动态以进行人类动作识别
PLoS One. 2015 Jul 6;10(7):e0130884. doi: 10.1371/journal.pone.0130884. eCollection 2015.
10
Generalized fused group lasso regularized multi-task feature learning for predicting cognitive outcomes in Alzheimers disease.用于预测阿尔茨海默病认知结局的广义融合组套索正则化多任务特征学习。
Comput Methods Programs Biomed. 2018 Aug;162:19-45. doi: 10.1016/j.cmpb.2018.04.028. Epub 2018 May 3.

引用本文的文献

1
Automating multi-task learning on optical neural networks with weight sharing and physical rotation.通过权重共享和物理旋转实现光学神经网络多任务学习的自动化。
Sci Rep. 2025 Apr 25;15(1):14419. doi: 10.1038/s41598-025-97262-2.
2
Attention-based bidirectional-long short-term memory for abnormal human activity detection.基于注意力机制的双向长短时记忆网络在异常人体活动检测中的应用
Sci Rep. 2023 Sep 2;13(1):14442. doi: 10.1038/s41598-023-41231-0.
3
Human Activity Recognition Using Cascaded Dual Attention CNN and Bi-Directional GRU Framework.
基于级联双注意力卷积神经网络和双向门控循环单元框架的人类活动识别
J Imaging. 2023 Jun 26;9(7):130. doi: 10.3390/jimaging9070130.
4
Deep Learning Techniques for Vehicle Detection and Classification from Images/Videos: A Survey.基于图像/视频的车辆检测与分类的深度学习技术综述
Sensors (Basel). 2023 May 17;23(10):4832. doi: 10.3390/s23104832.
5
Efficient multi-task learning with adaptive temporal structure for progression prediction.用于进展预测的具有自适应时间结构的高效多任务学习。
Neural Comput Appl. 2023 May 10:1-16. doi: 10.1007/s00521-023-08461-9.
6
A 3DCNN-Based Knowledge Distillation Framework for Human Activity Recognition.一种基于3D卷积神经网络的人类活动识别知识蒸馏框架
J Imaging. 2023 Apr 14;9(4):82. doi: 10.3390/jimaging9040082.
7
Multi-task hourglass network for online automatic diagnosis of developmental dysplasia of the hip.用于髋关节发育不良在线自动诊断的多任务沙漏网络
World Wide Web. 2023;26(2):539-559. doi: 10.1007/s11280-022-01051-0. Epub 2022 May 4.
8
Vision Transformer and Deep Sequence Learning for Human Activity Recognition in Surveillance Videos.基于视觉Transformer 和深度学习序列模型的监控视频人体行为识别
Comput Intell Neurosci. 2022 Apr 4;2022:3454167. doi: 10.1155/2022/3454167. eCollection 2022.
9
Robust video content analysis schemes for human action recognition.用于人体动作识别的稳健视频内容分析方案。
Sci Prog. 2021 Apr-Jun;104(2):368504211005480. doi: 10.1177/00368504211005480.
10
Hierarchical Adaptive Multi-task Learning Framework for Patient Diagnoses and Diagnostic Category Classification.用于患者诊断和诊断类别分类的分层自适应多任务学习框架
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2019 Nov;2019. doi: 10.1109/bibm47256.2019.8983298. Epub 2020 Feb 6.