• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于动作识别中视图模糊的双推荐解缠网络

Dual-Recommendation Disentanglement Network for View Fuzz in Action Recognition.

作者信息

Liu Wenxuan, Zhong Xian, Zhou Zhuo, Jiang Kui, Wang Zheng, Lin Chia-Wen

出版信息

IEEE Trans Image Process. 2023;32:2719-2733. doi: 10.1109/TIP.2023.3273459. Epub 2023 May 16.

DOI:10.1109/TIP.2023.3273459
PMID:37163393
Abstract

Multi-view action recognition aims to identify action categories from given clues. Existing studies ignore the negative influences of fuzzy views between view and action in disentangling, commonly arising the mistaken recognition results. To this end, we regard the observed image as the composition of the view and action components, and give full play to the advantages of multiple views via the adaptive cooperative representation among these two components, forming a Dual-Recommendation Disentanglement Network (DRDN) for multi-view action recognition. Specifically, 1) For the action, we leverage a multi-level Specific Information Recommendation (SIR) to enhance the interaction among intricate activities and views. SIR offers a more comprehensive representation of activities, measuring the trade-off between global and local information. 2) For the view, we utilize a Pyramid Dynamic Recommendation (PDR) to learn a complete and detailed global representation by transferring features from different views. It is explicitly restricted to resist the fuzzy noise influence, focusing on positive knowledge from other views. Our DRDN aims for complete action and view representation, where PDR directly guides action to disentangle with view features and SIR considers mutual exclusivity of view and action clues. Extensive experiments have indicated that the multi-view action recognition method DRDN we proposed achieves state-of-the-art performance over powerful competitors on several standard benchmarks. The code will be available at https://github.com/51cloud/DRDN.

摘要

多视图动作识别旨在从给定线索中识别动作类别。现有研究在解缠过程中忽略了视图与动作之间模糊视图的负面影响,这通常会导致错误的识别结果。为此,我们将观察到的图像视为视图和动作成分的组合,并通过这两个成分之间的自适应协作表示充分发挥多视图的优势,形成用于多视图动作识别的双推荐解缠网络(DRDN)。具体来说,1)对于动作,我们利用多级特定信息推荐(SIR)来增强复杂活动和视图之间的交互。SIR提供了更全面的活动表示,衡量全局和局部信息之间的权衡。2)对于视图,我们利用金字塔动态推荐(PDR)通过从不同视图转移特征来学习完整而详细的全局表示。它被明确限制以抵抗模糊噪声的影响,专注于来自其他视图的积极知识。我们的DRDN旨在实现完整的动作和视图表示,其中PDR直接指导动作与视图特征解缠,而SIR考虑视图和动作线索的互斥性。大量实验表明,我们提出的多视图动作识别方法DRDN在几个标准基准上优于强大的竞争对手,取得了领先的性能。代码将在https://github.com/51cloud/DRDN上提供。

相似文献

1
Dual-Recommendation Disentanglement Network for View Fuzz in Action Recognition.用于动作识别中视图模糊的双推荐解缠网络
IEEE Trans Image Process. 2023;32:2719-2733. doi: 10.1109/TIP.2023.3273459. Epub 2023 May 16.
2
Viewpoint-Adaptive Representation Disentanglement Network for Change Captioning.观点自适应表示解缠结网络的变字幕生成。
IEEE Trans Image Process. 2023;32:2620-2635. doi: 10.1109/TIP.2023.3268004. Epub 2023 May 5.
3
Specificity and Latent Correlation Learning for Action Recognition Using Synthetic Multi-View Data From Depth Maps.使用深度图的合成多视角数据进行动作识别的特异性和潜在关联学习。
IEEE Trans Image Process. 2017 Dec;26(12):5560-5574. doi: 10.1109/TIP.2017.2740122. Epub 2017 Aug 14.
4
Dual Contrastive Prediction for Incomplete Multi-View Representation Learning.用于不完整多视图表示学习的双对比预测
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4447-4461. doi: 10.1109/TPAMI.2022.3197238. Epub 2023 Mar 7.
5
Dual adaptive learning multi-task multi-view for graph network representation learning.基于双适应学习的多任务多视图图网络表示学习。
Neural Netw. 2023 May;162:297-308. doi: 10.1016/j.neunet.2023.02.026. Epub 2023 Feb 27.
6
Dual Contrast-Driven Deep Multi-View Clustering.双对比度驱动的深度多视图聚类
IEEE Trans Image Process. 2024;33:4753-4764. doi: 10.1109/TIP.2024.3444269. Epub 2024 Aug 30.
7
Discriminative Multi-View Dynamic Image Fusion for Cross-View 3-D Action Recognition.用于跨视图三维动作识别的判别式多视图动态图像融合
IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5332-5345. doi: 10.1109/TNNLS.2021.3070179. Epub 2022 Oct 5.
8
circRNA-binding protein site prediction based on multi-view deep learning, subspace learning and multi-view classifier.基于多视图深度学习、子空间学习和多视图分类器的 circRNA 结合蛋白位点预测。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab394.
9
MGRN: toward robust drug recommendation via multi-view gating retrieval network.MGRN:通过多视图门控检索网络实现稳健的药物推荐。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae572.
10
D-Net: Dual Disentanglement Network for Brain Tumor Segmentation With Missing Modalities.D-Net:用于具有模态缺失的脑肿瘤分割的双解缠网络。
IEEE Trans Med Imaging. 2022 Oct;41(10):2953-2964. doi: 10.1109/TMI.2022.3175478. Epub 2022 Sep 30.