• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对齐增强型交互式融合模型用于完整和不完整多模态手势识别。

Alignment-Enhanced Interactive Fusion Model for Complete and Incomplete Multimodal Hand Gesture Recognition.

出版信息

IEEE Trans Neural Syst Rehabil Eng. 2023;31:4661-4671. doi: 10.1109/TNSRE.2023.3335101. Epub 2023 Nov 30.

DOI:10.1109/TNSRE.2023.3335101
PMID:37983152
Abstract

Hand gesture recognition (HGR) based on surface electromyogram (sEMG) and Accelerometer (ACC) signals is increasingly attractive where fusion strategies are crucial for performance and remain challenging. Currently, neural network-based fusion methods have gained superior performance. Nevertheless, these methods typically fuse sEMG and ACC either in the early or late stages, overlooking the integration of entire cross-modal hierarchical information within each individual hidden layer, thus inducing inefficient inter-modal fusion. To this end, we propose a novel Alignment-Enhanced Interactive Fusion (AiFusion) model, which achieves effective fusion via a progressive hierarchical fusion strategy. Notably, AiFusion can flexibly perform both complete and incomplete multimodal HGR. Specifically, AiFusion contains two unimodal branches and a cascaded transformer-based multimodal fusion branch. The fusion branch is first designed to adequately characterize modality-interactive knowledge by adaptively capturing inter-modal similarity and fusing hierarchical features from all branches layer by layer. Then, the modality-interactive knowledge is aligned with that of unimodality using cross-modal supervised contrastive learning and online distillation from embedding and probability spaces respectively. These alignments further promote fusion quality and refine modality-specific representations. Finally, the recognition outcomes are set to be determined by available modalities, thus contributing to handling the incomplete multimodal HGR problem, which is frequently encountered in real-world scenarios. Experimental results on five public datasets demonstrate that AiFusion outperforms most state-of-the-art benchmarks in complete multimodal HGR. Impressively, it also surpasses the unimodal baselines in the challenging incomplete multimodal HGR. The proposed AiFusion provides a promising solution to realize effective and robust multimodal HGR-based interfaces.

摘要

基于表面肌电 (sEMG) 和加速度计 (ACC) 信号的手势识别 (HGR) 越来越受到关注,其中融合策略对于性能至关重要且具有挑战性。目前,基于神经网络的融合方法已经取得了优异的性能。然而,这些方法通常在早期或晚期融合 sEMG 和 ACC,忽略了在每个单独的隐藏层内整合整个跨模态层次信息,从而导致模态间融合效率低下。为此,我们提出了一种新颖的对齐增强交互融合 (AiFusion) 模型,该模型通过渐进式层次融合策略实现有效融合。值得注意的是,AiFusion 可以灵活地执行完全和不完全的多模态 HGR。具体来说,AiFusion 包含两个单模态分支和一个级联基于转换器的多模态融合分支。融合分支首先通过自适应地捕获模态间相似性并逐层融合来自所有分支的层次特征,来充分表征模态间交互知识。然后,使用跨模态监督对比学习和来自嵌入和概率空间的在线蒸馏,将模态间交互知识与单模态对齐。这些对齐进一步促进了融合质量和模态特定表示的细化。最后,识别结果由可用的模态确定,从而有助于处理现实场景中经常遇到的不完全多模态 HGR 问题。在五个公共数据集上的实验结果表明,AiFusion 在完全多模态 HGR 方面优于大多数最先进的基准,令人印象深刻的是,它在具有挑战性的不完全多模态 HGR 方面也超过了单模态基线。所提出的 AiFusion 为实现有效的、稳健的基于多模态 HGR 的接口提供了一种有前景的解决方案。

相似文献

1
Alignment-Enhanced Interactive Fusion Model for Complete and Incomplete Multimodal Hand Gesture Recognition.对齐增强型交互式融合模型用于完整和不完整多模态手势识别。
IEEE Trans Neural Syst Rehabil Eng. 2023;31:4661-4671. doi: 10.1109/TNSRE.2023.3335101. Epub 2023 Nov 30.
2
Transferable non-invasive modal fusion-transformer (NIMFT) for end-to-end hand gesture recognition.可迁移的无创模态融合-Transformer(NIMFT)用于端到端手势识别。
J Neural Eng. 2024 Apr 9;21(2). doi: 10.1088/1741-2552/ad39a5.
3
ViT-HGR: Vision Transformer-based Hand Gesture Recognition from High Density Surface EMG Signals.基于视觉Transformer的高密度表面肌电信号手势识别
Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:5115-5119. doi: 10.1109/EMBC48229.2022.9871489.
4
TraHGR: Transformer for Hand Gesture Recognition via Electromyography.TraHGR:基于肌电信号的手势识别转换器。
IEEE Trans Neural Syst Rehabil Eng. 2023;31:4211-4224. doi: 10.1109/TNSRE.2023.3324252. Epub 2023 Oct 27.
5
FS-HGR: Few-Shot Learning for Hand Gesture Recognition via Electromyography.FS-HGR:基于肌电的少数样本手 gestures 识别的学习。
IEEE Trans Neural Syst Rehabil Eng. 2021;29:1004-1015. doi: 10.1109/TNSRE.2021.3077413. Epub 2021 Jun 8.
6
Uncertainty-Aware Graph Contrastive Fusion Network for multimodal physiological signal emotion recognition.用于多模态生理信号情感识别的不确定性感知图对比融合网络
Neural Netw. 2025 Jul;187:107363. doi: 10.1016/j.neunet.2025.107363. Epub 2025 Mar 14.
7
sEMG-Based Hand Gesture Recognition Using Binarized Neural Network.基于二值神经网络的表面肌电手势识别
Sensors (Basel). 2023 Jan 28;23(3):1436. doi: 10.3390/s23031436.
8
Surface-Electromyography-Based Gesture Recognition by Multi-View Deep Learning.基于多视图深度学习的表面肌电手势识别。
IEEE Trans Biomed Eng. 2019 Oct;66(10):2964-2973. doi: 10.1109/TBME.2019.2899222. Epub 2019 Feb 13.
9
A Real-Time Hand Gesture Recognition System for Low-Latency HMI via Transient HD-SEMG and In-Sensor Computing.一种通过瞬态高清表面肌电信号和传感器内计算实现低延迟人机交互的实时手势识别系统。
IEEE J Biomed Health Inform. 2024 Sep;28(9):5156-5167. doi: 10.1109/JBHI.2024.3417236. Epub 2024 Sep 5.
10
A Deep Q-Network based hand gesture recognition system for control of robotic platforms.基于深度 Q 网络的机器人平台手势控制系统。
Sci Rep. 2023 May 17;13(1):7956. doi: 10.1038/s41598-023-34540-x.