• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于雷达的人体活动识别的轻量级混合视觉Transformer 网络。

A lightweight hybrid vision transformer network for radar-based human activity recognition.

机构信息

School of Electronics and Communication Engineering, Guangzhou University, Guangzhou, 510006, China.

Key Laboratory of On-Chip Communication and Sensor Chip of Guangdong Higher Education Institutes, Guangzhou, 510006, China.

出版信息

Sci Rep. 2023 Oct 21;13(1):17996. doi: 10.1038/s41598-023-45149-5.

DOI:10.1038/s41598-023-45149-5
PMID:37865672
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10590397/
Abstract

Radar-based human activity recognition (HAR) offers a non-contact technique with privacy protection and lighting robustness for many advanced applications. Complex deep neural networks demonstrate significant performance advantages when classifying the radar micro-Doppler signals that have unique correspondences with human behavior. However, in embedded applications, the demand for lightweight and low latency poses challenges to the radar-based HAR network construction. In this paper, an efficient network based on a lightweight hybrid Vision Transformer (LH-ViT) is proposed to address the HAR accuracy and network lightweight simultaneously. This network combines the efficient convolution operations with the strength of the self-attention mechanism in ViT. Feature Pyramid architecture is applied for the multi-scale feature extraction for the micro-Doppler map. Feature enhancement is executed by the stacked Radar-ViT subsequently, in which the fold and unfold operations are added to lower the computational load of the attention mechanism. The convolution operator in the LH-ViT is replaced by the RES-SE block, an efficient structure that combines the residual learning framework with the Squeeze-and-Excitation network. Experiments based on two human activity datasets indicate our method's advantages in terms of expressiveness and computing efficiency over traditional methods.

摘要

基于雷达的人体活动识别 (HAR) 为许多高级应用提供了一种具有隐私保护和光照鲁棒性的非接触技术。复杂的深度神经网络在对与人类行为具有独特对应关系的雷达微多普勒信号进行分类时表现出显著的性能优势。然而,在嵌入式应用中,对轻量级和低延迟的需求对基于雷达的 HAR 网络构建提出了挑战。在本文中,提出了一种基于轻量级混合 Vision Transformer (LH-ViT) 的高效网络,以同时解决 HAR 准确性和网络轻量级化的问题。该网络结合了高效的卷积运算和 ViT 中自注意力机制的优势。特征金字塔架构用于微多普勒图的多尺度特征提取。通过堆叠的 Radar-ViT 进行特征增强,其中添加了折叠和展开操作以降低注意力机制的计算负荷。LH-ViT 中的卷积算子被 RES-SE 块取代,这是一种将残差学习框架与挤压激励网络相结合的高效结构。基于两个人体活动数据集的实验表明,与传统方法相比,我们的方法在表达能力和计算效率方面具有优势。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0834/10590397/bb775acde8e9/41598_2023_45149_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0834/10590397/8a570d8d6f3e/41598_2023_45149_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0834/10590397/f0d6775e8fbd/41598_2023_45149_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0834/10590397/51e15a1f5a8c/41598_2023_45149_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0834/10590397/1f22f6aaea93/41598_2023_45149_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0834/10590397/cbd564c2c2e1/41598_2023_45149_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0834/10590397/bb775acde8e9/41598_2023_45149_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0834/10590397/8a570d8d6f3e/41598_2023_45149_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0834/10590397/f0d6775e8fbd/41598_2023_45149_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0834/10590397/51e15a1f5a8c/41598_2023_45149_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0834/10590397/1f22f6aaea93/41598_2023_45149_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0834/10590397/cbd564c2c2e1/41598_2023_45149_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0834/10590397/bb775acde8e9/41598_2023_45149_Fig6_HTML.jpg

相似文献

1
A lightweight hybrid vision transformer network for radar-based human activity recognition.一种基于雷达的人体活动识别的轻量级混合视觉Transformer 网络。
Sci Rep. 2023 Oct 21;13(1):17996. doi: 10.1038/s41598-023-45149-5.
2
Radar Human Activity Recognition with an Attention-Based Deep Learning Network.基于注意力深度学习网络的雷达人体活动识别。
Sensors (Basel). 2023 Mar 16;23(6):3185. doi: 10.3390/s23063185.
3
FMCW Radar Human Action Recognition Based on Asymmetric Convolutional Residual Blocks.基于非对称卷积残差块的 FMCW 雷达人体动作识别。
Sensors (Basel). 2024 Jul 15;24(14):4570. doi: 10.3390/s24144570.
4
Human Activity Recognition Method Based on FMCW Radar Sensor with Multi-Domain Feature Attention Fusion Network.基于 FMCW 雷达传感器与多域特征注意力融合网络的人体活动识别方法。
Sensors (Basel). 2023 May 26;23(11):5100. doi: 10.3390/s23115100.
5
Non-Contact Blood Pressure Estimation From Radar Signals by a Stacked Deformable Convolution Network.基于堆叠式可变形卷积网络的雷达信号无接触血压估计。
IEEE J Biomed Health Inform. 2024 Aug;28(8):4553-4564. doi: 10.1109/JBHI.2024.3400961. Epub 2024 Aug 6.
6
MF-Match: A Semi-Supervised Model for Human Action Recognition.MF-Match:一种用于人体动作识别的半监督模型。
Sensors (Basel). 2024 Jul 30;24(15):4940. doi: 10.3390/s24154940.
7
A Deep Learning-Based Approach for Cervical Cancer Classification Using 3D CNN and Vision Transformer.基于深度学习的 3D CNN 和 Vision Transformer 用于宫颈癌分类的方法。
J Imaging Inform Med. 2024 Feb;37(1):280-296. doi: 10.1007/s10278-023-00911-z. Epub 2024 Jan 10.
8
A Lightweight Hybrid Model with Location-Preserving ViT for Efficient Food Recognition.一种具有位置保持 ViT 的轻量级混合模型,用于高效的食物识别。
Nutrients. 2024 Jan 8;16(2):200. doi: 10.3390/nu16020200.
9
Lightweight high-precision SAR ship detection method based on YOLOv7-LDS.基于 YOLOv7-LDS 的轻量级高精度 SAR 船舶检测方法。
PLoS One. 2024 Feb 13;19(2):e0296992. doi: 10.1371/journal.pone.0296992. eCollection 2024.
10
Human Activity Recognition Based on Deep Learning and Micro-Doppler Radar Data.基于深度学习和微多普勒雷达数据的人类活动识别
Sensors (Basel). 2024 Apr 15;24(8):2530. doi: 10.3390/s24082530.

引用本文的文献

1
A Review and Tutorial on Machine Learning-Enabled Radar-Based Biomedical Monitoring.基于机器学习的雷达生物医学监测综述与教程
IEEE Open J Eng Med Biol. 2024 May 6;5:680-699. doi: 10.1109/OJEMB.2024.3397208. eCollection 2024.

本文引用的文献

1
On the Use of a Convolutional Block Attention Module in Deep Learning-Based Human Activity Recognition with Motion Sensors.基于运动传感器的深度学习人体活动识别中卷积块注意力模块的应用
Diagnostics (Basel). 2023 May 26;13(11):1861. doi: 10.3390/diagnostics13111861.
2
Radar Human Activity Recognition with an Attention-Based Deep Learning Network.基于注意力深度学习网络的雷达人体活动识别。
Sensors (Basel). 2023 Mar 16;23(6):3185. doi: 10.3390/s23063185.
3
Light-efficient channel attention in convolutional neural networks for tic recognition in the children with tic disorders.
用于抽动障碍儿童抽动识别的卷积神经网络中的轻量级高效通道注意力机制
Front Comput Neurosci. 2022 Nov 4;16:1047954. doi: 10.3389/fncom.2022.1047954. eCollection 2022.
4
Attention-Based LSTM for Non-Contact Sleep Stage Classification Using IR-UWB Radar.基于注意力机制的 LSTM 用于基于 IR-UWB 雷达的非接触式睡眠阶段分类。
IEEE J Biomed Health Inform. 2021 Oct;25(10):3844-3853. doi: 10.1109/JBHI.2021.3072644. Epub 2021 Oct 5.
5
Squeeze-and-Excitation Networks.挤压激励网络。
IEEE Trans Pattern Anal Mach Intell. 2020 Aug;42(8):2011-2023. doi: 10.1109/TPAMI.2019.2913372. Epub 2019 Apr 29.