• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

增强手部特征的改进型 3D-ResNet 手语识别算法。

Improved 3D-ResNet sign language recognition algorithm with enhanced hand features.

机构信息

College of Electronic and Information Engineering, Shandong University of Science and Technology, Qingdao, 266590, Shandong, China.

出版信息

Sci Rep. 2022 Oct 24;12(1):17812. doi: 10.1038/s41598-022-21636-z.

DOI:10.1038/s41598-022-21636-z
PMID:36280693
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9592594/
Abstract

In sign language video, the hand region is small, the resolution is low, the motion speed is fast, and there are cross occlusion and blur phenomena, which have a great impact on sign language recognition rate and speed, and are important factors restricting sign language recognition performance. To solve these problems, this paper proposes an improved 3D-ResNet sign language recognition algorithm with enhanced hand features, aiming to highlight the features of both hands, solve the problem of missing more effective information when relying only on global features, and improve the accuracy of sign language recognition. The proposed method has two improvements. Firstly, the algorithm detects the left and right hand regions based on the improved EfficientDet network, uses the improved Bi-FPN module and dual channel and spatial attention module are used to enhance the detection ability of the network for small targets like hand. Secondly, the improved residual module is used to improve the 3D-ResNet18 network to extract sign language features. The global, the left-hand and the right-hand image sequences are divided into three branches for feature extraction and fusion, so as to strengthen the attention to hand features, strengthen the representation ability of sign language features, and achieve the purpose of improving the accuracy of sign language recognition. In order to verify the performance of this algorithm, a series of experiments are carried out on CSL dataset. For example, in the experiments of hand detection algorithm and sign language recognition algorithm, the performance indicators such as Top-N, mAP, FLOPs and Parm are applied to find the optimal algorithm framework. The experimental results show that the Top1 recognition accuracy of this algorithm reaches 91.12%, which is more than 10% higher than that of C3D, P3D and 3D-ResNet basic networks. From the performance indicators of Top-N, mAP, FLOPs, Parm and so on, the performance of the algorithm in this paper is better than several algorithms in recent three years, such as I3D+BLSTM, B3D ResNet, AM-ResC3D+RCNN and so on. The results show that the hand detection network with enhanced hand features and three-dimensional convolutional neural network proposed in this paper can achieve higher accuracy of sign language recognition.

摘要

在手语视频中,手部区域较小,分辨率较低,运动速度较快,并且存在交叉遮挡和模糊现象,这对口译识别率和速度有很大影响,是限制口译识别性能的重要因素。为了解决这些问题,本文提出了一种改进的基于增强手部特征的 3D-ResNet 手语识别算法,旨在突出双手的特征,解决仅依赖全局特征时会丢失更多有效信息的问题,并提高手语识别的准确性。

所提出的方法有两个改进。首先,该算法基于改进的 EfficientDet 网络检测左右手部区域,使用改进的 Bi-FPN 模块和双通道和空间注意力模块来增强网络对手部等小目标的检测能力。其次,使用改进的残差模块改进 3D-ResNet18 网络以提取手语特征。将全局、左手和右手图像序列分为三个分支进行特征提取和融合,从而加强对手部特征的关注,增强手语特征的表示能力,达到提高手语识别准确性的目的。

为了验证该算法的性能,在 CSL 数据集上进行了一系列实验。例如,在手语检测算法和手语识别算法的实验中,应用 Top-N、mAP、FLOPs 和 Parm 等性能指标来寻找最佳算法框架。实验结果表明,该算法的 Top1 识别准确率达到 91.12%,比 C3D、P3D 和 3D-ResNet 基本网络高出 10%以上。从 Top-N、mAP、FLOPs、 Parm 等性能指标来看,本文算法的性能优于近三年的 I3D+BLSTM、B3D ResNet、AM-ResC3D+RCNN 等几种算法。结果表明,本文提出的增强手部特征的手语检测网络和三维卷积神经网络可以实现更高的手语识别准确率。

相似文献

1
Improved 3D-ResNet sign language recognition algorithm with enhanced hand features.增强手部特征的改进型 3D-ResNet 手语识别算法。
Sci Rep. 2022 Oct 24;12(1):17812. doi: 10.1038/s41598-022-21636-z.
2
Video-Based Sign Language Recognition via ResNet and LSTM Network.基于视频的手语识别:通过ResNet和LSTM网络实现
J Imaging. 2024 Jun 20;10(6):149. doi: 10.3390/jimaging10060149.
3
An Attention-Enhanced Multi-Scale and Dual Sign Language Recognition Network Based on a Graph Convolution Network.基于图卷积网络的注意力增强多尺度双通道手语识别网络。
Sensors (Basel). 2021 Feb 5;21(4):1120. doi: 10.3390/s21041120.
4
A Novel Efficient Convolutional Neural Algorithm for Multi-Category Aliasing Hardware Recognition.一种用于多类别混淆硬件识别的新型高效卷积神经网络算法。
Sensors (Basel). 2022 Jul 18;22(14):5358. doi: 10.3390/s22145358.
5
Research on rainy day traffic sign recognition algorithm based on PMRNet.基于PMRNet的雨天交通标志识别算法研究
Math Biosci Eng. 2023 May 18;20(7):12240-12262. doi: 10.3934/mbe.2023545.
6
Intelligent Malaysian Sign Language Translation System Using Convolutional-Based Attention Module with Residual Network.基于卷积注意力模块和残差网络的智能马来西亚手语翻译系统
Comput Intell Neurosci. 2021 Dec 10;2021:9023010. doi: 10.1155/2021/9023010. eCollection 2021.
7
IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography.IRDC-Net:一种基于表面肌电信号的手语识别的具有残差模块和扩张卷积的 Inception 网络。
Sensors (Basel). 2023 Jun 21;23(13):5775. doi: 10.3390/s23135775.
8
Dynamic Gesture Recognition Algorithm Based on 3D Convolutional Neural Network.基于三维卷积神经网络的动态手势识别算法。
Comput Intell Neurosci. 2021 Aug 16;2021:4828102. doi: 10.1155/2021/4828102. eCollection 2021.
9
An Improved Convolutional Neural Network-Based Scene Image Recognition Method.基于改进卷积神经网络的场景图像识别方法。
Comput Intell Neurosci. 2022 Jun 29;2022:3464984. doi: 10.1155/2022/3464984. eCollection 2022.
10
American Sign Language Words Recognition of Skeletal Videos Using Processed Video Driven Multi-Stacked Deep LSTM.基于处理视频驱动的多层堆叠深度 LSTM 的骨骼视频的美国手语词识别。
Sensors (Basel). 2022 Feb 11;22(4):1406. doi: 10.3390/s22041406.

引用本文的文献

1
Fusion of Multimodal Spatio-Temporal Features and 3D Deformable Convolution Based on Sign Language Recognition in Sensor Networks.基于传感器网络中手语识别的多模态时空特征融合与3D可变形卷积
Sensors (Basel). 2025 Jul 13;25(14):4378. doi: 10.3390/s25144378.
2
Evaluation of the invasiveness of pure ground-glass nodules based on dual-head ResNet technique.基于双头 ResNet 技术的纯磨玻璃结节侵袭性评估。
BMC Cancer. 2024 Sep 2;24(1):1080. doi: 10.1186/s12885-024-12823-4.

本文引用的文献

1
A Knitted Sensing Glove for Human Hand Postures Pattern Recognition.用于人手姿势模式识别的针织感应手套。
Sensors (Basel). 2021 Feb 15;21(4):1364. doi: 10.3390/s21041364.
2
Skeleton-based Chinese sign language recognition and generation for bidirectional communication between deaf and hearing people.基于骨架的中文手语识别与生成,实现聋听人群的双向交流。
Neural Netw. 2020 May;125:41-55. doi: 10.1016/j.neunet.2020.01.030. Epub 2020 Feb 6.
3
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.