• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于双路径背景消除卷积神经网络的手语识别

Sign language recognition based on dual-path background erasure convolutional neural network.

作者信息

Zhang Junming, Bu Xiaolong, Wang Yushuai, Dong Hao, Zhang Yu, Wu Haitao

机构信息

School of Computer and Artificial Intelligence, Huanghuai University, Zhumadian, 463000, Henan Province, China.

Key Laboratory of Intelligent Lighting, Henan Province, Zhumadian, 463000, China.

出版信息

Sci Rep. 2024 May 18;14(1):11360. doi: 10.1038/s41598-024-62008-z.

DOI:10.1038/s41598-024-62008-z
PMID:38762676
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11102471/
Abstract

Sign language is an important way to provide expression information to people with hearing and speaking disabilities. Therefore, sign language recognition has always been a very important research topic. However, many sign language recognition systems currently require complex deep models and rely on expensive sensors, which limits the application scenarios of sign language recognition. To address this issue, based on computer vision, this study proposed a lightweight, dual-path background erasing deep convolutional neural network (DPCNN) model for sign language recognition. The DPCNN consists of two paths. One path is used to learn the overall features, while the other path learns the background features. The background features are gradually subtracted from the overall features to obtain an effective representation of hand features. Then, these features are flatten into a one-dimensional layer, and pass through a fully connected layer with an output unit of 128. Finally, use a fully connected layer with an output unit of 24 as the output layer. Based on the ASL Finger Spelling dataset, the total accuracy and Macro-F1 scores of the proposed method is 99.52% and 0.997, respectively. More importantly, the proposed method can be applied to small terminals, thereby improving the application scenarios of sign language recognition. Through experimental comparison, the dual path background erasure network model proposed in this paper has better generalization ability.

摘要

手语是向听力和语言有障碍的人提供表达信息的重要方式。因此,手语识别一直是一个非常重要的研究课题。然而,目前许多手语识别系统需要复杂的深度模型,并且依赖昂贵的传感器,这限制了手语识别的应用场景。为了解决这个问题,基于计算机视觉,本研究提出了一种用于手语识别的轻量级双路径背景擦除深度卷积神经网络(DPCNN)模型。DPCNN由两条路径组成。一条路径用于学习整体特征,而另一条路径学习背景特征。背景特征从整体特征中逐渐减去,以获得手部特征的有效表示。然后,将这些特征展平为一维层,并通过一个输出单元为128的全连接层。最后,使用一个输出单元为24的全连接层作为输出层。基于美国手语手指拼写数据集,该方法的总准确率和宏F1分数分别为99.52%和0.997。更重要的是,该方法可以应用于小型终端,从而改善手语识别的应用场景。通过实验比较,本文提出的双路径背景擦除网络模型具有更好的泛化能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/611206c290f4/41598_2024_62008_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/557dea04e0c5/41598_2024_62008_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/157f9f4ef107/41598_2024_62008_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/0e6548f4acb0/41598_2024_62008_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/db860d11254e/41598_2024_62008_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/4c808aafe0a7/41598_2024_62008_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/a59dc74d4bf3/41598_2024_62008_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/27ce0239827c/41598_2024_62008_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/611206c290f4/41598_2024_62008_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/557dea04e0c5/41598_2024_62008_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/157f9f4ef107/41598_2024_62008_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/0e6548f4acb0/41598_2024_62008_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/db860d11254e/41598_2024_62008_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/4c808aafe0a7/41598_2024_62008_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/a59dc74d4bf3/41598_2024_62008_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/27ce0239827c/41598_2024_62008_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fc97/11102471/611206c290f4/41598_2024_62008_Fig8_HTML.jpg

相似文献

1
Sign language recognition based on dual-path background erasure convolutional neural network.基于双路径背景消除卷积神经网络的手语识别
Sci Rep. 2024 May 18;14(1):11360. doi: 10.1038/s41598-024-62008-z.
2
Hypertuned Deep Convolutional Neural Network for Sign Language Recognition.超调深度卷积神经网络的手语识别。
Comput Intell Neurosci. 2022 Apr 30;2022:1450822. doi: 10.1155/2022/1450822. eCollection 2022.
3
Video-Based Sign Language Recognition via ResNet and LSTM Network.基于视频的手语识别:通过ResNet和LSTM网络实现
J Imaging. 2024 Jun 20;10(6):149. doi: 10.3390/jimaging10060149.
4
Improved 3D-ResNet sign language recognition algorithm with enhanced hand features.增强手部特征的改进型 3D-ResNet 手语识别算法。
Sci Rep. 2022 Oct 24;12(1):17812. doi: 10.1038/s41598-022-21636-z.
5
An Attention-Enhanced Multi-Scale and Dual Sign Language Recognition Network Based on a Graph Convolution Network.基于图卷积网络的注意力增强多尺度双通道手语识别网络。
Sensors (Basel). 2021 Feb 5;21(4):1120. doi: 10.3390/s21041120.
6
Sign language recognition using the fusion of image and hand landmarks through multi-headed convolutional neural network.基于多头卷积神经网络的融合图像和手地标进行手语识别。
Sci Rep. 2023 Oct 9;13(1):16975. doi: 10.1038/s41598-023-43852-x.
7
American Sign Language Alphabet Recognition by Extracting Feature from Hand Pose Estimation.从手姿态估计中提取特征实现美国手语字母识别。
Sensors (Basel). 2021 Aug 31;21(17):5856. doi: 10.3390/s21175856.
8
BdSL47: A complete depth-based Bangla sign alphabet and digit dataset.BdSL47:一个完整的基于深度的孟加拉语手语字母和数字数据集。
Data Brief. 2023 Nov 11;51:109799. doi: 10.1016/j.dib.2023.109799. eCollection 2023 Dec.
9
Spatial Attention-Based 3D Graph Convolutional Neural Network for Sign Language Recognition.基于空间注意力的 3D 图卷积神经网络的手语识别。
Sensors (Basel). 2022 Jun 16;22(12):4558. doi: 10.3390/s22124558.
10
Convolutional and recurrent neural network for human activity recognition: Application on American sign language.卷积神经网络和递归神经网络在人类活动识别中的应用:美国手语的应用。
PLoS One. 2020 Feb 19;15(2):e0228869. doi: 10.1371/journal.pone.0228869. eCollection 2020.

引用本文的文献

1
Recognizing American Sign Language gestures efficiently and accurately using a hybrid transformer model.使用混合变压器模型高效准确地识别美国手语手势。
Sci Rep. 2025 Jun 23;15(1):20253. doi: 10.1038/s41598-025-06344-8.
2
Sign language recognition using modified deep learning network and hybrid optimization: a hybrid optimizer (HO) based optimized CNNSa-LSTM approach.使用改进的深度学习网络和混合优化的手语识别:一种基于混合优化器(HO)的优化卷积神经网络-长短时记忆网络(CNNSa-LSTM)方法。
Sci Rep. 2024 Oct 30;14(1):26111. doi: 10.1038/s41598-024-76174-7.

本文引用的文献

1
Hand Gesture Recognition Using FSK Radar Sensors.基于移频键控雷达传感器的手势识别
Sensors (Basel). 2024 Jan 6;24(2):349. doi: 10.3390/s24020349.
2
Sign language recognition using the fusion of image and hand landmarks through multi-headed convolutional neural network.基于多头卷积神经网络的融合图像和手地标进行手语识别。
Sci Rep. 2023 Oct 9;13(1):16975. doi: 10.1038/s41598-023-43852-x.
3
Real-Time Hand Gesture Recognition Using Fine-Tuned Convolutional Neural Network.基于微调卷积神经网络的实时手势识别。
Sensors (Basel). 2022 Jan 18;22(3):706. doi: 10.3390/s22030706.
4
AI enabled sign language recognition and VR space bidirectional communication using triboelectric smart glove.利用摩擦电智能手套实现 AI 手语识别和 VR 空间双向通信。
Nat Commun. 2021 Sep 10;12(1):5378. doi: 10.1038/s41467-021-25637-w.
5
American Sign Language Alphabet Recognition by Extracting Feature from Hand Pose Estimation.从手姿态估计中提取特征实现美国手语字母识别。
Sensors (Basel). 2021 Aug 31;21(17):5856. doi: 10.3390/s21175856.
6
Weakly Supervised Learning with Multi-Stream CNN-LSTM-HMMs to Discover Sequential Parallelism in Sign Language Videos.基于多流 CNN-LSTM-HMM 的弱监督学习发现手语视频中的序列并行性。
IEEE Trans Pattern Anal Mach Intell. 2020 Sep;42(9):2306-2320. doi: 10.1109/TPAMI.2019.2911077. Epub 2019 Apr 15.
7
A Review on Systems-Based Sensory Gloves for Sign Language Recognition State of the Art between 2007 and 2017.基于系统的感手套用于手语识别的研究进展:2007 年至 2017 年的综述
Sensors (Basel). 2018 Jul 9;18(7):2208. doi: 10.3390/s18072208.