• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用自动编码器和自适应灰狼优化算法的改进型手语识别特征约简框架

Improved feature reduction framework for sign language recognition using autoencoders and adaptive Grey Wolf Optimization.

作者信息

Goel Rajeev, Bansal Sandhya, Gupta Kavita

机构信息

Government College, Naraingarh, India.

CSE Department, Maharishi Markandeshwar (Deemed to be) University, Mullana, India.

出版信息

Sci Rep. 2025 Jan 17;15(1):2300. doi: 10.1038/s41598-024-82785-x.

DOI:10.1038/s41598-024-82785-x
PMID:39824931
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11742416/
Abstract

Automatic Sign Language Recognition Systems (ASLR) offers smooth communication between hearing-impaired and normal-hearing individuals, enhancing educational opportunities for impaired. However, it struggles with "curse of dimensionality" due to excessive features resulting in prolonged training time and exhaustive computational demand. This paper proposes technique that integrates machine learning and swarm intelligence to effectively address this issue. The proposed technique, initially, extracts features using histrogram of gradient (HOG) approach and then reduces dimensions of extracted features using unsupervised autoencoder and subsequently refining the feature set with an improved GWO algorithm. A handcrafted artificial neural network serves as the classifier within this integrated framework, denoted as AEGWO-Net. Exhaustive experimentations were conducted on six different datasets namely ASL, ASL MNIST, ISL, ArSL, MNIST Digits, and IEEE-ISL containing gestures of different languages to demonstrate the performance of AEGWO-Net. The AEGWO-Net demonstrates superior performance improving accuracy and F1 score by 6% and 4% respectively compared to PCA-IGWO and KPCA-IGWO algorithms. Achieving high accuracy (98.40%), F1-score (96.59%), MCC (97.14%), and AUC (96.21%) indicates the robustness and generalizability of the AEGWO-Net method even with reduced dimensionality. Furthermore, a comparison between AEGWO-Net with other existing swarm intelligence techniques is also made to demonstrate its superiority.

摘要

自动手语识别系统(ASLR)为听力障碍者和听力正常者之间提供了顺畅的交流,增加了听力障碍者的教育机会。然而,由于特征过多,它面临着“维数灾难”,导致训练时间延长和计算需求过大。本文提出了一种将机器学习和群体智能相结合的技术来有效解决这个问题。所提出的技术首先使用梯度直方图(HOG)方法提取特征,然后使用无监督自动编码器降低提取特征的维度,随后用改进的灰狼优化算法(GWO)对特征集进行优化。一个手工制作的人工神经网络在这个集成框架中作为分类器,记为AEGWO-Net。在六个不同的数据集上进行了详尽的实验,这些数据集分别是ASL、ASL MNIST、ISL、ArSL、MNIST Digits和IEEE-ISL,包含不同语言的手势,以证明AEGWO-Net的性能。与PCA-IGWO和KPCA-IGWO算法相比,AEGWO-Net表现出卓越的性能,准确率和F1分数分别提高了6%和4%。即使在降维的情况下,AEGWO-Net方法也实现了高精度(98.40%)、F1分数(96.59%)、马修斯相关系数(MCC,97.14%)和曲线下面积(AUC,96.21%),表明了其鲁棒性和通用性。此外,还将AEGWO-Net与其他现有的群体智能技术进行了比较,以证明其优越性。

相似文献

1
Improved feature reduction framework for sign language recognition using autoencoders and adaptive Grey Wolf Optimization.使用自动编码器和自适应灰狼优化算法的改进型手语识别特征约简框架
Sci Rep. 2025 Jan 17;15(1):2300. doi: 10.1038/s41598-024-82785-x.
2
Innovative hand pose based sign language recognition using hybrid metaheuristic optimization algorithms with deep learning model for hearing impaired persons.基于创新手部姿势的手语识别:使用混合元启发式优化算法与深度学习模型助力听力障碍者
Sci Rep. 2025 Mar 18;15(1):9320. doi: 10.1038/s41598-025-93559-4.
3
Real-Time Arabic Sign Language Recognition Using a Hybrid Deep Learning Model.基于混合深度学习模型的实时阿拉伯手语识别
Sensors (Basel). 2024 Jun 6;24(11):3683. doi: 10.3390/s24113683.
4
A wrapper framework for feature selection and ELM weights optimization for FMG-based sign recognition.基于 FMG 的签名识别中特征选择和 ELM 权重优化的封装框架。
Comput Biol Med. 2024 Sep;179:108817. doi: 10.1016/j.compbiomed.2024.108817. Epub 2024 Jul 15.
5
Unsupervised Feature Selection to Identify Important ICD-10 and ATC Codes for Machine Learning on a Cohort of Patients With Coronary Heart Disease: Retrospective Study.无监督特征选择以识别冠心病患者队列机器学习中的重要国际疾病分类第十版(ICD - 10)和解剖治疗化学分类系统(ATC)编码:回顾性研究
JMIR Med Inform. 2024 Jul 26;12:e52896. doi: 10.2196/52896.
6
ASLDetect: Arabic sign language detection using ResNet and U-Net like component.ASLDetect:使用类似ResNet和U-Net组件的阿拉伯手语检测
Sci Rep. 2025 May 23;15(1):18012. doi: 10.1038/s41598-025-01588-w.
7
Gray wolf optimization-extreme learning machine approach for diabetic retinopathy detection.基于灰狼优化-极限学习机的糖尿病视网膜病变检测方法。
Front Public Health. 2022 Aug 1;10:925901. doi: 10.3389/fpubh.2022.925901. eCollection 2022.
8
Two-Stream Mixed Convolutional Neural Network for American Sign Language Recognition.双流混合卷积神经网络在美手势语识别中的应用。
Sensors (Basel). 2022 Aug 9;22(16):5959. doi: 10.3390/s22165959.
9
Toward a Vision-Based Intelligent System: A Stacked Encoded Deep Learning Framework for Sign Language Recognition.基于视觉的智能系统:用于手语识别的堆叠编码深度学习框架。
Sensors (Basel). 2023 Nov 9;23(22):9068. doi: 10.3390/s23229068.
10
Mechanomyography signals pattern recognition in hand movements using swarm intelligence algorithm optimized support vector machine based on acceleration sensors.基于加速度传感器的群智能算法优化支持向量机对手部运动的肌电信号模式识别
Med Eng Phys. 2024 Feb;124:104060. doi: 10.1016/j.medengphy.2023.104060. Epub 2023 Nov 3.

引用本文的文献

1
Sign language recognition based on dual-channel star-attention convolutional neural network.基于双通道星型注意力卷积神经网络的手语识别
Sci Rep. 2025 Jul 29;15(1):27685. doi: 10.1038/s41598-025-13625-9.

本文引用的文献

1
Machine learning models for classification and identification of significant attributes to detect type 2 diabetes.用于分类和识别重要属性以检测2型糖尿病的机器学习模型。
Health Inf Sci Syst. 2022 Feb 9;10(1):2. doi: 10.1007/s13755-021-00168-2. eCollection 2022 Dec.
2
Multiobjective Particle Swarm Optimization for Feature Selection With Fuzzy Cost.基于模糊代价的多目标粒子群优化特征选择方法
IEEE Trans Cybern. 2021 Feb;51(2):874-888. doi: 10.1109/TCYB.2020.3015756. Epub 2021 Jan 15.
3
Particle swarm optimization for feature selection in classification: a multi-objective approach.
粒子群优化在分类中的特征选择:一种多目标方法。
IEEE Trans Cybern. 2013 Dec;43(6):1656-71. doi: 10.1109/TSMCB.2012.2227469.