• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于手指拼写中唇音清晰度分类的无监督聚类与集成学习

Unsupervised Clustering and Ensemble Learning for Classifying Lip Articulation in Fingerspelling.

作者信息

Amangeldy Nurzada, Gazizova Nazerke, Milosz Marek, Kurmetbek Bekbolat, Nazyrova Aizhan, Kassymova Akmaral

机构信息

Faculty of Information Technologies, L.N. Gumilyov Eurasian National University, Astana 010008, Kazakhstan.

Department of Computer Science, Lublin University of Technology, 36B Nadbystrzycka Str., 20-618 Lublin, Poland.

出版信息

Sensors (Basel). 2025 Jun 13;25(12):3703. doi: 10.3390/s25123703.

DOI:10.3390/s25123703
PMID:40573592
Abstract

This paper presents a new methodology for analyzing lip articulation during fingerspelling aimed at extracting robust visual patterns that can overcome the inherent ambiguity and variability of lip shape. The proposed approach is based on unsupervised clustering of lip movement trajectories to identify consistent articulatory patterns across different time profiles. The methodology is not limited to using a single model. Still, it includes the exploration of varying cluster configurations and an assessment of their robustness, as well as a detailed analysis of the correspondence between individual alphabet letters and specific clusters. In contrast to direct classification based on raw visual features, this approach pre-tests clustered representations using a model-based assessment of their discriminative potential. This structured approach enhances the interpretability and robustness of the extracted features, highlighting the importance of lip dynamics as an auxiliary modality in multimodal sign language recognition. The obtained results demonstrate that trajectory clustering can serve as a practical method for generating features, providing more accurate and context-sensitive gesture interpretation.

摘要

本文提出了一种新的方法,用于分析手指拼写过程中的唇部发音,旨在提取能够克服唇部形状固有模糊性和变异性的稳健视觉模式。所提出的方法基于唇部运动轨迹的无监督聚类,以识别不同时间轮廓上一致的发音模式。该方法不限于使用单一模型,还包括探索不同的聚类配置及其稳健性评估,以及对单个字母与特定聚类之间对应关系的详细分析。与基于原始视觉特征的直接分类不同,该方法使用基于模型的判别潜力评估对聚类表示进行预测试。这种结构化方法增强了所提取特征的可解释性和稳健性,突出了唇部动态作为多模态手语识别辅助模态的重要性。所得结果表明,轨迹聚类可作为一种生成特征的实用方法,提供更准确且上下文敏感的手势解释。

相似文献

1
Unsupervised Clustering and Ensemble Learning for Classifying Lip Articulation in Fingerspelling.用于手指拼写中唇音清晰度分类的无监督聚类与集成学习
Sensors (Basel). 2025 Jun 13;25(12):3703. doi: 10.3390/s25123703.
2
Automated Image-Based Wound Area Assessment in Outpatient Clinics Using Computer-Aided Methods: A Development and Validation Study.使用计算机辅助方法在门诊诊所进行基于图像的伤口面积自动评估:一项开发与验证研究。
Medicina (Kaunas). 2025 Jun 17;61(6):1099. doi: 10.3390/medicina61061099.
3
Interventions targeted at women to encourage the uptake of cervical screening.针对女性的干预措施,以鼓励她们接受宫颈癌筛查。
Cochrane Database Syst Rev. 2021 Sep 6;9(9):CD002834. doi: 10.1002/14651858.CD002834.pub3.
4
Sign Language Recognition Using the Electromyographic Signal: A Systematic Literature Review.使用肌电图信号的手语识别:系统文献综述。
Sensors (Basel). 2023 Oct 9;23(19):8343. doi: 10.3390/s23198343.
5
A systematic review of speech, language and communication interventions for children with Down syndrome from 0 to 6 years.对0至6岁唐氏综合征儿童言语、语言和沟通干预措施的系统评价。
Int J Lang Commun Disord. 2022 Mar;57(2):441-463. doi: 10.1111/1460-6984.12699. Epub 2022 Feb 22.
6
Colorectal cancer detection with enhanced precision using a hybrid supervised and unsupervised learning approach.使用混合监督与无监督学习方法提高结直肠癌检测的精度
Sci Rep. 2025 Jan 25;15(1):3180. doi: 10.1038/s41598-025-86590-y.
7
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
8
Exploring engagement patterns within a mobile health intervention for women at risk of gestational diabetes.探索针对有妊娠期糖尿病风险的女性的移动健康干预中的参与模式。
Womens Health (Lond). 2025 Jan-Dec;21:17455057251327510. doi: 10.1177/17455057251327510. Epub 2025 Jun 5.
9
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
10
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.

本文引用的文献

1
Continuous Sign Language Recognition and Its Translation into Intonation-Colored Speech.连续手语识别及其语调色彩语音的翻译。
Sensors (Basel). 2023 Jul 13;23(14):6383. doi: 10.3390/s23146383.
2
Audio-Visual Speech and Gesture Recognition by Sensors of Mobile Devices.基于移动设备传感器的视听语音和手势识别。
Sensors (Basel). 2023 Feb 17;23(4):2284. doi: 10.3390/s23042284.
3
Improving Speech Recognition Performance in Noisy Environments by Enhancing Lip Reading Accuracy.通过提高唇读准确率来提高噪声环境下的语音识别性能。
Sensors (Basel). 2023 Feb 11;23(4):2053. doi: 10.3390/s23042053.
4
SIFT-CNN: When Convolutional Neural Networks Meet Dense SIFT Descriptors for Image and Sequence Classification.SIFT-CNN:当卷积神经网络与密集SIFT描述符相遇用于图像和序列分类时。
J Imaging. 2022 Sep 21;8(10):256. doi: 10.3390/jimaging8100256.
5
Sign Language Recognition Method Based on Palm Definition Model and Multiple Classification.基于手掌定义模型和多分类的手语识别方法。
Sensors (Basel). 2022 Sep 1;22(17):6621. doi: 10.3390/s22176621.
6
Pushing the limits of remote RF sensing by reading lips under the face mask.通过戴口罩读唇实现远程射频感应的极限突破。
Nat Commun. 2022 Sep 7;13(1):5168. doi: 10.1038/s41467-022-32231-1.
7
[Development and evaluation of a deep learning algorithm for German word recognition from lip movements].[一种用于从唇动识别德语单词的深度学习算法的开发与评估]
HNO. 2022 Jun;70(6):456-465. doi: 10.1007/s00106-021-01143-9. Epub 2022 Jan 13.