• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于未来听力设备语音识别的人工智能智能口罩。

Artificial intelligence enabled smart mask for speech recognition for future hearing devices.

作者信息

Hameed Hira, Usman Muhammad, Kazim Jalil Ur Rehman, Assaleh Khaled, Arshad Kamran, Hussain Amir, Imran Muhammad, Abbasi Qammer H

机构信息

James Watt School of Engineering, University of Glasgow, Glasgow, G12 8QQ, UK.

University of Engineering & Technology, UETP, Peshawar, Pakistan.

出版信息

Sci Rep. 2024 Dec 3;14(1):30112. doi: 10.1038/s41598-024-81904-y.

DOI:10.1038/s41598-024-81904-y
PMID:39627338
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11614889/
Abstract

In recent years, Lip-reading has emerged as a significant research challenge. The aim is to recognise speech by analysing Lip movements. The majority of Lip-reading technologies are based on cameras and wearable devices. However, these technologies have well-known occlusion and ambient lighting limitations, privacy concerns as well as wearable device discomfort for subjects and disturb their daily routines. Furthermore, in the era of coronavirus (COVID-19), where face masks are the norm, vision-based and wearable-based technologies for hearing aids are ineffective. To address the fundamental limitations of camera-based and wearable-based systems, this paper proposes a Radio Frequency Identification (RFID)-based smart mask for a Lip-reading framework capable of reading Lips under face masks, enabling effective speech recognition and fostering conversational accessibility for individuals with hearing impairment. The system uses RFID technology to make Radio Frequency (RF) sensing-based Lip-reading possible. A smart RFID face mask is used to collect a dataset containing three different classes of vowels (A, E, I, O, U), Consonants (F, G, M, S), and words (Fish, Goat, Meal, Moon, Snake). The collected data are fed into well-known machine-learning models for classification. A high classification accuracy is achieved by individual classes and combined datasets. On the RFID combined dataset, the Random Forest model achieves a high classification accuracy of 80%.

摘要

近年来,唇读已成为一项重大的研究挑战。其目的是通过分析唇部动作来识别语音。大多数唇读技术基于摄像头和可穿戴设备。然而,这些技术存在众所周知的遮挡和环境光照限制、隐私问题,以及可穿戴设备给受试者带来的不适并干扰其日常生活。此外,在冠状病毒病(COVID-19)时代,戴口罩成为常态,基于视觉和可穿戴设备的助听器技术无效。为解决基于摄像头和可穿戴设备系统的根本局限性,本文提出一种基于射频识别(RFID)的智能口罩,用于唇读框架,该框架能够在口罩下读取唇部动作,实现有效的语音识别,并促进听力障碍者的对话便利性。该系统利用RFID技术使基于射频(RF)传感的唇读成为可能。一个智能RFID口罩用于收集包含三类不同元音(A、E、I、O、U)、辅音(F、G、M、S)和单词(Fish、Goat、Meal、Moon、Snake)的数据集。收集到的数据被输入到著名的机器学习模型中进行分类。单个类别和组合数据集均实现了较高的分类准确率。在RFID组合数据集上,随机森林模型实现了80%的高分类准确率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/8945a43b0e51/41598_2024_81904_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/fdd9d83a9c3a/41598_2024_81904_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/ccca51833027/41598_2024_81904_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/95b98e949853/41598_2024_81904_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/b99f49447bf1/41598_2024_81904_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/55db7fa6b83c/41598_2024_81904_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/6e6ac8c64a2c/41598_2024_81904_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/8945a43b0e51/41598_2024_81904_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/fdd9d83a9c3a/41598_2024_81904_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/ccca51833027/41598_2024_81904_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/95b98e949853/41598_2024_81904_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/b99f49447bf1/41598_2024_81904_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/55db7fa6b83c/41598_2024_81904_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/6e6ac8c64a2c/41598_2024_81904_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b359/11614889/8945a43b0e51/41598_2024_81904_Fig7_HTML.jpg

相似文献

1
Artificial intelligence enabled smart mask for speech recognition for future hearing devices.用于未来听力设备语音识别的人工智能智能口罩。
Sci Rep. 2024 Dec 3;14(1):30112. doi: 10.1038/s41598-024-81904-y.
2
Pushing the limits of remote RF sensing by reading lips under the face mask.通过戴口罩读唇实现远程射频感应的极限突破。
Nat Commun. 2022 Sep 7;13(1):5168. doi: 10.1038/s41467-022-32231-1.
3
Beyond Pathogen Filtration: Possibility of Smart Masks as Wearable Devices for Personal and Group Health and Safety Management.超越病原体过滤:智能口罩作为可穿戴设备用于个人和群体健康与安全管理的可能性。
JMIR Mhealth Uhealth. 2022 Jun 21;10(6):e38614. doi: 10.2196/38614.
4
Toward Realigning Automatic Speaker Verification in the Era of COVID-19.面向新冠疫情时代的自动说话人验证技术的再调整。
Sensors (Basel). 2022 Mar 30;22(7):2638. doi: 10.3390/s22072638.
5
A data-efficient and easy-to-use lip language interface based on wearable motion capture and speech movement reconstruction.基于可穿戴运动捕捉和语音运动重建的高效、易用的唇语接口。
Sci Adv. 2024 Jun 28;10(26):eado9576. doi: 10.1126/sciadv.ado9576. Epub 2024 Jun 26.
6
Face Masks Impact Auditory and Audiovisual Consonant Recognition in Children With and Without Hearing Loss.口罩对有听力损失和无听力损失儿童的听觉及视听辅音识别产生影响。
Front Psychol. 2022 May 13;13:874345. doi: 10.3389/fpsyg.2022.874345. eCollection 2022.
7
The impact of face masks on the communication of adults with hearing loss during COVID-19 in a clinical setting.新冠疫情期间临床环境中口罩对成年听力损失患者沟通的影响。
Int J Audiol. 2022 May;61(5):365-370. doi: 10.1080/14992027.2021.1952490. Epub 2021 Jul 28.
8
Communicating During COVID-19: The Effect of Transparent Masks for Speech Recognition in Noise.在 COVID-19 期间的交流:透明口罩对噪声环境下语音识别的影响。
Ear Hear. 2021 July/Aug;42(4):772-781. doi: 10.1097/AUD.0000000000001065.
9
The beneficial effect of transparent surgical masks on the communication of adults with hearing loss within clinical settings.透明外科口罩对临床环境中听力损失成年人沟通的有益影响。
Disabil Rehabil Assist Technol. 2025 Feb;20(2):381-387. doi: 10.1080/17483107.2024.2376171. Epub 2024 Jul 8.
10
Functionalized Face Masks as Smart Wearable Sensors for Multiple Sensing.功能化口罩作为用于多种传感的智能可穿戴传感器
ACS Sens. 2024 Sep 27;9(9):4520-4535. doi: 10.1021/acssensors.4c01705. Epub 2024 Sep 19.

本文引用的文献

1
Pushing the limits of remote RF sensing by reading lips under the face mask.通过戴口罩读唇实现远程射频感应的极限突破。
Nat Commun. 2022 Sep 7;13(1):5168. doi: 10.1038/s41467-022-32231-1.
2
Decoding lip language using triboelectric sensors with deep learning.使用带有深度学习的摩擦电传感器解码唇语。
Nat Commun. 2022 Mar 17;13(1):1401. doi: 10.1038/s41467-022-29083-0.