Suppr超能文献

一种用于智能手表音频处理的资源高效系统。

A Resource Efficient System for On-Smartwatch Audio Processing.

作者信息

Ahmed Md Sabbir, Rahman Arafat, Wang Zhiyuan, Rucker Mark, Barnes Laura E

机构信息

Department of Systems and Information Engineering University of Virginia, VA, USA.

出版信息

Proc Annu Int Conf Mob Comput Netw. 2024 Nov;2024:1805-1807. doi: 10.1145/3636534.3698866. Epub 2024 Dec 4.

Abstract

While audio data shows promise in addressing various health challenges, there is a lack of research on on-device audio processing for smartwatches. Privacy concerns make storing raw audio and performing post-hoc analysis undesirable for many users. Additionally, current on-device audio processing systems for smartwatches are limited in their feature extraction capabilities, restricting their potential for understanding user behavior and health. We developed a real-time system for on-device audio processing on smartwatches, which takes an average of 1.78 minutes (SD = 0.07 min) to extract 22 spectral and rhythmic features from a 1-minute audio sample, using a small window size of 25 milliseconds. Using these extracted audio features on a public dataset, we developed and incorporated models into a watch to classify foreground and background speech in real-time. Our Random Forest-based model classifies speech with a balanced accuracy of 80.3%.

摘要

虽然音频数据在应对各种健康挑战方面显示出前景,但针对智能手表的设备端音频处理的研究却很匮乏。隐私问题使得存储原始音频并进行事后分析对许多用户来说并不可取。此外,当前用于智能手表的设备端音频处理系统在特征提取能力方面存在局限,限制了它们理解用户行为和健康状况的潜力。我们开发了一种用于智能手表设备端音频处理的实时系统,该系统使用25毫秒的小窗口大小,从1分钟的音频样本中提取22个频谱和节奏特征平均需要1.78分钟(标准差 = 0.07分钟)。在一个公共数据集上使用这些提取的音频特征,我们开发了模型并将其集成到手表中,以实时分类前景语音和背景语音。我们基于随机森林的模型对语音进行分类的平衡准确率为80.3%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76c8/12126283/0874b8b04109/nihms-2077961-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验