一种用于智能手表音频处理的资源高效系统。

A Resource Efficient System for On-Smartwatch Audio Processing.

作者信息

Ahmed Md Sabbir, Rahman Arafat, Wang Zhiyuan, Rucker Mark, Barnes Laura E

机构信息

Department of Systems and Information Engineering University of Virginia, VA, USA.

出版信息

Proc Annu Int Conf Mob Comput Netw. 2024 Nov;2024:1805-1807. doi: 10.1145/3636534.3698866. Epub 2024 Dec 4.

DOI:10.1145/3636534.3698866

PMID:40453550

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12126283/

Abstract

While audio data shows promise in addressing various health challenges, there is a lack of research on on-device audio processing for smartwatches. Privacy concerns make storing raw audio and performing post-hoc analysis undesirable for many users. Additionally, current on-device audio processing systems for smartwatches are limited in their feature extraction capabilities, restricting their potential for understanding user behavior and health. We developed a real-time system for on-device audio processing on smartwatches, which takes an average of 1.78 minutes (SD = 0.07 min) to extract 22 spectral and rhythmic features from a 1-minute audio sample, using a small window size of 25 milliseconds. Using these extracted audio features on a public dataset, we developed and incorporated models into a watch to classify foreground and background speech in real-time. Our Random Forest-based model classifies speech with a balanced accuracy of 80.3%.

摘要

虽然音频数据在应对各种健康挑战方面显示出前景，但针对智能手表的设备端音频处理的研究却很匮乏。隐私问题使得存储原始音频并进行事后分析对许多用户来说并不可取。此外，当前用于智能手表的设备端音频处理系统在特征提取能力方面存在局限，限制了它们理解用户行为和健康状况的潜力。我们开发了一种用于智能手表设备端音频处理的实时系统，该系统使用25毫秒的小窗口大小，从1分钟的音频样本中提取22个频谱和节奏特征平均需要1.78分钟（标准差 = 0.07分钟）。在一个公共数据集上使用这些提取的音频特征，我们开发了模型并将其集成到手表中，以实时分类前景语音和背景语音。我们基于随机森林的模型对语音进行分类的平衡准确率为80.3%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/76c8/12126283/0874b8b04109/nihms-2077961-f0001.jpg

相似文献

A Resource Efficient System for On-Smartwatch Audio Processing.一种用于智能手表音频处理的资源高效系统。

Proc Annu Int Conf Mob Comput Netw. 2024 Nov;2024:1805-1807. doi: 10.1145/3636534.3698866. Epub 2024 Dec 4.

Smartwatch User Interface Implementation Using CNN-Based Gesture Pattern Recognition.基于卷积神经网络的手势模式识别的智能手表用户界面实现。

Sensors (Basel). 2018 Sep 7;18(9):2997. doi: 10.3390/s18092997.

HornBase: An audio dataset of car horns in different scenarios and positions.HornBase：一个包含不同场景和位置汽车喇叭声的音频数据集。

Data Brief. 2024 Jul 14;55:110678. doi: 10.1016/j.dib.2024.110678. eCollection 2024 Aug.

Speech emotion recognition using machine learning techniques: Feature extraction and comparison of convolutional neural network and random forest.基于机器学习技术的语音情感识别：卷积神经网络和随机森林的特征提取与比较。

PLoS One. 2023 Nov 21;18(11):e0291500. doi: 10.1371/journal.pone.0291500. eCollection 2023.

Understanding Smartwatch Battery Utilization in the Wild.理解智能手表电池的实际使用情况。

Sensors (Basel). 2020 Jul 6;20(13):3784. doi: 10.3390/s20133784.

Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices.用于可穿戴设备环境音频中前景语音定位的深度多实例学习

EURASIP J Audio Speech Music Process. 2021;2021(1):7. doi: 10.1186/s13636-020-00194-0. Epub 2021 Feb 3.

iWash: A smartwatch handwashing quality assessment and reminder system with real-time feedback in the context of infectious disease.iWash：一种在传染病背景下具有实时反馈功能的智能手表洗手质量评估与提醒系统。

Smart Health (Amst). 2021 Mar;19:100171. doi: 10.1016/j.smhl.2020.100171. Epub 2020 Dec 13.

Forensic authentication method for audio recordings generated by Voice Recorder application on Samsung Galaxy Watch4 series.用于三星 Galaxy Watch4 系列上的 Voice Recorder 应用程序生成的录音的法医认证方法。

J Forensic Sci. 2023 Jan;68(1):139-153. doi: 10.1111/1556-4029.15158. Epub 2022 Oct 22.

Audio-visual multi-modality driven hybrid feature learning model for crowd analysis and classification.用于人群分析与分类的视听多模态驱动混合特征学习模型

Math Biosci Eng. 2023 May 25;20(7):12529-12561. doi: 10.3934/mbe.2023558.

Audio-based detection and evaluation of eating behavior using the smartwatch platform.使用智能手表平台基于音频的进食行为检测与评估。

Comput Biol Med. 2015 Oct 1;65:1-9. doi: 10.1016/j.compbiomed.2015.07.013. Epub 2015 Jul 26.

本文引用的文献

Systematic review and meta-analysis of performance of wearable artificial intelligence in detecting and predicting depression.可穿戴人工智能在检测和预测抑郁症方面性能的系统评价与荟萃分析

NPJ Digit Med. 2023 May 5;6(1):84. doi: 10.1038/s41746-023-00828-5.

Deep Multivariate Domain Translation for Device Invariant Pulmonary Patient Identification from Cough and Speech Sounds.从咳嗽和语音中进行设备不变的肺部患者识别的深度多元域转换。

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:4473-4478. doi: 10.1109/EMBC48229.2022.9871967.

The Impact of Wearable Technologies in Health Research: Scoping Review.可穿戴技术在健康研究中的影响：范围综述。

JMIR Mhealth Uhealth. 2022 Jan 25;10(1):e34384. doi: 10.2196/34384.

Estimating Respiratory Rate From Breath Audio Obtained Through Wearable Microphones.从可穿戴麦克风获得的呼吸音频估算呼吸率。

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:7310-7315. doi: 10.1109/EMBC46164.2021.9629661.

Machine learning algorithm validation with a limited sample size.机器学习算法在有限样本量下的验证。

PLoS One. 2019 Nov 7;14(11):e0224365. doi: 10.1371/journal.pone.0224365. eCollection 2019.

pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis.pyAudioAnalysis：一个用于音频信号分析的开源Python库。

PLoS One. 2015 Dec 11;10(12):e0144610. doi: 10.1371/journal.pone.0144610. eCollection 2015.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于智能手表音频处理的资源高效系统。

A Resource Efficient System for On-Smartwatch Audio Processing.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献