通过使用生成模型进行音频分析来适应语音混淆中的噪声以实现被动健康监测

Adapting to Noise in Speech Obfuscation by Audio Profiling Using Generative Models for Passive Health Monitoring.

作者信息

Vatanparvar Korosh, Nathan Viswam, Nemati Ebrahim, Rahman Md Mahbubur, Kuang Jilong

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:5700-5704. doi: 10.1109/EMBC44109.2020.9176156.

DOI:10.1109/EMBC44109.2020.9176156

PMID:33019269

Abstract

Passive health monitoring has been introduced as a solution for continuous diagnosis and tracking of subjects' condition with minimal effort. This is partially achieved by the technology of passive audio recording although it poses major audio privacy issues for subjects. Existing methods are limited to controlled recording environments and their prediction is significantly influenced by background noises. Meanwhile, they are too compute-intensive to be continuously running on smart phones. In this paper, we implement an efficient and robust audio privacy preserving method that profiles the background audio to focus only on audio activities detected during recording for performance improvement, and to adapt to the noise for more accurate speech segmentation. We analyze the performance of our method using audio data collected by a smart watch in lab noisy settings. Our obfuscation results show a low false positive rate of 20% with a 92% true positive rate by adapting to the recording noise level. We also reduced model memory footprint and execution time of the method on a smart phone by 75% and 62% to enable continuous speech obfuscation.

摘要

被动健康监测作为一种以最小努力持续诊断和跟踪受试者健康状况的解决方案被引入。被动音频记录技术在一定程度上实现了这一点，尽管它给受试者带来了重大的音频隐私问题。现有方法仅限于受控的录音环境，并且其预测受到背景噪声的显著影响。同时，它们计算量太大，无法在智能手机上持续运行。在本文中，我们实现了一种高效且强大的音频隐私保护方法，该方法对背景音频进行分析，以便仅关注录音期间检测到的音频活动以提高性能，并适应噪声以进行更准确的语音分割。我们使用智能手表在实验室嘈杂环境中收集的音频数据来分析我们方法的性能。我们的混淆结果显示，通过适应录音噪声水平，误报率低至20%，真阳性率为92%。我们还将该方法在智能手机上的模型内存占用和执行时间分别减少了75%和62%，以实现连续语音混淆。

相似文献

Adapting to Noise in Speech Obfuscation by Audio Profiling Using Generative Models for Passive Health Monitoring.通过使用生成模型进行音频分析来适应语音混淆中的噪声以实现被动健康监测

Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:5700-5704. doi: 10.1109/EMBC44109.2020.9176156.

What Does Social Support Sound Like? Challenges and Opportunities for Using Passive Episodic Audio Collection to Assess the Social Environment.社交支持听起来像什么？利用被动式情境音频采集评估社会环境的挑战与机遇。

Front Public Health. 2021 Mar 29;9:633606. doi: 10.3389/fpubh.2021.633606. eCollection 2021.

Noise-Robust Multimodal Audio-Visual Speech Recognition System for Speech-Based Interaction Applications.用于基于语音交互应用的抗噪多模态视听语音识别系统。

Sensors (Basel). 2022 Oct 12;22(20):7738. doi: 10.3390/s22207738.

Enabling Real-Time On-Chip Audio Super Resolution for Bone-Conduction Microphones.实现骨传导麦克风的实时片上音频超分辨率

Sensors (Basel). 2022 Dec 20;23(1):35. doi: 10.3390/s23010035.

Audio-visual enhancement of speech in noise.

J Acoust Soc Am. 2001 Jun;109(6):3007-20. doi: 10.1121/1.1358887.

Correlating subword articulation with lip shapes for embedding aware audio-visual speech enhancement.将子词发音与唇形相关联以实现嵌入感知的视听语音增强。

Neural Netw. 2021 Nov;143:171-182. doi: 10.1016/j.neunet.2021.06.003. Epub 2021 Jun 8.

Effects of aging on audio-visual speech integration.衰老对视听言语整合的影响。

J Acoust Soc Am. 2014 Oct;136(4):1918-31. doi: 10.1121/1.4894685.

A virtual speaker in noisy classroom conditions: supporting or disrupting children's listening comprehension?嘈杂课堂环境中的虚拟扬声器：支持还是干扰儿童的听力理解？

Logoped Phoniatr Vocol. 2019 Jul;44(2):79-86. doi: 10.1080/14015439.2018.1455894. Epub 2018 Apr 5.

Effect of importance sampling on robust segmentation of audio-cough events in noisy environments.重要性采样对噪声环境下音频咳嗽事件稳健分割的影响。

Annu Int Conf IEEE Eng Med Biol Soc. 2016 Aug;2016:3740-3744. doi: 10.1109/EMBC.2016.7591541.

Speech perception in noise: Impact of directional microphones in users of combined electric-acoustic stimulation.噪声环境下的言语感知：混合电-声刺激使用者的指向性麦克风的影响。

PLoS One. 2019 Mar 6;14(3):e0213251. doi: 10.1371/journal.pone.0213251. eCollection 2019.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过使用生成模型进行音频分析来适应语音混淆中的噪声以实现被动健康监测

Adapting to Noise in Speech Obfuscation by Audio Profiling Using Generative Models for Passive Health Monitoring.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献