Vatanparvar Korosh, Nathan Viswam, Nemati Ebrahim, Rahman Md Mahbubur, Kuang Jilong
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:5700-5704. doi: 10.1109/EMBC44109.2020.9176156.
Passive health monitoring has been introduced as a solution for continuous diagnosis and tracking of subjects' condition with minimal effort. This is partially achieved by the technology of passive audio recording although it poses major audio privacy issues for subjects. Existing methods are limited to controlled recording environments and their prediction is significantly influenced by background noises. Meanwhile, they are too compute-intensive to be continuously running on smart phones. In this paper, we implement an efficient and robust audio privacy preserving method that profiles the background audio to focus only on audio activities detected during recording for performance improvement, and to adapt to the noise for more accurate speech segmentation. We analyze the performance of our method using audio data collected by a smart watch in lab noisy settings. Our obfuscation results show a low false positive rate of 20% with a 92% true positive rate by adapting to the recording noise level. We also reduced model memory footprint and execution time of the method on a smart phone by 75% and 62% to enable continuous speech obfuscation.
被动健康监测作为一种以最小努力持续诊断和跟踪受试者健康状况的解决方案被引入。被动音频记录技术在一定程度上实现了这一点,尽管它给受试者带来了重大的音频隐私问题。现有方法仅限于受控的录音环境,并且其预测受到背景噪声的显著影响。同时,它们计算量太大,无法在智能手机上持续运行。在本文中,我们实现了一种高效且强大的音频隐私保护方法,该方法对背景音频进行分析,以便仅关注录音期间检测到的音频活动以提高性能,并适应噪声以进行更准确的语音分割。我们使用智能手表在实验室嘈杂环境中收集的音频数据来分析我们方法的性能。我们的混淆结果显示,通过适应录音噪声水平,误报率低至20%,真阳性率为92%。我们还将该方法在智能手机上的模型内存占用和执行时间分别减少了75%和62%,以实现连续语音混淆。