Despotovic Vladimir, Ismael Muhannad, Cornil Maël, Call Roderick Mc, Fagherazzi Guy
University of Luxembourg, Department of Computer Science, Esch-sur-Alzette, Luxembourg.
Luxembourg Institute of Science and Technology, IT for Innovation in Services Department, Esch-sur-Alzette, Luxembourg.
Comput Biol Med. 2021 Nov;138:104944. doi: 10.1016/j.compbiomed.2021.104944. Epub 2021 Oct 13.
COVID-19 heavily affects breathing and voice and causes symptoms that make patients' voices distinctive, creating recognizable audio signatures. Initial studies have already suggested the potential of using voice as a screening solution. In this article we present a dataset of voice, cough and breathing audio recordings collected from individuals infected by SARS-CoV-2 virus, as well as non-infected subjects via large scale crowdsourced campaign. We describe preliminary results for detection of COVID-19 from cough patterns using standard acoustic features sets, wavelet scattering features and deep audio embeddings extracted from low-level feature representations (VGGish and OpenL3). Our models achieve accuracy of 88.52%, sensitivity of 88.75% and specificity of 90.87%, confirming the applicability of audio signatures to identify COVID-19 symptoms. We furthermore provide an in-depth analysis of the most informative acoustic features and try to elucidate the mechanisms that alter the acoustic characteristics of coughs of people with COVID-19.
新冠病毒病对呼吸和嗓音有严重影响,并导致一些症状,使患者的嗓音具有独特性,从而产生可识别的音频特征。初步研究已表明利用嗓音作为筛查手段的潜力。在本文中,我们展示了一个通过大规模众包活动从感染严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的个体以及未感染个体收集的嗓音、咳嗽和呼吸音频记录数据集。我们描述了使用标准声学特征集、小波散射特征以及从低级特征表示(VGGish和OpenL3)中提取的深度音频嵌入从咳嗽模式检测新冠病毒病的初步结果。我们的模型准确率达到88.52%,灵敏度为88.75%,特异性为90.87%,证实了音频特征在识别新冠病毒病症状方面的适用性。我们还对最具信息量的声学特征进行了深入分析,并试图阐明改变新冠病毒病患者咳嗽声学特征的机制。