Suppr
超能文献

从声音中检测新型冠状病毒2型

SARS-CoV-2 Detection From Voice.

作者信息

Pinkas Gadi, Karny Yarden, Malachi Aviad, Barkai Galia, Bachar Gideon, Aharonson Vered

机构信息

Afeka Center of Language Processing, AfekaTel Aviv Academic College of Engineering Tel Aviv-Yafo 6910717 Israel.

Pediatric Infectious Diseases Unit, Safra Children's Hospital, Sheba Medical Center and Sackler School of MedicineTel-Aviv University Tel Aviv-Yafo 69978 Israel.

出版信息

IEEE Open J Eng Med Biol. 2020 Sep 24;1:268-274. doi: 10.1109/OJEMB.2020.3026468. eCollection 2020.

DOI:10.1109/OJEMB.2020.3026468

PMID:35402954

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8769003/

Abstract

Automated voice-based detection of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) could facilitate the screening for COVID19. A dataset of cellular phone recordings from 88 subjects was recently collected. The dataset included vocal utterances, speech and coughs that were self-recorded by the subjects in either hospitals or isolation sites. All subjects underwent nasopharyngeal swabbing at the time of recording and were labelled as SARS-CoV-2 positives or negative controls. The present study harnessed deep machine learning and speech processing to detect the SARS-CoV-2 positives. A three-stage architecture was implemented. A self-supervised attention-based transformer generated embeddings from the audio inputs. Recurrent neural networks were used to produce specialized sub-models for the SARS-CoV-2 classification. An ensemble stacking fused the predictions of the sub-models. Pre-training, bootstrapping and regularization techniques were used to prevent overfitting. A recall of 78% and a probability of false alarm (PFA) of 41% were measured on a test set of 57 recording sessions. A leave-one-speaker-out cross validation on 292 recording sessions yielded a recall of 78% and a PFA of 30%. These preliminary results imply a feasibility for COVID19 screening using voice.

摘要

基于自动语音的严重急性呼吸综合征冠状病毒2（SARS-CoV-2）检测有助于新冠病毒病（COVID-19）的筛查。最近收集了一个包含88名受试者手机录音的数据集。该数据集包括受试者在医院或隔离地点自行录制的发声、语音和咳嗽声。所有受试者在录音时均接受了鼻咽拭子检测，并被标记为SARS-CoV-2阳性或阴性对照。本研究利用深度机器学习和语音处理来检测SARS-CoV-2阳性。实施了一个三阶段架构。一个基于自监督注意力的变换器从音频输入中生成嵌入。循环神经网络用于生成用于SARS-CoV-2分类的专门子模型。一个集成堆叠融合了子模型的预测。使用预训练、自助法和正则化技术来防止过拟合。在一个包含57个录音会话的测试集上，召回率为78%，误报概率（PFA）为41%。在292个录音会话上进行的留一说话者交叉验证产生了78%的召回率和30%的PFA。这些初步结果表明使用语音进行COVID-19筛查具有可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29a2/8977211/d6fe1008f45d/aharo1-3026468.jpg

相似文献

SARS-CoV-2 Detection From Voice.

IEEE Open J Eng Med Biol. 2020 Sep 24;1:268-274. doi: 10.1109/OJEMB.2020.3026468. eCollection 2020.

Ensemble learning with speaker embeddings in multiple speech task stimuli for depression detection.

Front Neurosci. 2023 Mar 23;17:1141621. doi: 10.3389/fnins.2023.1141621. eCollection 2023.

Detection of COVID-19 from voice, cough and breathing patterns: Dataset and preliminary results.

Comput Biol Med. 2021 Nov;138:104944. doi: 10.1016/j.compbiomed.2021.104944. Epub 2021 Oct 13.

A large-scale and PCR-referenced vocal audio dataset for COVID-19.

Sci Data. 2024 Jun 27;11(1):700. doi: 10.1038/s41597-024-03492-w.

Semi-supervised audio-driven TV-news speaker diarization using deep neural embeddings.

J Acoust Soc Am. 2020 Dec;148(6):3751. doi: 10.1121/10.0002924.

Automatic Recognition, Segmentation, and Sex Assignment of Nocturnal Asthmatic Coughs and Cough Epochs in Smartphone Audio Recordings: Observational Field Study.

J Med Internet Res. 2020 Jul 14;22(7):e18082. doi: 10.2196/18082.

Deep Learning Approach to Parkinson's Disease Detection Using Voice Recordings and Convolutional Neural Network Dedicated to Image Classification.

Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:717-720. doi: 10.1109/EMBC.2019.8856972.

Self-Collected Oral Fluid and Nasal Swabs Demonstrate Comparable Sensitivity to Clinician Collected Nasopharyngeal Swabs for Coronavirus Disease 2019 Detection.

Clin Infect Dis. 2021 Nov 2;73(9):e3106-e3109. doi: 10.1093/cid/ciaa1589.

Use of Deep Neural Networks to Predict Obesity With Short Audio Recordings: Development and Usability Study.

JMIR AI. 2024 Jul 25;3:e54885. doi: 10.2196/54885.

Transfer learning-based ensemble support vector machine model for automated COVID-19 detection using lung computerized tomography scan data.

Med Biol Eng Comput. 2021 Apr;59(4):825-839. doi: 10.1007/s11517-020-02299-2. Epub 2021 Mar 18.

引用本文的文献

Clinical Characteristics of Patients With Respiratory Infections After Nonpharmacological Interventions for COVID-19 in China Have Ended: Using Machine Learning Approaches to Support Pathogen Prediction at Admission.

Immun Inflamm Dis. 2025 Aug;13(8):e70237. doi: 10.1002/iid3.70237.

Vocal Feature Changes for Monitoring Parkinson's Disease Progression-A Systematic Review.

Brain Sci. 2025 Mar 19;15(3):320. doi: 10.3390/brainsci15030320.

Challenges issues and future recommendations of deep learning techniques for SARS-CoV-2 detection utilising X-ray and CT images: a comprehensive review.

PeerJ Comput Sci. 2024 Dec 24;10:e2517. doi: 10.7717/peerj-cs.2517. eCollection 2024.

Machine learning-based infection diagnostic and prognostic models in post-acute care settings: a systematic review.

J Am Med Inform Assoc. 2025 Jan 1;32(1):241-252. doi: 10.1093/jamia/ocae278.

Respiratory Diseases Diagnosis Using Audio Analysis and Artificial Intelligence: A Systematic Review.

Sensors (Basel). 2024 Feb 10;24(4):1173. doi: 10.3390/s24041173.

COVID-19 Detection From Respiratory Sounds With Hierarchical Spectrogram Transformers.

IEEE J Biomed Health Inform. 2024 Mar;28(3):1273-1284. doi: 10.1109/JBHI.2023.3339700. Epub 2024 Mar 6.

Graph data science and machine learning for the detection of COVID-19 infection from symptoms.

PeerJ Comput Sci. 2023 Apr 10;9:e1333. doi: 10.7717/peerj-cs.1333. eCollection 2023.

A summary of the ComParE COVID-19 challenges.

Front Digit Health. 2023 Mar 8;5:1058163. doi: 10.3389/fdgth.2023.1058163. eCollection 2023.

Development and Validation of a Respiratory-Responsive Vocal Biomarker-Based Tool for Generalizable Detection of Respiratory Impairment: Independent Case-Control Studies in Multiple Respiratory Conditions Including Asthma, Chronic Obstructive Pulmonary Disease, and COVID-19.

J Med Internet Res. 2023 Apr 14;25:e44410. doi: 10.2196/44410.

Dissociating COVID-19 from other respiratory infections based on acoustic, motor coordination, and phonemic patterns.

Sci Rep. 2023 Jan 28;13(1):1567. doi: 10.1038/s41598-023-27934-4.

本文引用的文献

Clinical and epidemiological characteristics of 1420 European patients with mild-to-moderate coronavirus disease 2019.

J Intern Med. 2020 Sep;288(3):335-344. doi: 10.1111/joim.13089. Epub 2020 Jun 17.

Clinical characteristics of coronavirus disease 2019 (COVID-19) in China: A systematic review and meta-analysis.

J Infect. 2020 Jun;80(6):656-665. doi: 10.1016/j.jinf.2020.03.041. Epub 2020 Apr 10.

Automatic adventitious respiratory sound analysis: A systematic review.

PLoS One. 2017 May 26;12(5):e0177926. doi: 10.1371/journal.pone.0177926. eCollection 2017.

Metabolic Mechanisms of Vocal Fatigue.

J Voice. 2017 May;31(3):378.e1-378.e11. doi: 10.1016/j.jvoice.2016.09.014. Epub 2016 Oct 21.

The S/Z ratio: a simple and reliable clinical method of evaluating laryngeal function in patients after intubation.

J Crit Care. 2010 Sep;25(3):489-92. doi: 10.1016/j.jcrc.2009.11.009. Epub 2010 Feb 10.

Clinical analysis of 150 cases with the novel influenza A (H1N1) virus infection in Shanghai, China.

Biosci Trends. 2009 Aug;3(4):127-30.

Adaptive estimation of residue signal for voice pathology diagnosis.

IEEE Trans Biomed Eng. 2000 Jan;47(1):96-104. doi: 10.1109/10.817624.

Long short-term memory.

Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.

The S/Z ratio as an indicator of laryngeal pathology.

J Speech Hear Disord. 1981 May;46(2):147-9. doi: 10.1044/jshd.4602.147.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

从声音中检测新型冠状病毒2型

SARS-CoV-2 Detection From Voice.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译