• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从声音中检测新型冠状病毒2型

SARS-CoV-2 Detection From Voice.

作者信息

Pinkas Gadi, Karny Yarden, Malachi Aviad, Barkai Galia, Bachar Gideon, Aharonson Vered

机构信息

Afeka Center of Language Processing, AfekaTel Aviv Academic College of Engineering Tel Aviv-Yafo 6910717 Israel.

Pediatric Infectious Diseases Unit, Safra Children's Hospital, Sheba Medical Center and Sackler School of MedicineTel-Aviv University Tel Aviv-Yafo 69978 Israel.

出版信息

IEEE Open J Eng Med Biol. 2020 Sep 24;1:268-274. doi: 10.1109/OJEMB.2020.3026468. eCollection 2020.

DOI:10.1109/OJEMB.2020.3026468
PMID:35402954
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8769003/
Abstract

Automated voice-based detection of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) could facilitate the screening for COVID19. A dataset of cellular phone recordings from 88 subjects was recently collected. The dataset included vocal utterances, speech and coughs that were self-recorded by the subjects in either hospitals or isolation sites. All subjects underwent nasopharyngeal swabbing at the time of recording and were labelled as SARS-CoV-2 positives or negative controls. The present study harnessed deep machine learning and speech processing to detect the SARS-CoV-2 positives. A three-stage architecture was implemented. A self-supervised attention-based transformer generated embeddings from the audio inputs. Recurrent neural networks were used to produce specialized sub-models for the SARS-CoV-2 classification. An ensemble stacking fused the predictions of the sub-models. Pre-training, bootstrapping and regularization techniques were used to prevent overfitting. A recall of 78% and a probability of false alarm (PFA) of 41% were measured on a test set of 57 recording sessions. A leave-one-speaker-out cross validation on 292 recording sessions yielded a recall of 78% and a PFA of 30%. These preliminary results imply a feasibility for COVID19 screening using voice.

摘要

基于自动语音的严重急性呼吸综合征冠状病毒2(SARS-CoV-2)检测有助于新冠病毒病(COVID-19)的筛查。最近收集了一个包含88名受试者手机录音的数据集。该数据集包括受试者在医院或隔离地点自行录制的发声、语音和咳嗽声。所有受试者在录音时均接受了鼻咽拭子检测,并被标记为SARS-CoV-2阳性或阴性对照。本研究利用深度机器学习和语音处理来检测SARS-CoV-2阳性。实施了一个三阶段架构。一个基于自监督注意力的变换器从音频输入中生成嵌入。循环神经网络用于生成用于SARS-CoV-2分类的专门子模型。一个集成堆叠融合了子模型的预测。使用预训练、自助法和正则化技术来防止过拟合。在一个包含57个录音会话的测试集上,召回率为78%,误报概率(PFA)为41%。在292个录音会话上进行的留一说话者交叉验证产生了78%的召回率和30%的PFA。这些初步结果表明使用语音进行COVID-19筛查具有可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29a2/8977211/9840b38fd7ac/aharo5-3026468.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29a2/8977211/d6fe1008f45d/aharo1-3026468.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29a2/8977211/34f04290d319/aharo2-3026468.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29a2/8977211/802b166846fd/aharo3-3026468.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29a2/8977211/6ea71542c764/aharo4-3026468.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29a2/8977211/9840b38fd7ac/aharo5-3026468.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29a2/8977211/d6fe1008f45d/aharo1-3026468.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29a2/8977211/34f04290d319/aharo2-3026468.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29a2/8977211/802b166846fd/aharo3-3026468.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29a2/8977211/6ea71542c764/aharo4-3026468.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/29a2/8977211/9840b38fd7ac/aharo5-3026468.jpg

相似文献

1
SARS-CoV-2 Detection From Voice.从声音中检测新型冠状病毒2型
IEEE Open J Eng Med Biol. 2020 Sep 24;1:268-274. doi: 10.1109/OJEMB.2020.3026468. eCollection 2020.
2
Ensemble learning with speaker embeddings in multiple speech task stimuli for depression detection.在用于抑郁症检测的多语音任务刺激中结合说话人嵌入的集成学习。
Front Neurosci. 2023 Mar 23;17:1141621. doi: 10.3389/fnins.2023.1141621. eCollection 2023.
3
Detection of COVID-19 from voice, cough and breathing patterns: Dataset and preliminary results.从声音、咳嗽和呼吸模式中检测新冠病毒:数据集与初步结果
Comput Biol Med. 2021 Nov;138:104944. doi: 10.1016/j.compbiomed.2021.104944. Epub 2021 Oct 13.
4
A large-scale and PCR-referenced vocal audio dataset for COVID-19.一个大规模的、基于 PCR 的 COVID-19 语音音频数据集。
Sci Data. 2024 Jun 27;11(1):700. doi: 10.1038/s41597-024-03492-w.
5
Semi-supervised audio-driven TV-news speaker diarization using deep neural embeddings.使用深度神经嵌入的半监督音频驱动电视新闻说话人分割
J Acoust Soc Am. 2020 Dec;148(6):3751. doi: 10.1121/10.0002924.
6
Automatic Recognition, Segmentation, and Sex Assignment of Nocturnal Asthmatic Coughs and Cough Epochs in Smartphone Audio Recordings: Observational Field Study.智能手机音频记录中夜间哮喘性咳嗽及咳嗽发作的自动识别、分割与性别判定:观察性现场研究
J Med Internet Res. 2020 Jul 14;22(7):e18082. doi: 10.2196/18082.
7
Deep Learning Approach to Parkinson's Disease Detection Using Voice Recordings and Convolutional Neural Network Dedicated to Image Classification.使用语音记录和专用于图像分类的卷积神经网络的深度学习方法进行帕金森病检测。
Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:717-720. doi: 10.1109/EMBC.2019.8856972.
8
Self-Collected Oral Fluid and Nasal Swabs Demonstrate Comparable Sensitivity to Clinician Collected Nasopharyngeal Swabs for Coronavirus Disease 2019 Detection.自我采集的口腔液和鼻腔拭子与临床医生采集的鼻咽拭子在检测 2019 年冠状病毒病方面具有相当的敏感性。
Clin Infect Dis. 2021 Nov 2;73(9):e3106-e3109. doi: 10.1093/cid/ciaa1589.
9
Use of Deep Neural Networks to Predict Obesity With Short Audio Recordings: Development and Usability Study.使用深度神经网络通过短音频记录预测肥胖:开发与可用性研究。
JMIR AI. 2024 Jul 25;3:e54885. doi: 10.2196/54885.
10
Transfer learning-based ensemble support vector machine model for automated COVID-19 detection using lung computerized tomography scan data.基于迁移学习的集成支持向量机模型,用于使用肺部计算机断层扫描数据自动检测 COVID-19。
Med Biol Eng Comput. 2021 Apr;59(4):825-839. doi: 10.1007/s11517-020-02299-2. Epub 2021 Mar 18.

引用本文的文献

1
Clinical Characteristics of Patients With Respiratory Infections After Nonpharmacological Interventions for COVID-19 in China Have Ended: Using Machine Learning Approaches to Support Pathogen Prediction at Admission.中国新冠疫情非药物干预措施结束后呼吸道感染患者的临床特征:采用机器学习方法辅助入院时病原体预测
Immun Inflamm Dis. 2025 Aug;13(8):e70237. doi: 10.1002/iid3.70237.
2
Vocal Feature Changes for Monitoring Parkinson's Disease Progression-A Systematic Review.用于监测帕金森病进展的嗓音特征变化——一项系统综述
Brain Sci. 2025 Mar 19;15(3):320. doi: 10.3390/brainsci15030320.
3
Challenges issues and future recommendations of deep learning techniques for SARS-CoV-2 detection utilising X-ray and CT images: a comprehensive review.

本文引用的文献

1
Clinical and epidemiological characteristics of 1420 European patients with mild-to-moderate coronavirus disease 2019.1420 例轻症至中症 2019 年冠状病毒病欧洲患者的临床和流行病学特征。
J Intern Med. 2020 Sep;288(3):335-344. doi: 10.1111/joim.13089. Epub 2020 Jun 17.
2
Clinical characteristics of coronavirus disease 2019 (COVID-19) in China: A systematic review and meta-analysis.《中国 2019 年冠状病毒病(COVID-19)的临床特征:系统评价和荟萃分析》
J Infect. 2020 Jun;80(6):656-665. doi: 10.1016/j.jinf.2020.03.041. Epub 2020 Apr 10.
3
Automatic adventitious respiratory sound analysis: A systematic review.
利用X射线和CT图像进行SARS-CoV-2检测的深度学习技术面临的挑战、问题及未来建议:全面综述
PeerJ Comput Sci. 2024 Dec 24;10:e2517. doi: 10.7717/peerj-cs.2517. eCollection 2024.
4
Machine learning-based infection diagnostic and prognostic models in post-acute care settings: a systematic review.基于机器学习的急性后护理环境中的感染诊断和预后模型:一项系统综述
J Am Med Inform Assoc. 2025 Jan 1;32(1):241-252. doi: 10.1093/jamia/ocae278.
5
Respiratory Diseases Diagnosis Using Audio Analysis and Artificial Intelligence: A Systematic Review.使用音频分析和人工智能诊断呼吸疾病:系统评价。
Sensors (Basel). 2024 Feb 10;24(4):1173. doi: 10.3390/s24041173.
6
COVID-19 Detection From Respiratory Sounds With Hierarchical Spectrogram Transformers.基于分层频谱图变换的呼吸音 COVID-19 检测。
IEEE J Biomed Health Inform. 2024 Mar;28(3):1273-1284. doi: 10.1109/JBHI.2023.3339700. Epub 2024 Mar 6.
7
Graph data science and machine learning for the detection of COVID-19 infection from symptoms.用于从症状检测新冠病毒感染的图数据科学与机器学习
PeerJ Comput Sci. 2023 Apr 10;9:e1333. doi: 10.7717/peerj-cs.1333. eCollection 2023.
8
A summary of the ComParE COVID-19 challenges.ComParE新冠疫情挑战总结。
Front Digit Health. 2023 Mar 8;5:1058163. doi: 10.3389/fdgth.2023.1058163. eCollection 2023.
9
Development and Validation of a Respiratory-Responsive Vocal Biomarker-Based Tool for Generalizable Detection of Respiratory Impairment: Independent Case-Control Studies in Multiple Respiratory Conditions Including Asthma, Chronic Obstructive Pulmonary Disease, and COVID-19.基于呼吸响应性声音生物标志物的通用呼吸障碍检测工具的开发和验证:包括哮喘、慢性阻塞性肺疾病和 COVID-19 在内的多种呼吸系统疾病的独立病例对照研究。
J Med Internet Res. 2023 Apr 14;25:e44410. doi: 10.2196/44410.
10
Dissociating COVID-19 from other respiratory infections based on acoustic, motor coordination, and phonemic patterns.基于声学、运动协调和音位模式区分 COVID-19 与其他呼吸道感染。
Sci Rep. 2023 Jan 28;13(1):1567. doi: 10.1038/s41598-023-27934-4.
自动异常呼吸音分析:一项系统综述。
PLoS One. 2017 May 26;12(5):e0177926. doi: 10.1371/journal.pone.0177926. eCollection 2017.
4
Metabolic Mechanisms of Vocal Fatigue.嗓音疲劳的代谢机制
J Voice. 2017 May;31(3):378.e1-378.e11. doi: 10.1016/j.jvoice.2016.09.014. Epub 2016 Oct 21.
5
The S/Z ratio: a simple and reliable clinical method of evaluating laryngeal function in patients after intubation.S/Z 比值:评估插管后患者喉功能的一种简单可靠的临床方法。
J Crit Care. 2010 Sep;25(3):489-92. doi: 10.1016/j.jcrc.2009.11.009. Epub 2010 Feb 10.
6
Clinical analysis of 150 cases with the novel influenza A (H1N1) virus infection in Shanghai, China.中国上海 150 例新型甲型 H1N1 流感病毒感染的临床分析。
Biosci Trends. 2009 Aug;3(4):127-30.
7
Adaptive estimation of residue signal for voice pathology diagnosis.用于语音病理学诊断的残余信号自适应估计。
IEEE Trans Biomed Eng. 2000 Jan;47(1):96-104. doi: 10.1109/10.817624.
8
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.
9
The S/Z ratio as an indicator of laryngeal pathology.S/Z比值作为喉病理学的一个指标。
J Speech Hear Disord. 1981 May;46(2):147-9. doi: 10.1044/jshd.4602.147.