• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度学习和机器学习的语音分析用于新冠病毒检测:架构建议与比较

Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures.

作者信息

Costantini Giovanni, Dr Valerio Cesarini, Robotti Carlo, Benazzo Marco, Pietrantonio Filomena, Di Girolamo Stefano, Pisani Antonio, Canzi Pietro, Mauramati Simone, Bertino Giulia, Cassaniti Irene, Baldanti Fausto, Saggio Giovanni

机构信息

Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy.

Department of Otolaryngology - Head and Neck Surgery, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy.

出版信息

Knowl Based Syst. 2022 Oct 11;253:109539. doi: 10.1016/j.knosys.2022.109539. Epub 2022 Jul 28.

DOI:10.1016/j.knosys.2022.109539
PMID:35915642
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9328841/
Abstract

Alongside the currently used nasal swab testing, the COVID-19 pandemic situation would gain noticeable advantages from low-cost tests that are available at any-time, anywhere, at a large-scale, and with real time answers. A novel approach for COVID-19 assessment is adopted here, discriminating negative subjects versus positive or recovered subjects. The scope is to identify potential discriminating features, highlight mid and short-term effects of COVID on the voice and compare two custom algorithms. A pool of 310 subjects took part in the study; recordings were collected in a low-noise, controlled setting employing three different vocal tasks. Binary classifications followed, using two different custom algorithms. The first was based on the coupling of boosting and bagging, with an AdaBoost classifier using Random Forest learners. A feature selection process was employed for the training, identifying a subset of features acting as clinically relevant biomarkers. The other approach was centered on two custom CNN architectures applied to mel-Spectrograms, with a custom knowledge-based data augmentation. Performances, evaluated on an independent test set, were comparable: Adaboost and CNN differentiated COVID-19 positive from negative with accuracies of 100% and 95% respectively, and recovered from negative individuals with accuracies of 86.1% and 75% respectively. This study highlights the possibility to identify COVID-19 positive subjects, foreseeing a tool for on-site screening, while also considering recovered subjects and the effects of COVID-19 on the voice. The two proposed novel architectures allow for the identification of biomarkers and demonstrate the ongoing relevance of traditional ML versus deep learning in speech analysis.

摘要

除了目前使用的鼻拭子检测外,新冠疫情形势将从低成本检测中获得显著优势,这种检测可以随时随地大规模进行,并能实时给出结果。本文采用了一种新的新冠评估方法,区分阴性受试者与阳性或康复受试者。其目的是识别潜在的区分特征,突出新冠对嗓音的中长期影响,并比较两种定制算法。310名受试者参与了这项研究;在低噪音、可控环境下收集了他们的录音,采用了三种不同的发声任务。随后进行二元分类,使用两种不同的定制算法。第一种基于提升和装袋的结合,使用基于随机森林学习器的AdaBoost分类器。在训练过程中采用了特征选择过程,识别出作为临床相关生物标志物的特征子集。另一种方法以应用于梅尔频谱图的两种定制卷积神经网络架构为中心,并采用基于知识的定制数据增强。在独立测试集上评估的性能相当:AdaBoost和卷积神经网络分别以100%和95%的准确率区分新冠阳性和阴性,分别以86.1%和75%的准确率区分康复者和阴性个体。这项研究突出了识别新冠阳性受试者的可能性,预见了一种现场筛查工具,同时也考虑了康复受试者以及新冠对嗓音的影响。所提出的两种新颖架构能够识别生物标志物,并证明了传统机器学习与深度学习在语音分析中的持续相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cdc3/9328841/61a5151cc592/gr6_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cdc3/9328841/75f5bfbd78da/gr1_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cdc3/9328841/2b01f224c2bc/gr2_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cdc3/9328841/ecd2026b9d39/gr3_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cdc3/9328841/93f8a0ad9250/gr4_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cdc3/9328841/02e159a8db9f/gr5_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cdc3/9328841/61a5151cc592/gr6_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cdc3/9328841/75f5bfbd78da/gr1_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cdc3/9328841/2b01f224c2bc/gr2_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cdc3/9328841/ecd2026b9d39/gr3_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cdc3/9328841/93f8a0ad9250/gr4_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cdc3/9328841/02e159a8db9f/gr5_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cdc3/9328841/61a5151cc592/gr6_lrg.jpg

相似文献

1
Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures.基于深度学习和机器学习的语音分析用于新冠病毒检测:架构建议与比较
Knowl Based Syst. 2022 Oct 11;253:109539. doi: 10.1016/j.knosys.2022.109539. Epub 2022 Jul 28.
2
Deep learning in automatic detection of dysphonia: Comparing acoustic features and developing a generalizable framework.深度学习在嗓音障碍自动检测中的应用:比较声学特征并开发一个可推广的框架。
Int J Lang Commun Disord. 2023 Mar;58(2):279-294. doi: 10.1111/1460-6984.12783. Epub 2022 Sep 18.
3
Machine Learning-based Voice Assessment for the Detection of Positive and Recovered COVID-19 Patients.基于机器学习的语音评估用于检测 COVID-19 阳性和康复患者。
J Voice. 2024 May;38(3):796.e1-796.e13. doi: 10.1016/j.jvoice.2021.11.004. Epub 2021 Nov 26.
4
COPDVD: Automated classification of chronic obstructive pulmonary disease on a new collected and evaluated voice dataset.COPDVD:在新收集和评估的语音数据集上对慢性阻塞性肺疾病进行自动化分类。
Artif Intell Med. 2024 Oct;156:102953. doi: 10.1016/j.artmed.2024.102953. Epub 2024 Aug 15.
5
Detection of COVID-19 in smartphone-based breathing recordings: A pre-screening deep learning tool.基于智能手机的呼吸记录中 COVID-19 的检测:一种预筛选深度学习工具。
PLoS One. 2022 Jan 13;17(1):e0262448. doi: 10.1371/journal.pone.0262448. eCollection 2022.
6
COVID-19 cough classification using machine learning and global smartphone recordings.利用机器学习和全球智能手机记录对 COVID-19 咳嗽进行分类。
Comput Biol Med. 2021 Aug;135:104572. doi: 10.1016/j.compbiomed.2021.104572. Epub 2021 Jun 17.
7
Convolutional Neural Network Classifies Pathological Voice Change in Laryngeal Cancer with High Accuracy.卷积神经网络可高精度地对喉癌中的病理性声音变化进行分类。
J Clin Med. 2020 Oct 25;9(11):3415. doi: 10.3390/jcm9113415.
8
Particle Swarm Optimization-Based Extreme Learning Machine for COVID-19 Detection.基于粒子群优化的极限学习机用于新冠病毒检测
Cognit Comput. 2022 Oct 12:1-16. doi: 10.1007/s12559-022-10063-x.
9
Artificial Intelligence-Based Voice Assessment of Patients with Parkinson's Disease Off and On Treatment: Machine vs. Deep-Learning Comparison.基于人工智能的帕金森病患者治疗前后语音评估:机器与深度学习比较。
Sensors (Basel). 2023 Feb 18;23(4):2293. doi: 10.3390/s23042293.
10
Ensemble learning with speaker embeddings in multiple speech task stimuli for depression detection.在用于抑郁症检测的多语音任务刺激中结合说话人嵌入的集成学习。
Front Neurosci. 2023 Mar 23;17:1141621. doi: 10.3389/fnins.2023.1141621. eCollection 2023.

引用本文的文献

1
Detection of disease on nasal breath sound by new lightweight architecture: Using COVID-19 as an example.利用新型轻量级架构通过鼻腔呼吸声检测疾病:以新冠病毒病为例
Digit Health. 2025 May 28;11:20552076251339284. doi: 10.1177/20552076251339284. eCollection 2025 Jan-Dec.
2
Acoustic Analysis and Prediction of Type 2 Diabetes Mellitus Using Smartphone-Recorded Voice Segments.使用智能手机录制的语音片段对2型糖尿病进行声学分析和预测
Mayo Clin Proc Digit Health. 2023 Oct 17;1(4):534-544. doi: 10.1016/j.mcpdig.2023.08.005. eCollection 2023 Dec.
3
Acoustic and Clinical Data Analysis of Vocal Recordings: Pandemic Insights and Lessons.

本文引用的文献

1
SARS-CoV-2 Detection From Voice.从声音中检测新型冠状病毒2型
IEEE Open J Eng Med Biol. 2020 Sep 24;1:268-274. doi: 10.1109/OJEMB.2020.3026468. eCollection 2020.
2
Machine Learning-based Voice Assessment for the Detection of Positive and Recovered COVID-19 Patients.基于机器学习的语音评估用于检测 COVID-19 阳性和康复患者。
J Voice. 2024 May;38(3):796.e1-796.e13. doi: 10.1016/j.jvoice.2021.11.004. Epub 2021 Nov 26.
3
COVID-19 Artificial Intelligence Diagnosis Using Only Cough Recordings.仅使用咳嗽录音的COVID-19人工智能诊断
语音录音的声学和临床数据分析:疫情期间的见解与经验教训。
Diagnostics (Basel). 2024 Oct 12;14(20):2273. doi: 10.3390/diagnostics14202273.
4
Sound as a bell: a deep learning approach for health status classification through speech acoustic biomarkers.健康如钟:一种通过语音声学生物标志物进行健康状态分类的深度学习方法。
Chin Med. 2024 Jul 24;19(1):101. doi: 10.1186/s13020-024-00973-3.
5
Applications to augment patient care for Internal Medicine specialists: a position paper from the EFIM working group on telemedicine, innovative technologies & digital health.应用于提升内科专家患者护理水平的相关研究:来自 EFIM 远程医疗、创新技术与数字健康工作组的立场文件。
Front Public Health. 2024 Jun 28;12:1370555. doi: 10.3389/fpubh.2024.1370555. eCollection 2024.
6
Severity Classification Using Dynamic Time Warping-Based Voice Biomarkers for Patients With COVID-19: Feasibility Cross-Sectional Study.使用基于动态时间规整的语音生物标志物对新冠肺炎患者进行严重程度分类:可行性横断面研究
JMIR Biomed Eng. 2023 Nov 6;8:e50924. doi: 10.2196/50924. eCollection 2023.
7
Acoustic analysis in stuttering: a machine-learning study.口吃的声学分析:一项机器学习研究。
Front Neurol. 2023 Jun 30;14:1169707. doi: 10.3389/fneur.2023.1169707. eCollection 2023.
8
High-Level CNN and Machine Learning Methods for Speaker Recognition.基于深度学习的说话人识别方法。
Sensors (Basel). 2023 Mar 25;23(7):3461. doi: 10.3390/s23073461.
9
Acoustic Analysis of Speech for Screening for Suicide Risk: Machine Learning Classifiers for Between- and Within-Person Evaluation of Suicidality.言语声学分析用于自杀风险筛查:用于个体间和个体内评估自杀倾向的机器学习分类器。
J Med Internet Res. 2023 Mar 23;25:e45456. doi: 10.2196/45456.
10
Artificial Intelligence-Based Voice Assessment of Patients with Parkinson's Disease Off and On Treatment: Machine vs. Deep-Learning Comparison.基于人工智能的帕金森病患者治疗前后语音评估:机器与深度学习比较。
Sensors (Basel). 2023 Feb 18;23(4):2293. doi: 10.3390/s23042293.
IEEE Open J Eng Med Biol. 2020 Sep 29;1:275-281. doi: 10.1109/OJEMB.2020.3026928. eCollection 2020.
4
A novel data augmentation based on Gabor filter and convolutional deep learning for improving the classification of COVID-19 chest X-Ray images.一种基于Gabor滤波器和卷积深度学习的新型数据增强方法,用于改善COVID-19胸部X光图像的分类。
Biomed Signal Process Control. 2022 Feb;72:103326. doi: 10.1016/j.bspc.2021.103326. Epub 2021 Nov 9.
5
Detection of COVID-19 from voice, cough and breathing patterns: Dataset and preliminary results.从声音、咳嗽和呼吸模式中检测新冠病毒:数据集与初步结果
Comput Biol Med. 2021 Nov;138:104944. doi: 10.1016/j.compbiomed.2021.104944. Epub 2021 Oct 13.
6
Identifying individuals with recent COVID-19 through voice classification using deep learning.利用深度学习通过声音分类识别近期感染 COVID-19 的个体。
Sci Rep. 2021 Sep 27;11(1):19149. doi: 10.1038/s41598-021-98742-x.
7
The voice of COVID-19: Acoustic correlates of infection in sustained vowels.COVID-19 的声音:元音持续发音中的感染声学相关特征。
J Acoust Soc Am. 2021 Jun;149(6):4377. doi: 10.1121/10.0005194.
8
COVID-19 cough classification using machine learning and global smartphone recordings.利用机器学习和全球智能手机记录对 COVID-19 咳嗽进行分类。
Comput Biol Med. 2021 Aug;135:104572. doi: 10.1016/j.compbiomed.2021.104572. Epub 2021 Jun 17.
9
Fostering Voice Objective Analysis in Patients with Movement Disorders.促进运动障碍患者的语音客观分析
Mov Disord. 2021 Apr;36(4):1041. doi: 10.1002/mds.28537.
10
Artificial intelligence enabled preliminary diagnosis for COVID-19 from voice cues and questionnaires.人工智能通过语音提示和问卷进行 COVID-19 的初步诊断。
J Acoust Soc Am. 2021 Feb;149(2):1120. doi: 10.1121/10.0003434.