• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用分形维数和视觉Transformer以及基于咳嗽声音的Grad-CAM进行可解释的COVID-19检测

Explainable COVID-19 detection using fractal dimension and vision transformer with Grad-CAM on cough sounds.

作者信息

Sobahi Nebras, Atila Orhan, Deniz Erkan, Sengur Abdulkadir, Acharya U Rajendra

机构信息

King Abdulaziz University, Department of Electrical and Computer Engineering, Jeddah, Saudi Arabia.

Firat University, Technology Faculty, Electrical and Electronics Engineering Department, Elazig, Turkey.

出版信息

Biocybern Biomed Eng. 2022 Jul-Sep;42(3):1066-1080. doi: 10.1016/j.bbe.2022.08.005. Epub 2022 Sep 6.

DOI:10.1016/j.bbe.2022.08.005
PMID:36092540
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9444505/
Abstract

The polymerase chain reaction (PCR) test is not only time-intensive but also a contact method that puts healthcare personnel at risk. Thus, contactless and fast detection tests are more valuable. Cough sound is an important indicator of COVID-19, and in this paper, a novel explainable scheme is developed for cough sound-based COVID-19 detection. In the presented work, the cough sound is initially segmented into overlapping parts, and each segment is labeled as the input audio, which may contain other sounds. The deep Yet Another Mobile Network (YAMNet) model is considered in this work. After labeling, the segments labeled as cough are cropped and concatenated to reconstruct the pure cough sounds. Then, four fractal dimensions (FD) calculation methods are employed to acquire the FD coefficients on the cough sound with an overlapped sliding window that forms a matrix. The constructed matrixes are then used to form the fractal dimension images. Finally, a pretrained vision transformer (ViT) model is used to classify the constructed images into COVID-19, healthy and symptomatic classes. In this work, we demonstrate the performance of the ViT on cough sound-based COVID-19, and a visual explainability of the inner workings of the ViT model is shown. Three publically available cough sound datasets, namely COUGHVID, VIRUFY, and COSWARA, are used in this study. We have obtained 98.45%, 98.15%, and 97.59% accuracy for COUGHVID, VIRUFY, and COSWARA datasets, respectively. Our developed model obtained the highest performance compared to the state-of-the-art methods and is ready to be tested in real-world applications.

摘要

聚合酶链反应(PCR)检测不仅耗时,而且是一种会让医护人员面临风险的接触式检测方法。因此,非接触式快速检测更为重要。咳嗽声是新冠肺炎的一个重要指标,本文提出了一种基于咳嗽声的新冠肺炎检测新的可解释方案。在本研究中,首先将咳嗽声分割成重叠部分,每个片段作为输入音频进行标注,其中可能包含其他声音。本研究采用了深度“又一个移动网络”(YAMNet)模型。标注后,将标注为咳嗽的片段裁剪并拼接起来,以重建纯净的咳嗽声。然后,采用四种分形维数(FD)计算方法,通过重叠滑动窗口获取咳嗽声的FD系数,形成一个矩阵。然后,用构建的矩阵形成分形维数图像。最后,使用预训练的视觉Transformer(ViT)模型将构建的图像分类为新冠肺炎、健康和有症状类别。在本研究中,我们展示了ViT在基于咳嗽声的新冠肺炎检测中的性能,并展示了ViT模型内部工作原理的可视化可解释性。本研究使用了三个公开可用的咳嗽声数据集,即COUGHVID、VIRUFY和COSWARA。对于COUGHVID、VIRUFY和COSWARA数据集,我们分别获得了98.45%、98.15%和97.59%的准确率。与现有方法相比,我们开发的模型性能最高,并且已准备好在实际应用中进行测试。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/e501bcc06967/gr14_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/4e7f3667651d/gr1_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/8bdebf4de490/gr2_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/344729b37d44/gr3_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/2871062dd129/gr4_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/100f7d73bc13/gr5_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/0bc3de55db4c/gr6_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/0ee4a32df6e5/gr7_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/fe819b199ed8/gr8_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/0b51946686cc/gr9_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/7e7eae4e678b/gr10_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/494549055735/gr11_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/20f535c68826/gr12_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/cd252425301a/gr13_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/e501bcc06967/gr14_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/4e7f3667651d/gr1_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/8bdebf4de490/gr2_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/344729b37d44/gr3_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/2871062dd129/gr4_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/100f7d73bc13/gr5_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/0bc3de55db4c/gr6_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/0ee4a32df6e5/gr7_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/fe819b199ed8/gr8_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/0b51946686cc/gr9_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/7e7eae4e678b/gr10_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/494549055735/gr11_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/20f535c68826/gr12_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/cd252425301a/gr13_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a5e2/9444505/e501bcc06967/gr14_lrg.jpg

相似文献

1
Explainable COVID-19 detection using fractal dimension and vision transformer with Grad-CAM on cough sounds.使用分形维数和视觉Transformer以及基于咳嗽声音的Grad-CAM进行可解释的COVID-19检测
Biocybern Biomed Eng. 2022 Jul-Sep;42(3):1066-1080. doi: 10.1016/j.bbe.2022.08.005. Epub 2022 Sep 6.
2
QUCoughScope: An Intelligent Application to Detect COVID-19 Patients Using Cough and Breath Sounds.QUCoughScope:一款利用咳嗽声和呼吸声检测新冠肺炎患者的智能应用程序。
Diagnostics (Basel). 2022 Apr 7;12(4):920. doi: 10.3390/diagnostics12040920.
3
A semi-supervised algorithm for improving the consistency of crowdsourced datasets: The COVID-19 case study on respiratory disorder classification.一种用于提高众包数据集一致性的半监督算法:COVID-19 呼吸障碍分类案例研究。
Comput Methods Programs Biomed. 2023 Nov;241:107743. doi: 10.1016/j.cmpb.2023.107743. Epub 2023 Aug 9.
4
Machine learning for detecting COVID-19 from cough sounds: An ensemble-based MCDM method.基于集成的多准则决策方法的机器学习用于从咳嗽声音中检测 COVID-19。
Comput Biol Med. 2022 Jun;145:105405. doi: 10.1016/j.compbiomed.2022.105405. Epub 2022 Mar 17.
5
CovidCoughNet: A new method based on convolutional neural networks and deep feature extraction using pitch-shifting data augmentation for covid-19 detection from cough, breath, and voice signals.CovidCoughNet:一种新的方法,基于卷积神经网络和深度特征提取,使用移频数据增强从咳嗽、呼吸和语音信号中检测新冠病毒。
Comput Biol Med. 2023 Sep;163:107153. doi: 10.1016/j.compbiomed.2023.107153. Epub 2023 Jun 8.
6
A study of using cough sounds and deep neural networks for the early detection of Covid-19.一项关于利用咳嗽声音和深度神经网络进行新冠病毒病早期检测的研究。
Biomed Eng Adv. 2022 Jun;3:100025. doi: 10.1016/j.bea.2022.100025. Epub 2022 Jan 6.
7
Covid-19 recognition from cough sounds using lightweight separable-quadratic convolutional network.基于轻量级可分离二次卷积网络的咳嗽声新冠病毒识别
Biomed Signal Process Control. 2022 Feb;72:103333. doi: 10.1016/j.bspc.2021.103333. Epub 2021 Nov 15.
8
COVID-19 cough sound symptoms classification from scalogram image representation using deep learning models.基于深度学习模型从频谱图图像表征对新冠病毒咳嗽声音症状进行分类
Comput Biol Med. 2021 Dec;139:105020. doi: 10.1016/j.compbiomed.2021.105020. Epub 2021 Nov 10.
9
Novel audio characteristic-dependent feature extraction and data augmentation methods for cough-based respiratory disease classification.基于咳嗽的呼吸疾病分类的新型音频特征相关特征提取和数据增强方法。
Comput Biol Med. 2024 Sep;179:108843. doi: 10.1016/j.compbiomed.2024.108843. Epub 2024 Jul 18.
10
Deep learning based cough detection camera using enhanced features.基于深度学习的咳嗽检测摄像头,采用增强特征。
Expert Syst Appl. 2022 Nov 15;206:117811. doi: 10.1016/j.eswa.2022.117811. Epub 2022 Jun 9.

引用本文的文献

1
Evidence Generation for a Host-Response Biosignature of Respiratory Disease.呼吸系统疾病宿主反应生物标志物的证据生成
Viruses. 2025 Jul 2;17(7):943. doi: 10.3390/v17070943.
2
Automatic detection and prediction of COVID-19 in cough audio signals using coronavirus herd immunity optimizer algorithm.使用冠状病毒群体免疫优化算法对咳嗽音频信号中的新冠肺炎进行自动检测和预测。
Sci Rep. 2025 Jan 17;15(1):2271. doi: 10.1038/s41598-025-85140-w.
3
Laryngeal disease classification using voice data: Octave-band vs. mel-frequency filters.使用语音数据进行喉疾病分类:倍频程滤波器与梅尔频率滤波器
Heliyon. 2024 Nov 30;10(24):e40748. doi: 10.1016/j.heliyon.2024.e40748. eCollection 2024 Dec 30.
4
Diagnosis of atrial fibrillation based on AI-detected anomalies of ECG segments.基于人工智能检测到的心电图段异常诊断心房颤动。
Heliyon. 2023 Dec 12;10(1):e23597. doi: 10.1016/j.heliyon.2023.e23597. eCollection 2024 Jan 15.