Suppr超能文献

基于深度学习模型从频谱图图像表征对新冠病毒咳嗽声音症状进行分类

COVID-19 cough sound symptoms classification from scalogram image representation using deep learning models.

作者信息

Loey Mohamed, Mirjalili Seyedali

机构信息

Department of Computer Science, Faculty of Computers and Artificial Intelligence, Benha University, Benha, 13518, Egypt; Information Technology Program, New Cairo Technological University, New Cairo, Egypt.

Center for Artificial Intelligence Research and Optimization, Torrens University Australia, Fortitude Valley, Brisbane, QLD, 4006, Australia; Yonsei Frontier Lab, Yonsei University, Seoul, South Korea.

出版信息

Comput Biol Med. 2021 Dec;139:105020. doi: 10.1016/j.compbiomed.2021.105020. Epub 2021 Nov 10.

Abstract

Deep Learning shows promising performance in diverse fields and has become an emerging technology in Artificial Intelligence. Recent visual recognition is based on the ranking of photographs and the finding of artefacts in those images. The aim of this research is to classify the different cough sounds of COVID-19 artefacts in the signals of altered real-life environments. The introduced model takes into consideration two major steps. The first step is the transformation phase from sound to image that is optimized by the scalogram technique. The second step involves feature extraction and classification based on six deep transfer models (GoogleNet, ResNet18, ResNet50, ResNet101, MobileNetv2, and NasNetmobile). The dataset used contains 1457 (755 of COVID-19 and 702 of healthy) wave cough sounds. Although our recognition model performs the best, its accuracy only reaches 94.9% based on SGDM optimizer. The accuracy is promising enough for a wide set of labeled cough data to test the potential for generalization. The outcomes show that ResNet18 is the most stable model to classify the cough sounds from a limited dataset with a sensitivity of 94.44% and a specificity of 95.37%. Finally, a comparison of the research with a similar analysis is made. It is observed that the proposed model is more reliable and accurate than any current models. Cough research precision is promising enough to test the ability for extrapolation and generalization.

摘要

深度学习在多个领域展现出了良好的性能,已成为人工智能领域的一项新兴技术。近期的视觉识别基于照片的排序以及在这些图像中发现人工制品。本研究的目的是在真实环境改变的信号中对新冠病毒人工制品的不同咳嗽声音进行分类。所引入的模型考虑了两个主要步骤。第一步是通过小波变换技术优化的从声音到图像的转换阶段。第二步涉及基于六种深度迁移模型(谷歌网络、残差网络18、残差网络50、残差网络101、移动网络v2和纳斯网络移动版)进行特征提取和分类。所使用的数据集包含1457个(755个新冠病毒咳嗽声音和702个健康咳嗽声音)波形咳嗽声音。尽管我们的识别模型表现最佳,但基于SGDM优化器,其准确率仅达到94.9%。对于大量带标签的咳嗽数据而言,该准确率足以测试其泛化潜力。结果表明,残差网络18是从有限数据集中对咳嗽声音进行分类的最稳定模型,其灵敏度为94.44%,特异性为95.37%。最后,将本研究与类似分析进行了比较。结果发现,所提出的模型比任何当前模型都更可靠、更准确。咳嗽研究的精度足以测试其外推和泛化能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84d1/8628520/fd2234b77168/gr1_lrg.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验