• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于卷积神经网络的 ICBHI 数据库对肺音分类的声谱图、线谱图、梅尔频谱图和伽马频响图时频表示的比较研究。

A comparative study of the spectrogram, scalogram, melspectrogram and gammatonegram time-frequency representations for the classification of lung sounds using the ICBHI database based on CNNs.

机构信息

Electronics Department, University of Badji Mokhtar Annaba, Annaba, Algeria.

Faculty of Electronics and Computer Engineering, Universiti Teknikal Malaysia Melaka, Melaka, Malaysia.

出版信息

Biomed Tech (Berl). 2022 Aug 8;67(5):367-390. doi: 10.1515/bmt-2022-0180. Print 2022 Oct 26.

DOI:10.1515/bmt-2022-0180
PMID:35926850
Abstract

In lung sound classification using deep learning, many studies have considered the use of short-time Fourier transform (STFT) as the most commonly used 2D representation of the input data. Consequently, STFT has been widely used as an analytical tool, but other versions of the representation have also been developed. This study aims to evaluate and compare the performance of the spectrogram, scalogram, melspectrogram and gammatonegram representations, and provide comparative information to users regarding the suitability of these time-frequency (TF) techniques in lung sound classification. Lung sound signals used in this study were obtained from the ICBHI 2017 respiratory sound database. These lung sound recordings were converted into images of spectrogram, scalogram, melspectrogram and gammatonegram TF representations respectively. The four types of images were fed separately into the VGG16, ResNet-50 and AlexNet deep-learning architectures. Network performances were analyzed and compared based on accuracy, precision, recall and F1-score. The results of the analysis on the performance of the four representations using these three commonly used CNN deep-learning networks indicate that the generated gammatonegram and scalogram TF images coupled with ResNet-50 achieved maximum classification accuracies.

摘要

在使用深度学习进行肺部声音分类的研究中,许多研究都考虑使用短时傅里叶变换(STFT)作为输入数据最常用的 2D 表示方法。因此,STFT 已被广泛用作分析工具,但也已经开发出了其他版本的表示方法。本研究旨在评估和比较频谱图、线谱图、梅尔频谱图和伽马频图谱表示方法的性能,并为用户提供关于这些时频(TF)技术在肺部声音分类中的适用性的比较信息。本研究中使用的肺部声音信号来自于 ICBHI 2017 呼吸声音数据库。这些肺部声音记录分别转换为频谱图、线谱图、梅尔频谱图和伽马频图谱 TF 表示的图像。将这四种类型的图像分别输入到 VGG16、ResNet-50 和 AlexNet 深度学习架构中。根据准确率、精度、召回率和 F1 分数对网络性能进行分析和比较。对这三种常用的 CNN 深度学习网络对四种表示方法性能的分析结果表明,生成的伽马频图谱和线谱图 TF 图像与 ResNet-50 相结合可以达到最高的分类准确率。

相似文献

1
A comparative study of the spectrogram, scalogram, melspectrogram and gammatonegram time-frequency representations for the classification of lung sounds using the ICBHI database based on CNNs.基于卷积神经网络的 ICBHI 数据库对肺音分类的声谱图、线谱图、梅尔频谱图和伽马频响图时频表示的比较研究。
Biomed Tech (Berl). 2022 Aug 8;67(5):367-390. doi: 10.1515/bmt-2022-0180. Print 2022 Oct 26.
2
Classification of lung sounds using scalogram representation of sound segments and convolutional neural network.利用声音片段的小波尺度图表示和卷积神经网络对肺音进行分类。
J Med Eng Technol. 2022 May;46(4):270-279. doi: 10.1080/03091902.2022.2040624. Epub 2022 Feb 25.
3
Convolutional neural networks based efficient approach for classification of lung diseases.基于卷积神经网络的肺部疾病分类高效方法。
Health Inf Sci Syst. 2019 Dec 23;8(1):4. doi: 10.1007/s13755-019-0091-3. eCollection 2020 Dec.
4
LungAttn: advanced lung sound classification using attention mechanism with dual TQWT and triple STFT spectrogram.LungAttn:使用具有双 TQWT 和三重 STFT 频谱图的注意力机制进行高级肺部声音分类。
Physiol Meas. 2021 Oct 29;42(10). doi: 10.1088/1361-6579/ac27b9.
5
Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer.基于音频频谱图视觉Transformer的异常呼吸音识别
Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10341036.
6
A Lightweight CNN Model for Detecting Respiratory Diseases From Lung Auscultation Sounds Using EMD-CWT-Based Hybrid Scalogram.基于 EMD-CWT 基混合标度的肺听诊声音呼吸疾病检测的轻量化卷积神经网络模型
IEEE J Biomed Health Inform. 2021 Jul;25(7):2595-2603. doi: 10.1109/JBHI.2020.3048006. Epub 2021 Jul 27.
7
CNN-MoE Based Framework for Classification of Respiratory Anomalies and Lung Disease Detection.基于 CNN-MoE 的呼吸异常分类和肺病检测框架。
IEEE J Biomed Health Inform. 2021 Aug;25(8):2938-2947. doi: 10.1109/JBHI.2021.3064237. Epub 2021 Aug 5.
8
Crackle Detection In Lung Sounds Using Transfer Learning And Multi-Input Convolutional Neural Networks.使用迁移学习和多输入卷积神经网络进行肺部声音的爆裂音检测。
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:80-83. doi: 10.1109/EMBC46164.2021.9630577.
9
Phonocardiogram transfer learning-based CatBoost model for diastolic dysfunction identification using multiple domain-specific deep feature fusion.基于心音图迁移学习的 CatBoost 模型,使用多个特定领域深度特征融合进行舒张功能障碍识别。
Comput Biol Med. 2023 Apr;156:106707. doi: 10.1016/j.compbiomed.2023.106707. Epub 2023 Feb 20.
10
Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function.基于混合 CNN-LSTM 网络和焦点损失函数的肺部声音自动分类。
Sensors (Basel). 2022 Feb 6;22(3):1232. doi: 10.3390/s22031232.

引用本文的文献

1
Lung disease recognition methods using audio-based analysis with machine learning.使用基于音频分析和机器学习的肺部疾病识别方法。
Heliyon. 2024 Feb 17;10(4):e26218. doi: 10.1016/j.heliyon.2024.e26218. eCollection 2024 Feb 29.
2
Digital Pulmonology Practice with Phonopulmography Leveraging Artificial Intelligence: Future Perspectives Using Dual Microwave Acoustic Sensing and Imaging.基于人工智能的声门启闭图数字肺脏学实践:利用双微波声敏与成像技术的未来展望。
Sensors (Basel). 2023 Jun 12;23(12):5514. doi: 10.3390/s23125514.