• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于音频输入的肺异常分类的深度集成神经网络与注意力机制

A Deep Ensemble Neural Network with Attention Mechanisms for Lung Abnormality Classification Using Audio Inputs.

机构信息

Department of Computer and Information Sciences, Faculty of Engineering and Environment, University of Northumbria, Newcastle upon Tyne NE1 8ST, UK.

Department of Computer Science, Royal Holloway University of London, Egham TW20 0EX, UK.

出版信息

Sensors (Basel). 2022 Jul 26;22(15):5566. doi: 10.3390/s22155566.

DOI:10.3390/s22155566
PMID:35898070
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9332569/
Abstract

Medical audio classification for lung abnormality diagnosis is a challenging problem owing to comparatively unstructured audio signals present in the respiratory sound clips. To tackle such challenges, we propose an ensemble model by incorporating diverse deep neural networks with attention mechanisms for undertaking lung abnormality and COVID-19 diagnosis using respiratory, speech, and coughing audio inputs. Specifically, four base deep networks are proposed, which include attention-based Convolutional Recurrent Neural Network (A-CRNN), attention-based bidirectional Long Short-Term Memory (A-BiLSTM), attention-based bidirectional Gated Recurrent Unit (A-BiGRU), as well as Convolutional Neural Network (CNN). A Particle Swarm Optimization (PSO) algorithm is used to optimize the training parameters of each network. An ensemble mechanism is used to integrate the outputs of these base networks by averaging the probability predictions of each class. Evaluated using respiratory ICBHI, Coswara breathing, speech, and cough datasets, as well as a combination of ICBHI and Coswara breathing databases, our ensemble model and base networks achieve ICBHI scores ranging from 0.920 to 0.9766. Most importantly, the empirical results indicate that a positive COVID-19 diagnosis can be distinguished to a high degree from other more common respiratory diseases using audio recordings, based on the combined ICBHI and Coswara breathing datasets.

摘要

用于肺部异常诊断的医学音频分类是一个具有挑战性的问题,因为呼吸声样本中存在相对非结构化的音频信号。为了应对这些挑战,我们提出了一种集成模型,该模型通过结合具有注意力机制的多种深度神经网络,使用呼吸、语音和咳嗽音频输入进行肺部异常和 COVID-19 诊断。具体来说,提出了四个基于注意力的基础深度网络,包括基于注意力的卷积递归神经网络 (A-CRNN)、基于注意力的双向长短时记忆网络 (A-BiLSTM)、基于注意力的双向门控循环单元 (A-BiGRU) 以及卷积神经网络 (CNN)。使用粒子群优化 (PSO) 算法优化每个网络的训练参数。通过平均每个类别的概率预测,使用集成机制来整合这些基础网络的输出。通过使用呼吸 ICBHI、Coswara 呼吸、语音和咳嗽数据集以及 ICBHI 和 Coswara 呼吸数据库的组合进行评估,我们的集成模型和基础网络在 ICBHI 评分方面的表现范围为 0.920 到 0.9766。最重要的是,根据 ICBHI 和 Coswara 呼吸数据库的组合,实证结果表明,使用音频记录可以高度区分 COVID-19 阳性诊断与其他更为常见的呼吸道疾病。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c2e/9332569/c98fd132686f/sensors-22-05566-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c2e/9332569/c98fd132686f/sensors-22-05566-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c2e/9332569/c98fd132686f/sensors-22-05566-g001.jpg

相似文献

1
A Deep Ensemble Neural Network with Attention Mechanisms for Lung Abnormality Classification Using Audio Inputs.基于音频输入的肺异常分类的深度集成神经网络与注意力机制
Sensors (Basel). 2022 Jul 26;22(15):5566. doi: 10.3390/s22155566.
2
Benchmarking of eight recurrent neural network variants for breath phase and adventitious sound detection on a self-developed open-access lung sound database-HF_Lung_V1.在自主开发的开放获取肺部声音数据库 HF_Lung_V1 上,对八种递归神经网络变体进行呼吸相位和偶发声音检测的基准测试。
PLoS One. 2021 Jul 1;16(7):e0254134. doi: 10.1371/journal.pone.0254134. eCollection 2021.
3
Multimodal Gait Abnormality Recognition Using a Convolutional Neural Network-Bidirectional Long Short-Term Memory (CNN-BiLSTM) Network Based on Multi-Sensor Data Fusion.基于多传感器数据融合的卷积神经网络-双向长短时记忆网络(CNN-BiLSTM)的多模态步态异常识别
Sensors (Basel). 2023 Nov 10;23(22):9101. doi: 10.3390/s23229101.
4
Classification of lung sounds using scalogram representation of sound segments and convolutional neural network.利用声音片段的小波尺度图表示和卷积神经网络对肺音进行分类。
J Med Eng Technol. 2022 May;46(4):270-279. doi: 10.1080/03091902.2022.2040624. Epub 2022 Feb 25.
5
Audio-Based Drone Detection and Identification Using Deep Learning Techniques with Dataset Enhancement through Generative Adversarial Networks.基于音频的无人机检测与识别:深度学习技术与生成对抗网络增强数据集
Sensors (Basel). 2021 Jul 21;21(15):4953. doi: 10.3390/s21154953.
6
Pulmonary disease detection and classification in patient respiratory audio files using long short-term memory neural networks.使用长短期记忆神经网络对患者呼吸音频文件中的肺部疾病进行检测和分类。
Front Med (Lausanne). 2023 Nov 3;10:1269784. doi: 10.3389/fmed.2023.1269784. eCollection 2023.
7
FLANNEL (Focal Loss bAsed Neural Network EnsembLe) for COVID-19 detection.基于焦点损失的神经网络集成 (FLANNEL) 用于 COVID-19 检测。
J Am Med Inform Assoc. 2021 Mar 1;28(3):444-452. doi: 10.1093/jamia/ocaa280.
8
Two-Stream Attention Network for Pain Recognition from Video Sequences.基于双流注意力网络的视频序列疼痛识别
Sensors (Basel). 2020 Feb 4;20(3):839. doi: 10.3390/s20030839.
9
Polyphonic Sound Event Detection Using Temporal-Frequency Attention and Feature Space Attention.基于时频注意力和特征空间注意力的复音声音事件检测。
Sensors (Basel). 2022 Sep 9;22(18):6818. doi: 10.3390/s22186818.
10
Prediction of dissolved oxygen concentration in aquaculture based on attention mechanism and combined neural network.基于注意力机制和组合神经网络的水产养殖中溶解氧浓度预测
Math Biosci Eng. 2023 Jan;20(1):998-1017. doi: 10.3934/mbe.2023046. Epub 2022 Oct 21.

引用本文的文献

1
Fused Audio Instance and Representation for Respiratory Disease Detection.融合音频实例和表示进行呼吸疾病检测。
Sensors (Basel). 2024 Sep 24;24(19):6176. doi: 10.3390/s24196176.
2
Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers.结合耳蜗图和视觉Transformer的附加音分类
Sensors (Basel). 2024 Jan 21;24(2):682. doi: 10.3390/s24020682.
3
A Machine Learning Approach for Walking Classification in Elderly People with Gait Disorders.机器学习在步态障碍老年人行走分类中的应用。

本文引用的文献

1
Evolving Deep Architecture Generation with Residual Connections for Image Classification Using Particle Swarm Optimization.基于粒子群优化的残差连接在图像分类中用于深度架构生成的演化。
Sensors (Basel). 2021 Nov 28;21(23):7936. doi: 10.3390/s21237936.
2
COVID-19 cough classification using machine learning and global smartphone recordings.利用机器学习和全球智能手机记录对 COVID-19 咳嗽进行分类。
Comput Biol Med. 2021 Aug;135:104572. doi: 10.1016/j.compbiomed.2021.104572. Epub 2021 Jun 17.
3
A deep-learning based multimodal system for Covid-19 diagnosis using breathing sounds and chest X-ray images.
Sensors (Basel). 2023 Jan 6;23(2):679. doi: 10.3390/s23020679.
一种基于深度学习的多模态系统,用于利用呼吸声和胸部X光图像诊断新冠肺炎。
Appl Soft Comput. 2021 Sep;109:107522. doi: 10.1016/j.asoc.2021.107522. Epub 2021 May 26.
4
Clinical features of COVID-19 in elderly patients: A comparison with young and middle-aged patients.老年 COVID-19 患者的临床特征:与中青年患者的比较。
J Infect. 2020 Jun;80(6):e14-e18. doi: 10.1016/j.jinf.2020.03.005. Epub 2020 Mar 27.
5
Detecting Respiratory Pathologies Using Convolutional Neural Networks and Variational Autoencoders for Unbalancing Data.使用卷积神经网络和变分自动编码器检测呼吸病理学,以平衡不平衡数据。
Sensors (Basel). 2020 Feb 22;20(4):1214. doi: 10.3390/s20041214.
6
IoTNet: An Efficient and Accurate Convolutional Neural Network for IoT Devices.IoTNet:一种用于物联网设备的高效准确卷积神经网络。
Sensors (Basel). 2019 Dec 14;19(24):5541. doi: 10.3390/s19245541.
7
Classifying Heart Sounds Using Images of Motifs, MFCC and Temporal Features.基于图案图像、MFCC 和时频特征的心脏音分类。
J Med Syst. 2019 May 6;43(6):168. doi: 10.1007/s10916-019-1286-5.
8
Convolutional neural networks: an overview and application in radiology.卷积神经网络:概述及其在放射学中的应用。
Insights Imaging. 2018 Aug;9(4):611-629. doi: 10.1007/s13244-018-0639-9. Epub 2018 Jun 22.
9
A Micro-GA Embedded PSO Feature Selection Approach to Intelligent Facial Emotion Recognition.基于微 GA 嵌入 PSO 的特征选择方法在智能人脸表情识别中的应用。
IEEE Trans Cybern. 2017 Jun;47(6):1496-1509. doi: 10.1109/TCYB.2016.2549639. Epub 2016 Apr 21.
10
Understanding and using sensitivity, specificity and predictive values.理解并运用敏感度、特异度和预测值。
Indian J Ophthalmol. 2008 Jan-Feb;56(1):45-50. doi: 10.4103/0301-4738.37595.