Department of Electronics and Communication Engineering, Pondicherry Engineering College Puducherry, 605 014, India.
Artif Intell Med. 2020 Mar;103:101809. doi: 10.1016/j.artmed.2020.101809. Epub 2020 Jan 20.
Auscultation of the lung is a conventional technique used for diagnosing chronic obstructive pulmonary diseases (COPDs) and lower respiratory infections and disorders in patients. In most of the earlier works, wavelet transforms or spectrograms have been used to analyze the lung sounds. However, an accurate prediction model for respiratory disorders has not been developed so far. In this paper, a pre-trained optimized Alexnet Convolutional Neural Network (CNN) architecture is proposed for predicting respiratory disorders. The proposed approach models the segmented respiratory sound signal into Bump and Morse scalograms from several intrinsic mode functions (IMFs) using empirical mode decomposition (EMD) method. From the extracted intrinsic mode functions, the percentage energy calculated for each wavelet coefficient in the form of scalograms are computed. Subsequently, these scalograms are given as input to the pre-trained optimized CNN model for training and testing. Stochastic gradient descent with momentum (SGDM) and adaptive data momentum (ADAM) optimization algorithms were examined to check the prediction accuracy on the dataset comprising of four classes of lung sounds, normal, crackles (coarse and fine), wheezes (monophonic & polyphonic) and low-pitched wheezes (Rhonchi). On comparison to the baseline method of standard Bump and Morse wavelet transform approach which produced 79.04 % and 81.27 % validation accuracy, an improved accuracy of 83.78 % is achieved by the virtue of scalogram representation of various IMFs of EMD. Hence, the proposed approach achieves significant performance improvement in accuracy compared to the existing state-of- the-art techniques in literature.
听诊是一种用于诊断慢性阻塞性肺疾病(COPD)和下呼吸道感染以及患者下呼吸道疾病的常规技术。在早期的大多数研究中,已经使用小波变换或声谱图来分析肺部声音。然而,到目前为止,还没有开发出用于呼吸障碍的准确预测模型。在本文中,提出了一种经过预训练的优化 Alexnet 卷积神经网络(CNN)架构,用于预测呼吸障碍。该方法使用经验模态分解(EMD)方法将分段呼吸声信号建模为几个固有模态函数(IMF)的 Bump 和 Morse 声谱图。从提取的固有模态函数中,以声谱图的形式计算每个子波系数的百分比能量。然后,将这些声谱图作为输入提供给经过预训练的优化 CNN 模型进行训练和测试。检查了随机梯度下降带动量(SGDM)和自适应数据动量(ADAM)优化算法,以检查在包含四类肺部声音(正常、爆裂声(粗和细)、哮鸣音(单音和多音)和低调哮鸣音(罗音)的数据集上的预测准确性。与产生 79.04%和 81.27%验证准确性的标准 Bump 和 Morse 子波变换方法的基线方法相比,通过 EMD 的各种 IMF 的声谱图表示,可以实现 83.78%的提高精度。因此,与文献中的现有最新技术相比,该方法在准确性方面实现了显著的性能提升。