Zhang Weiwei, Li Xinyu, Liu Qiao, Zheng Xiangyang, Ge Yisu, Pan Xiaotian, Zhou Yu
Infectious Disease Department, The Third Affiliated Hospital of Wenzhou Medical University, Wenzhou, China.
China Telecom Corporation Limited Zhejiang Branch, Hangzhou, China.
Front Bioeng Biotechnol. 2025 Jul 2;13:1583416. doi: 10.3389/fbioe.2025.1583416. eCollection 2025.
In recent years, advancements in machine learning and electronic stethoscope technology have enabled high-precision recording and analysis of lung sounds, significantly enhancing pulmonary disease diagnosis.
This study presents a comprehensive approach to classify lung sounds into healthy and unhealthy categories using a dataset collected from 112 subjects, comprising 35 healthy individuals and 77 patients with various pulmonary conditions, such as asthma, heart failure, pneumonia, bronchitis, pleural effusion, lung fibrosis, and chronic obstructive pulmonary disease (COPD), grouped as unhealthy. The dataset was obtained using a 3M Littmann® Electronic Stethoscope Model 3,200, employing three types of filters (Bell, Diaphragm, and Extended) to capture sounds across different frequency ranges. We extracted five key audio features-Spectral Centroid, Power, Energy, Zero Crossing Rate, and Mel-Frequency Cepstral Coefficients (MFCCs)-from each recording to form a feature matrix. A Multi-Layer Perceptron (MLP) neural network was trained for binary classification.
The MLP neural network achieved accuracies of 98%, 100%, and 94% on the training, validation, and testing sets, respectively. This partitioning ensured the model's robustness and accuracy.
The high classification accuracy achieved by the MLP neural network suggests that this approach is a valuable decision-support tool for identifying healthy versus unhealthy lung sounds in clinical settings, facilitating early intervention while maintaining computational efficiency for offline implementation. The combination of detailed feature extraction and an optimized MLP neural network resulted in a reliable method for automated binary classification of lung sounds.
近年来,机器学习和电子听诊器技术的进步使得能够对肺部声音进行高精度记录和分析,显著提高了肺部疾病的诊断水平。
本研究提出了一种综合方法,使用从112名受试者收集的数据集将肺部声音分为健康和不健康两类,其中包括35名健康个体和77名患有各种肺部疾病的患者,如哮喘、心力衰竭、肺炎、支气管炎、胸腔积液、肺纤维化和慢性阻塞性肺疾病(COPD),这些患者被归类为不健康。该数据集使用3M Littmann® 3200型电子听诊器获得,采用三种类型的滤波器(钟形、膜片形和扩展形)来捕捉不同频率范围内的声音。我们从每个记录中提取了五个关键音频特征——谱质心、功率、能量、过零率和梅尔频率倒谱系数(MFCC),以形成一个特征矩阵。训练了一个多层感知器(MLP)神经网络用于二分类。
MLP神经网络在训练集、验证集和测试集上分别达到了98%、100%和94%的准确率。这种划分确保了模型的稳健性和准确性。
MLP神经网络实现的高分类准确率表明,这种方法是一种有价值的决策支持工具,可用于在临床环境中识别健康与不健康的肺部声音,有助于早期干预,同时保持离线实施的计算效率。详细的特征提取和优化的MLP神经网络的结合产生了一种可靠的肺部声音自动二分类方法。