Kim Yeonkyeong, Kim Kyu Bom, Leem Ah Young, Kim Kyuseok, Lee Su Hwan
Division of Pulmonology and Critical Care Medicine, Department of Internal Medicine, Yonsei University College of Medicine, 50-1, Yonsei-ro, Seodaemun-gu, Seoul 03722, Republic of Korea.
2TS Corporation, 211, Hwarang-ro, Seongbuk-gu, Seoul 02772, Republic of Korea.
J Clin Med. 2025 Aug 1;14(15):5437. doi: 10.3390/jcm14155437.
: Identifying and classifying abnormal lung sounds is essential for diagnosing patients with respiratory disorders. In particular, the simultaneous recording of auscultation signals from multiple clinically relevant positions offers greater diagnostic potential compared to traditional single-channel measurements. This study aims to improve the accuracy of respiratory sound classification by leveraging multichannel signals and capturing positional characteristics from multiple sites in the same patient. : We evaluated the performance of respiratory sound classification using multichannel lung sound data with a deep learning model that combines a convolutional neural network (CNN) and long short-term memory (LSTM), based on mel-frequency cepstral coefficients (MFCCs). We analyzed the impact of the number and placement of channels on classification performance. : The results demonstrated that using four-channel recordings improved accuracy, sensitivity, specificity, precision, and F1-score by approximately 1.11, 1.15, 1.05, 1.08, and 1.13 times, respectively, compared to using three, two, or single-channel recordings. : This study confirms that multichannel data capture a richer set of features corresponding to various respiratory sound characteristics, leading to significantly improved classification performance. The proposed method holds promise for enhancing sound classification accuracy not only in clinical applications but also in broader domains such as speech and audio processing.
识别和分类异常肺音对于诊断呼吸系统疾病患者至关重要。特别是,与传统的单通道测量相比,从多个临床相关位置同时记录听诊信号具有更大的诊断潜力。本研究旨在通过利用多通道信号并捕捉同一患者多个部位的位置特征来提高呼吸音分类的准确性。:我们使用基于梅尔频率倒谱系数(MFCC)的深度学习模型,结合卷积神经网络(CNN)和长短期记忆(LSTM),对多通道肺音数据的呼吸音分类性能进行了评估。我们分析了通道数量和放置位置对分类性能的影响。:结果表明,与使用三通道、两通道或单通道记录相比,使用四通道记录分别将准确率、灵敏度、特异性、精确率和F1分数提高了约1.11、1.15、1.05、1.08和1.13倍。:本研究证实,多通道数据捕捉到了与各种呼吸音特征相对应的更丰富的特征集,从而显著提高了分类性能。所提出的方法不仅在临床应用中,而且在语音和音频处理等更广泛的领域中,都有望提高声音分类的准确性。