Shi Lukui, Zhang Yixuan, Zhang Jingye
IEEE J Biomed Health Inform. 2023 Jan;27(1):308-318. doi: 10.1109/JBHI.2022.3210996. Epub 2023 Jan 4.
Lung diseases are serious threats to human health and life, therefore, an accurate diagnosis of lung diseases is significant. The use of artificial intelligence to analyze lung sounds can aid in diagnosing lung diseases. Most of the existing lung sound recognition methods ignore the correlation between the time-domain and frequency-domain information of the lung sounds. Additionally, the spectrograms used in these models do not adequately capture the detailed features of the lung sounds. This paper proposes a model based on wavelet feature enhancement and time-frequency synchronous modeling, comprising a dual wavelet analysis module (DWAM), a cubic network, and an attention module. DWAM in the model performed a dual wavelet transformation on the spectrograms to extract the detailed features of the lung sounds. The cubic network comprised multiple cubic gated recursive units to capture the correlation of the time-frequency of the lung sounds using the time-frequency synchronous modeling. The attention module, which includes temporal and channel attention, was used to enhance the time-domain and channel dimension features. In the combined dataset and the International Conference on Biomedical and Health Informatics 2017 dataset, the suggested framework outperforms existing models by more than 1.36% and 4.28%, respectively.
肺部疾病对人类健康和生命构成严重威胁,因此,准确诊断肺部疾病意义重大。利用人工智能分析肺音有助于诊断肺部疾病。现有的大多数肺音识别方法都忽略了肺音时域和频域信息之间的相关性。此外,这些模型中使用的频谱图没有充分捕捉到肺音的详细特征。本文提出了一种基于小波特征增强和时频同步建模的模型,该模型包括双小波分析模块(DWAM)、立方网络和注意力模块。模型中的DWAM对频谱图进行双小波变换,以提取肺音的详细特征。立方网络由多个立方门控递归单元组成,通过时频同步建模来捕捉肺音时频的相关性。注意力模块包括时间注意力和通道注意力,用于增强时域和通道维度特征。在组合数据集和2017年生物医学与健康信息学国际会议数据集上,所提出的框架分别比现有模型高出1.36%和4.28%以上。