Department of Electrical and Computer Engineering, Concordia University, Canada.
Comput Methods Programs Biomed. 2024 May;248:108122. doi: 10.1016/j.cmpb.2024.108122. Epub 2024 Mar 15.
Most of the existing machine learning-based heart sound classification methods achieve limited accuracy. Since they primarily depend on single domain feature information and tend to focus equally on each part of the signal rather than employing a selective attention mechanism. In addition, they fail to exploit convolutional neural network (CNN) - based features with an effective fusion strategy.
In order to overcome these limitations, a novel multimodal attention convolutional neural network (MACNN) with a feature-level fusion strategy, in which Mel-cepstral domain as well as general frequency domain features are incorporated to increase the diversity of the features, is proposed in this paper. In the proposed method, DilationAttenNet is first utilized to construct attention-based CNN feature extractors and then these feature extractors are jointly optimized in MACNN at the feature-level. The attention mechanism aims to suppress irrelevant information and focus on crucial diverse features extracted from the CNN.
Extensive experiments are carried out to study the efficacy of the feature level fusion in comparison to that with early fusion. The results show that the proposed MACNN method significantly outperforms the state-of-the-art approaches in terms of accuracy and score for the two publicly available Github and Physionet datasets.
The findings of our experiments demonstrated the high performance for heart sound classification based on the proposed MACNN, and hence have potential clinical usefulness in the identification of heart diseases. This technique can assist cardiologists and researchers in the design and development of heart sound classification methods.
现有的基于机器学习的心脏声音分类方法大多精度有限。由于它们主要依赖于单一领域的特征信息,并且往往平等地关注信号的每个部分,而不是采用选择性注意机制。此外,它们未能利用基于卷积神经网络(CNN)的特征并采用有效的融合策略。
为了克服这些限制,本文提出了一种新的基于多模态注意卷积神经网络(MACNN)的方法,该方法具有特征级融合策略,其中包括梅尔倒谱域和一般频域特征,以增加特征的多样性。在该方法中,首先利用 DilationAttenNet 构建基于注意力的 CNN 特征提取器,然后在 MACNN 中在特征级别上联合优化这些特征提取器。注意力机制旨在抑制不相关信息并关注从 CNN 中提取的关键多样特征。
进行了广泛的实验以研究与早期融合相比,特征级融合的效果。结果表明,与现有的最先进的方法相比,所提出的 MACNN 方法在两个公开的 Github 和 Physionet 数据集的准确性和得分方面都有显著的提高。
实验结果表明,基于所提出的 MACNN 的心脏声音分类具有很高的性能,因此在心脏病识别方面具有潜在的临床应用价值。该技术可以帮助心脏病专家和研究人员设计和开发心脏声音分类方法。