School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, 430074, China.
Hanyang District Yongfeng Community Health Service Center, Wuhan, China.
Sci Rep. 2024 Oct 5;14(1):23196. doi: 10.1038/s41598-024-73237-7.
Heart sound auscultation plays a crucial role in the early diagnosis of cardiovascular diseases. In recent years, great achievements have been made in the automatic classification of heart sounds, but most methods are based on segmentation features and traditional classifiers and do not fully exploit existing deep networks. This paper proposes a cardiac audio classification method based on image expression of multidimensional features (CACIEMDF). First, a 102-dimensional feature vector is designed by combining the characteristics of heart sound data in the time domain, frequency domain and statistical domain. Based on the feature vector, a two-dimensional feature projection space is constructed by PCA dimensionality reduction and the convex hull algorithm, and 102 pairs of coordinate representations of the feature vector in the two-dimensional space are calculated. Each one-dimensional component of the feature vector corresponds to a pair of 2D coordinate representations. Finally, the one-dimensional feature component value and its divergence into categories are used to fill the three channels of a color image, and a Gaussian model is used to dye the image to enrich its content. The color image is sent to a deep network such as ResNet50 for classification. In this paper, three public heart sound datasets are fused, and experiments are conducted using the above methods. The results show that for the two-classification/five-classification task of heart sounds, the method in this paper can achieve a classification accuracy of 95.68%/94.53% when combined with the current deep network.
心音听诊在心脑血管疾病的早期诊断中起着至关重要的作用。近年来,心音自动分类取得了很大的成就,但大多数方法都是基于分段特征和传统分类器,没有充分利用现有的深度网络。本文提出了一种基于多维特征图像表达的心音分类方法(CACIEMDF)。首先,通过结合心音数据在时域、频域和统计域的特征,设计了一个 102 维特征向量。基于特征向量,通过 PCA 降维和凸壳算法构建二维特征投影空间,并计算特征向量在二维空间中的 102 对坐标表示。特征向量的每个一维分量对应二维空间中一对 2D 坐标表示。最后,使用一维特征分量值及其类别发散来填充彩色图像的三个通道,并使用高斯模型对图像进行染色,以丰富其内容。将彩色图像发送到 ResNet50 等深度网络进行分类。本文融合了三个公开的心音数据集,并使用上述方法进行了实验。结果表明,对于心音的两类/五类分类任务,本文方法与当前深度网络结合,可分别达到 95.68%/94.53%的分类准确率。