Hosseinzadeh Mehdi, Haider Amir, Malik Mazhar Hussain, Adeli Mohammad, Mzoughi Olfa, Gemeay Entesar, Mohammadi Mokhtar, Alinejad-Rokny Hamid, Khoshvaght Parisa, Porntaveetus Thantrira, Rahmani Amir Masoud
School of Computer Science, Duy Tan University, Da Nang, Vietnam.
Jadara University Research Center, Jadara University, Irbid, Jordan.
PLoS One. 2024 Dec 31;19(12):e0316645. doi: 10.1371/journal.pone.0316645. eCollection 2024.
This paper seeks to enhance the performance of Mel Frequency Cepstral Coefficients (MFCCs) for detecting abnormal heart sounds. Heart sounds are first pre-processed to remove noise and then segmented into S1, systole, S2, and diastole intervals, with thirteen MFCCs estimated from each segment, yielding 52 MFCCs per beat. Finally, MFCCs are used for heart sound classification. For that purpose, a single classifier and an innovative ensemble classifier strategy are presented and compared. In the single classifier strategy, the MFCCs from nine consecutive beats are averaged to classify heart sounds by a single classifier (either a support vector machine (SVM), the k nearest neighbors (kNN), or a decision tree (DT)). Conversely, the ensemble classifier strategy employs nine classifiers (either nine SVMs, nine kNN classifiers, or nine DTs) to individually assess beats as normal or abnormal, with the overall classification based on the majority vote. Both methods were tested on a publicly available phonocardiogram database. The heart sound classification accuracy was 91.95% for the SVM, 91.9% for the kNN, and 87.33% for the DT in the single classifier strategy. Also, the accuracy was 93.59% for the SVM, 91.84% for the kNN, and 92.22% for the DT in the ensemble classifier strategy. Overall, the results demonstrated that MFCCs were more effective than other features, including time, time-frequency, and statistical features, evaluated in similar studies. In addition, the ensemble classifier strategy improved the accuracies of the DT and the SVM by 4.89% and 1.64%, implying that the averaging of MFCCs across multiple phonocardiogram beats in the single classifier strategy degraded the important cues that are required for detecting the abnormal heart sounds, and therefore should be avoided.
本文旨在提高用于检测异常心音的梅尔频率倒谱系数(MFCCs)的性能。心音首先经过预处理以去除噪声,然后被分割为S1、收缩期、S2和舒张期间隔,从每个片段估计出13个MFCCs,每搏产生52个MFCCs。最后,MFCCs用于心音分类。为此,提出并比较了单一分类器和创新的集成分类器策略。在单一分类器策略中,将来自九个连续心搏的MFCCs进行平均,由单一分类器(支持向量机(SVM)、k近邻(kNN)或决策树(DT))对心音进行分类。相反,集成分类器策略采用九个分类器(九个SVM、九个kNN分类器或九个DT)分别评估心搏为正常或异常,总体分类基于多数投票。两种方法都在一个公开可用的心音图数据库上进行了测试。在单一分类器策略中,SVM的心音分类准确率为91.95%,kNN为91.9%,DT为87.33%。此外,在集成分类器策略中,SVM的准确率为93.59%,kNN为91.84%,DT为92.22%。总体而言,结果表明MFCCs比在类似研究中评估的其他特征(包括时间、时频和统计特征)更有效。此外,集成分类器策略将DT和SVM的准确率分别提高了4.89%和1.64%,这意味着在单一分类器策略中跨多个心音图心搏对MFCCs进行平均会降低检测异常心音所需的重要线索,因此应避免这种做法。