Simon Bolivar University, Caracas, Venezuela.
Physiol Meas. 2017 Jul 31;38(8):1631-1644. doi: 10.1088/1361-6579/aa7982.
Heart sound classification and analysis play an important role in the early diagnosis and prevention of cardiovascular disease. To this end, this paper introduces a novel method for automatic classification of normal and abnormal heart sound recordings.
Signals are first preprocessed to extract a total of 131 features in the time, frequency, wavelet and statistical domains from the entire signal and from the timings of the states. Outlier signals are then detected and separated from those with a standard range using an interquartile range algorithm. After that, feature extreme values are given special consideration, and finally features are reduced to the most significant ones using a feature reduction technique. In the classification stage, the selected features either for standard or outlier signals are fed separately into an ensemble of 20 two-step classifiers for the classification task. The first step of the classifier is represented by a nested set of ensemble algorithms which was cross-validated on the training dataset provided by PhysioNet Challenge 2016, while the second one uses a voting rule of the class label.
The results show that this method is able to recognize heart sound recordings efficiently, achieving an overall score of 96.30% for standard signals and 90.18% for outlier signals on a cross-validated experiment using the available training data.
The approach of our proposed method helped reduce overfitting and improved classification performance, achieving an overall score on the hidden test set of 80.1% (79.6% sensitivity and 80.6% specificity).
心音分类和分析在心脑血管疾病的早期诊断和预防中起着重要作用。为此,本文介绍了一种用于自动分类正常和异常心音记录的新方法。
首先对信号进行预处理,从整个信号和状态的时间中提取总共 131 个时间、频率、小波和统计域的特征,以及特征。然后使用四分位距算法检测并分离出超出标准范围的异常信号。之后,特别考虑特征极值,最后使用特征约简技术将特征减少到最重要的特征。在分类阶段,将选择的特征(无论是标准信号还是异常信号)分别输入到 20 个两步分类器的集成中进行分类任务。分类器的第一步由一组嵌套的集成算法表示,这些算法在 PhysioNet 挑战赛 2016 提供的训练数据集上进行了交叉验证,而第二步则使用类标签的投票规则。
结果表明,该方法能够有效地识别心音记录,在使用可用训练数据进行交叉验证实验时,标准信号的总体得分为 96.30%,异常信号的总体得分为 90.18%。
我们提出的方法的方法有助于减少过拟合并提高分类性能,在隐藏测试集上的总体得分为 80.1%(79.6%的灵敏度和 80.6%的特异性)。