IEEE J Biomed Health Inform. 2019 Mar;23(2):642-649. doi: 10.1109/JBHI.2018.2841197. Epub 2018 May 28.
Heart sounds are difficult to interpret due to events with very short temporal onset between them (tens of milliseconds) and dominant frequencies that are out of the human audible spectrum. Computer-assisted decision systems may help but they require robust signal processing algorithms. In this paper, we propose a new algorithm for heart sound segmentation using a hidden semi-Markov model. The proposed algorithm infers more suitable sojourn time parameters than those currently suggested by the state of the art, through a maximum likelihood approach. We test our approach over three different datasets, including the publicly available PhysioNet and Pascal datasets. We also release a pediatric dataset composed of 29 heart sounds. In contrast with any other dataset available online, the annotations of the heart sounds in the released dataset contain information about the beginning and the ending of each heart sound event. Annotations were made by two cardiopulmonologists. The proposed algorithm is compared with the current state of the art. The results show a significant increase in segmentation performance, regardless the dataset or the methodology presented. For example, when using the PhysioNet dataset to train and to evaluate the HSMMs, our algorithm achieved average an F-score of [Formula: see text] compared to [Formula: see text] achieved by the algorithm described in [D.B. Springer, L. Tarassenko, and G. D. Clifford, "Logistic regressionHSMM-based heart sound segmentation," IEEE Transactions on Biomedical Engineering, vol. 63, no. 4, pp. 822-832, 2016]. In this sense, the proposed approach to adapt sojourn time parameters represents an effective solution for heart sound segmentation problems, even when the training data does not perfectly express the variability of the testing data.
心音由于其在时间上非常短(数十毫秒),且其主频超出人耳可听范围,因此难以解释。计算机辅助决策系统可能会有所帮助,但它们需要稳健的信号处理算法。在本文中,我们提出了一种使用隐半马尔可夫模型进行心音分割的新算法。该算法通过最大似然方法推断出比当前最先进技术更适合的逗留时间参数。我们在三个不同的数据集上测试了我们的方法,包括公开的 PhysioNet 和 Pascal 数据集。我们还发布了一个由 29 个心音组成的儿科数据集。与在线上提供的任何其他数据集不同,发布的数据集中的心音注释包含每个心音事件开始和结束的信息。注释由两名心肺专家完成。我们将提出的算法与当前的最先进技术进行了比较。结果表明,无论数据集或所提出的方法如何,分割性能都有显著提高。例如,当使用 PhysioNet 数据集进行训练和评估 HSMM 时,我们的算法在 F 分数方面平均达到了[Formula: see text],而在[D.B. Springer、L. Tarassenko 和 G. D. Clifford 中描述的算法仅达到了[Formula: see text]。在这种情况下,适应逗留时间参数的方法代表了心音分割问题的有效解决方案,即使训练数据不能完全表达测试数据的可变性。