Qian Kun, Bao Zhihao, Zhao Zhonghao, Koike Tomoya, Dong Fengquan, Schmitt Maximilian, Dong Qunxi, Shen Jian, Jiang Weipeng, Jiang Yajuan, Dong Bo, Dai Zhenyu, Hu Bin, Schuller Björn W, Yamamoto Yoshiharu
Key Laboratory of Brain Health Intelligent Evaluation and Intervention, Ministry of Education (Beijing Institute of Technology), Beijing 100081, China.
School of Medical Technology, Beijing Institute of Technology, Beijing 100081, China.
Cyborg Bionic Syst. 2024 Mar 4;5:0075. doi: 10.34133/cbsystems.0075. eCollection 2024.
Leveraging the power of artificial intelligence to facilitate an automatic analysis and monitoring of heart sounds has increasingly attracted tremendous efforts in the past decade. Nevertheless, lacking on standard open-access database made it difficult to maintain a sustainable and comparable research before the first release of the PhysioNet CinC Challenge Dataset. However, inconsistent standards on data collection, annotation, and partition are still restraining a fair and efficient comparison between different works. To this line, we introduced and benchmarked a first version of the Heart Sounds Shenzhen (HSS) corpus. Motivated and inspired by the previous works based on HSS, we redefined the tasks and make a comprehensive investigation on shallow and deep models in this study. First, we segmented the heart sound recording into shorter recordings (10 s), which makes it more similar to the human auscultation case. Second, we redefined the classification tasks. Besides using the 3 class categories (normal, moderate, and mild/severe) adopted in HSS, we added a binary classification task in this study, i.e., normal and abnormal. In this work, we provided detailed benchmarks based on both the classic machine learning and the state-of-the-art deep learning technologies, which are reproducible by using open-source toolkits. Last but not least, we analyzed the feature contributions of best performance achieved by the benchmark to make the results more convincing and interpretable.
在过去十年中,利用人工智能的力量来促进心音的自动分析和监测越来越受到广泛关注。然而,在PhysioNet CinC挑战数据集首次发布之前,缺乏标准的开放获取数据库使得难以开展可持续且具有可比性的研究。此外,数据收集、标注和划分方面的不一致标准仍然限制着不同研究成果之间进行公平有效的比较。为此,我们引入并对首个版本的深圳心音(HSS)语料库进行了基准测试。受基于HSS的先前研究启发,我们在本研究中重新定义了任务,并对浅层和深层模型进行了全面研究。首先,我们将心音记录分割成较短的记录(10秒),使其更类似于人工听诊情况。其次,我们重新定义了分类任务。除了使用HSS中采用的3个类别(正常、中度和轻度/重度)之外,我们在本研究中还增加了一个二元分类任务,即正常和异常。在这项工作中,我们基于经典机器学习和最先进的深度学习技术提供了详细的基准测试,这些测试可通过使用开源工具包进行重现。最后但同样重要的是,我们分析了基准测试中取得最佳性能的特征贡献,以使结果更具说服力和可解释性。