IEEE J Biomed Health Inform. 2021 May;25(5):1519-1528. doi: 10.1109/JBHI.2020.3022989. Epub 2021 May 11.
Electrocardiography (ECG) is a very common, non-invasive diagnostic procedure and its interpretation is increasingly supported by algorithms. The progress in the field of automatic ECG analysis has up to now been hampered by a lack of appropriate datasets for training as well as a lack of well-defined evaluation procedures to ensure comparability of different algorithms. To alleviate these issues, we put forward first benchmarking results for the recently published, freely accessible clinical 12-lead ECG dataset PTB-XL, covering a variety of tasks from different ECG statement prediction tasks to age and sex prediction. Among the investigated deep-learning-based timeseries classification algorithms, we find that convolutional neural networks, in particular resnet- and inception-based architectures, show the strongest performance across all tasks. We find consistent results on the ICBEB2018 challenge ECG dataset and discuss prospects of transfer learning using classifiers pretrained on PTB-XL. These benchmarking results are complemented by deeper insights into the classification algorithm in terms of hidden stratification, model uncertainty and an exploratory interpretability analysis, which provide connecting points for future research on the dataset. Our results emphasize the prospects of deep-learning-based algorithms in the field of ECG analysis, not only in terms of quantitative accuracy but also in terms of clinically equally important further quality metrics such as uncertainty quantification and interpretability. With this resource, we aim to establish the PTB-XL dataset as a resource for structured benchmarking of ECG analysis algorithms and encourage other researchers in the field to join these efforts.
心电图(ECG)是一种非常常见的非侵入性诊断程序,其解释越来越依赖于算法。自动心电图分析领域的进展迄今为止一直受到缺乏适当的训练数据集以及缺乏明确定义的评估程序来确保不同算法的可比性的限制。为了解决这些问题,我们提出了最近发布的、免费提供的临床 12 导联 ECG 数据集 PTB-XL 的基准测试结果,涵盖了从不同的心电图声明预测任务到年龄和性别预测的各种任务。在所研究的基于深度学习的时间序列分类算法中,我们发现卷积神经网络,特别是基于 resnet 和 inception 的架构,在所有任务中表现出最强的性能。我们在 ICBEB2018 挑战 ECG 数据集上得到了一致的结果,并讨论了使用在 PTB-XL 上预训练的分类器进行迁移学习的前景。这些基准测试结果通过对分类算法的深入了解得到补充,包括隐藏分层、模型不确定性和探索性可解释性分析,这些为未来对数据集的研究提供了联系点。我们的结果强调了基于深度学习的算法在心电图分析领域的前景,不仅在定量准确性方面,而且在临床同等重要的进一步质量指标方面,如不确定性量化和可解释性。通过这个资源,我们旨在将 PTB-XL 数据集确立为 ECG 分析算法结构化基准测试的资源,并鼓励该领域的其他研究人员加入这些努力。