Singstad Bjørn-Jostein, Muten Eraraya Morenzo
Medical Technology and E-health, Akershus University Hospital, Lørenskog, Norway.
Department of Endocrinology, Obesity and Nutrition, Vestfold Hospital Trust, Tønsberg, Norway.
Cardiovasc Eng Technol. 2025 Sep 17. doi: 10.1007/s13239-025-00797-8.
The electrocardiogram (ECG) is an almost universally accessible diagnostic tool for heart disease. An ECG is measured by using an electrocardiograph, and today's electrocardiographs use built-in software to interpret the ECGs automatically after they are recorded. However, these algorithms exhibit limited performance, and therefore, clinicians usually have to manually interpret the ECG, regardless of whether an algorithm has interpreted it or not. Manual interpretation of the ECG can be time-consuming and requires specific skills. Therefore, better algorithms are clearly needed to make correct ECG interpretations more accessible and time-efficient. Algorithms based on artificial intelligence (AI) have demonstrated promising performance in various fields, including ECG interpretation, over the past few years and may represent an alternative to manual ECG interpretation by doctors.
We trained and validated a convolutional neural network with an Inception architecture on a dataset with 88253 12-lead ECGs, and classified 30 of the most frequent annotated cardiac conditions in the dataset. We assessed two different loss functions and different ECG sampling rates and the best-performing model used double soft F1-loss and ECGs downsampled to 75Hz. This model achieved an F1-score of , accuracy , and an AUROC score of . An aggregated saliency map, showing the global importance of all 12 ECG leads for the 30 cardiac conditions, was generated using Local Interpretable Model-Agnostic Explanations (LIME). The global saliency map showed that the Inception model paid the most attention to the limb leads and the augmented leads and less importance to the precordial leads.
One of the more significant contributions that emerge from this study is the use of aggregated saliency maps to obtain global ECG lead importance for different cardiac conditions. In addition, we emphasized the relevance of evaluating different loss functions, and in this specific case, we found double soft F1-loss to be slightly better than binary cross-entropy (BCE). Finally, we found it somewhat surprising that drastic downsampling of the ECG led to higher performance than higher sampling frequencies, such as 500Hz. These findings contribute in several ways to our understanding of the artificial intelligence-based interpretation of ECGs, but further studies should be carried out to validate these findings in other datasets from other patient cohorts.
心电图(ECG)是一种几乎普遍可用的心脏病诊断工具。心电图通过心电图仪进行测量,如今的心电图仪使用内置软件在记录心电图后自动进行解读。然而,这些算法的性能有限,因此,无论算法是否已进行解读,临床医生通常都必须手动解读心电图。手动解读心电图可能耗时且需要特定技能。因此,显然需要更好的算法,以使正确的心电图解读更容易获得且更高效。在过去几年中,基于人工智能(AI)的算法在包括心电图解读在内的各个领域都展现出了有前景的性能,可能成为医生手动解读心电图的一种替代方法。
我们在一个包含88253份12导联心电图的数据集上训练并验证了一个具有Inception架构的卷积神经网络,并对数据集中30种最常见的标注心脏疾病进行了分类。我们评估了两种不同的损失函数以及不同的心电图采样率,表现最佳的模型使用了双软F1损失函数,并将心电图下采样至75Hz。该模型的F1分数为 ,准确率为 ,曲线下面积(AUROC)分数为 。使用局部可解释模型无关解释(LIME)生成了一个聚合显著性图,展示了所有12个心电图导联对30种心脏疾病的全局重要性。全局显著性图表明,Inception模型对肢体导联和增强导联最为关注,而对胸前导联的重要性较低。
本研究中出现的较为显著的贡献之一是使用聚合显著性图来获取不同心脏疾病的全局心电图导联重要性。此外,我们强调了评估不同损失函数的相关性,在这种特定情况下,我们发现双软F1损失函数略优于二元交叉熵(BCE)。最后,我们发现有点令人惊讶的是,心电图的大幅下采样比更高的采样频率(如500Hz)能带来更高的性能。这些发现从多个方面有助于我们理解基于人工智能的心电图解读,但应开展进一步研究以在来自其他患者队列的其他数据集中验证这些发现。