Yoon Taeyoung, Kang Daesung
School of Bio-Health Convergence, College of Natural Sciences, Sungshin Women's University Woonjung Green Campus, Seoul, Republic of Korea.
School of AI Convergence, Sungshin Women's University, Seoul, Republic of Korea.
Sci Rep. 2025 Jul 8;15(1):24444. doi: 10.1038/s41598-025-10773-w.
Cardiovascular diseases (CVDs) are the leading cause of mortality worldwide, emphasizing the need for accurate and early diagnosis. Electrocardiograms (ECG) provide a non-invasive means of diagnosing various cardiac conditions. However, traditional methods of interpreting ECG signals require substantial expertise and time, motivating the development of automated deep learning models to enhance diagnostic precision. This study proposes a novel approach that leverages masked autoencoders (MAE) to pretrain a model on ECG scalogram images, thereby enhancing the diagnostic accuracy for seven CVDs. Through extensive experimentation, we demonstrated that pretraining with an 85% masking ratio over 500 epochs yields optimal results. The pretrained ViT-S(MAE-scalo) network demonstrated remarkable performance in detecting CVDs, achieving an AUC of 0.986 and 92.43% accuracy in Lead II. Furthermore, the ensemble learning approach applied across 12 ECG leads enhanced the model's diagnostic capabilities, resulting in an AUC of 0.994 and 92.72% accuracy. The MAE-based models outperformed traditional models such as ResNet-34 and ViT-S pretrained on ImageNet or random weights, as well as other SSL models such as MoCo-v2 and BYOL. Notably, the MAE-based models demonstrated superior performance even with a significantly smaller dataset, using only 1/12th the size of the ImageNet dataset. These findings suggest that this efficient pretraining approach for deep learning models holds great potential for clinical application, particularly in resource-limited environments where labeled data is scarce. This method provides a scalable and cost-effective solution for improving CVD diagnosis.
心血管疾病(CVDs)是全球范围内主要的死亡原因,这凸显了准确和早期诊断的必要性。心电图(ECG)提供了一种诊断各种心脏疾病的非侵入性方法。然而,传统的心电图信号解读方法需要大量专业知识和时间,这促使了自动化深度学习模型的发展以提高诊断精度。本研究提出了一种新颖的方法,利用掩码自动编码器(MAE)在心电图频谱图图像上对模型进行预训练,从而提高对七种心血管疾病的诊断准确性。通过广泛的实验,我们证明在500个轮次上使用85%的掩码率进行预训练可产生最佳结果。预训练的ViT-S(MAE-scalo)网络在检测心血管疾病方面表现出色,在II导联中实现了0.986的AUC和92.43%的准确率。此外,跨12个心电图导联应用的集成学习方法增强了模型的诊断能力,产生了0.994的AUC和92.72%的准确率。基于MAE的模型优于传统模型,如在ImageNet上预训练或使用随机权重的ResNet-34和ViT-S,以及其他自监督学习模型,如MoCo-v2和BYOL。值得注意的是,即使使用显著更小的数据集,仅使用ImageNet数据集大小的1/12,基于MAE的模型仍表现出卓越的性能。这些发现表明,这种用于深度学习模型的高效预训练方法在临床应用中具有巨大潜力,特别是在标记数据稀缺的资源有限环境中。该方法为改善心血管疾病诊断提供了一种可扩展且具有成本效益的解决方案。