Yanar Erdem, Hardalaç Fırat, Ayturan Kubilay
Department of Healthcare Systems System Engineering, ASELSAN, 06200 Ankara, Turkey.
Department of Electrical and Electronics Engineering, Gazi University, 06570 Ankara, Turkey.
Diagnostics (Basel). 2025 Jun 25;15(13):1602. doi: 10.3390/diagnostics15131602.
Cardiomegaly-defined as the abnormal enlargement of the heart-is a key radiological indicator of various cardiovascular conditions. Early detection is vital for initiating timely clinical intervention and improving patient outcomes. This study investigates the application of deep learning techniques for the automated diagnosis of cardiomegaly from chest X-ray (CXR) images, utilizing both convolutional neural networks (CNNs) and Vision Transformers (ViTs). We assembled one of the largest and most diverse CXR datasets to date, combining posteroanterior (PA) images from PadChest, NIH CXR, VinDr-CXR, and CheXpert. Multiple pre-trained CNN architectures (VGG16, ResNet50, InceptionV3, DenseNet121, DenseNet201, and AlexNet), as well as Vision Transformer models, were trained and compared. In addition, we introduced a novel stacking-based ensemble model-Combined Ensemble Learning Model (CELM)-that integrates complementary CNN features via a meta-classifier. The CELM achieved the highest diagnostic performance, with a test accuracy of 92%, precision of 99%, recall of 89%, F1-score of 0.94, specificity of 92.0%, and AUC of 0.90. These results highlight the model's high agreement with expert annotations and its potential for reliable clinical use. Notably, Vision Transformers offered competitive performance, suggesting their value as complementary tools alongside CNNs. With further validation, the proposed CELM framework may serve as an efficient and scalable decision-support tool for cardiomegaly screening, particularly in resource-limited settings such as intensive care units (ICUs) and emergency departments (EDs), where rapid and accurate diagnosis is imperative.
心脏肥大(定义为心脏异常增大)是各种心血管疾病的关键影像学指标。早期检测对于及时启动临床干预和改善患者预后至关重要。本研究调查了深度学习技术在利用胸部X光(CXR)图像自动诊断心脏肥大方面的应用,同时使用了卷积神经网络(CNN)和视觉Transformer(ViT)。我们组装了迄今为止最大且最多样化的CXR数据集,结合了来自PadChest、NIH CXR、VinDr-CXR和CheXpert的后前位(PA)图像。对多个预训练的CNN架构(VGG16、ResNet50、InceptionV3、DenseNet121、DenseNet201和AlexNet)以及视觉Transformer模型进行了训练和比较。此外,我们引入了一种新颖的基于堆叠的集成模型——组合集成学习模型(CELM),该模型通过元分类器整合互补的CNN特征。CELM实现了最高的诊断性能,测试准确率为92%,精确率为99%,召回率为89%,F1分数为0.94,特异性为92.0%,AUC为0.90。这些结果突出了该模型与专家标注的高度一致性及其在临床可靠应用的潜力。值得注意的是,视觉Transformer表现出具有竞争力的性能,表明它们作为CNN辅助工具的价值。经过进一步验证,所提出的CELM框架可作为心脏肥大筛查的高效且可扩展的决策支持工具,特别是在重症监护病房(ICU)和急诊科(ED)等资源有限的环境中,快速准确的诊断至关重要。