Sivasubramanian Arrun, Sasidharan Divya, Sowmya V, Ravi Vinayakumar
Amrita School of Artificial Intelligence, Amrita Vishwa Vidyapeetham, Coimbatore, India.
Center for Artificial Intelligence, Prince Mohammed Bin Fahd University, Khobar, Saudi Arabia.
Phys Eng Sci Med. 2025 May 28. doi: 10.1007/s13246-025-01566-6.
Ultrasound fetal imaging is beneficial to support prenatal development because it is affordable and non-intrusive. Nevertheless, fetal plane classification (FPC) remains challenging and time-consuming for obstetricians since it depends on nuanced clinical aspects, which increases the difficulty in identifying relevant features of the fetal anatomy. Thus, to assist with its accurate feature extraction, a lightweight artificial intelligence architecture leveraging convolutional neural networks and attention mechanisms is proposed to classify the largest benchmark ultrasound dataset. The approach fine-tunes from lightweight EfficientNet feature extraction backbones pre-trained on the ImageNet1k. to classify key fetal planes such as the brain, femur, thorax, cervix, and abdomen. Our methodology incorporates the attention mechanism to refine features and 3-layer perceptrons for classification, achieving superior performance with the highest Top-1 accuracy of 96.25%, Top-2 accuracy of 99.80% and F1-Score of 0.9576. Importantly, the model has 40x fewer trainable parameters than existing benchmark ensemble or transformer pipelines, facilitating easy deployment on edge devices to help clinical practitioners with real-time FPC. The findings are also interpreted using GradCAM to carry out clinical correlation to aid doctors with diagnostics and improve treatment plans for expectant mothers.
超声胎儿成像有助于支持产前发育,因为它价格实惠且非侵入性。然而,胎儿平面分类(FPC)对产科医生来说仍然具有挑战性且耗时,因为它依赖于细微的临床特征,这增加了识别胎儿解剖结构相关特征的难度。因此,为了协助进行准确的特征提取,提出了一种利用卷积神经网络和注意力机制的轻量级人工智能架构,用于对最大的基准超声数据集进行分类。该方法从在ImageNet1k上预训练的轻量级EfficientNet特征提取主干进行微调,以对大脑、股骨、胸部、宫颈和腹部等关键胎儿平面进行分类。我们的方法结合了注意力机制来细化特征,并使用3层感知器进行分类,实现了卓越的性能,最高Top-1准确率为96.25%,Top-2准确率为99.80%,F1分数为0.9576。重要的是,该模型的可训练参数比现有的基准集成或变压器管道少40倍,便于在边缘设备上轻松部署,以帮助临床医生进行实时FPC。研究结果还使用GradCAM进行解释,以进行临床关联,帮助医生进行诊断并改善对孕妇的治疗方案。