Alsaleh Aqilah M, Albalawi Eid, Algosaibi Abdulelah, Albakheet Salman S, Khan Surbhi Bhatia
College of Computer Science and Information Technology, King Faisal University, Al Hofuf 400-31982, AlAhsa, Saudi Arabia.
Department of Information Technology, AlAhsa Health Cluster, Al Hofuf 3158-36421, AlAhsa, Saudi Arabia.
Diagnostics (Basel). 2024 Jun 7;14(12):1213. doi: 10.3390/diagnostics14121213.
Deep learning has attained state-of-the-art results in general image segmentation problems; however, it requires a substantial number of annotated images to achieve the desired outcomes. In the medical field, the availability of annotated images is often limited. To address this challenge, few-shot learning techniques have been successfully adapted to rapidly generalize to new tasks with only a few samples, leveraging prior knowledge. In this paper, we employ a gradient-based method known as Model-Agnostic Meta-Learning (MAML) for medical image segmentation. MAML is a meta-learning algorithm that quickly adapts to new tasks by updating a model's parameters based on a limited set of training samples. Additionally, we use an enhanced 3D U-Net as the foundational network for our models. The enhanced 3D U-Net is a convolutional neural network specifically designed for medical image segmentation. We evaluate our approach on the TotalSegmentator dataset, considering a few annotated images for four tasks: liver, spleen, right kidney, and left kidney. The results demonstrate that our approach facilitates rapid adaptation to new tasks using only a few annotated images. In 10-shot settings, our approach achieved mean dice coefficients of 93.70%, 85.98%, 81.20%, and 89.58% for liver, spleen, right kidney, and left kidney segmentation, respectively. In five-shot sittings, the approach attained mean Dice coefficients of 90.27%, 83.89%, 77.53%, and 87.01% for liver, spleen, right kidney, and left kidney segmentation, respectively. Finally, we assess the effectiveness of our proposed approach on a dataset collected from a local hospital. Employing five-shot sittings, we achieve mean Dice coefficients of 90.62%, 79.86%, 79.87%, and 78.21% for liver, spleen, right kidney, and left kidney segmentation, respectively.
深度学习在一般图像分割问题上已取得了最先进的成果;然而,它需要大量带注释的图像才能达到预期效果。在医学领域,带注释图像的可用性往往有限。为应对这一挑战,少样本学习技术已成功得到应用,借助先验知识,仅用少量样本就能快速推广到新任务。在本文中,我们采用一种基于梯度的方法,即模型无关元学习(MAML)来进行医学图像分割。MAML是一种元学习算法,通过基于有限的训练样本集更新模型参数,能快速适应新任务。此外,我们使用增强版的3D U-Net作为模型的基础网络。增强版的3D U-Net是专门为医学图像分割设计的卷积神经网络。我们在TotalSegmentator数据集上评估我们的方法,针对肝脏、脾脏、右肾和左肾这四项任务考虑少量带注释的图像。结果表明,我们的方法仅使用少量带注释的图像就能促进对新任务的快速适应。在10次采样设置下,我们的方法在肝脏、脾脏、右肾和左肾分割任务中分别取得了93.70%、85.98%、81.20%和89.58%的平均骰子系数。在5次采样设置下,该方法在肝脏、脾脏、右肾和左肾分割任务中分别取得了90.27%、83.89%、77.53%和87.01%的平均骰子系数。最后,我们在从当地一家医院收集的数据集上评估我们提出的方法的有效性。采用5次采样设置,我们在肝脏、脾脏、右肾和左肾分割任务中分别取得了90.62%、79.86%、79.87%和78.21%的平均骰子系数。