School of Computer Science, University College Dublin, Dublin, D04 V1W8, Ireland.
Comput Biol Med. 2024 Jun;176:108585. doi: 10.1016/j.compbiomed.2024.108585. Epub 2024 May 12.
Active learning (AL) attempts to select informative samples in a dataset to minimize the number of required labels while maximizing the performance of the model. Current AL in segmentation tasks is limited to the expansion of popular classification-based methods including entropy, MC-dropout, etc. Meanwhile, most applications in the medical field are simply migrations that fail to consider the nature of medical images, such as high class imbalance, high domain difference, and data scarcity. In this study, we address these challenges and propose a novel AL framework for medical image segmentation task. Our approach introduces a pseudo-label-based filter addressing excessive blank patches in medical abnormalities segmentation tasks, e.g., lesions, and tumors, used before the AL selection. This filter helps reduce resource usage and allows the model to focus on selecting more informative samples. For the sample selection, we propose a novel query strategy that combines both model impact and data stability by employing adversarial attack. Furthermore, we harness the adversarial samples generated during the query process to enhance the robustness of the model. The experimental results verify our framework's effectiveness over various state-of-the-art methods. Our proposed method only needs less than 14% annotated patches in 3D brain MRI multiple sclerosis (MS) segmentation tasks and 20% for Low-Grade Glioma (LGG) tumor segmentation to achieve competitive results with full supervision. These promising outcomes not only improve performance but alleviate the time burden associated with expert annotation, thereby facilitating further advancements in the field of medical image segmentation. Our code is available at https://github.com/HelenMa9998/adversarial_active_learning.
主动学习 (AL) 试图在数据集中选择有信息的样本,以在最小化所需标签数量的同时最大限度地提高模型性能。目前,分割任务中的 AL 仅限于对基于分类的流行方法的扩展,包括熵、MC-dropout 等。同时,医学领域的大多数应用程序只是简单的迁移,未能考虑到医学图像的性质,例如高类别不平衡、高域差异和数据稀缺。在这项研究中,我们解决了这些挑战,并提出了一种用于医学图像分割任务的新的 AL 框架。我们的方法引入了一种基于伪标签的滤波器,用于解决医学异常分割任务中的过度空白斑块问题,例如病变和肿瘤,在 AL 选择之前使用该滤波器。这有助于减少资源的使用,并使模型能够专注于选择更有信息的样本。对于样本选择,我们提出了一种新的查询策略,通过使用对抗攻击来结合模型影响和数据稳定性。此外,我们利用查询过程中生成的对抗样本来增强模型的鲁棒性。实验结果验证了我们的框架在各种最先进的方法中的有效性。我们提出的方法在 3D 脑 MRI 多发性硬化症 (MS) 分割任务中仅需要少于 14%的标注补丁,在低级别胶质瘤 (LGG) 肿瘤分割中需要 20%的标注补丁,就可以达到与全监督相当的竞争结果。这些有希望的结果不仅提高了性能,而且减轻了专家标注的时间负担,从而促进了医学图像分割领域的进一步发展。我们的代码可在 https://github.com/HelenMa9998/adversarial_active_learning 上获得。