IEEE Trans Med Imaging. 2019 Nov;38(11):2642-2653. doi: 10.1109/TMI.2019.2907805. Epub 2019 Mar 27.
Deep convolutional neural networks (CNN) have recently achieved superior performance at the task of medical image segmentation compared to classic models. However, training a generalizable CNN requires a large amount of training data, which is difficult, expensive, and time-consuming to obtain in medical settings. Active Learning (AL) algorithms can facilitate training CNN models by proposing a small number of the most informative data samples to be annotated to achieve a rapid increase in performance. We proposed a new active learning method based on Fisher information (FI) for CNNs for the first time. Using efficient backpropagation methods for computing gradients together with a novel low-dimensional approximation of FI enabled us to compute FI for CNNs with a large number of parameters. We evaluated the proposed method for brain extraction with a patch-wise segmentation CNN model in two different learning scenarios: universal active learning and active semi-automatic segmentation. In both scenarios, an initial model was obtained using labeled training subjects of a source data set and the goal was to annotate a small subset of new samples to build a model that performs well on the target subject(s). The target data sets included images that differed from the source data by either age group (e.g. newborns with different image contrast) or underlying pathology that was not available in the source data. In comparison to several recently proposed AL methods and brain extraction baselines, the results showed that FI-based AL outperformed the competing methods in improving the performance of the model after labeling a very small portion of target data set (<0.25%).
深度卷积神经网络(CNN)在医学图像分割任务上的表现优于经典模型。然而,训练一个可泛化的 CNN 需要大量的训练数据,而在医学环境中获取这些数据既困难、昂贵又耗时。主动学习(AL)算法可以通过提出少量最具信息量的数据样本进行标注来加速训练 CNN 模型,从而快速提高性能。我们首次提出了一种基于 Fisher 信息(FI)的 CNN 主动学习方法。我们使用高效的反向传播方法来计算梯度,同时对 FI 进行新颖的低维近似,使我们能够对具有大量参数的 CNN 计算 FI。我们在两种不同的学习场景中,使用基于斑块的分割 CNN 模型来评估该方法对脑提取的性能:通用主动学习和主动半自动分割。在这两种情况下,初始模型都是使用源数据集的有标签训练对象获得的,目标是标注一小部分新样本,以构建一个在目标对象上表现良好的模型。目标数据集包括与源数据集在年龄组(例如具有不同图像对比度的新生儿)或源数据中不存在的潜在病理学方面不同的图像。与最近提出的几种 AL 方法和脑提取基线相比,结果表明,在标注目标数据集的很小一部分(<0.25%)后,基于 FI 的 AL 在提高模型性能方面优于竞争方法。