Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom; Department of Radiology, University of Cambridge, Cambridge, United Kingdom; Department for Diagnostic and Interventional Radiology and Nuclear Medicine, University Hospital Hamburg-Eppendorf, Hamburg, Germany; Jung diagnostics GmbH, Hamburg, Germany.
Department of Radiology, University of Cambridge, Cambridge, United Kingdom; Cancer Research UK Cambridge Centre, University of Cambridge, Cambridge, United Kingdom.
Comput Biol Med. 2023 Sep;163:107096. doi: 10.1016/j.compbiomed.2023.107096. Epub 2023 Jun 1.
Uncertainty quantification in automated image analysis is highly desired in many applications. Typically, machine learning models in classification or segmentation are only developed to provide binary answers; however, quantifying the uncertainty of the models can play a critical role for example in active learning or machine human interaction. Uncertainty quantification is especially difficult when using deep learning-based models, which are the state-of-the-art in many imaging applications. The current uncertainty quantification approaches do not scale well in high-dimensional real-world problems. Scalable solutions often rely on classical techniques, such as dropout, during inference or training ensembles of identical models with different random seeds to obtain a posterior distribution. In this paper, we present the following contributions. First, we show that the classical approaches fail to approximate the classification probability. Second, we propose a scalable and intuitive framework for uncertainty quantification in medical image segmentation that yields measurements that approximate the classification probability. Third, we suggest the usage of k-fold cross-validation to overcome the need for held out calibration data. Lastly, we motivate the adoption of our method in active learning, creating pseudo-labels to learn from unlabeled images and human-machine collaboration.
不确定性量化在许多应用中是非常需要的。通常,分类或分割中的机器学习模型仅被开发用于提供二进制答案;然而,量化模型的不确定性对于主动学习或人机交互等方面可能起着关键作用。当使用基于深度学习的模型时,不确定性量化尤其困难,这些模型是许多成像应用中的最新技术。当前的不确定性量化方法在高维现实问题中不能很好地扩展。可扩展的解决方案通常依赖于经典技术,例如在推理过程中使用随机失活,或者使用不同随机种子的相同模型的训练集合来获得后验分布。在本文中,我们提出了以下贡献。首先,我们表明经典方法无法逼近分类概率。其次,我们提出了一种用于医学图像分割的可扩展和直观的不确定性量化框架,该框架产生的度量结果可以逼近分类概率。第三,我们建议使用 k 折交叉验证来克服对保留校准数据的需求。最后,我们提出了在主动学习中采用我们的方法的动机,通过使用未标记的图像和人机协作来学习创建伪标签。