Saeed Shaheer U, Ramalhinho João, Pinnock Mark, Shen Ziyi, Fu Yunguan, Montaña-Brown Nina, Bonmati Ester, Barratt Dean C, Pereira Stephen P, Davidson Brian, Clarkson Matthew J, Hu Yipeng
Centre for Medical Image Computing, Wellcome/EPSRC Centre for Interventional & Surgical Sciences, and Department of Medical Physics & Biomedical Engineering, University College London, London, UK.
Centre for Medical Image Computing, Wellcome/EPSRC Centre for Interventional & Surgical Sciences, and Department of Medical Physics & Biomedical Engineering, University College London, London, UK.
Med Image Anal. 2024 Jul;95:103181. doi: 10.1016/j.media.2024.103181. Epub 2024 Apr 16.
Supervised machine learning-based medical image computing applications necessitate expert label curation, while unlabelled image data might be relatively abundant. Active learning methods aim to prioritise a subset of available image data for expert annotation, for label-efficient model training. We develop a controller neural network that measures priority of images in a sequence of batches, as in batch-mode active learning, for multi-class segmentation tasks. The controller is optimised by rewarding positive task-specific performance gain, within a Markov decision process (MDP) environment that also optimises the task predictor. In this work, the task predictor is a segmentation network. A meta-reinforcement learning algorithm is proposed with multiple MDPs, such that the pre-trained controller can be adapted to a new MDP that contains data from different institutes and/or requires segmentation of different organs or structures within the abdomen. We present experimental results using multiple CT datasets from more than one thousand patients, with segmentation tasks of nine different abdominal organs, to demonstrate the efficacy of the learnt prioritisation controller function and its cross-institute and cross-organ adaptability. We show that the proposed adaptable prioritisation metric yields converging segmentation accuracy for a new kidney segmentation task, unseen in training, using between approximately 40% to 60% of labels otherwise required with other heuristic or random prioritisation metrics. For clinical datasets of limited size, the proposed adaptable prioritisation offers a performance improvement of 22.6% and 10.2% in Dice score, for tasks of kidney and liver vessel segmentation, respectively, compared to random prioritisation and alternative active sampling strategies.
基于监督式机器学习的医学图像计算应用需要专家进行标签整理,而未标记的图像数据可能相对丰富。主动学习方法旨在对可用图像数据的子集进行优先级排序,以便进行专家标注,从而实现高效的标签模型训练。我们开发了一种控制器神经网络,用于测量批次序列中图像的优先级,就像在批处理模式的主动学习中一样,用于多类分割任务。在一个马尔可夫决策过程(MDP)环境中,通过奖励特定任务的积极性能提升来优化控制器,该环境同时也对任务预测器进行优化。在这项工作中,任务预测器是一个分割网络。提出了一种具有多个MDP的元强化学习算法,使得预训练的控制器能够适应一个新的MDP,该MDP包含来自不同机构的数据和/或需要对腹部内不同器官或结构进行分割。我们使用来自一千多名患者的多个CT数据集进行了实验,这些数据集涉及九个不同腹部器官的分割任务,以证明学习到的优先级控制器功能及其跨机构和跨器官的适应性的有效性。我们表明,对于训练中未见过的新肾脏分割任务,所提出的可适应优先级度量使用大约40%至60%的标签就能产生收敛的分割精度,而使用其他启发式或随机优先级度量则需要更多标签。对于规模有限的临床数据集,与随机优先级划分和其他主动采样策略相比,所提出的可适应优先级划分在肾脏和肝脏血管分割任务的Dice分数上分别提高了22.6%和10.2%。