Department for Translational Surgical Oncology, National Center for Tumor Diseases (NCT), Partner Site Dresden, Dresden, Germany.
Department of General, Visceral and Transplant Surgery, University of Heidelberg, Heidelberg, Germany.
Int J Comput Assist Radiol Surg. 2019 Jun;14(6):1079-1087. doi: 10.1007/s11548-019-01963-9. Epub 2019 Apr 9.
For many applications in the field of computer-assisted surgery, such as providing the position of a tumor, specifying the most probable tool required next by the surgeon or determining the remaining duration of surgery, methods for surgical workflow analysis are a prerequisite. Often machine learning-based approaches serve as basis for analyzing the surgical workflow. In general, machine learning algorithms, such as convolutional neural networks (CNN), require large amounts of labeled data. While data is often available in abundance, many tasks in surgical workflow analysis need annotations by domain experts, making it difficult to obtain a sufficient amount of annotations.
The aim of using active learning to train a machine learning model is to reduce the annotation effort. Active learning methods determine which unlabeled data points would provide the most information according to some metric, such as prediction uncertainty. Experts will then be asked to only annotate these data points. The model is then retrained with the new data and used to select further data for annotation. Recently, active learning has been applied to CNN by means of deep Bayesian networks (DBN). These networks make it possible to assign uncertainties to predictions. In this paper, we present a DBN-based active learning approach adapted for image-based surgical workflow analysis task. Furthermore, by using a recurrent architecture, we extend this network to video-based surgical workflow analysis. To decide which data points should be labeled next, we explore and compare different metrics for expressing uncertainty.
We evaluate these approaches and compare different metrics on the Cholec80 dataset by performing instrument presence detection and surgical phase segmentation. Here we are able to show that using a DBN-based active learning approach for selecting what data points to annotate next can significantly outperform a baseline based on randomly selecting data points. In particular, metrics such as entropy and variation ratio perform consistently on the different tasks.
We show that using DBN-based active learning strategies make it possible to selectively annotate data, thereby reducing the required amount of labeled training in surgical workflow-related tasks.
对于计算机辅助手术领域的许多应用,例如提供肿瘤的位置、指定外科医生下一个最可能需要的工具或确定手术的剩余持续时间,手术流程分析方法是前提。通常,基于机器学习的方法是分析手术流程的基础。一般来说,机器学习算法,如卷积神经网络(CNN),需要大量标记数据。虽然数据通常很丰富,但手术流程分析中的许多任务都需要领域专家进行注释,因此很难获得足够数量的注释。
使用主动学习来训练机器学习模型的目的是减少注释工作。主动学习方法根据某些指标(例如预测不确定性)确定哪些未标记的数据点将提供最多信息。然后,专家将被要求仅注释这些数据点。然后,使用新数据重新训练模型,并使用该模型选择进一步进行注释的数据。最近,主动学习已通过深度贝叶斯网络(DBN)应用于 CNN。这些网络可以为预测分配不确定性。在本文中,我们提出了一种基于 DBN 的主动学习方法,适用于基于图像的手术流程分析任务。此外,通过使用递归架构,我们将该网络扩展到基于视频的手术流程分析。为了决定接下来应标记哪些数据点,我们探索并比较了用于表达不确定性的不同指标。
我们通过执行器械存在检测和手术阶段分割,在 Cholec80 数据集上评估了这些方法并比较了不同的度量标准。在这里,我们能够表明,使用基于 DBN 的主动学习方法选择要注释的下一个数据点可以显著优于基于随机选择数据点的基线。特别是,熵和变异比等指标在不同任务上表现一致。
我们表明,使用基于 DBN 的主动学习策略可以有选择地注释数据,从而减少与手术流程相关任务中所需的标记训练量。