Kolluru Chaitanya, Lee Juhwan, Gharaibeh Yazan, Bezerra Hiram G, Wilson David L
Department of Biomedical Engineering, Case Western Reserve University, Cleveland, OH 44106, USA.
Interventional Cardiology Center, Heart and Vascular Institute, The University of South Florida, Tampa, FL 33606, USA.
IEEE Access. 2021;9:37273-37280. doi: 10.1109/access.2021.3058890. Epub 2021 Feb 11.
Deep learning based methods are routinely used to segment various structures of interest in varied medical imaging modalities. Acquiring annotations for a large number of images requires a skilled analyst, and the process is both time consuming and challenging. Our approach to reduce effort is to reduce the number of images needing detailed annotation. For intravascular optical coherence tomography (IVOCT) image pullbacks, we tested 10% to 100% of training images derived from two schemes: equally-spaced image subsampling and deep-learning- based image clustering. The first strategy involves selecting images at equally spaced intervals from the volume, accounting for the high spatial correlation between neighboring images. In clustering, we used an autoencoder to create a deep feature space representation, performed k-medoids clustering, and then used the cluster medians for training. For coronary calcifications, a baseline U-net model was trained on all images from volumes of interest (VOIs) and compared with fewer images from the sub-sampling strategies. For a given sampling ratio, the clustering based method performed better or similar as compared to the equally spaced sampling approach (e.g., with 10% of images, mean F1 score for calcific class increased from 0.52 to 0.63, with equal spacing and with clustering, respectively). Additionally, for a fixed number of training images, sampling images from more VOIs performed better than otherwise. In conclusion, we recommend the clustering based approach to annotate a small fraction of images, creating a baseline model, which potentially can be improved further by annotating images selected using methods described in active learning research.
基于深度学习的方法通常用于分割各种医学成像模态中感兴趣的结构。为大量图像获取标注需要熟练的分析师,并且这个过程既耗时又具有挑战性。我们减少工作量的方法是减少需要详细标注的图像数量。对于血管内光学相干断层扫描(IVOCT)图像回撤,我们测试了从两种方案中获取的10%到100%的训练图像:等间距图像子采样和基于深度学习的图像聚类。第一种策略是从体积中以等间距间隔选择图像,考虑到相邻图像之间的高空间相关性。在聚类中,我们使用自动编码器创建深度特征空间表示,执行k-中心点聚类,然后使用聚类中心进行训练。对于冠状动脉钙化,在感兴趣体积(VOI)的所有图像上训练一个基线U-net模型,并与来自子采样策略的较少图像进行比较。对于给定的采样率,基于聚类的方法与等间距采样方法相比表现更好或相似(例如,对于10%的图像,钙化类别的平均F1分数分别从等间距采样的0.52增加到聚类采样的0.63)。此外,对于固定数量的训练图像,从更多VOI中采样图像比其他方式表现更好。总之,我们建议采用基于聚类的方法来标注一小部分图像,创建一个基线模型,通过标注使用主动学习研究中描述的方法选择的图像,该模型可能会进一步得到改进。