Janowczyk Andrew, Madabhushi Anant
Department of Biomedical Engineering, Case Western Reserve University, Cleveland, OH 44106, USA.
J Pathol Inform. 2016 Jul 26;7:29. doi: 10.4103/2153-3539.186902. eCollection 2016.
Deep learning (DL) is a representation learning approach ideally suited for image analysis challenges in digital pathology (DP). The variety of image analysis tasks in the context of DP includes detection and counting (e.g., mitotic events), segmentation (e.g., nuclei), and tissue classification (e.g., cancerous vs. non-cancerous). Unfortunately, issues with slide preparation, variations in staining and scanning across sites, and vendor platforms, as well as biological variance, such as the presentation of different grades of disease, make these image analysis tasks particularly challenging. Traditional approaches, wherein domain-specific cues are manually identified and developed into task-specific "handcrafted" features, can require extensive tuning to accommodate these variances. However, DL takes a more domain agnostic approach combining both feature discovery and implementation to maximally discriminate between the classes of interest. While DL approaches have performed well in a few DP related image analysis tasks, such as detection and tissue classification, the currently available open source tools and tutorials do not provide guidance on challenges such as (a) selecting appropriate magnification, (b) managing errors in annotations in the training (or learning) dataset, and (c) identifying a suitable training set containing information rich exemplars. These foundational concepts, which are needed to successfully translate the DL paradigm to DP tasks, are non-trivial for (i) DL experts with minimal digital histology experience, and (ii) DP and image processing experts with minimal DL experience, to derive on their own, thus meriting a dedicated tutorial.
This paper investigates these concepts through seven unique DP tasks as use cases to elucidate techniques needed to produce comparable, and in many cases, superior to results from the state-of-the-art hand-crafted feature-based classification approaches.
Specifically, in this tutorial on DL for DP image analysis, we show how an open source framework (Caffe), with a singular network architecture, can be used to address: (a) nuclei segmentation (F-score of 0.83 across 12,000 nuclei), (b) epithelium segmentation (F-score of 0.84 across 1735 regions), (c) tubule segmentation (F-score of 0.83 from 795 tubules), (d) lymphocyte detection (F-score of 0.90 across 3064 lymphocytes), (e) mitosis detection (F-score of 0.53 across 550 mitotic events), (f) invasive ductal carcinoma detection (F-score of 0.7648 on 50 k testing patches), and (g) lymphoma classification (classification accuracy of 0.97 across 374 images).
This paper represents the largest comprehensive study of DL approaches in DP to date, with over 1200 DP images used during evaluation. The supplemental online material that accompanies this paper consists of step-by-step instructions for the usage of the supplied source code, trained models, and input data.
深度学习(DL)是一种表示学习方法,非常适合数字病理学(DP)中的图像分析挑战。DP背景下的各种图像分析任务包括检测和计数(如有丝分裂事件)、分割(如细胞核)以及组织分类(如癌组织与非癌组织)。不幸的是,载玻片制备问题、不同地点染色和扫描的差异、供应商平台以及生物学差异(如不同疾病分级的表现)使得这些图像分析任务极具挑战性。传统方法是手动识别特定领域的线索并将其开发为特定任务的“手工制作”特征,可能需要进行大量调整以适应这些差异。然而,DL采用了一种更通用的方法,将特征发现和实现相结合,以最大程度地区分感兴趣的类别。虽然DL方法在一些与DP相关的图像分析任务(如检测和组织分类)中表现良好,但目前可用的开源工具和教程并未就以下挑战提供指导:(a)选择合适的放大倍数;(b)处理训练(或学习)数据集中注释的错误;(c)识别包含信息丰富示例的合适训练集。这些将DL范式成功应用于DP任务所需的基础概念,对于(i)数字组织学经验极少的DL专家,以及(ii)DL经验极少的DP和图像处理专家来说,自行推导并非易事,因此值得专门编写一篇教程。
本文通过七个独特的DP任务作为用例来研究这些概念,以阐明产生可比结果所需的技术,并在许多情况下,优于基于手工制作特征的最新分类方法的结果。
具体而言,在本关于DP图像分析的DL教程中,我们展示了如何使用具有单一网络架构的开源框架(Caffe)来解决:(a)细胞核分割(12000个细胞核的F值为0.83);(b)上皮细胞分割(1735个区域的F值为0.84);(c)肾小管分割(795个肾小管的F值为0.83);(d)淋巴细胞检测(3064个淋巴细胞的F值为0.90);(e)有丝分裂检测(550个有丝分裂事件的F值为0.53);(f)浸润性导管癌检测(50k测试补丁上的F值为0.7648);以及(g)淋巴瘤分类(374张图像的分类准确率为0.97)。
本文是迄今为止对DP中DL方法进行的最大规模综合研究,评估期间使用了超过1200张DP图像。本文附带的补充在线材料包括使用所提供的源代码、训练模型和输入数据的逐步说明。