Medical Image Analysis Group, Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands.
Department of Pathology, University Medical Center Utrecht, University Utrecht, Utrecht, The Netherlands.
Lab Invest. 2021 Apr;101(4):525-533. doi: 10.1038/s41374-021-00540-6. Epub 2021 Feb 19.
Ductal carcinoma in situ (DCIS) is a non-invasive breast cancer that can progress into invasive ductal carcinoma (IDC). Studies suggest DCIS is often overtreated since a considerable part of DCIS lesions may never progress into IDC. Lower grade lesions have a lower progression speed and risk, possibly allowing treatment de-escalation. However, studies show significant inter-observer variation in DCIS grading. Automated image analysis may provide an objective solution to address high subjectivity of DCIS grading by pathologists. In this study, we developed and evaluated a deep learning-based DCIS grading system. The system was developed using the consensus DCIS grade of three expert observers on a dataset of 1186 DCIS lesions from 59 patients. The inter-observer agreement, measured by quadratic weighted Cohen's kappa, was used to evaluate the system and compare its performance to that of expert observers. We present an analysis of the lesion-level and patient-level inter-observer agreement on an independent test set of 1001 lesions from 50 patients. The deep learning system (dl) achieved on average slightly higher inter-observer agreement to the three observers (o1, o2 and o3) (κ= 0.81, κ= 0.53 and κ= 0.40) than the observers amongst each other (κ= 0.58, κ= 0.50 and κ= 0.42) at the lesion-level. At the patient-level, the deep learning system achieved similar agreement to the observers (κ= 0.77, κ= 0.75 and κ= 0.70) as the observers amongst each other (κ= 0.77, κ= 0.75 and κ= 0.72). The deep learning system better reflected the grading spectrum of DCIS than two of the observers. In conclusion, we developed a deep learning-based DCIS grading system that achieved a performance similar to expert observers. To the best of our knowledge, this is the first automated system for the grading of DCIS that could assist pathologists by providing robust and reproducible second opinions on DCIS grade.
导管原位癌 (DCIS) 是一种非浸润性乳腺癌,可能进展为浸润性导管癌 (IDC)。研究表明,DCIS 常常被过度治疗,因为相当一部分 DCIS 病变可能永远不会进展为 IDC。低级别病变进展速度和风险较低,可能允许治疗降级。然而,研究表明 DCIS 分级存在显著的观察者间差异。自动化图像分析可能为解决病理学家对 DCIS 分级的高度主观性提供客观的解决方案。在这项研究中,我们开发并评估了一种基于深度学习的 DCIS 分级系统。该系统是使用三位专家观察者对来自 59 名患者的 1186 个 DCIS 病变数据集的共识 DCIS 分级开发的。使用二次加权 Cohen's kappa 衡量观察者间一致性,用于评估系统并比较其性能与专家观察者的性能。我们对来自 50 名患者的 1001 个病变的独立测试集进行了病变水平和患者水平的观察者间一致性分析。深度学习系统 (dl) 在病变水平上平均比三位观察者 (o1、o2 和 o3) 获得更高的观察者间一致性 (κ=0.81、κ=0.53 和 κ=0.40),而观察者之间的一致性则较低 (κ=0.58、κ=0.50 和 κ=0.42)。在患者水平上,深度学习系统与观察者的一致性相似 (κ=0.77、κ=0.75 和 κ=0.70),与观察者之间的一致性也相似 (κ=0.77、κ=0.75 和 κ=0.72)。深度学习系统比两位观察者更能反映 DCIS 的分级范围。总之,我们开发了一种基于深度学习的 DCIS 分级系统,其性能与专家观察者相似。据我们所知,这是第一个用于 DCIS 分级的自动化系统,可以通过提供稳健且可重复的 DCIS 分级第二意见来协助病理学家。