Mercan Caner, Balkenhol Maschenka, Salgado Roberto, Sherman Mark, Vielh Philippe, Vreuls Willem, Polónia António, Horlings Hugo M, Weichert Wilko, Carter Jodi M, Bult Peter, Christgen Matthias, Denkert Carsten, van de Vijver Koen, Bokhorst John-Melle, van der Laak Jeroen, Ciompi Francesco
Radboud University Medical Center, Department of Pathology, Nijmegen, The Netherlands.
GZA-ZNA Hospitals, Department of Pathology, Antwerp, Belgium.
NPJ Breast Cancer. 2022 Nov 8;8(1):120. doi: 10.1038/s41523-022-00488-w.
To guide the choice of treatment, every new breast cancer is assessed for aggressiveness (i.e., graded) by an experienced histopathologist. Typically, this tumor grade consists of three components, one of which is the nuclear pleomorphism score (the extent of abnormalities in the overall appearance of tumor nuclei). The degree of nuclear pleomorphism is subjectively classified from 1 to 3, where a score of 1 most closely resembles epithelial cells of normal breast epithelium and 3 shows the greatest abnormalities. Establishing numerical criteria for grading nuclear pleomorphism is challenging, and inter-observer agreement is poor. Therefore, we studied the use of deep learning to develop fully automated nuclear pleomorphism scoring in breast cancer. The reference standard used for training the algorithm consisted of the collective knowledge of an international panel of 10 pathologists on a curated set of regions of interest covering the entire spectrum of tumor morphology in breast cancer. To fully exploit the information provided by the pathologists, a first-of-its-kind deep regression model was trained to yield a continuous scoring rather than limiting the pleomorphism scoring to the standard three-tiered system. Our approach preserves the continuum of nuclear pleomorphism without necessitating a large data set with explicit annotations of tumor nuclei. Once translated to the traditional system, our approach achieves top pathologist-level performance in multiple experiments on regions of interest and whole-slide images, compared to a panel of 10 and 4 pathologists, respectively.
为指导治疗方案的选择,每例新发乳腺癌都由经验丰富的组织病理学家评估其侵袭性(即分级)。通常,这种肿瘤分级由三个部分组成,其中之一是核多形性评分(肿瘤细胞核整体外观的异常程度)。核多形性程度主观上分为1至3级,1分最接近正常乳腺上皮的上皮细胞,3分显示出最大的异常。建立核多形性分级的数值标准具有挑战性,且观察者间的一致性较差。因此,我们研究了使用深度学习来开发乳腺癌中全自动核多形性评分。用于训练算法的参考标准由一个由10名病理学家组成的国际小组对一组精心挑选的感兴趣区域的集体知识构成,这些区域涵盖了乳腺癌肿瘤形态的整个范围。为了充分利用病理学家提供的信息,训练了一种首创的深度回归模型以产生连续评分,而不是将多形性评分限制在标准的三层系统中。我们的方法保留了核多形性的连续性,而无需一个带有肿瘤细胞核明确注释的大数据集。一旦转换为传统系统,与分别由10名和4名病理学家组成的小组相比,我们的方法在对感兴趣区域和全切片图像的多次实验中达到了顶级病理学家级别的性能。