Abel John, Jain Suyog, Rajan Deepta, Padigela Harshith, Leidal Kenneth, Prakash Aaditya, Conway Jake, Nercessian Michael, Kirkup Christian, Javed Syed Ashar, Biju Raymond, Harguindeguy Natalia, Shenker Daniel, Indorf Nicholas, Sanghavi Darpan, Egger Robert, Trotter Benjamin, Gerardin Ylaine, Brosnan-Cashman Jacqueline A, Dhoot Aditya, Montalto Michael C, Parmar Chintan, Wapinski Ilan, Khosla Archit, Drage Michael G, Yu Limin, Taylor-Weiner Amaro
PathAI, Boston, MA, USA.
NPJ Precis Oncol. 2024 Jun 19;8(1):134. doi: 10.1038/s41698-024-00623-9.
While alterations in nucleus size, shape, and color are ubiquitous in cancer, comprehensive quantification of nuclear morphology across a whole-slide histologic image remains a challenge. Here, we describe the development of a pan-tissue, deep learning-based digital pathology pipeline for exhaustive nucleus detection, segmentation, and classification and the utility of this pipeline for nuclear morphologic biomarker discovery. Manually-collected nucleus annotations were used to train an object detection and segmentation model for identifying nuclei, which was deployed to segment nuclei in H&E-stained slides from the BRCA, LUAD, and PRAD TCGA cohorts. Interpretable features describing the shape, size, color, and texture of each nucleus were extracted from segmented nuclei and compared to measurements of genomic instability, gene expression, and prognosis. The nuclear segmentation and classification model trained herein performed comparably to previously reported models. Features extracted from the model revealed differences sufficient to distinguish between BRCA, LUAD, and PRAD. Furthermore, cancer cell nuclear area was associated with increased aneuploidy score and homologous recombination deficiency. In BRCA, increased fibroblast nuclear area was indicative of poor progression-free and overall survival and was associated with gene expression signatures related to extracellular matrix remodeling and anti-tumor immunity. Thus, we developed a powerful pan-tissue approach for nucleus segmentation and featurization, enabling the construction of predictive models and the identification of features linking nuclear morphology with clinically-relevant prognostic biomarkers across multiple cancer types.
虽然细胞核大小、形状和颜色的改变在癌症中普遍存在,但在全切片组织学图像上全面量化核形态仍然是一项挑战。在此,我们描述了一种基于深度学习的全组织数字病理学流程的开发,用于详尽的细胞核检测、分割和分类,以及该流程在核形态生物标志物发现中的应用。人工收集的细胞核注释用于训练一个用于识别细胞核的目标检测和分割模型,该模型被部署用于分割来自BRCA、LUAD和PRAD TCGA队列的苏木精-伊红(H&E)染色切片中的细胞核。从分割后的细胞核中提取描述每个细胞核形状、大小、颜色和纹理的可解释特征,并与基因组不稳定性、基因表达和预后的测量结果进行比较。本文训练的核分割和分类模型的性能与先前报道的模型相当。从该模型中提取的特征显示出足以区分BRCA、LUAD和PRAD的差异。此外,癌细胞核面积与非整倍体评分增加和同源重组缺陷相关。在BRCA中,成纤维细胞核面积增加表明无进展生存期和总生存期较差,并且与细胞外基质重塑和抗肿瘤免疫相关的基因表达特征有关。因此,我们开发了一种强大的全组织细胞核分割和特征提取方法,能够构建预测模型,并识别将核形态与多种癌症类型中临床相关预后生物标志物联系起来的特征。