Singh Shantanu, Janoos Firdaus, Pécot Thierry, Caserta Enrico, Leone Gustavo, Rittscher Jens, Machiraju Raghu
Dept. of Computer Science and Engg., The Ohio State University, USA.
Inf Process Med Imaging. 2011;22:398-410. doi: 10.1007/978-3-642-22092-0_33.
In systems-based approaches for studying processes such as cancer and development, identifying and characterizing individual cells within a tissue is the first step towards understanding the large-scale effects that emerge from the interactions between cells. To this end, nuclear morphology is an important phenotype to characterize the physiological and differentiated state of a cell. This study focuses on using nuclear morphology to identify cellular phenotypes in thick tissue sections imaged using 3D fluorescence microscopy. The limited label information, heterogeneous feature set describing a nucleus, and existence of subpopulations within cell-types makes this a difficult learning problem. To address these issues, a technique is presented to learn a distance metric from labeled data which is locally adaptive to account for heterogeneity in the data. Additionally, a label propagation technique is used to improve the quality of the learned metric by expanding the training set using unlabeled data. Results are presented on images of tumor stroma in breast cancer, where the framework is used to identify fibroblasts, macrophages and endothelial cells--three major stromal cells involved in carcinogenesis.
在基于系统的方法中,用于研究诸如癌症和发育等过程时,识别和表征组织内的单个细胞是理解细胞间相互作用所产生的大规模效应的第一步。为此,核形态是表征细胞生理和分化状态的重要表型。本研究聚焦于利用核形态在使用三维荧光显微镜成像的厚组织切片中识别细胞表型。有限的标记信息、描述细胞核的异质特征集以及细胞类型内亚群的存在使得这成为一个困难的学习问题。为解决这些问题,提出了一种从标记数据中学习距离度量的技术,该度量在局部具有适应性以考虑数据中的异质性。此外,使用标签传播技术通过利用未标记数据扩展训练集来提高所学习度量的质量。给出了乳腺癌肿瘤基质图像的结果,其中该框架用于识别成纤维细胞、巨噬细胞和内皮细胞——参与致癌作用的三种主要基质细胞。