一种基于纹理特征的组织病理学图像细胞核分割质量评估方法。

A Methodology for Texture Feature-based Quality Assessment in Nucleus Segmentation of Histopathology Image.

作者信息

Wen Si, Kurc Tahsin M, Gao Yi, Zhao Tianhao, Saltz Joel H, Zhu Wei

机构信息

Department of Applied Mathematics and Statistics, State University of New York at Stony Brook, Stony Brook, NY, USA.

Department of Biomedical Informatics, State University of New York at Stony Brook, Stony Brook, NY, USA.

出版信息

J Pathol Inform. 2017 Sep 7;8:38. doi: 10.4103/jpi.jpi_43_17. eCollection 2017.

Abstract

CONTEXT

Image segmentation pipelines often are sensitive to algorithm input parameters. Algorithm parameters optimized for a set of images do not necessarily produce good-quality-segmentation results for other images. Even within an image, some regions may not be well segmented due to a number of factors, including multiple pieces of tissue with distinct characteristics, differences in staining of the tissue, normal versus tumor regions, and tumor heterogeneity. Evaluation of quality of segmentation results is an important step in image analysis. It is very labor intensive to do quality assessment manually with large image datasets because a whole-slide tissue image may have hundreds of thousands of nuclei. Semi-automatic mechanisms are needed to assist researchers and application developers to detect image regions with bad segmentations efficiently.

AIMS

Our goal is to develop and evaluate a machine-learning-based semi-automated workflow to assess quality of nucleus segmentation results in a large set of whole-slide tissue images.

METHODS

We propose a quality control methodology, in which machine-learning algorithms are trained with image intensity and texture features to produce a classification model. This model is applied to image patches in a whole-slide tissue image to predict the quality of nucleus segmentation in each patch. The training step of our methodology involves the selection and labeling of regions by a pathologist in a set of images to create the training dataset. The image regions are partitioned into patches. A set of intensity and texture features is computed for each patch. A classifier is trained with the features and the labels assigned by the pathologist. At the end of this process, a classification model is generated. The classification step applies the classification model to unlabeled test images. Each test image is partitioned into patches. The classification model is applied to each patch to predict the patch's label.

RESULTS

The proposed methodology has been evaluated by assessing the segmentation quality of a segmentation method applied to images from two cancer types in The Cancer Genome Atlas; WHO Grade II lower grade glioma (LGG) and lung adenocarcinoma (LUAD). The results show that our method performs well in predicting patches with good-quality segmentations and achieves F1 scores 84.7% for LGG and 75.43% for LUAD.

CONCLUSIONS

As image scanning technologies advance, large volumes of whole-slide tissue images will be available for research and clinical use. Efficient approaches for the assessment of quality and robustness of output from computerized image analysis workflows will become increasingly critical to extracting useful quantitative information from tissue images. Our work demonstrates the feasibility of machine-learning-based semi-automated techniques to assist researchers and algorithm developers in this process.

摘要

背景

图像分割流程通常对算法输入参数很敏感。针对一组图像优化的算法参数不一定能为其他图像产生高质量的分割结果。即使在一幅图像内,由于多种因素,包括具有不同特征的多块组织、组织染色差异、正常区域与肿瘤区域以及肿瘤异质性,某些区域可能也无法得到很好的分割。分割结果质量评估是图像分析中的重要一步。对于大型图像数据集进行人工质量评估非常耗费人力,因为一张全切片组织图像可能有数十万细胞核。需要半自动机制来协助研究人员和应用开发者有效检测分割效果不佳的图像区域。

目的

我们的目标是开发并评估一种基于机器学习的半自动工作流程,以评估大量全切片组织图像中细胞核分割结果的质量。

方法

我们提出一种质量控制方法,其中利用图像强度和纹理特征训练机器学习算法以生成分类模型。该模型应用于全切片组织图像中的图像块,以预测每个图像块中细胞核分割的质量。我们方法的训练步骤包括病理学家在一组图像中选择区域并进行标注以创建训练数据集。将图像区域划分为图像块。为每个图像块计算一组强度和纹理特征。使用这些特征和病理学家分配的标签训练分类器。在此过程结束时,生成一个分类模型。分类步骤将分类模型应用于未标注的测试图像。将每个测试图像划分为图像块。将分类模型应用于每个图像块以预测图像块的标签。

结果

通过评估应用于来自癌症基因组图谱中两种癌症类型(世界卫生组织二级低级别胶质瘤(LGG)和肺腺癌(LUAD))图像的分割方法的分割质量,对所提出的方法进行了评估。结果表明,我们的方法在预测高质量分割的图像块方面表现良好,LGG的F1分数达到84.7%,LUAD的F1分数达到75.43%。

结论

随着图像扫描技术的进步,大量全切片组织图像将可用于研究和临床应用。评估计算机化图像分析工作流程输出的质量和稳健性的有效方法对于从组织图像中提取有用的定量信息将变得越来越关键。我们的工作证明了基于机器学习的半自动技术在此过程中协助研究人员和算法开发者的可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/87fa/5609357/b63dbbe06d3c/JPI-8-38-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索