College of Computer and Cyber Security, Fujian Normal University, Fuzhou, Fujian, China.
Digit Fujian Internet-of-Things Laboratory of Environmental Monitoring Fuzhou, Fujian, China.
PLoS One. 2022 Feb 4;17(2):e0263006. doi: 10.1371/journal.pone.0263006. eCollection 2022.
Biomedical research is inseparable from the analysis of various histopathological images, and hematoxylin-eosin (HE)-stained images are one of the most basic and widely used types. However, at present, machine learning based approaches of the analysis of this kind of images are highly relied on manual labeling of images for training. Fully automated processing of HE-stained images remains a challenging task due to the high degree of color intensity, size and shape uncertainty of the stained cells. For this problem, we propose a fully automatic pixel-wise semantic segmentation method based on pseudo-labels, which concerns to significantly reduce the manual cell sketching and labeling work before machine learning, and guarantees the accuracy of segmentation. First, we collect reliable training samples in a unsupervised manner based on K-means clustering results; second, we use full mixup strategy to enhance the training images and to obtain the U-Net model for the nuclei segmentation from the background. The experimental results based on the meningioma pathology image dataset show that the proposed method has good performance and the pathological features obtained statistically based on the segmentation results can be used to assist in the clinical grading of meningiomas. Compared with other machine learning strategies, it can provide a reliable reference for clinical research more effectively.
生物医学研究离不开对各种组织病理学图像的分析,而苏木精-伊红(HE)染色图像是最基本和最广泛使用的类型之一。然而,目前基于机器学习的这种图像分析方法高度依赖于对图像进行手动标记以进行训练。由于染色细胞的颜色强度、大小和形状不确定性很高,因此完全自动化处理 HE 染色图像仍然是一项具有挑战性的任务。针对这个问题,我们提出了一种基于伪标签的全自动像素级语义分割方法,该方法显著减少了机器学习之前的手动细胞描边和标记工作,并保证了分割的准确性。首先,我们基于 K-means 聚类结果以无监督的方式收集可靠的训练样本;其次,我们使用全混洗策略来增强训练图像,并从背景中获得用于核分割的 U-Net 模型。基于脑膜瘤病理图像数据集的实验结果表明,所提出的方法具有良好的性能,并且基于分割结果统计获得的病理特征可用于辅助脑膜瘤的临床分级。与其他机器学习策略相比,它可以更有效地为临床研究提供可靠的参考。