Department of Electrical Engineering, Faculty of Engineering, Chiang Mai University, Chiang Mai 50200, Thailand.
Department of Electrical Engineering, Faculty of Engineering, Chiang Mai University, Chiang Mai 50200, Thailand; Biomedical Engineering Center, Chiang Mai University, Chiang Mai 50200, Thailand.
Comput Methods Programs Biomed. 2014 Feb;113(2):539-56. doi: 10.1016/j.cmpb.2013.12.012. Epub 2014 Jan 2.
Cervical cancer is one of the leading causes of cancer death in females worldwide. The disease can be cured if the patient is diagnosed in the pre-cancerous lesion stage or earlier. A common physical examination technique widely used in the screening is Papanicolaou test or Pap test. In this research, a method for automatic cervical cancer cell segmentation and classification is proposed. A single-cell image is segmented into nucleus, cytoplasm, and background, using the fuzzy C-means (FCM) clustering technique. Four cell classes in the ERUDIT and LCH datasets, i.e., normal, low grade squamous intraepithelial lesion (LSIL), high grade squamous intraepithelial lesion (HSIL), and squamous cell carcinoma (SCC), are considered. The 2-class problem can be achieved by grouping the last 3 classes as one abnormal class. Whereas, the Herlev dataset consists of 7 cell classes, i.e., superficial squamous, intermediate squamous, columnar, mild dysplasia, moderate dysplasia, severe dysplasia, and carcinoma in situ. These 7 classes can also be grouped to form a 2-class problem. These 3 datasets were tested on 5 classifiers including Bayesian classifier, linear discriminant analysis (LDA), K-nearest neighbor (KNN), artificial neural networks (ANN), and support vector machine (SVM). For the ERUDIT dataset, ANN with 5 nucleus-based features yielded the accuracies of 96.20% and 97.83% on the 4-class and 2-class problems, respectively. For the Herlev dataset, ANN with 9 cell-based features yielded the accuracies of 93.78% and 99.27% for the 7-class and 2-class problems, respectively. For the LCH dataset, ANN with 9 cell-based features yielded the accuracies of 95.00% and 97.00% for the 4-class and 2-class problems, respectively. The segmentation and classification performances of the proposed method were compared with that of the hard C-means clustering and watershed technique. The results show that the proposed automatic approach yields very good performance and is better than its counterparts.
宫颈癌是全球女性癌症死亡的主要原因之一。如果患者在癌前病变阶段或更早被诊断出来,该疾病是可以治愈的。巴氏涂片检查是一种广泛用于筛查的常见体格检查技术。在这项研究中,提出了一种用于自动宫颈癌细胞分割和分类的方法。使用模糊 C 均值(FCM)聚类技术将单细胞图像分割为细胞核、细胞质和背景。在 ERUDIT 和 LCH 数据集上考虑了 4 种细胞类,即正常、低级别鳞状上皮内病变(LSIL)、高级别鳞状上皮内病变(HSIL)和鳞状细胞癌(SCC)。可以通过将最后 3 类组合为一个异常类来实现 2 类问题。然而,Herlev 数据集由 7 种细胞类组成,即浅表鳞状、中间鳞状、柱状、轻度发育不良、中度发育不良、重度发育不良和原位癌。这 7 类也可以组合形成 2 类问题。这 3 个数据集在 5 个分类器上进行了测试,包括贝叶斯分类器、线性判别分析(LDA)、K-最近邻(KNN)、人工神经网络(ANN)和支持向量机(SVM)。对于 ERUDIT 数据集,基于 5 个核特征的 ANN 在 4 类和 2 类问题上的准确率分别为 96.20%和 97.83%。对于 Herlev 数据集,基于 9 个细胞特征的 ANN 在 7 类和 2 类问题上的准确率分别为 93.78%和 99.27%。对于 LCH 数据集,基于 9 个细胞特征的 ANN 在 4 类和 2 类问题上的准确率分别为 95.00%和 97.00%。所提出方法的分割和分类性能与硬 C 均值聚类和分水岭技术进行了比较。结果表明,所提出的自动方法具有非常好的性能,优于其同类方法。