Department of Computer & Information Sciences, PIEAS, Pakistan Institute of Engineering and Applied Sciences, P.O. Nilore, Islamabad.
Department of Computer & Information Sciences, PIEAS, Pakistan Institute of Engineering and Applied Sciences, P.O. Nilore, Islamabad.
Comput Biol Med. 2014 Apr;47:76-92. doi: 10.1016/j.compbiomed.2013.12.010. Epub 2014 Jan 8.
In recent years, classification of colon biopsy images has become an active research area. Traditionally, colon cancer is diagnosed using microscopic analysis. However, the process is subjective and leads to considerable inter/intra observer variation. Therefore, reliable computer-aided colon cancer detection techniques are in high demand. In this paper, we propose a colon biopsy image classification system, called CBIC, which benefits from discriminatory capabilities of information rich hybrid feature spaces, and performance enhancement based on ensemble classification methodology. Normal and malignant colon biopsy images differ with each other in terms of the color distribution of different biological constituents. The colors of different constituents are sharp in normal images, whereas the colors diffuse with each other in malignant images. In order to exploit this variation, two feature types, namely color components based statistical moments (CCSM) and Haralick features have been proposed, which are color components based variants of their traditional counterparts. Moreover, in normal colon biopsy images, epithelial cells possess sharp and well-defined edges. Histogram of oriented gradients (HOG) based features have been employed to exploit this information. Different combinations of hybrid features have been constructed from HOG, CCSM, and Haralick features. The minimum Redundancy Maximum Relevance (mRMR) feature selection method has been employed to select meaningful features from individual and hybrid feature sets. Finally, an ensemble classifier based on majority voting has been proposed, which classifies colon biopsy images using the selected features. Linear, RBF, and sigmoid SVM have been employed as base classifiers. The proposed system has been tested on 174 colon biopsy images, and improved performance (=98.85%) has been observed compared to previously reported studies. Additionally, the use of mRMR method has been justified by comparing the performance of CBIC on original and reduced feature sets.
近年来,结肠活检图像分类已成为一个活跃的研究领域。传统上,结肠癌的诊断是通过显微镜分析进行的。然而,这个过程是主观的,会导致相当大的观察者间和观察者内差异。因此,需要可靠的计算机辅助结直肠癌检测技术。在本文中,我们提出了一种结肠活检图像分类系统,称为 CBIC,它受益于信息丰富的混合特征空间的判别能力,以及基于集成分类方法的性能增强。正常和恶性结肠活检图像在不同生物成分的颜色分布上彼此不同。正常图像中不同成分的颜色是鲜明的,而恶性图像中颜色则相互扩散。为了利用这种变化,提出了两种特征类型,即基于颜色分量的统计矩(CCSM)和 Haralick 特征,它们是传统特征的颜色分量变体。此外,在正常的结肠活检图像中,上皮细胞具有鲜明和定义良好的边缘。基于方向梯度直方图(HOG)的特征已被用于利用这些信息。HOG、CCSM 和 Haralick 特征的不同组合已从混合特征中构建。最小冗余最大相关性(mRMR)特征选择方法已被用于从个体和混合特征集中选择有意义的特征。最后,提出了一种基于多数投票的集成分类器,该分类器使用所选特征对结肠活检图像进行分类。线性、RBF 和 sigmoid SVM 被用作基分类器。该系统已在 174 张结肠活检图像上进行了测试,与以前报道的研究相比,观察到性能提高(=98.85%)。此外,通过比较 CBIC 在原始和简化特征集上的性能,证明了使用 mRMR 方法的合理性。