Saraswat Mukesh, Arya K V
ABV-Indian Institute of Information Technology and Management, Gwalior, 474010, India,
Med Biol Eng Comput. 2014 Dec;52(12):1041-52. doi: 10.1007/s11517-014-1200-8. Epub 2014 Oct 5.
In automatic segmentation of leukocytes from the complex morphological background of tissue section images, a vast number of artifacts/noise are also extracted causing large amount of multivariate data generation. This multivariate data degrades the performance of a classifier to discriminate between leukocytes and artifacts/noise. However, the selection of prominent features plays an important role in reducing the computational complexity and increasing the performance of the classifier as compared to a high-dimensional features space. Therefore, this paper introduces a novel Gini importance-based binary random forest feature selection method. Moreover, the random forest classifier is used to classify the extracted objects into artifacts, mononuclear cells, and polymorphonuclear cells. The experimental results establish that the proposed method effectively eliminates the irrelevant features, maintaining the high classification accuracy as compared to other feature reduction methods.
在从组织切片图像的复杂形态背景中自动分割白细胞时,也会提取大量伪像/噪声,从而产生大量多变量数据。这种多变量数据会降低分类器区分白细胞与伪像/噪声的性能。然而,与高维特征空间相比,突出特征的选择在降低计算复杂度和提高分类器性能方面起着重要作用。因此,本文介绍了一种基于基尼重要性的新型二元随机森林特征选择方法。此外,随机森林分类器用于将提取的对象分类为伪像、单核细胞和多形核细胞。实验结果表明,与其他特征约简方法相比,该方法有效地消除了无关特征,保持了较高的分类准确率。