Department of Computer Science and Engineering, Dahan Institute of Technology, Hualien, 970, Taiwan, ROC.
Gene. 2013 Apr 10;518(1):159-63. doi: 10.1016/j.gene.2012.11.046. Epub 2012 Dec 10.
Correct classification and prediction of tumor cells is essential for a successful diagnosis and reliable future treatment. In this study, we aimed at using genetic algorithms for feature selection and proposed silhouette statistics as a discriminant function to distinguish between six subtypes of pediatric acute lymphoblastic leukemia by using microarray with thousands of gene expressions. Our methods have shown a better classification accuracy than previously published methods and obtained a set of genes effective to discriminate subtypes of pediatric acute lymphoblastic leukemia. Furthermore, the use of silhouette statistics, offering the advantages of measuring the classification quality by a graphical display and by an average silhouette width, has also demonstrated feasibility and novelty for more difficult multiclass tumor prediction problems.
正确分类和预测肿瘤细胞对于成功诊断和可靠的未来治疗至关重要。在这项研究中,我们旨在使用遗传算法进行特征选择,并提出了轮廓统计作为判别函数,通过使用数千个基因表达的微阵列来区分六种儿科急性淋巴细胞白血病亚型。我们的方法比以前发表的方法具有更高的分类准确性,并获得了一组有效区分儿科急性淋巴细胞白血病亚型的基因。此外,使用轮廓统计,通过图形显示和平均轮廓宽度来衡量分类质量,也为更困难的多类肿瘤预测问题提供了可行性和新颖性。