Nguyen Thanh, Khosravi Abbas, Creighton Douglas, Nahavandi Saeid
Centre for Intelligent Systems Research (CISR), Deakin University, Geelong Waurn Ponds Campus, Victoria, 3216, Australia.
PLoS One. 2015 Mar 30;10(3):e0120364. doi: 10.1371/journal.pone.0120364. eCollection 2015.
This paper introduces a novel approach to gene selection based on a substantial modification of analytic hierarchy process (AHP). The modified AHP systematically integrates outcomes of individual filter methods to select the most informative genes for microarray classification. Five individual ranking methods including t-test, entropy, receiver operating characteristic (ROC) curve, Wilcoxon and signal to noise ratio are employed to rank genes. These ranked genes are then considered as inputs for the modified AHP. Additionally, a method that uses fuzzy standard additive model (FSAM) for cancer classification based on genes selected by AHP is also proposed in this paper. Traditional FSAM learning is a hybrid process comprising unsupervised structure learning and supervised parameter tuning. Genetic algorithm (GA) is incorporated in-between unsupervised and supervised training to optimize the number of fuzzy rules. The integration of GA enables FSAM to deal with the high-dimensional-low-sample nature of microarray data and thus enhance the efficiency of the classification. Experiments are carried out on numerous microarray datasets. Results demonstrate the performance dominance of the AHP-based gene selection against the single ranking methods. Furthermore, the combination of AHP-FSAM shows a great accuracy in microarray data classification compared to various competing classifiers. The proposed approach therefore is useful for medical practitioners and clinicians as a decision support system that can be implemented in the real medical practice.
本文介绍了一种基于层次分析法(AHP)重大改进的基因选择新方法。改进后的AHP系统地整合了各个过滤方法的结果,以选择用于微阵列分类的最具信息性的基因。采用包括t检验、熵、受试者工作特征(ROC)曲线、威尔科克森法和信噪比在内的五种个体排序方法对基因进行排序。然后将这些排序后的基因作为改进后AHP的输入。此外,本文还提出了一种基于AHP选择的基因,使用模糊标准加法模型(FSAM)进行癌症分类的方法。传统的FSAM学习是一个包括无监督结构学习和有监督参数调整的混合过程。遗传算法(GA)被纳入无监督和有监督训练之间,以优化模糊规则的数量。GA的整合使FSAM能够处理微阵列数据的高维低样本特性,从而提高分类效率。在众多微阵列数据集上进行了实验。结果表明,基于AHP的基因选择相对于单一排序方法具有性能优势。此外,与各种竞争分类器相比,AHP-FSAM的组合在微阵列数据分类中显示出很高的准确性。因此,所提出的方法对于医学从业者和临床医生来说是一个有用的决策支持系统,可以在实际医疗实践中实施。