Aziz Rabia, Verma C K, Srivastava Namita
Department of Mathematics & Computer Application, Maulana Azad National Institute of Technology, Bhopal, M.P., 462003, India.
Department of Mathematics & Computer Application, Maulana Azad National Institute of Technology, Bhopal, M.P., 462003, India.
Comput Biol Chem. 2017 Dec;71:161-169. doi: 10.1016/j.compbiolchem.2017.10.009. Epub 2017 Oct 28.
This paper proposes a new hybrid search technique for feature (gene) selection (FS) using Independent component analysis (ICA) and Artificial Bee Colony (ABC) called ICA+ABC, to select informative genes based on a Naïve Bayes (NB) algorithm. An important trait of this technique is the optimization of ICA feature vector using ABC. ICA+ABC is a hybrid search algorithm that combines the benefits of extraction approach, to reduce the size of data and wrapper approach, to optimize the reduced feature vectors. This hybrid search technique is facilitated by evaluating the performance of ICA+ABC on six standard gene expression datasets of classification. Extensive experiments were conducted to compare the performance of ICA+ABC with the results obtained from recently published Minimum Redundancy Maximum Relevance (mRMR) +ABC algorithm for NB classifier. Also to check the performance that how ICA+ABC works as feature selection with NB classifier, compared the combination of ICA with popular filter techniques and with other similar bio inspired algorithm such as Genetic Algorithm (GA) and Particle Swarm Optimization (PSO). The result shows that ICA+ABC has a significant ability to generate small subsets of genes from the ICA feature vector, that significantly improve the classification accuracy of NB classifier compared to other previously suggested methods.
本文提出了一种新的混合搜索技术,即使用独立成分分析(ICA)和人工蜂群算法(ABC)的特征(基因)选择(FS)方法,称为ICA+ABC,用于基于朴素贝叶斯(NB)算法选择信息基因。该技术的一个重要特点是使用ABC对ICA特征向量进行优化。ICA+ABC是一种混合搜索算法,它结合了提取方法的优点以减少数据量,以及包装方法的优点以优化降维后的特征向量。通过评估ICA+ABC在六个标准基因表达分类数据集上的性能,促进了这种混合搜索技术的发展。进行了大量实验,将ICA+ABC的性能与最近发表的用于NB分类器的最小冗余最大相关性(mRMR)+ABC算法的结果进行比较。此外,为了检验ICA+ABC作为NB分类器的特征选择方法的性能,还将ICA与流行的过滤技术以及其他类似的生物启发算法(如遗传算法(GA)和粒子群优化(PSO))的组合进行了比较。结果表明,ICA+ABC具有从ICA特征向量中生成小基因子集的显著能力,与其他先前提出的方法相比,这显著提高了NB分类器的分类准确率。