Devaraj Senthilkumar, Paulraj S
Department of Computer Science and Engineering, University College of Engineering, Anna University, Tiruchirappalli, Tamil Nadu, India.
Department of Mathematics, College of Engineering, Anna University, Tamil Nadu, India.
ScientificWorldJournal. 2015;2015:821798. doi: 10.1155/2015/821798. Epub 2015 Sep 28.
Multidimensional medical data classification has recently received increased attention by researchers working on machine learning and data mining. In multidimensional dataset (MDD) each instance is associated with multiple class values. Due to its complex nature, feature selection and classifier built from the MDD are typically more expensive or time-consuming. Therefore, we need a robust feature selection technique for selecting the optimum single subset of the features of the MDD for further analysis or to design a classifier. In this paper, an efficient feature selection algorithm is proposed for the classification of MDD. The proposed multidimensional feature subset selection (MFSS) algorithm yields a unique feature subset for further analysis or to build a classifier and there is a computational advantage on MDD compared with the existing feature selection algorithms. The proposed work is applied to benchmark multidimensional datasets. The number of features was reduced to 3% minimum and 30% maximum by using the proposed MFSS. In conclusion, the study results show that MFSS is an efficient feature selection algorithm without affecting the classification accuracy even for the reduced number of features. Also the proposed MFSS algorithm is suitable for both problem transformation and algorithm adaptation and it has great potentials in those applications generating multidimensional datasets.
多维医学数据分类最近受到了从事机器学习和数据挖掘研究人员的更多关注。在多维数据集(MDD)中,每个实例都与多个类值相关联。由于其性质复杂,从MDD构建的特征选择和分类器通常更昂贵或更耗时。因此,我们需要一种强大的特征选择技术来选择MDD特征的最佳单个子集,以便进行进一步分析或设计分类器。本文提出了一种用于MDD分类的高效特征选择算法。所提出的多维特征子集选择(MFSS)算法产生一个唯一的特征子集用于进一步分析或构建分类器,并且与现有的特征选择算法相比,在MDD上具有计算优势。所提出的工作应用于基准多维数据集。使用所提出的MFSS,特征数量减少到最小3%和最大30%。总之,研究结果表明,MFSS是一种高效的特征选择算法,即使特征数量减少也不会影响分类精度。此外,所提出的MFSS算法适用于问题转换和算法适配,并且在生成多维数据集的那些应用中具有很大的潜力。