School of Computer and Information Engineering, Henan University, Kaifeng 475004, China.
School of Artificial Intelligence, Henan University, Zhengzhou 450046, China.
Math Biosci Eng. 2023 Feb 3;20(4):6652-6665. doi: 10.3934/mbe.2023286.
The developing of DNA microarray technology has made it possible to study the cancer in view of the genes. Since the correlation between the genes is unconsidered, current unsupervised feature selection models may select lots of the redundant genes during the feature selecting due to the over focusing on genes with similar attribute. which may deteriorate the clustering performance of the model. To tackle this problem, we propose an adaptive feature selection model here in which reconstructed coefficient matrix with additional constraint is introduced to transform original data of high dimensional space into a low-dimensional space meanwhile to prevent over focusing on genes with similar attribute. Moreover, Alternative Optimization (AO) is also proposed to handle the nonconvex optimization induced by solving the proposed model. The experimental results on four different cancer datasets show that the proposed model is superior to existing models in the aspects such as clustering accuracy and sparsity of selected genes.
DNA 微阵列技术的发展使得从基因的角度研究癌症成为可能。由于当前的无监督特征选择模型不考虑基因之间的相关性,因此在特征选择过程中,由于过度关注具有相似属性的基因,可能会选择许多冗余基因,从而降低模型的聚类性能。为了解决这个问题,我们在这里提出了一个自适应特征选择模型,其中引入了带有附加约束的重构系数矩阵,将原始的高维空间数据转换到低维空间,同时防止过度关注具有相似属性的基因。此外,还提出了交替优化(AO)来处理由解决所提出的模型所引起的非凸优化。在四个不同的癌症数据集上的实验结果表明,所提出的模型在聚类准确性和选择基因的稀疏性方面优于现有模型。