基于增量前向迭代拉普拉斯分数的无监督特征选择

Unsupervised feature selection based on incremental forward iterative Laplacian score.

作者信息

Jiang Jiefang, Zhang Xianyong, Yang Jilin

机构信息

School of Mathematical Sciences, Sichuan Normal University, Chengdu, 610066 China.

Institute of Intelligent Information and Quantum Information, Sichuan Normal University, Chengdu, 610066 China.

出版信息

Artif Intell Rev. 2023;56(5):4077-4112. doi: 10.1007/s10462-022-10274-6. Epub 2022 Sep 19.

DOI:10.1007/s10462-022-10274-6

PMID:36160366

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9484723/

Abstract

Feature selection facilitates intelligent information processing, and the unsupervised learning of feature selection has become important. In terms of unsupervised feature selection, the Laplacian score (LS) provides a powerful measurement and optimization method, and good performance has been achieved using the recent forward iterative Laplacian score (FILS) algorithm. However, there is still room for advancement. The aim of this paper is to improve the FILS algorithm, and thus, feature significance (SIG) is mainly introduced to develop a high-quality selection method, i.e., the incremental forward iterative Laplacian score (IFILS) algorithm. Based on the modified LS, the metric difference in the incremental feature process motivates SIG. Therefore, SIG offers a dynamic characterization by considering initial and terminal states, and it promotes the current FILS measurement on only the terminal state. Then, both the modified LS and integrated SIG acquire granulation nonmonotonicity and uncertainty, especially on incremental feature chains, and the corresponding verification is achieved by completing examples and experiments. Furthermore, a SIG-based incremental criterion of minimum selection is designed to choose optimization features, and thus, the IFILS algorithm is naturally formulated to implement unsupervised feature selection. Finally, an in-depth comparison of the IFILS algorithm with the FILS algorithm is achieved using data experiments on multiple datasets, including a nominal dataset of COVID-19 surveillance. As validated by the experimental results, the IFILS algorithm outperforms the FILS algorithm and achieves better classification performance.

摘要

特征选择有助于智能信息处理，无监督特征选择学习变得至关重要。在无监督特征选择方面，拉普拉斯分数（LS）提供了一种强大的度量和优化方法，并且使用最近的前向迭代拉普拉斯分数（FILS）算法已取得了良好的性能。然而，仍有改进的空间。本文的目的是改进FILS算法，因此，主要引入特征显著性（SIG）来开发一种高质量的选择方法，即增量前向迭代拉普拉斯分数（IFILS）算法。基于改进的LS，增量特征过程中的度量差异激发了SIG。因此，SIG通过考虑初始状态和终端状态提供了一种动态表征，并且它仅促进当前对终端状态的FILS度量。然后，改进的LS和整合的SIG都具有粒度非单调性和不确定性，特别是在增量特征链上，并且通过完整的示例和实验实现了相应的验证。此外，设计了基于SIG的最小选择增量准则来选择优化特征，从而自然地制定了IFILS算法以实现无监督特征选择。最后，使用包括COVID-19监测名义数据集在内的多个数据集上的数据实验，对IFILS算法和FILS算法进行了深入比较。实验结果验证了IFILS算法优于FILS算法，并实现了更好的分类性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7381/9484723/441852e784e3/10462_2022_10274_Fig1_HTML.jpg

相似文献

Unsupervised feature selection based on incremental forward iterative Laplacian score.基于增量前向迭代拉普拉斯分数的无监督特征选择

Artif Intell Rev. 2023;56(5):4077-4112. doi: 10.1007/s10462-022-10274-6. Epub 2022 Sep 19.

Laplacian linear discriminant analysis approach to unsupervised feature selection.拉普拉斯线性判别分析方法在无监督特征选择中的应用。

IEEE/ACM Trans Comput Biol Bioinform. 2009 Oct-Dec;6(4):605-14. doi: 10.1109/TCBB.2007.70257.

Deep unsupervised feature selection by discarding nuisance and correlated features.深度无监督特征选择，通过丢弃无关和相关特征。

Neural Netw. 2022 Aug;152:34-43. doi: 10.1016/j.neunet.2022.04.002. Epub 2022 Apr 12.

Autoweighted Multiview Feature Selection With Graph Optimization.基于图优化的自动加权多视图特征选择

IEEE Trans Cybern. 2022 Dec;52(12):12966-12977. doi: 10.1109/TCYB.2021.3094843. Epub 2022 Nov 18.

Semisupervised Feature Selection Based on Relevance and Redundancy Criteria.基于相关性和冗余性准则的半监督特征选择。

IEEE Trans Neural Netw Learn Syst. 2017 Sep;28(9):1974-1984. doi: 10.1109/TNNLS.2016.2562670. Epub 2016 May 20.

LLE Score: A New Filter-Based Unsupervised Feature Selection Method Based on Nonlinear Manifold Embedding and Its Application to Image Recognition.LLE 得分：一种新的基于非线性流形嵌入的基于过滤的无监督特征选择方法及其在图像识别中的应用。

IEEE Trans Image Process. 2017 Nov;26(11):5257-5269. doi: 10.1109/TIP.2017.2733200. Epub 2017 Jul 28.

A Variance Minimization Criterion to Feature Selection Using Laplacian Regularization.基于拉普拉斯正则化的特征选择的方差最小化准则。

IEEE Trans Pattern Anal Mach Intell. 2011 Oct;33(10):2013-25. doi: 10.1109/TPAMI.2011.44. Epub 2011 Mar 10.

Exploiting Local Coherent Patterns for Unsupervised Feature Ranking.利用局部相干模式进行无监督特征排序

IEEE Trans Syst Man Cybern B Cybern. 2011 Dec;41(6):1471-82. doi: 10.1109/TSMCB.2011.2151256. Epub 2011 Jun 16.

Rough sets and Laplacian score based cost-sensitive feature selection.基于粗糙集和拉普拉斯得分的代价敏感特征选择。

PLoS One. 2018 Jun 18;13(6):e0197564. doi: 10.1371/journal.pone.0197564. eCollection 2018.

Locality preserving score for joint feature weights learning.局部保持评分的联合特征权重学习。

Neural Netw. 2015 Sep;69:126-34. doi: 10.1016/j.neunet.2015.06.001. Epub 2015 Jun 15.

本文引用的文献

Smart microalgae farming with internet-of-things for sustainable agriculture.利用物联网进行智能微藻养殖，实现可持续农业。

Biotechnol Adv. 2022 Jul-Aug;57:107931. doi: 10.1016/j.biotechadv.2022.107931. Epub 2022 Feb 22.

Sustainable smart photobioreactor for continuous cultivation of microalgae embedded with Internet of Things.可持续智能光生物反应器，用于嵌入物联网的微藻连续培养。

Bioresour Technol. 2022 Feb;346:126558. doi: 10.1016/j.biortech.2021.126558. Epub 2021 Dec 11.

Feature Selection Combining Information Theory View and Algebraic View in the Neighborhood Decision System.邻域决策系统中结合信息论视角与代数视角的特征选择

Entropy (Basel). 2021 Jun 2;23(6):704. doi: 10.3390/e23060704.

Valorization of groundnut shell via pyrolysis: Product distribution, thermodynamic analysis, kinetic estimation, and artificial neural network modeling.通过热解实现花生壳的增值利用：产品分布、热力学分析、动力学估算和人工神经网络建模。

Chemosphere. 2021 Nov;283:131162. doi: 10.1016/j.chemosphere.2021.131162. Epub 2021 Jun 15.

Integration of multi-objective PSO based feature selection and node centrality for medical datasets.基于多目标 PSO 的特征选择和节点中心性在医学数据集上的集成。

Genomics. 2020 Nov;112(6):4370-4384. doi: 10.1016/j.ygeno.2020.07.027. Epub 2020 Jul 25.

Unsupervised feature selection algorithm for multiclass cancer classification of gene expression RNA-Seq data.无监督特征选择算法在基因表达 RNA-Seq 数据的多类癌症分类中的应用。

Genomics. 2020 Mar;112(2):1916-1925. doi: 10.1016/j.ygeno.2019.11.004. Epub 2019 Nov 20.

A review of feature selection methods in medical applications.医学应用中的特征选择方法综述。

Comput Biol Med. 2019 Sep;112:103375. doi: 10.1016/j.compbiomed.2019.103375. Epub 2019 Jul 31.

Feature Selection Based on Neighborhood Discrimination Index.基于邻域判别指数的特征选择

IEEE Trans Neural Netw Learn Syst. 2018 Jul;29(7):2986-2999. doi: 10.1109/TNNLS.2017.2710422. Epub 2017 Jun 23.

Adaptive Unsupervised Feature Selection With Structure Regularization.自适应无监督特征选择与结构正则化。

IEEE Trans Neural Netw Learn Syst. 2018 Apr;29(4):944-956. doi: 10.1109/TNNLS.2017.2650978. Epub 2017 Jan 27.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于增量前向迭代拉普拉斯分数的无监督特征选择

Unsupervised feature selection based on incremental forward iterative Laplacian score.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献