在分类的特征选择问题中识别（准）等信息量子集：一种最大相关性最小冗余方法。

IEEE Trans Cybern. 2016 Jun;46(6):1424-37. doi: 10.1109/TCYB.2015.2444435. Epub 2015 Jul 6.

An emerging trend in feature selection is the development of two-objective algorithms that analyze the tradeoff between the number of features and the classification performance of the model built with these features. Since these two objectives are conflicting, a typical result stands in a set of Pareto-efficient subsets, each having a different cardinality and a corresponding discriminating power. However, this approach overlooks the fact that, for a given cardinality, there can be several subsets with similar information content. The study reported here addresses this problem, and introduces a novel multiobjective feature selection approach conceived to identify: 1) a subset that maximizes the performance of a given classifier and 2) a set of subsets that are quasi equally informative, i.e., have almost same classification performance, to the performance maximizing subset. The approach consists of a wrapper [Wrapper for Quasi Equally Informative Subset Selection (W-QEISS)] built on the formulation of a four-objective optimization problem, which is aimed at maximizing the accuracy of a classifier, minimizing the number of features, and optimizing two entropy-based measures of relevance and redundancy. This allows conducting the search in a larger space, thus enabling the wrapper to generate a large number of Pareto-efficient solutions. The algorithm is compared against the mRMR algorithm, a two-objective wrapper and a computationally efficient filter [Filter for Quasi Equally Informative Subset Selection (F-QEISS)] on 24 University of California, Irvine, (UCI) datasets including both binary and multiclass classification. Experimental results show that W-QEISS has the capability of evolving a rich and diverse set of Pareto-efficient solutions, and that their availability helps in: 1) studying the tradeoff between multiple measures of classification performance and 2) understanding the relative importance of each feature. The quasi equally informative subsets are identified at the cost of a marginal increase in the computational time thanks to the adoption of Borg Multiobjective Evolutionary Algorithm and Extreme Learning Machine as global optimization and learning algorithms, respectively.

特征选择的一个新趋势是开发双目标算法，该算法分析了特征数量和使用这些特征构建的模型的分类性能之间的权衡。由于这两个目标是相互冲突的，典型的结果是一组帕累托有效的子集，每个子集的基数不同，相应的区分能力也不同。然而，这种方法忽略了一个事实，即对于给定的基数，可能有几个具有相似信息量的子集。本文研究了这个问题，并提出了一种新的多目标特征选择方法，旨在识别：1）最大化给定分类器性能的子集；2）一组准等信息量的子集，即具有几乎相同的分类性能，到性能最大化子集。该方法由一个包装器[用于准等信息量子集选择的包装器（W-QEISS）]组成，该包装器基于一个四目标优化问题的公式，旨在最大化分类器的准确性、最小化特征数量，并优化两个基于熵的相关性和冗余度度量。这允许在更大的空间中进行搜索，从而使包装器能够生成大量的帕累托有效解决方案。该算法与 mRMR 算法、一个双目标包装器和一个计算效率高的过滤器[用于准等信息量子集选择的过滤器（F-QEISS）]在 24 个加利福尼亚大学欧文分校（UCI）数据集上进行了比较，包括二进制和多类分类。实验结果表明，W-QEISS 具有进化出丰富多样的帕累托有效解决方案的能力，它们的可用性有助于：1）研究多个分类性能度量之间的权衡；2）了解每个特征的相对重要性。由于采用 Borg 多目标进化算法和极限学习机分别作为全局优化和学习算法，准等信息量子集的识别是以计算时间略有增加为代价的。

相似文献

Identifying (Quasi) Equally Informative Subsets in Feature Selection Problems for Classification: A Max-Relevance Min-Redundancy Approach.

IEEE Trans Cybern. 2016 Jun;46(6):1424-37. doi: 10.1109/TCYB.2015.2444435. Epub 2015 Jul 6.

Particle swarm optimization for feature selection in classification: a multi-objective approach.

IEEE Trans Cybern. 2013 Dec;43(6):1656-71. doi: 10.1109/TSMCB.2012.2227469.

A novel feature selection approach for biomedical data classification.

J Biomed Inform. 2010 Feb;43(1):15-23. doi: 10.1016/j.jbi.2009.07.008. Epub 2009 Jul 30.

A new hybrid filter/wrapper algorithm for feature selection in classification.

Anal Chim Acta. 2019 Nov 8;1080:43-54. doi: 10.1016/j.aca.2019.06.054. Epub 2019 Jun 28.

Multi-Objective Particle Swarm Optimization Approach for Cost-Based Feature Selection in Classification.

IEEE/ACM Trans Comput Biol Bioinform. 2017 Jan-Feb;14(1):64-75. doi: 10.1109/TCBB.2015.2476796. Epub 2015 Sep 4.

Sensitivity versus accuracy in multiclass problems using memetic Pareto evolutionary neural networks.

IEEE Trans Neural Netw. 2010 May;21(5):750-70. doi: 10.1109/TNN.2010.2041468. Epub 2010 Mar 11.

Development of a two-stage gene selection method that incorporates a novel hybrid approach using the cuckoo optimization algorithm and harmony search for cancer classification.

J Biomed Inform. 2017 Mar;67:11-20. doi: 10.1016/j.jbi.2017.01.016. Epub 2017 Feb 3.

A Tri-Stage Wrapper-Filter Feature Selection Framework for Disease Classification.

Sensors (Basel). 2021 Aug 18;21(16):5571. doi: 10.3390/s21165571.

A novel approach for dimension reduction of microarray.

Comput Biol Chem. 2017 Dec;71:161-169. doi: 10.1016/j.compbiolchem.2017.10.009. Epub 2017 Oct 28.

Wrapper-filter feature selection algorithm using a memetic framework.

IEEE Trans Syst Man Cybern B Cybern. 2007 Feb;37(1):70-6. doi: 10.1109/tsmcb.2006.883267.

引用本文的文献

Temporal and spatial variability of dynamic microstate brain network in disorders of consciousness.

CNS Neurosci Ther. 2024 Feb;30(2):e14641. doi: 10.1111/cns.14641.

Feature Selection Based on Adaptive Particle Swarm Optimization with Leadership Learning.

Comput Intell Neurosci. 2022 Aug 28;2022:1825341. doi: 10.1155/2022/1825341. eCollection 2022.

Feature Selection Based on a Large-Scale Many-Objective Evolutionary Algorithm.

Comput Intell Neurosci. 2021 Aug 24;2021:9961727. doi: 10.1155/2021/9961727. eCollection 2021.

Nature-Inspired Multiobjective Cancer Subtype Diagnosis.

IEEE J Transl Eng Health Med. 2019 Mar 7;7:4300112. doi: 10.1109/JTEHM.2019.2891746. eCollection 2019.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Identifying (Quasi) Equally Informative Subsets in Feature Selection Problems for Classification: A Max-Relevance Min-Redundancy Approach.

IEEE Trans Cybern. 2016 Jun;46(6):1424-37. doi: 10.1109/TCYB.2015.2444435. Epub 2015 Jul 6.

Particle swarm optimization for feature selection in classification: a multi-objective approach.

IEEE Trans Cybern. 2013 Dec;43(6):1656-71. doi: 10.1109/TSMCB.2012.2227469.

A novel feature selection approach for biomedical data classification.

J Biomed Inform. 2010 Feb;43(1):15-23. doi: 10.1016/j.jbi.2009.07.008. Epub 2009 Jul 30.

A new hybrid filter/wrapper algorithm for feature selection in classification.

Anal Chim Acta. 2019 Nov 8;1080:43-54. doi: 10.1016/j.aca.2019.06.054. Epub 2019 Jun 28.

Multi-Objective Particle Swarm Optimization Approach for Cost-Based Feature Selection in Classification.

IEEE/ACM Trans Comput Biol Bioinform. 2017 Jan-Feb;14(1):64-75. doi: 10.1109/TCBB.2015.2476796. Epub 2015 Sep 4.

Sensitivity versus accuracy in multiclass problems using memetic Pareto evolutionary neural networks.

IEEE Trans Neural Netw. 2010 May;21(5):750-70. doi: 10.1109/TNN.2010.2041468. Epub 2010 Mar 11.

Development of a two-stage gene selection method that incorporates a novel hybrid approach using the cuckoo optimization algorithm and harmony search for cancer classification.

J Biomed Inform. 2017 Mar;67:11-20. doi: 10.1016/j.jbi.2017.01.016. Epub 2017 Feb 3.

A Tri-Stage Wrapper-Filter Feature Selection Framework for Disease Classification.

Sensors (Basel). 2021 Aug 18;21(16):5571. doi: 10.3390/s21165571.

A novel approach for dimension reduction of microarray.

Comput Biol Chem. 2017 Dec;71:161-169. doi: 10.1016/j.compbiolchem.2017.10.009. Epub 2017 Oct 28.

Wrapper-filter feature selection algorithm using a memetic framework.

IEEE Trans Syst Man Cybern B Cybern. 2007 Feb;37(1):70-6. doi: 10.1109/tsmcb.2006.883267.

引用本文的文献

Temporal and spatial variability of dynamic microstate brain network in disorders of consciousness.

CNS Neurosci Ther. 2024 Feb;30(2):e14641. doi: 10.1111/cns.14641.

Feature Selection Based on Adaptive Particle Swarm Optimization with Leadership Learning.

Comput Intell Neurosci. 2022 Aug 28;2022:1825341. doi: 10.1155/2022/1825341. eCollection 2022.

Feature Selection Based on a Large-Scale Many-Objective Evolutionary Algorithm.

Comput Intell Neurosci. 2021 Aug 24;2021:9961727. doi: 10.1155/2021/9961727. eCollection 2021.

Nature-Inspired Multiobjective Cancer Subtype Diagnosis.

IEEE J Transl Eng Health Med. 2019 Mar 7;7:4300112. doi: 10.1109/JTEHM.2019.2891746. eCollection 2019.

Identifying (Quasi) Equally Informative Subsets in Feature Selection Problems for Classification: A Max-Relevance Min-Redundancy Approach.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献