• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于最大权重最小冗余的核偏最小二乘特征选择

Kernel Partial Least Squares Feature Selection Based on Maximum Weight Minimum Redundancy.

作者信息

Liu Xiling, Zhou Shuisheng

机构信息

School of Mathematics and Statistics, Xidian University, Xi'an 710071, China.

Public Education Department, Zhengzhou University of Economics and Business, Zhengzhou 451191, China.

出版信息

Entropy (Basel). 2023 Feb 10;25(2):325. doi: 10.3390/e25020325.

DOI:10.3390/e25020325
PMID:36832691
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9955929/
Abstract

Feature selection refers to a vital function in machine learning and data mining. The maximum weight minimum redundancy feature selection method not only considers the importance of features but also reduces the redundancy among features. However, the characteristics of various datasets are not identical, and thus the feature selection method should have different feature evaluation criteria for all datasets. Additionally, high-dimensional data analysis poses a challenge to enhancing the classification performance of the different feature selection methods. This study presents a kernel partial least squares feature selection method on the basis of the enhanced maximum weight minimum redundancy algorithm to simplify the calculation and improve the classification accuracy of high-dimensional datasets. By introducing a weight factor, the correlation between the maximum weight and the minimum redundancy in the evaluation criterion can be adjusted to develop an improved maximum weight minimum redundancy method. In this study, the proposed KPLS feature selection method considers the redundancy between the features and the feature weighting between any feature and a class label in different datasets. Moreover, the feature selection method proposed in this study has been tested regarding its classification accuracy on data containing noise and several datasets. The experimental findings achieved using different datasets explore the feasibility and effectiveness of the proposed method which can select an optimal feature subset and obtain great classification performance based on three different metrics when compared with other feature selection methods.

摘要

特征选择是机器学习和数据挖掘中的一项重要功能。最大权重最小冗余特征选择方法不仅考虑了特征的重要性,还减少了特征之间的冗余。然而,各种数据集的特征并不相同,因此特征选择方法应对所有数据集具有不同的特征评估标准。此外,高维数据分析对提高不同特征选择方法的分类性能提出了挑战。本研究基于增强的最大权重最小冗余算法提出了一种核偏最小二乘特征选择方法,以简化计算并提高高维数据集的分类精度。通过引入权重因子,可以调整评估标准中最大权重与最小冗余之间的相关性,从而开发出一种改进的最大权重最小冗余方法。在本研究中,所提出的KPLS特征选择方法考虑了不同数据集中特征之间的冗余以及任何特征与类标签之间的特征加权。此外,本研究中提出的特征选择方法已针对其在含噪声数据和多个数据集上的分类精度进行了测试。使用不同数据集获得的实验结果探索了所提出方法的可行性和有效性,与其他特征选择方法相比,该方法可以基于三种不同指标选择最优特征子集并获得出色的分类性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5690/9955929/7caccfe4af2f/entropy-25-00325-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5690/9955929/da7352c8651d/entropy-25-00325-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5690/9955929/3c922ca23ce8/entropy-25-00325-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5690/9955929/753a1eee5a1a/entropy-25-00325-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5690/9955929/212938486978/entropy-25-00325-g004a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5690/9955929/7caccfe4af2f/entropy-25-00325-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5690/9955929/da7352c8651d/entropy-25-00325-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5690/9955929/3c922ca23ce8/entropy-25-00325-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5690/9955929/753a1eee5a1a/entropy-25-00325-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5690/9955929/212938486978/entropy-25-00325-g004a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5690/9955929/7caccfe4af2f/entropy-25-00325-g005.jpg

相似文献

1
Kernel Partial Least Squares Feature Selection Based on Maximum Weight Minimum Redundancy.基于最大权重最小冗余的核偏最小二乘特征选择
Entropy (Basel). 2023 Feb 10;25(2):325. doi: 10.3390/e25020325.
2
Computer-assisted lip diagnosis on Traditional Chinese Medicine using multi-class support vector machines.基于多类支持向量机的中医唇诊计算机辅助诊断。
BMC Complement Altern Med. 2012 Aug 16;12:127. doi: 10.1186/1472-6882-12-127.
3
A filter feature selection method based on the Maximal Information Coefficient and Gram-Schmidt Orthogonalization for biomedical data mining.基于最大信息系数和 Gram-Schmidt 正交化的生物医学数据挖掘过滤特征选择方法。
Comput Biol Med. 2017 Oct 1;89:264-274. doi: 10.1016/j.compbiomed.2017.08.021. Epub 2017 Aug 24.
4
AlPOs synthetic factor analysis based on maximum weight and minimum redundancy feature selection.基于最大权重和最小冗余特征选择的 AlPOs 合成因子分析。
Int J Mol Sci. 2013 Nov 8;14(11):22132-48. doi: 10.3390/ijms141122132.
5
Minimum redundancy maximum relevance feature selection approach for temporal gene expression data.用于时间基因表达数据的最小冗余最大相关特征选择方法
BMC Bioinformatics. 2017 Jan 3;18(1):9. doi: 10.1186/s12859-016-1423-9.
6
A Feature Selection Algorithm Integrating Maximum Classification Information and Minimum Interaction Feature Dependency Information.一种集成最大分类信息和最小交互特征依赖信息的特征选择算法。
Comput Intell Neurosci. 2021 Dec 28;2021:3569632. doi: 10.1155/2021/3569632. eCollection 2021.
7
Multi-Label Feature Selection with Conditional Mutual Information.基于条件互信息的多标签特征选择。
Comput Intell Neurosci. 2022 Oct 8;2022:9243893. doi: 10.1155/2022/9243893. eCollection 2022.
8
Feature Selection With Maximal Relevance and Minimal Supervised Redundancy.基于最大相关性和最小监督冗余的特征选择
IEEE Trans Cybern. 2023 Feb;53(2):707-717. doi: 10.1109/TCYB.2021.3139898. Epub 2023 Jan 13.
9
A Feature and Algorithm Selection Method for Improving the Prediction of Protein Structural Class.一种用于改进蛋白质结构类预测的特征与算法选择方法
Comb Chem High Throughput Screen. 2017;20(7):612-621. doi: 10.2174/1386207320666170314103147.
10
The feature selection bias problem in relation to high-dimensional gene data.与高维基因数据相关的特征选择偏差问题。
Artif Intell Med. 2016 Jan;66:63-71. doi: 10.1016/j.artmed.2015.11.001. Epub 2015 Nov 14.

引用本文的文献

1
An Industrial Internet Security Assessment Model Based on a Selectable Confidence Rule Base.一种基于可选置信规则库的工业互联网安全评估模型
Sensors (Basel). 2024 Nov 27;24(23):7577. doi: 10.3390/s24237577.
2
Toward molecular diagnosis of major depressive disorder by plasma peptides using a deep learning approach.采用深度学习方法通过血浆肽实现对重度抑郁症的分子诊断。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae554.

本文引用的文献

1
Driver Stress Detection Using Ultra-Short-Term HRV Analysis under Real World Driving Conditions.在实际驾驶条件下使用超短期心率变异性分析进行驾驶员压力检测
Entropy (Basel). 2023 Jan 19;25(2):194. doi: 10.3390/e25020194.
2
Feature Selection in High-Dimensional Models via EBIC with Energy Distance Correlation.基于能量距离相关性的经验贝叶斯信息准则在高维模型中的特征选择
Entropy (Basel). 2022 Dec 21;25(1):14. doi: 10.3390/e25010014.
3
Pruning Decision Rules by Reduct-Based Weighting and Ranking of Features.基于约简的特征加权与排序来修剪决策规则
Entropy (Basel). 2022 Nov 3;24(11):1602. doi: 10.3390/e24111602.
4
Using mutual information for selecting features in supervised neural net learning.在监督式神经网络学习中使用互信息来选择特征。
IEEE Trans Neural Netw. 1994;5(4):537-50. doi: 10.1109/72.298224.
5
Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.基于互信息的特征选择:最大依赖、最大相关和最小冗余准则。
IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1226-38. doi: 10.1109/TPAMI.2005.159.
6
Statistical methods in epidemiology. v. Towards an understanding of the kappa coefficient.流行病学中的统计方法。第五部分:对kappa系数的理解
Disabil Rehabil. 2000 May 20;22(8):339-44. doi: 10.1080/096382800296575.