• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于特征子集的新的改进的最大相关性和最小冗余方法。

A new improved maximal relevance and minimal redundancy method based on feature subset.

作者信息

Xie Shanshan, Zhang Yan, Lv Danjv, Chen Xu, Lu Jing, Liu Jiang

机构信息

College of Big Data and Intelligent Engineering, Southwest Forestry University, Kunming, 650224 China.

College of Mathematics and Physics, Southwest Forestry University, Kunming, 650224 China.

出版信息

J Supercomput. 2023;79(3):3157-3180. doi: 10.1007/s11227-022-04763-2. Epub 2022 Aug 30.

DOI:10.1007/s11227-022-04763-2
PMID:36060093
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9424812/
Abstract

Feature selection plays a very significant role for the success of pattern recognition and data mining. Based on the maximal relevance and minimal redundancy (mRMR) method, combined with feature subset, this paper proposes an improved maximal relevance and minimal redundancy (ImRMR) feature selection method based on feature subset. In ImRMR, the Pearson correlation coefficient and mutual information are first used to measure the relevance of a single feature to the sample category, and a factor is introduced to adjust the weights of the two measurement criteria. And an equal grouping method is exploited to generate candidate feature subsets according to the ranking features. Then, the relevance and redundancy of candidate feature subsets are calculated and the ordered sequence of these feature subsets is gained by incremental search method. Finally, the final optimal feature subset is obtained from these feature subsets by combining the sequence forward search method and the classification learning algorithm. Experiments are conducted on seven datasets. The results show that ImRMR can effectively remove irrelevant and redundant features, which can not only reduce the dimension of sample features and time of model training and prediction, but also improve the classification performance.

摘要

特征选择对于模式识别和数据挖掘的成功起着非常重要的作用。基于最大相关性和最小冗余度(mRMR)方法,结合特征子集,本文提出了一种基于特征子集的改进型最大相关性和最小冗余度(ImRMR)特征选择方法。在ImRMR中,首先使用皮尔逊相关系数和互信息来度量单个特征与样本类别的相关性,并引入一个因子来调整这两个度量标准的权重。然后采用等分组方法根据排序后的特征生成候选特征子集。接着,计算候选特征子集的相关性和冗余度,并通过增量搜索方法获得这些特征子集的有序序列。最后,结合序列前向搜索方法和分类学习算法从这些特征子集中获得最终的最优特征子集。在七个数据集上进行了实验。结果表明,ImRMR能够有效地去除不相关和冗余的特征,不仅可以降低样本特征的维度以及模型训练和预测的时间,还能提高分类性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce96/9424812/5c949d692281/11227_2022_4763_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce96/9424812/093c880d7ad7/11227_2022_4763_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce96/9424812/8c7903152297/11227_2022_4763_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce96/9424812/4b07ae262056/11227_2022_4763_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce96/9424812/581b68de3a16/11227_2022_4763_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce96/9424812/241239525281/11227_2022_4763_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce96/9424812/5c949d692281/11227_2022_4763_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce96/9424812/093c880d7ad7/11227_2022_4763_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce96/9424812/8c7903152297/11227_2022_4763_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce96/9424812/4b07ae262056/11227_2022_4763_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce96/9424812/581b68de3a16/11227_2022_4763_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce96/9424812/241239525281/11227_2022_4763_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce96/9424812/5c949d692281/11227_2022_4763_Fig6_HTML.jpg

相似文献

1
A new improved maximal relevance and minimal redundancy method based on feature subset.一种基于特征子集的新的改进的最大相关性和最小冗余方法。
J Supercomput. 2023;79(3):3157-3180. doi: 10.1007/s11227-022-04763-2. Epub 2022 Aug 30.
2
A filter feature selection method based on the Maximal Information Coefficient and Gram-Schmidt Orthogonalization for biomedical data mining.基于最大信息系数和 Gram-Schmidt 正交化的生物医学数据挖掘过滤特征选择方法。
Comput Biol Med. 2017 Oct 1;89:264-274. doi: 10.1016/j.compbiomed.2017.08.021. Epub 2017 Aug 24.
3
Semisupervised Feature Selection Based on Relevance and Redundancy Criteria.基于相关性和冗余性准则的半监督特征选择。
IEEE Trans Neural Netw Learn Syst. 2017 Sep;28(9):1974-1984. doi: 10.1109/TNNLS.2016.2562670. Epub 2016 May 20.
4
A new froth image classification method based on the MRMR-SSGMM hybrid model for recognition of reagent dosage condition in the coal flotation process.一种基于MRMR-SSGMM混合模型的新泡沫图像分类方法,用于识别煤炭浮选过程中的药剂用量条件。
Appl Intell (Dordr). 2022;52(1):732-752. doi: 10.1007/s10489-021-02328-z. Epub 2021 May 10.
5
Feature Selection With Maximal Relevance and Minimal Supervised Redundancy.基于最大相关性和最小监督冗余的特征选择
IEEE Trans Cybern. 2023 Feb;53(2):707-717. doi: 10.1109/TCYB.2021.3139898. Epub 2023 Jan 13.
6
Supervised Relevance-Redundancy assessments for feature selection in omics-based classification scenarios.基于组学的分类场景中特征选择的有监督相关性-冗余评估。
J Biomed Inform. 2023 Aug;144:104457. doi: 10.1016/j.jbi.2023.104457. Epub 2023 Jul 23.
7
FSCME: A Feature Selection Method Combining Copula Correlation and Maximal Information Coefficient by Entropy Weights.FSCME:一种基于熵权的结合 Copula 相关性和最大信息系数的特征选择方法。
IEEE J Biomed Health Inform. 2024 Sep;28(9):5638-5648. doi: 10.1109/JBHI.2024.3409628. Epub 2024 Sep 5.
8
Deep Learning Based Feature Selection Algorithm for Small Targets Based on mRMR.基于mRMR的深度学习小目标特征选择算法
Micromachines (Basel). 2022 Oct 18;13(10):1765. doi: 10.3390/mi13101765.
9
Chi-MIC-share: a new feature selection algorithm for quantitative structure-activity relationship models.Chi-MIC-share:一种用于定量构效关系模型的新特征选择算法。
RSC Adv. 2020 May 27;10(34):19852-19860. doi: 10.1039/d0ra00061b. eCollection 2020 May 26.
10
EEG feature selection method based on maximum information coefficient and quantum particle swarm.基于最大信息系数和量子粒子群的脑电图特征选择方法
Sci Rep. 2023 Sep 4;13(1):14515. doi: 10.1038/s41598-023-41682-5.

引用本文的文献

1
Detection of Thymoma Disease Using mRMR Feature Selection and Transformer Models.基于最小冗余最大相关(mRMR)特征选择和Transformer模型的胸腺瘤疾病检测
Diagnostics (Basel). 2024 Sep 29;14(19):2169. doi: 10.3390/diagnostics14192169.
2
Energy optimization for wireless sensor network using minimum redundancy maximum relevance feature selection and classification techniques.使用最小冗余最大相关特征选择和分类技术的无线传感器网络能量优化
PeerJ Comput Sci. 2024 Apr 30;10:e1997. doi: 10.7717/peerj-cs.1997. eCollection 2024.
3
Hybrid similarity relation based mutual information for feature selection in intuitionistic fuzzy rough framework and its applications.
直觉模糊粗糙框架下基于混合相似关系的互信息特征选择及其应用
Sci Rep. 2024 Mar 12;14(1):5958. doi: 10.1038/s41598-024-55902-z.
4
Hybrid Learning Models for IMU-Based HAR with Feature Analysis and Data Correction.基于 IMU 的 HAR 的混合学习模型,具有特征分析和数据校正功能。
Sensors (Basel). 2023 Sep 11;23(18):7802. doi: 10.3390/s23187802.
5
Detection of Red Pepper Powder Adulteration with Allura Red and Red Pepper Seeds Using Hyperspectral Imaging.利用高光谱成像技术检测辣椒粉末中诱惑红和辣椒籽的掺假情况
Foods. 2023 Sep 18;12(18):3471. doi: 10.3390/foods12183471.