• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

FSCME:一种基于熵权的结合 Copula 相关性和最大信息系数的特征选择方法。

FSCME: A Feature Selection Method Combining Copula Correlation and Maximal Information Coefficient by Entropy Weights.

出版信息

IEEE J Biomed Health Inform. 2024 Sep;28(9):5638-5648. doi: 10.1109/JBHI.2024.3409628. Epub 2024 Sep 5.

DOI:10.1109/JBHI.2024.3409628
PMID:38833405
Abstract

Feature selection is a critical component of data mining and has garnered significant attention in recent years. However, feature selection methods based on information entropy often introduce complex mutual information forms to measure features, leading to increased redundancy and potential errors. To address this issue, we propose FSCME, a feature selection method combining Copula correlation (Ccor) and the maximum information coefficient (MIC) by entropy weights. The FSCME takes into consideration the relevance between features and labels, as well as the redundancy among candidate features and selected features. Therefore, the FSCME utilizes Ccor to measure the redundancy between features, while also estimating the relevance between features and labels. Meanwhile, the FSCME employs MIC to enhance the credibility of the correlation between features and labels. Moreover, this study employs the Entropy Weight Method (EWM) to evaluate and assign weights to the Ccor and MIC. The experimental results demonstrate that FSCME yields a more effective feature subset for subsequent clustering processes, significantly improving the classification performance compared to the other six feature selection methods.

摘要

特征选择是数据挖掘的一个关键组成部分,近年来受到了广泛关注。然而,基于信息熵的特征选择方法通常引入复杂的互信息形式来度量特征,导致冗余增加和潜在的错误。针对这个问题,我们提出了 FSCME,这是一种通过熵权重结合 Copula 相关系数 (Ccor) 和最大信息系数 (MIC) 的特征选择方法。FSCME 考虑了特征与标签之间的相关性,以及候选特征和选择特征之间的冗余性。因此,FSCME 利用 Ccor 来度量特征之间的冗余性,同时估计特征与标签之间的相关性。同时,FSCME 采用 MIC 来增强特征与标签之间相关性的可信度。此外,本研究采用熵权法 (EWM) 来评估和分配 Ccor 和 MIC 的权重。实验结果表明,FSCME 为后续的聚类过程产生了更有效的特征子集,与其他六种特征选择方法相比,显著提高了分类性能。

相似文献

1
FSCME: A Feature Selection Method Combining Copula Correlation and Maximal Information Coefficient by Entropy Weights.FSCME:一种基于熵权的结合 Copula 相关性和最大信息系数的特征选择方法。
IEEE J Biomed Health Inform. 2024 Sep;28(9):5638-5648. doi: 10.1109/JBHI.2024.3409628. Epub 2024 Sep 5.
2
A filter feature selection method based on the Maximal Information Coefficient and Gram-Schmidt Orthogonalization for biomedical data mining.基于最大信息系数和 Gram-Schmidt 正交化的生物医学数据挖掘过滤特征选择方法。
Comput Biol Med. 2017 Oct 1;89:264-274. doi: 10.1016/j.compbiomed.2017.08.021. Epub 2017 Aug 24.
3
A neurodynamic optimization approach to supervised feature selection via fractional programming.基于分数规划的监督特征选择的神经动力优化方法。
Neural Netw. 2021 Apr;136:194-206. doi: 10.1016/j.neunet.2021.01.004. Epub 2021 Jan 14.
4
Multi-Label Feature Selection with Conditional Mutual Information.基于条件互信息的多标签特征选择。
Comput Intell Neurosci. 2022 Oct 8;2022:9243893. doi: 10.1155/2022/9243893. eCollection 2022.
5
A new improved maximal relevance and minimal redundancy method based on feature subset.一种基于特征子集的新的改进的最大相关性和最小冗余方法。
J Supercomput. 2023;79(3):3157-3180. doi: 10.1007/s11227-022-04763-2. Epub 2022 Aug 30.
6
A Feature Selection Algorithm Integrating Maximum Classification Information and Minimum Interaction Feature Dependency Information.一种集成最大分类信息和最小交互特征依赖信息的特征选择算法。
Comput Intell Neurosci. 2021 Dec 28;2021:3569632. doi: 10.1155/2021/3569632. eCollection 2021.
7
Comparison of five supervised feature selection algorithms leading to top features and gene signatures from multi-omics data in cancer.比较五种监督特征选择算法,这些算法可从癌症的多组学数据中得到顶级特征和基因特征。
BMC Bioinformatics. 2022 Apr 28;23(Suppl 3):153. doi: 10.1186/s12859-022-04678-y.
8
Unsupervised Feature Selection Using an Integrated Strategy of Hierarchical Clustering With Singular Value Decomposition: An Integrative Biomarker Discovery Method With Application to Acute Myeloid Leukemia.基于层次聚类和奇异值分解的集成策略的无监督特征选择:一种集成生物标志物发现方法及其在急性髓系白血病中的应用。
IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1354-1364. doi: 10.1109/TCBB.2021.3110989. Epub 2022 Jun 3.
9
Feature Selection With Maximal Relevance and Minimal Supervised Redundancy.基于最大相关性和最小监督冗余的特征选择
IEEE Trans Cybern. 2023 Feb;53(2):707-717. doi: 10.1109/TCYB.2021.3139898. Epub 2023 Jan 13.
10
An entropy-based gene selection method for cancer classification using microarray data.一种基于熵的利用微阵列数据进行癌症分类的基因选择方法。
BMC Bioinformatics. 2005 Mar 24;6:76. doi: 10.1186/1471-2105-6-76.