• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于高维混合决策表的新型特征选择方法。

A Novel Feature Selection Method for High-Dimensional Mixed Decision Tables.

作者信息

Thuy Nguyen Ngoc, Wongthanavasu Sartra

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Jul;33(7):3024-3037. doi: 10.1109/TNNLS.2020.3048080. Epub 2022 Jul 6.

DOI:10.1109/TNNLS.2020.3048080
PMID:33449885
Abstract

Attribute reduction, also called feature selection, is one of the most important issues of rough set theory, which is regarded as a vital preprocessing step in pattern recognition, machine learning, and data mining. Nowadays, high-dimensional mixed and incomplete data sets are very common in real-world applications. Certainly, the selection of a promising feature subset from such data sets is a very interesting, but challenging problem. Almost all of the existing methods generated a cover on the space of objects to determine important features. However, some tolerance classes in the cover are useless for the computational process. Thus, this article introduces a new concept of stripped neighborhood covers to reduce unnecessary tolerance classes from the original cover. Based on the proposed stripped neighborhood cover, we define a new reduct in mixed and incomplete decision tables, and then design an efficient heuristic algorithm to find this reduct. For each loop in the main loop of the proposed algorithm, we use an error measure to select an optimal feature and put it into the selected feature subset. Besides, to deal more efficiently with high-dimensional data sets, we also determine redundant features after each loop and remove them from the candidate feature subset. For the purpose of verifying the performance of the proposed algorithm, we carry out experiments on data sets downloaded from public data sources to compare with existing state-of-the-art algorithms. Experimental results showed that our algorithm outperforms compared algorithms, especially in classification accuracy.

摘要

属性约简,也称为特征选择,是粗糙集理论中最重要的问题之一,它被视为模式识别、机器学习和数据挖掘中至关重要的预处理步骤。如今,高维混合且不完整的数据集在实际应用中非常常见。当然,从这类数据集中选择一个有前景的特征子集是一个非常有趣但具有挑战性的问题。几乎所有现有的方法都在对象空间上生成一个覆盖来确定重要特征。然而,覆盖中的一些容差类在计算过程中是无用的。因此,本文引入了一种新的剥离邻域覆盖概念,以从原始覆盖中减少不必要的容差类。基于所提出的剥离邻域覆盖,我们在混合且不完整的决策表中定义了一种新的约简,然后设计了一种高效的启发式算法来找到这种约简。对于所提出算法主循环中的每一轮循环,我们使用一种误差度量来选择一个最优特征并将其放入所选特征子集中。此外,为了更有效地处理高维数据集,我们还在每一轮循环后确定冗余特征,并将它们从候选特征子集中移除。为了验证所提出算法的性能,我们对从公共数据源下载的数据集进行实验,以与现有的最先进算法进行比较。实验结果表明,我们的算法优于比较算法,尤其是在分类准确率方面。

相似文献

1
A Novel Feature Selection Method for High-Dimensional Mixed Decision Tables.一种用于高维混合决策表的新型特征选择方法。
IEEE Trans Neural Netw Learn Syst. 2022 Jul;33(7):3024-3037. doi: 10.1109/TNNLS.2020.3048080. Epub 2022 Jul 6.
2
R-Ensembler: A greedy rough set based ensemble attribute selection algorithm with kNN imputation for classification of medical data.R-Ensembler:一种基于粗糙集的贪婪集成属性选择算法,具有 kNN 插补功能,用于医学数据的分类。
Comput Methods Programs Biomed. 2020 Feb;184:105122. doi: 10.1016/j.cmpb.2019.105122. Epub 2019 Oct 8.
3
Rough sets and Laplacian score based cost-sensitive feature selection.基于粗糙集和拉普拉斯得分的代价敏感特征选择。
PLoS One. 2018 Jun 18;13(6):e0197564. doi: 10.1371/journal.pone.0197564. eCollection 2018.
4
Fuzzy-Rough Simultaneous Attribute Selection and Feature Extraction Algorithm.模糊粗糙同时属性选择与特征提取算法。
IEEE Trans Cybern. 2013 Aug;43(4):1166-77. doi: 10.1109/TSMCB.2012.2225832.
5
Feature selection based on neighborhood rough sets and Gini index.基于邻域粗糙集和基尼指数的特征选择
PeerJ Comput Sci. 2023 Dec 12;9:e1711. doi: 10.7717/peerj-cs.1711. eCollection 2023.
6
An Attribute Reduction Method Using Neighborhood Entropy Measures in Neighborhood Rough Sets.一种基于邻域粗糙集邻域熵测度的属性约简方法。
Entropy (Basel). 2019 Feb 7;21(2):155. doi: 10.3390/e21020155.
7
Feature Selection Based on Neighborhood Discrimination Index.基于邻域判别指数的特征选择
IEEE Trans Neural Netw Learn Syst. 2018 Jul;29(7):2986-2999. doi: 10.1109/TNNLS.2017.2710422. Epub 2017 Jun 23.
8
Optimizing multimodal feature selection using binary reinforced cuckoo search algorithm for improved classification performance.使用二进制增强布谷鸟搜索算法优化多模态特征选择以提高分类性能。
PeerJ Comput Sci. 2024 Jan 29;10:e1816. doi: 10.7717/peerj-cs.1816. eCollection 2024.
9
A Neighborhood Rough Sets-Based Attribute Reduction Method Using Lebesgue and Entropy Measures.一种基于邻域粗糙集的使用勒贝格测度和熵测度的属性约简方法。
Entropy (Basel). 2019 Feb 1;21(2):138. doi: 10.3390/e21020138.
10
Feature Selection Combining Information Theory View and Algebraic View in the Neighborhood Decision System.邻域决策系统中结合信息论视角与代数视角的特征选择
Entropy (Basel). 2021 Jun 2;23(6):704. doi: 10.3390/e23060704.

引用本文的文献

1
Applications and challenges of biomarker-based predictive models in proactive health management.基于生物标志物的预测模型在主动健康管理中的应用与挑战
Front Public Health. 2025 Aug 18;13:1633487. doi: 10.3389/fpubh.2025.1633487. eCollection 2025.