• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过优化条件互信息的下限进行特征选择。

Feature selection by optimizing a lower bound of conditional mutual information.

作者信息

Peng Hanyang, Fan Yong

机构信息

College of Computer Science and Software Engineering, Shenzhen University, Nanhai Ave 3688, Shenzhen, Guangdong, 518060, PR China.

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences, 100190, Beijing, PR China.

出版信息

Inf Sci (N Y). 2017 Dec;418-419:652-667. doi: 10.1016/j.ins.2017.08.036. Epub 2017 Aug 9.

DOI:10.1016/j.ins.2017.08.036
PMID:30283157
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6167022/
Abstract

A unified framework is proposed to select features by optimizing computationally feasible approximations of high-dimensional conditional mutual information (CMI) between features and their associated class label under different assumptions. Under this unified framework, state-of-the-art information theory based feature selection algorithms are rederived, and a new algorithm is proposed to select features by optimizing a lower bound of the CMI with a weaker assumption than those adopted by existing methods. The new feature selection method integrates a plug-in component to distinguish redundant features from irrelevant ones for improving the feature selection robustness. Furthermore, a novel metric is proposed to evaluate feature selection methods based on simulated data. The proposed method has been compared with state-of-the-art feature selection methods based on the new evaluation metric and classification performance of classifiers built upon the selected features. The experiment results have demonstrated that the proposed method could achieve promising performance in a variety of feature selection problems.

摘要

提出了一个统一框架,用于在不同假设下通过优化特征与其相关类标签之间高维条件互信息(CMI)的计算可行近似值来选择特征。在此统一框架下,重新推导了基于信息论的先进特征选择算法,并提出了一种新算法,该算法通过优化CMI的下限来选择特征,其假设比现有方法所采用的假设更弱。新的特征选择方法集成了一个插件组件,以区分冗余特征和无关特征,从而提高特征选择的鲁棒性。此外,还提出了一种新颖的指标,用于基于模拟数据评估特征选择方法。基于新的评估指标以及基于所选特征构建的分类器的分类性能,将所提出的方法与基于信息论的先进特征选择方法进行了比较。实验结果表明,所提出的方法在各种特征选择问题中都能取得良好的性能。

相似文献

1
Feature selection by optimizing a lower bound of conditional mutual information.通过优化条件互信息的下限进行特征选择。
Inf Sci (N Y). 2017 Dec;418-419:652-667. doi: 10.1016/j.ins.2017.08.036. Epub 2017 Aug 9.
2
A Feature Selection Algorithm Integrating Maximum Classification Information and Minimum Interaction Feature Dependency Information.一种集成最大分类信息和最小交互特征依赖信息的特征选择算法。
Comput Intell Neurosci. 2021 Dec 28;2021:3569632. doi: 10.1155/2021/3569632. eCollection 2021.
3
Multi-Label Feature Selection with Conditional Mutual Information.基于条件互信息的多标签特征选择。
Comput Intell Neurosci. 2022 Oct 8;2022:9243893. doi: 10.1155/2022/9243893. eCollection 2022.
4
Classification of high dimensional biomedical data based on feature selection using redundant removal.基于冗余消除的特征选择对高维生物医学数据的分类。
PLoS One. 2019 Apr 9;14(4):e0214406. doi: 10.1371/journal.pone.0214406. eCollection 2019.
5
Feature selection based on fuzzy joint mutual information maximization.基于模糊联合互信息最大化的特征选择。
Math Biosci Eng. 2020 Nov 30;18(1):305-327. doi: 10.3934/mbe.2021016.
6
Simple Stopping Criteria for Information Theoretic Feature Selection.信息论特征选择的简单停止准则。
Entropy (Basel). 2019 Jan 21;21(1):99. doi: 10.3390/e21010099.
7
Feature Selection With Maximal Relevance and Minimal Supervised Redundancy.基于最大相关性和最小监督冗余的特征选择
IEEE Trans Cybern. 2023 Feb;53(2):707-717. doi: 10.1109/TCYB.2021.3139898. Epub 2023 Jan 13.
8
Input feature selection for classification problems.用于分类问题的输入特征选择。
IEEE Trans Neural Netw. 2002;13(1):143-59. doi: 10.1109/72.977291.
9
Study on Mutual Information and Fractal Dimension-Based Unsupervised Feature Parameters Selection: Application in UAVs.基于互信息和分形维数的无监督特征参数选择研究:在无人机中的应用
Entropy (Basel). 2018 Sep 5;20(9):674. doi: 10.3390/e20090674.
10
An evolutionary decomposition-based multi-objective feature selection for multi-label classification.一种基于进化分解的多标签分类多目标特征选择方法。
PeerJ Comput Sci. 2020 Mar 2;6:e261. doi: 10.7717/peerj-cs.261. eCollection 2020.

引用本文的文献

1
Advancing precision rheumatology: applications of machine learning for rheumatoid arthritis management.推进精准风湿病学:机器学习在类风湿关节炎管理中的应用。
Front Immunol. 2024 Jun 10;15:1409555. doi: 10.3389/fimmu.2024.1409555. eCollection 2024.
2
Application of Artificial Intelligence in Diagnosis of Craniopharyngioma.人工智能在颅咽管瘤诊断中的应用
Front Neurol. 2022 Jan 6;12:752119. doi: 10.3389/fneur.2021.752119. eCollection 2021.
3
A Novel Biomarker Identification Approach for Gastric Cancer Using Gene Expression and DNA Methylation Dataset.一种利用基因表达和DNA甲基化数据集鉴定胃癌新型生物标志物的方法。
Front Genet. 2021 Mar 25;12:644378. doi: 10.3389/fgene.2021.644378. eCollection 2021.
4
Prediction of the performance of artificial neural networks in mapping sEMG to finger joint angles via signal pre-investigation techniques.通过信号预研究技术预测人工神经网络在将表面肌电图映射到手指关节角度方面的性能。
Heliyon. 2020 Apr 3;6(4):e03669. doi: 10.1016/j.heliyon.2020.e03669. eCollection 2020 Apr.
5
Adaptive Sparsity Regularization Based Collaborative Clustering for Cancer Prognosis.基于自适应稀疏正则化的协作聚类用于癌症预后分析
Med Image Comput Comput Assist Interv. 2019 Oct;11767:583-592. doi: 10.1007/978-3-030-32251-9_64. Epub 2019 Oct 10.
6
Robust Collaborative Clustering of Subjects and Radiomic Features for Cancer Prognosis.稳健的基于受试者和放射组学特征的协作聚类用于癌症预后。
IEEE Trans Biomed Eng. 2020 Oct;67(10):2735-2744. doi: 10.1109/TBME.2020.2969839. Epub 2020 Jan 27.
7
COLLABORATIVE CLUSTERING OF SUBJECTS AND RADIOMIC FEATURES FOR PREDICTING CLINICAL OUTCOMES OF RECTAL CANCER PATIENTS.用于预测直肠癌患者临床结局的受试者与影像组学特征的协作聚类
Proc IEEE Int Symp Biomed Imaging. 2019 Apr;2019:1303-1306. doi: 10.1109/ISBI.2019.8759512. Epub 2019 Jul 11.
8
FeatureSelect: a software for feature selection based on machine learning approaches.FeatureSelect:一款基于机器学习方法的特征选择软件。
BMC Bioinformatics. 2019 Apr 3;20(1):170. doi: 10.1186/s12859-019-2754-0.
9
Unsupervised machine learning of radiomic features for predicting treatment response and overall survival of early stage non-small cell lung cancer patients treated with stereotactic body radiation therapy.基于放射组学特征的无监督机器学习预测行立体定向体部放疗的早期非小细胞肺癌患者的治疗反应和总生存期。
Radiother Oncol. 2018 Nov;129(2):218-226. doi: 10.1016/j.radonc.2018.06.025. Epub 2018 Jul 4.
10
Parameter-Free Centralized Multi-Task Learning for Characterizing Developmental Sex Differences in Resting State Functional Connectivity.用于表征静息态功能连接中发育性别的差异的无参数集中式多任务学习
Proc AAAI Conf Artif Intell. 2018 Feb;2018:2660-2667. Epub 2018 Apr 26.

本文引用的文献

1
Discriminative least squares regression for multiclass classification and feature selection.用于多类分类和特征选择的判别最小二乘回归。
IEEE Trans Neural Netw Learn Syst. 2012 Nov;23(11):1738-54. doi: 10.1109/TNNLS.2012.2212721.
2
On the feature selection criterion based on an approximation of multidimensional mutual information.基于多维互信息逼近的特征选择准则。
IEEE Trans Pattern Anal Mach Intell. 2010 Jul;32(7):1342-3. doi: 10.1109/TPAMI.2010.62.
3
Using mutual information for selecting features in supervised neural net learning.在监督式神经网络学习中使用互信息来选择特征。
IEEE Trans Neural Netw. 1994;5(4):537-50. doi: 10.1109/72.298224.
4
Input feature selection for classification problems.用于分类问题的输入特征选择。
IEEE Trans Neural Netw. 2002;13(1):143-59. doi: 10.1109/72.977291.
5
Hybrid huberized support vector machines for microarray classification and gene selection.用于微阵列分类和基因选择的混合胡贝尔化支持向量机
Bioinformatics. 2008 Feb 1;24(3):412-9. doi: 10.1093/bioinformatics/btm579. Epub 2008 Jan 5.
6
Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.基于互信息的特征选择:最大依赖、最大相关和最小冗余准则。
IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1226-38. doi: 10.1109/TPAMI.2005.159.
7
Minimum redundancy feature selection from microarray gene expression data.从微阵列基因表达数据中进行最小冗余特征选择。
J Bioinform Comput Biol. 2005 Apr;3(2):185-205. doi: 10.1142/s0219720005001004.