• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Information Theoretic Methods for Variable Selection-A Review.变量选择的信息论方法——综述
Entropy (Basel). 2022 Aug 4;24(8):1079. doi: 10.3390/e24081079.
2
Analysis of Information-Based Nonparametric Variable Selection Criteria.基于信息的非参数变量选择准则分析
Entropy (Basel). 2020 Aug 31;22(9):974. doi: 10.3390/e22090974.
3
Simple Stopping Criteria for Information Theoretic Feature Selection.信息论特征选择的简单停止准则。
Entropy (Basel). 2019 Jan 21;21(1):99. doi: 10.3390/e21010099.
4
Model-Free Conditional Independence Feature Screening For Ultrahigh Dimensional Data.超高维数据的无模型条件独立特征筛选
Sci China Math. 2017 Mar;60(3):551-568. doi: 10.1007/s11425-016-0186-8. Epub 2016 Dec 29.
5
Markov Blanket Feature Selection Using Representative Sets.基于代表性集合的马尔可夫毯特征选择。
IEEE Trans Neural Netw Learn Syst. 2017 Nov;28(11):2775-2788. doi: 10.1109/TNNLS.2016.2602365.
6
A Feature Selection Algorithm Integrating Maximum Classification Information and Minimum Interaction Feature Dependency Information.一种集成最大分类信息和最小交互特征依赖信息的特征选择算法。
Comput Intell Neurosci. 2021 Dec 28;2021:3569632. doi: 10.1155/2021/3569632. eCollection 2021.
7
Feature selection using mutual information based uncertainty measures for tumor classification.基于互信息不确定性度量的特征选择用于肿瘤分类
Biomed Mater Eng. 2014;24(1):763-70. doi: 10.3233/BME-130865.
8
Accelerating Causal Inference and Feature Selection Methods through G-Test Computation Reuse.通过G检验计算复用加速因果推断和特征选择方法
Entropy (Basel). 2021 Nov 12;23(11):1501. doi: 10.3390/e23111501.
9
Learning dependence from samples.从样本中学习依赖性。
Int J Bioinform Res Appl. 2014;10(1):43-58. doi: 10.1504/IJBRA.2014.058777.
10
Structure Learning of Bayesian Network Based on Adaptive Thresholding.基于自适应阈值处理的贝叶斯网络结构学习
Entropy (Basel). 2019 Jul 8;21(7):665. doi: 10.3390/e21070665.

本文引用的文献

1
Fast and powerful conditional randomization testing via distillation.通过蒸馏实现快速且强大的条件随机化测试。
Biometrika. 2022 Jun;109(2):277-293. doi: 10.1093/biomet/asab039. Epub 2021 Jul 8.
2
Analysis of Information-Based Nonparametric Variable Selection Criteria.基于信息的非参数变量选择准则分析
Entropy (Basel). 2020 Aug 31;22(9):974. doi: 10.3390/e22090974.
3
A deeper look at two concepts of measuring gene-gene interactions: logistic regression and interaction information revisited.深入探讨测量基因-基因相互作用的两个概念:重新审视逻辑回归和相互作用信息。
Genet Epidemiol. 2018 Mar;42(2):187-200. doi: 10.1002/gepi.22108. Epub 2017 Dec 18.
4
Efficient Markov Blanket Discovery and Its Application.高效马尔可夫毯发现及其应用。
IEEE Trans Cybern. 2017 May;47(5):1169-1179. doi: 10.1109/TCYB.2016.2539338. Epub 2016 Mar 24.
5
BOOST: A fast approach to detecting gene-gene interactions in genome-wide case-control studies.BOOST:一种在全基因组病例对照研究中快速检测基因-基因相互作用的方法。
Am J Hum Genet. 2010 Sep 10;87(3):325-40. doi: 10.1016/j.ajhg.2010.07.021.
6
Normalized mutual information feature selection.归一化互信息特征选择
IEEE Trans Neural Netw. 2009 Feb;20(2):189-201. doi: 10.1109/TNN.2008.2005601. Epub 2009 Jan 13.
7
AMBIENCE: a novel approach and efficient algorithm for identifying informative genetic and environmental associations with complex phenotypes.AMBIENCE:一种用于识别与复杂表型相关的信息性遗传和环境关联的新颖方法及高效算法。
Genetics. 2008 Oct;180(2):1191-210. doi: 10.1534/genetics.108.088542. Epub 2008 Sep 9.
8
Using mutual information for selecting features in supervised neural net learning.在监督式神经网络学习中使用互信息来选择特征。
IEEE Trans Neural Netw. 1994;5(4):537-50. doi: 10.1109/72.298224.
9
The use of the restricted partition method with case-control data.将受限划分法用于病例对照数据。
Hum Hered. 2007;63(2):93-100. doi: 10.1159/000099181. Epub 2007 Feb 2.
10
Reducing the dimensionality of data with neural networks.使用神经网络降低数据维度。
Science. 2006 Jul 28;313(5786):504-7. doi: 10.1126/science.1127647.

变量选择的信息论方法——综述

Information Theoretic Methods for Variable Selection-A Review.

作者信息

Mielniczuk Jan

机构信息

Institute of Computer Science, Polish Academy of Sciences, Jana Kazimierza 5, 01-248 Warsaw, Poland.

Faculty of Mathematics and Information Science, Warsaw University of Technology, Koszykowa 75, 00-662 Warsaw, Poland.

出版信息

Entropy (Basel). 2022 Aug 4;24(8):1079. doi: 10.3390/e24081079.

DOI:10.3390/e24081079
PMID:36010742
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9407310/
Abstract

We review the principal information theoretic tools and their use for feature selection, with the main emphasis on classification problems with discrete features. Since it is known that empirical versions of conditional mutual information perform poorly for high-dimensional problems, we focus on various ways of constructing its counterparts and the properties and limitations of such methods. We present a unified way of constructing such measures based on truncation, or truncation and weighing, for the Möbius expansion of conditional mutual information. We also discuss the main approaches to feature selection which apply the introduced measures of conditional dependence, together with the ways of assessing the quality of the obtained vector of predictors. This involves discussion of recent results on asymptotic distributions of empirical counterparts of criteria, as well as advances in resampling.

摘要

我们回顾了主要的信息论工具及其在特征选择中的应用,主要侧重于具有离散特征的分类问题。由于已知条件互信息的经验版本在高维问题上表现不佳,我们重点关注构建其对应物的各种方法以及此类方法的性质和局限性。我们提出了一种基于截断或截断与加权的统一方法,用于条件互信息的莫比乌斯展开。我们还讨论了应用引入的条件依赖度量的主要特征选择方法,以及评估所得预测变量向量质量的方法。这涉及对准则经验对应物的渐近分布的最新结果的讨论,以及重采样方面的进展。