通过平方损失互信息估计进行充分降维。

Sufficient dimension reduction via squared-loss mutual information estimation.

机构信息

Department of Mathematical Informatics, University of Tokyo, Bunkyo-ku, Tokyo 113-8656, Japan.

出版信息

Neural Comput. 2013 Mar;25(3):725-58. doi: 10.1162/NECO_a_00407. Epub 2012 Dec 28.

Abstract

The goal of sufficient dimension reduction in supervised learning is to find the low-dimensional subspace of input features that contains all of the information about the output values that the input features possess. In this letter, we propose a novel sufficient dimension-reduction method using a squared-loss variant of mutual information as a dependency measure. We apply a density-ratio estimator for approximating squared-loss mutual information that is formulated as a minimum contrast estimator on parametric or nonparametric models. Since cross-validation is available for choosing an appropriate model, our method does not require any prespecified structure on the underlying distributions. We elucidate the asymptotic bias of our estimator on parametric models and the asymptotic convergence rate on nonparametric models. The convergence analysis utilizes the uniform tail-bound of a U-process, and the convergence rate is characterized by the bracketing entropy of the model. We then develop a natural gradient algorithm on the Grassmann manifold for sufficient subspace search. The analytic formula of our estimator allows us to compute the gradient efficiently. Numerical experiments show that the proposed method compares favorably with existing dimension-reduction approaches on artificial and benchmark data sets.

摘要

在监督学习中，充分降维的目标是找到输入特征的低维子空间，该子空间包含输入特征所具有的关于输出值的所有信息。在这封信中，我们提出了一种新的充分降维方法，该方法使用平方损失互信息的变体作为依赖度量。我们应用密度比估计器来近似平方损失互信息，该估计器被公式化为参数或非参数模型上的最小对比度估计器。由于交叉验证可用于选择合适的模型，因此我们的方法不需要对基础分布进行任何预指定的结构。我们阐明了我们在参数模型上的估计器的渐近偏差和非参数模型上的渐近收敛速度。收敛性分析利用了 U 过程的一致尾部界，并且收敛速度由模型的覆盖熵来刻画。然后，我们在 Grassmann 流形上开发了一种用于充分子空间搜索的自然梯度算法。我们的估计器的解析公式允许我们有效地计算梯度。数值实验表明，与人工和基准数据集上的现有降维方法相比，该方法具有优势。

相似文献

Sufficient dimension reduction via squared-loss mutual information estimation.

Neural Comput. 2013 Mar;25(3):725-58. doi: 10.1162/NECO_a_00407. Epub 2012 Dec 28.

Sufficient Dimension Reduction via Direct Estimation of the Gradients of Logarithmic Conditional Densities.

Neural Comput. 2018 Feb;30(2):477-504. doi: 10.1162/neco_a_01035. Epub 2017 Nov 21.

Direct Estimation of the Derivative of Quadratic Mutual Information with Application in Supervised Dimension Reduction.

Neural Comput. 2017 Aug;29(8):2076-2122. doi: 10.1162/NECO_a_00986. Epub 2017 Jun 9.

Entropy estimation in Turing's perspective.

Neural Comput. 2012 May;24(5):1368-89. doi: 10.1162/NECO_a_00266. Epub 2012 Feb 1.

Distance approximating dimension reduction of Riemannian manifolds.

IEEE Trans Syst Man Cybern B Cybern. 2010 Feb;40(1):208-17. doi: 10.1109/TSMCB.2009.2025028. Epub 2009 Jul 17.

Sufficient dimension reduction with simultaneous estimation of effective dimensions for time-to-event data.

Stat Sin. 2020 Jul;30(3):1285-1311. doi: 10.5705/ss.202017.0550.

Canonical dependency analysis based on squared-loss mutual information.

Neural Netw. 2012 Oct;34:46-55. doi: 10.1016/j.neunet.2012.06.009. Epub 2012 Jul 11.

Minimax mutual information approach for independent component analysis.

Neural Comput. 2004 Jun;16(6):1235-52. doi: 10.1162/089976604773717595.

Flexible manifold embedding: a framework for semi-supervised and unsupervised dimension reduction.

IEEE Trans Image Process. 2010 Jul;19(7):1921-32. doi: 10.1109/TIP.2010.2044958. Epub 2010 Mar 8.

Least-squares independent component analysis.

Neural Comput. 2011 Jan;23(1):284-301. doi: 10.1162/NECO_a_00062. Epub 2010 Oct 21.

引用本文的文献

Functional sufficient dimension reduction through information maximization with application to classification.

J Appl Stat. 2024 Apr 3;51(15):3059-3101. doi: 10.1080/02664763.2024.2335570. eCollection 2024.

Relevance, redundancy, and complementarity trade-off (RRCT): A principled, generic, robust feature-selection tool.

Patterns (N Y). 2022 Mar 31;3(5):100471. doi: 10.1016/j.patter.2022.100471. eCollection 2022 May 13.

Generalized reduced rank latent factor regression for high dimensional tensor fields, and neuroimaging-genetic applications.

Neuroimage. 2017 Jan 1;144(Pt A):35-57. doi: 10.1016/j.neuroimage.2016.08.027. Epub 2016 Sep 22.

The equivalence of information-theoretic and likelihood-based methods for neural dimensionality reduction.

PLoS Comput Biol. 2015 Apr 1;11(4):e1004141. doi: 10.1371/journal.pcbi.1004141. eCollection 2015 Apr.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过平方损失互信息估计进行充分降维。

Sufficient dimension reduction via squared-loss mutual information estimation.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献