离散分布之间KL散度的极小极大速率最优估计。

Minimax Rate-optimal Estimation of KL Divergence between Discrete Distributions.

作者信息

Han Yanjun, Jiao Jiantao, Weissman Tsachy

机构信息

Stanford University.

出版信息

Int Symp Inf Theory Appl. 2016;2016:256-260.

PMID:29457152

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5812299/

Abstract

We refine the general methodology in [1] for the construction and analysis of essentially minimax estimators for a wide class of functionals of finite dimensional parameters, and elaborate on the case of discrete distributions with support size comparable with the number of observations . Specifically, we determine the "smooth" and "non-smooth" regimes based on the confidence set and the smoothness of the functional. In the "non-smooth" regime, we apply an unbiased estimator for a "suitable" polynomial approximation of the functional. In the "smooth" regime, we construct a bias corrected version of the Maximum Likelihood Estimator (MLE) based on Taylor expansion. We apply the general methodology to the problem of estimating the KL divergence between two discrete distributions from empirical data. We construct a minimax rate-optimal estimator which is adaptive in the sense that it does not require the knowledge of the support size nor the upper bound on the likelihood ratio. Moreover, the performance of the optimal estimator with samples is essentially that of the MLE with ln samples, i.e., the phenomenon holds.

摘要

我们改进了[1]中用于构建和分析有限维参数的广泛函数类的本质上极小极大估计量的一般方法，并详细阐述了支持集大小与观测数量可比的离散分布情况。具体而言，我们基于置信集和函数的光滑性确定了“光滑”和“非光滑” regimes。在“非光滑” regime中，我们对函数的“合适”多项式逼近应用无偏估计量。在“光滑” regime中，我们基于泰勒展开构造最大似然估计量（MLE）的偏差校正版本。我们将一般方法应用于从经验数据估计两个离散分布之间的KL散度问题。我们构造了一个极小极大速率最优估计量，它具有适应性，即不需要知道支持集大小或似然比的上界。此外，具有n个样本的最优估计量的性能本质上与具有n ln n个样本的MLE的性能相同，即n现象成立。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f205/5812299/cb99704021bf/nihms910323f1.jpg

相似文献

Minimax Rate-optimal Estimation of KL Divergence between Discrete Distributions.离散分布之间KL散度的极小极大速率最优估计。

Int Symp Inf Theory Appl. 2016;2016:256-260.

Minimax Estimation of Functionals of Discrete Distributions.离散分布泛函的极小极大估计

IEEE Trans Inf Theory. 2015 May;61(5):2835-2885. doi: 10.1109/tit.2015.2412945. Epub 2015 Mar 13.

Optimal Estimation of Wasserstein Distance on A Tree with An Application to Microbiome Studies.树上瓦瑟斯坦距离的最优估计及其在微生物组研究中的应用

J Am Stat Assoc. 2021;116(535):1237-1253. doi: 10.1080/01621459.2019.1699422. Epub 2020 Jan 23.

Adversarial meta-learning of Gamma-minimax estimators that leverage prior knowledge.利用先验知识的伽马极小极大估计器的对抗元学习。

Electron J Stat. 2023;17(2):1996-2043. doi: 10.1214/23-ejs2151. Epub 2023 Sep 3.

Limit Distribution Theory for Maximum Likelihood Estimation of a Log-Concave Density.对数凹密度最大似然估计的极限分布理论

Ann Stat. 2009 Jun 1;37(3):1299-1331. doi: 10.1214/08-AOS609.

APPROXIMATION AND ESTIMATION OF -CONCAVE DENSITIES VIA RÉNYI DIVERGENCES.通过雷尼散度对 -凹密度进行近似和估计。

Ann Stat. 2016;44(3):1332-1359. doi: 10.1214/15-AOS1408. Epub 2016 Apr 11.

A Comparative Analysis of Discrete Entropy Estimators for Large-Alphabet Problems.针对大字母表问题的离散熵估计器的比较分析

Entropy (Basel). 2024 Apr 28;26(5):369. doi: 10.3390/e26050369.

ESTIMATION OF FUNCTIONALS OF SPARSE COVARIANCE MATRICES.稀疏协方差矩阵泛函的估计

Ann Stat. 2015;43(6):2706-2737. doi: 10.1214/15-AOS1357.

Estimation of the entropy based on its polynomial representation.基于多项式表示的熵估计。

Phys Rev E Stat Nonlin Soft Matter Phys. 2012 May;85(5 Pt 1):051139. doi: 10.1103/PhysRevE.85.051139. Epub 2012 May 29.

Estimation After a Group Sequential Trial.成组序贯试验后的估计

Stat Biosci. 2015 Oct;7(2):187-205. doi: 10.1007/s12561-014-9112-6. Epub 2014 Feb 22.

引用本文的文献

Optimal Estimation of Wasserstein Distance on A Tree with An Application to Microbiome Studies.树上瓦瑟斯坦距离的最优估计及其在微生物组研究中的应用

J Am Stat Assoc. 2021;116(535):1237-1253. doi: 10.1080/01621459.2019.1699422. Epub 2020 Jan 23.

Empirical Estimation of Information Measures: A Literature Guide.信息度量的实证估计：文献指南

Entropy (Basel). 2019 Jul 24;21(8):720. doi: 10.3390/e21080720.

本文引用的文献

Minimax Estimation of Functionals of Discrete Distributions.离散分布泛函的极小极大估计

IEEE Trans Inf Theory. 2015 May;61(5):2835-2885. doi: 10.1109/tit.2015.2412945. Epub 2015 Mar 13.

Nonparametric estimation of Küllback-Leibler divergence.库尔贝克-莱布勒散度的非参数估计

Neural Comput. 2014 Nov;26(11):2570-93. doi: 10.1162/NECO_a_00646. Epub 2014 Jul 24.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

离散分布之间KL散度的极小极大速率最优估计。

Minimax Rate-optimal Estimation of KL Divergence between Discrete Distributions.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献