Doss Hani, Linero Antonio
Department of Statistics, University of Florida, Gainesville, Florida 32603, USA.
Department of Statistics and Data Science, University of Texas at Austin, Austin, Texas 78705, USA.
Stat Sci. 2024 Nov;39(4):601-622. doi: 10.1214/24-sts936. Epub 2024 Oct 30.
Consider a Bayesian setup in which we observe , whose distribution depends on a parameter , that is, . The parameter is unknown and treated as random, and a prior distribution chosen from some parametric family , is to be placed on it. For the subjective Bayesian there is a single prior in the family which represents his or her beliefs about , but determination of this prior is very often extremely difficult. In the empirical Bayes approach, the latent distribution on is estimated from the data. This is usually done by choosing the value of the hyperparameter that maximizes some criterion. Arguably the most common way of doing this is to let be the marginal likelihood of , that is, , and choose the value of that maximizes . Unfortunately, except for a handful of textbook examples, analytic evaluation of is not feasible. The purpose of this paper is two-fold. First, we review the literature on estimating it and find that the most commonly used procedures are either potentially highly inaccurate or don't scale well with the dimension of , the dimension of , or both. Second, we present a method for estimating , based on Markov chain Monte Carlo, that applies very generally and scales well with dimension. Let be a real-valued function of , and let be the posterior expectation of when the prior is . As a byproduct of our approach, we show how to obtain point estimates and globally-valid confidence bands for the family , . To illustrate the scope of our methodology we provide three detailed examples, having different characters.
考虑一种贝叶斯框架,其中我们观察到(X),其分布依赖于参数(\theta),即(X\sim p(x|\theta))。参数(\theta)是未知的且被视为随机变量,并且要为其设定一个从某个参数族(\Pi)中选取的先验分布。对于主观贝叶斯而言,参数族中有一个单一的先验分布代表他或她对(\theta)的信念,但确定这个先验分布通常极其困难。在经验贝叶斯方法中,(\theta)上的潜在分布是从数据中估计出来的。这通常是通过选择超参数(\lambda)的值来实现,该值能使某个准则最大化。可以说最常见的做法是让(\lambda)为(X)的边际似然,即(p(x|\lambda)=\int p(x|\theta)p(\theta|\lambda)d\theta),并选择使(p(x|\lambda))最大化的(\lambda)值。不幸的是,除了少数几个教科书示例外,对(p(x|\lambda))进行解析评估是不可行的。本文的目的有两个。首先,我们回顾关于估计它的文献,发现最常用的方法要么可能非常不准确,要么在(\theta)的维度、(X)的维度或两者上都扩展性不佳。其次,我们提出一种基于马尔可夫链蒙特卡罗的估计(p(x|\lambda))的方法,该方法具有非常广泛的适用性且在维度上扩展性良好。设(g(\theta))是(\theta)的实值函数,并且当先验为(p(\theta|\lambda))时,设(E_{\lambda}[g(\theta)])是(g(\theta))的后验期望。作为我们方法的一个副产品,我们展示了如何获得关于族({E_{\lambda}[g(\theta)]:\lambda\in\Lambda})的点估计和全局有效的置信区间。为了说明我们方法的适用范围,我们提供了三个具有不同特点的详细示例。