Trubey Peter, Sansó Bruno
Department of Statistics, University of California, Santa Cruz, CA 95064, USA.
Entropy (Basel). 2024 Apr 14;26(4):335. doi: 10.3390/e26040335.
We consider a constructive definition of the multivariate Pareto that factorizes the random vector into a radial component and an independent angular component. The former follows a univariate Pareto distribution, and the latter is defined on the surface of the positive orthant of the infinity norm unit hypercube. We propose a method for inferring the distribution of the angular component by identifying its support as the limit of the positive orthant of the unit -norm spheres and introduce a projected gamma family of distributions defined through the normalization of a vector of independent random gammas to the space. This serves to construct a flexible family of distributions obtained as a Dirichlet process mixture of projected gammas. For model assessment, we discuss scoring methods appropriate to distributions on the unit hypercube. In particular, working with the energy score criterion, we develop a kernel metric that produces a proper scoring rule and presents a simulation study to compare different modeling choices using the proposed metric. Using our approach, we describe the dependence structure of extreme values in the integrated vapor transport (IVT), data describing the flow of atmospheric moisture along the coast of California. We find clear but heterogeneous geographical dependence.
我们考虑多元帕累托分布的一种构造性定义,该定义将随机向量分解为一个径向分量和一个独立的角度分量。前者遵循单变量帕累托分布,后者定义在无穷范数单位超立方体正卦限的表面上。我们提出了一种通过将角度分量的支撑集识别为单位范数球体正卦限的极限来推断其分布的方法,并引入了一个通过将独立随机伽马向量归一化到该空间而定义的投影伽马分布族。这有助于构建一个灵活的分布族,该分布族是通过投影伽马分布的狄利克雷过程混合得到的。对于模型评估,我们讨论了适用于单位超立方体上分布的评分方法。特别是,在能量得分准则的基础上,我们开发了一种核度量,它产生一个适当的评分规则,并进行了一项模拟研究,以使用所提出的度量来比较不同的建模选择。使用我们的方法,我们描述了综合水汽输送(IVT)中极值的数据的依赖结构,这些数据描述了加利福尼亚海岸沿线大气水汽的流动。我们发现了明显但异质的地理依赖性。