You Kisung, Shung Dennis, Giuffrè Mauro
Department of Mathematics, Baruch College.
Department of Internal Medicine, Yale School of Medicine.
J Comput Graph Stat. 2025;34(1):253-266. doi: 10.1080/10618600.2024.2374580. Epub 2024 Aug 29.
The primary choice to summarize a finite collection of random objects is by using measures of central tendency, such as mean and median. In the field of optimal transport, the Wasserstein barycenter corresponds to the Fréchet or geometric mean of a set of probability measures, which is defined as a minimizer of the sum of squared distances to each element in a given set with respect to the Wasserstein distance of order 2. We introduce the Wasserstein median as a robust alternative to the Wasserstein barycenter. The Wasserstein median corresponds to the Fréchet median under the 2 -Wasserstein metric. The existence and consistency of the Wasserstein median are first established, along with its robustness property. In addition, we present a general computational pipeline that employs any recognized algorithms for the Wasserstein barycenter in an iterative fashion and demonstrate its convergence. The utility of the Wasserstein median as a robust measure of central tendency is demonstrated using real and simulated data.
总结有限随机对象集合的主要方法是使用集中趋势度量,如均值和中位数。在最优传输领域,瓦瑟斯坦重心对应于一组概率测度的弗雷歇均值或几何均值,它被定义为在二阶瓦瑟斯坦距离下到给定集合中每个元素的平方距离之和的最小值。我们引入瓦瑟斯坦中位数作为瓦瑟斯坦重心的稳健替代。在2 -瓦瑟斯坦度量下,瓦瑟斯坦中位数对应于弗雷歇中位数。首先确立了瓦瑟斯坦中位数的存在性和一致性及其稳健性。此外,我们提出了一种通用的计算流程,该流程以迭代方式使用任何公认的瓦瑟斯坦重心算法,并证明其收敛性。使用真实数据和模拟数据展示了瓦瑟斯坦中位数作为稳健集中趋势度量的效用。