Gasparin Matteo, Wang Ruodu, Ramdas Aaditya
Department of Statistical Sciences, University of Padova, Padova 35121, Italy.
Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, ON N2L 3G1, Canada.
Proc Natl Acad Sci U S A. 2025 Mar 18;122(11):e2410849122. doi: 10.1073/pnas.2410849122. Epub 2025 Mar 14.
The problem of combining -values is an old and fundamental one, and the classic assumption of independence is often violated or unverifiable in many applications. There are many well-known rules that can combine a set of arbitrarily dependent -values (for the same hypothesis) into a single -value. We show that essentially all these existing rules can be strictly improved when the -values are exchangeable, or when external randomization is allowed (or both). For example, we derive randomized and/or exchangeable improvements of well-known rules like "twice the median" and "twice the average," as well as geometric and harmonic means. Exchangeable -values are often produced one at a time (for example, under repeated tests involving data splitting), and our rules can combine them sequentially as they are produced, stopping when the combined -values stabilize. Our work also improves rules for combining arbitrarily dependent -values, since the latter becomes exchangeable if they are presented to the analyst in a random order. The main technical advance is to show that all existing combination rules can be obtained by calibrating the -values to e-values (using an [Formula: see text]-dependent calibrator), averaging those e-values, converting to a level-[Formula: see text] test using Markov's inequality, and finally obtaining -values by combining this family of tests; the improvements are delivered via recent randomized and exchangeable variants of Markov's inequality.
组合p值的问题是一个古老而基本的问题,在许多应用中,经典的独立性假设常常被违反或无法验证。有许多著名的规则可以将一组任意相关的(针对同一假设的)p值组合成一个单一的p值。我们表明,当p值是可交换的,或者允许外部随机化(或两者都允许)时,基本上所有这些现有规则都可以得到严格改进。例如,我们推导了诸如“中位数的两倍”和“平均值的两倍”等著名规则以及几何平均值和谐波平均值的随机化和/或可交换改进。可交换的p值通常是一次产生一个(例如,在涉及数据分割的重复测试中),我们的规则可以在它们产生时依次组合它们,当组合后的p值稳定时停止。我们的工作还改进了组合任意相关p值的规则,因为如果将后者以随机顺序呈现给分析师,它们就会变得可交换。主要的技术进步是表明,所有现有的组合规则都可以通过将p值校准为e值(使用依赖于α的校准器)、对这些e值求平均、使用马尔可夫不等式转换为水平-α检验,最后通过组合这一族检验来获得p值;改进是通过马尔可夫不等式的最新随机化和可交换变体实现的。