Swanson David
University of Texas MD Anderson Cancer Center, Houston, TX, USA.
Comput Stat Data Anal. 2025 Nov;211. doi: 10.1016/j.csda.2025.108197. Epub 2025 Apr 23.
A method is demonstrated for localizing where two spline terms, or smooths, differ using a true discovery proportion (TDP)-based interpretation. The procedure yields a statement on the proportion of some region where true differences exist between two smooths. The methodology avoids ad hoc approaches to making such statements, like subsetting the data and performing hypothesis tests on the truncated spline terms. TDP estimates are confidence-bounded simultaneously, which means that a region's TDP estimate is a lower bound on the proportion of actual differences, or true discoveries, in that region, with high confidence regardless of the number of estimates made. The procedure is based on closed-testing using Simes local test. This local test requires that the multivariate test statistics of generalized Wishart type underlying the method be positive regression dependent on subsets (PRDS), a result for which evidence is presented suggesting that the condition holds. Consistency of the procedure is demonstrated for generalized additive models with the tuning parameter chosen by REML or GCV, and the achievement of confidence-bounded TDP is shown in simulation as is an analysis of walking gait.
展示了一种方法,该方法使用基于真实发现比例(TDP)的解释来定位两个样条项或平滑项的差异所在位置。该过程得出关于两个平滑项之间存在真实差异的某个区域比例的陈述。该方法避免了诸如对数据进行子集划分以及对截断的样条项进行假设检验等用于做出此类陈述的临时方法。TDP估计同时具有置信区间,这意味着一个区域的TDP估计是该区域实际差异或真实发现比例的下限,无论进行估计的数量多少,都具有高置信度。该过程基于使用西姆斯局部检验的封闭检验。这种局部检验要求该方法所基于的广义威沙特类型的多元检验统计量是正回归依赖于子集(PRDS),文中给出了证据表明该条件成立。对于通过REML或GCV选择调优参数的广义相加模型,证明了该过程的一致性,并且在模拟中展示了实现有置信区间的TDP,同时还对步行步态进行了分析。