International Drug Development Institute, Louvain-la-Neuve, Belgium.
Data Science Institute, Hasselt University, Diepenbeek, Belgium.
Pharm Stat. 2022 Jan;21(1):122-132. doi: 10.1002/pst.2156. Epub 2021 Aug 3.
The method of generalized pairwise comparisons (GPC) is a multivariate extension of the well-known non-parametric Wilcoxon-Mann-Whitney test. It allows comparing two groups of observations based on multiple hierarchically ordered endpoints, regardless of the number or type of the latter. The summary measure, "net benefit," quantifies the difference between the probabilities that a random observation from one group is doing better than an observation from the opposite group. The method takes into account the correlations between the endpoints. We have performed a simulation study for the case of two hierarchical endpoints to evaluate the impact of their correlation on the type-I error probability and power of the test based on GPC. The simulations show that the power of the GPC test for the primary endpoint is modified if the secondary endpoint is included in the hierarchical GPC analysis. The change in power depends on the correlation between the endpoints. Interestingly, a decrease in power can occur, regardless of whether there is any marginal treatment effect on the secondary endpoint. It appears that the overall power of the hierarchical GPC procedure depends, in a complex manner, on the entire variance-covariance structure of the set of outcomes. Any additional factors (such as thresholds of clinical relevance, drop out, or censoring scheme) will also affect the power and will have to be taken into account when designing a trial based on the hierarchical GPC procedure.
广义成对比较(GPC)方法是广为人知的非参数 Wilcoxon-Mann-Whitney 检验的多变量扩展。它允许根据多个层次有序的终点比较两组观察结果,而无需考虑后者的数量或类型。综合衡量标准“净效益”量化了来自一组的随机观察结果比来自相反组的观察结果更好的概率之间的差异。该方法考虑了终点之间的相关性。我们对两种层次终点的情况进行了模拟研究,以评估其相关性对基于 GPC 的检验的 I 型错误概率和功效的影响。模拟表明,如果将次要终点纳入层次 GPC 分析,则 GPC 检验对主要终点的功效会发生变化。功效的变化取决于终点之间的相关性。有趣的是,无论次要终点是否存在任何边际治疗效果,都可能会出现功效降低的情况。似乎层次 GPC 程序的整体功效以复杂的方式取决于结果集的整个方差协方差结构。任何其他因素(如临床相关阈值、脱落或删失方案)也会影响功效,并且在基于层次 GPC 程序设计试验时必须加以考虑。