Doostfatemeh Marziyeh, Taghi Ayatollah Seyyed Mohammad, Jafari Peyman
Department of Biostatistics, Shiraz University of Medical Sciences, Shiraz, Iran.
Department of Biostatistics, Shiraz University of Medical Sciences, Shiraz, Iran.
Value Health. 2016 Jul-Aug;19(5):639-47. doi: 10.1016/j.jval.2016.03.1857. Epub 2016 Jul 29.
To provide a valid sample size strategy based on simulation and to evaluate the statistical power in clinical trials with patient-reported outcomes (PROs) based on a polytomous item response theory model-the graded response model (GRM)-and to compare this framework with the classical test theory (CTT) approach.
One thousand randomized clinical trials were simulated using PRO based on the GRM and under various combinations of the number of patients in each arm, the group allocation ratio, the number of items and categories, and group effects. The power and sample size estimated in the simulations were then compared with those computed using the CTT framework.
The results indicated that the impact of the most influential factors, including the number of patients, group allocation ratio, group effects, and the number of categories, on the power and sample size of the GRM-based and CTT-based approaches was similar. Nevertheless, the strong impact of the number of items on these issues distinguished the two approaches.
It is crucial to use an adapted sample size formula in a GRM-based analysis because the classical formula designed for the CTT-based approach does not consider the impact of the number of items, which could result in an inadequately sized study and a decrease in power. Thus, when clinicians design a randomized clinical trial with polytomous PRO endpoints using classical sample size formula as the base, they should be aware of the possibility of making an incorrect clinical decision.
基于模拟提供有效的样本量策略,基于多分类项目反应理论模型——等级反应模型(GRM)评估具有患者报告结局(PROs)的临床试验的统计效能,并将此框架与经典测试理论(CTT)方法进行比较。
基于GRM并在每组患者数量、组分配比例、项目和类别数量以及组效应的各种组合下,使用PRO模拟了1000项随机临床试验。然后将模拟中估计的效能和样本量与使用CTT框架计算的结果进行比较。
结果表明,包括患者数量、组分配比例、组效应和类别数量在内的最有影响因素对基于GRM和基于CTT方法的效能和样本量的影响相似。然而,项目数量对这些问题的强烈影响区分了这两种方法。
在基于GRM的分析中使用适配的样本量公式至关重要,因为为基于CTT的方法设计的经典公式未考虑项目数量的影响,这可能导致研究规模不足和效能降低。因此,当临床医生以经典样本量公式为基础设计具有多分类PRO终点的随机临床试验时,应意识到做出错误临床决策的可能性。