Department of Statistical Science, University College London, Gower St., London, WC1E 6BT, UK. a.o'
Department of Statistical Science, University College London, Gower St., London, WC1E 6BT, UK.
BMC Med Res Methodol. 2017 Dec 2;17(1):157. doi: 10.1186/s12874-017-0426-1.
In healthcare research, outcomes with skewed probability distributions are common. Sample size calculations for such outcomes are typically based on estimates on a transformed scale (e.g. log) which may sometimes be difficult to obtain. In contrast, estimates of median and variance on the untransformed scale are generally easier to pre-specify. The aim of this paper is to describe how to calculate a sample size for a two group comparison of interest based on median and untransformed variance estimates for log-normal outcome data.
A log-normal distribution for outcome data is assumed and a sample size calculation approach for a two-sample t-test that compares log-transformed outcome data is demonstrated where the change of interest is specified as difference in median values on the untransformed scale. A simulation study is used to compare the method with a non-parametric alternative (Mann-Whitney U test) in a variety of scenarios and the method is applied to a real example in neurosurgery.
The method attained a nominal power value in simulation studies and was favourable in comparison to a Mann-Whitney U test and a two-sample t-test of untransformed outcomes. In addition, the method can be adjusted and used in some situations where the outcome distribution is not strictly log-normal.
We recommend the use of this sample size calculation approach for outcome data that are expected to be positively skewed and where a two group comparison on a log-transformed scale is planned. An advantage of this method over usual calculations based on estimates on the log-transformed scale is that it allows clinical efficacy to be specified as a difference in medians and requires a variance estimate on the untransformed scale. Such estimates are often easier to obtain and more interpretable than those for log-transformed outcomes.
在医疗保健研究中,具有偏态概率分布的结果很常见。此类结果的样本量计算通常基于转换后的尺度(例如对数)的估计值,而这些估计值有时可能难以获得。相比之下,未转换尺度上的中位数和方差的估计值通常更容易预先指定。本文的目的是描述如何根据对数正态分布结果数据的中位数和未转换方差估计值来计算两组比较的样本量。
假设结果数据服从对数正态分布,并演示了一种用于比较对数转换后结果数据的两样本 t 检验的样本量计算方法,其中感兴趣的变化指定为未转换尺度上中位数值的差异。通过模拟研究,在各种情况下将该方法与非参数替代方法(Mann-Whitney U 检验)进行比较,并将该方法应用于神经外科的实际示例。
该方法在模拟研究中达到了名义功效值,并且与 Mann-Whitney U 检验和未转换结果的两样本 t 检验相比具有优势。此外,该方法可以在某些情况下进行调整和使用,其中结果分布不严格服从对数正态分布。
我们建议在预期呈正偏态分布且计划在对数转换尺度上进行两组比较的情况下,使用此样本量计算方法来处理结果数据。与基于对数转换尺度的估计值进行通常计算相比,该方法的优势在于它允许将临床疗效指定为中位数差异,并需要在未转换尺度上估计方差。这些估计值通常比对数转换结果的估计值更容易获得且更具可解释性。