Orthodontic Department, Eastman Dental Institute, London, UK.
Eur J Orthod. 2012 Apr;34(2):158-63. doi: 10.1093/ejo/cjr010. Epub 2011 Mar 29.
This study examined the effects of different sample sizes and different levels of bias (systematic error) between replicated measurements on the accuracy of estimates of random error calculated using two common formulae: Dahlberg's and the 'method of moments' estimator (MME). Computer-based numerical simulations were used to generate clinically realistic measurements involving random errors with a known distribution. For each simulation, two sets of 'measured values' were generated to provide the replicated data necessary for the estimation of the random error. Dahlberg's and the MME formula were applied to these paired data sets and the resulting estimates of error compared with the 'true' error. Nine different sample sizes (n = 2, 5, 10, 15, 20, 25, 30, 50, and 100) and two different types of bias (additive and multiplicative) were examined for their effect on the estimated error. In each case, the estimates of the random error were based on the distribution of 5000 separate simulations. The results indicate that with a sample of less than 25-30 replicated measurements, the resulting estimates of error are potentially unreliable and may under or overestimate the true error, irrespective of the formula used in the calculation. Where, however, a bias exists between the replicate measurements, Dahlberg's formula can be expected to overestimate the true value of the random error even where the biases are small and difficult to detect by standard statistical tests. No such distorting effect was found for the MME formula, which provided estimates of error that were not meaningfully different from the true value even where relatively large biases existed between the replicates. These results suggest the following: 1. A sample of at least 25-30 cases should be replicated to provide an estimate of the random error. 2. Where the original study contains fewer than 20 cases, the estimate of error will be unreliable. In these circumstances, it would be helpful if a confidence interval for the true error was also quoted. 3. Unless one can be absolutely sure that no bias exists between the replicate measurements, Dahlberg's formula should be avoided and the MME formula used instead.
本研究考察了不同样本量和在重复测量之间不同程度的偏倚(系统误差)对使用两种常用公式(达贝尔格公式和矩估计法(MME))计算随机误差估计值准确性的影响。基于计算机的数值模拟用于生成涉及具有已知分布的随机误差的临床真实测量。对于每个模拟,生成两组“测量值”以提供用于估计随机误差的重复数据。将达贝尔格公式和 MME 公式应用于这两组配对数据集,并将得到的误差估计值与“真实”误差进行比较。研究了九种不同的样本量(n=2、5、10、15、20、25、30、50 和 100)和两种不同类型的偏倚(加性和乘性)对估计误差的影响。在每种情况下,随机误差的估计值都是基于 5000 次单独模拟的分布。结果表明,对于少于 25-30 个重复测量的样本,所得误差估计值可能不可靠,并且可能低估或高估真实误差,而与计算中使用的公式无关。然而,在重复测量之间存在偏倚的情况下,即使偏差较小且难以通过标准统计检验检测到,达贝尔格公式也可能高估随机误差的真实值。对于 MME 公式,没有发现这种扭曲效应,即使在重复测量之间存在相对较大的偏差,它也提供了与真实值没有明显差异的误差估计值。这些结果表明:1. 应该复制至少 25-30 个案例以提供随机误差的估计值。2. 原始研究中包含的案例少于 20 个时,误差估计值将不可靠。在这种情况下,如果还能引用真实误差的置信区间,将会有所帮助。3. 除非可以绝对确定重复测量之间不存在偏倚,否则应避免使用达贝尔格公式并改用 MME 公式。