Weissbach R, Herzog M
Institut für Wirtschafts- und Sozialstatistik, FB Statistik, Universität Dortmund.
Gesundheitswesen. 2009 Mar;71(3):121-6. doi: 10.1055/s-0028-1086008. Epub 2009 Feb 16.
Surveys on the dental status of preschool children in selected regions or at specific ages often take place in randomly assigned nursery centres (cluster sample). The mean dmft value in the population serves here as an indicator for the average caries decay. It is evaluated from the individual dmft values of the children on the basis of statistical procedures. The study at hand assesses the impact of cluster structure in the data on the estimated dmft mean and its variance.
We defined 170 nursery centres with 7 578 children aged 3-5 years old as population and drew 100 times 10%-cluster samples (with replacement) and by this means we simulated 100 surveys. Estimation of the population mean and the variance was established each time with and without respecting the cluster structure.
If the cluster structure is not obeyed in the course of calculating, the resulting value of the variance for the dmft mean estimator is too small; and hence confidence intervals are too narrow or tests exceed their type I error. If, in contrast, the cluster structure is taken into account, the variance is increased in all 100 cases (variance inflation), with a magnitude that, however, varies randomly. For surveys with cluster samples, this effect can have an impact on sample size calculations. A bias of the dmft average itself is possible for samples of a moderate extent once the mean occupancy of the nursery centres in the sample deviates markedly from that in the population.
If a cluster design is chosen for a caries epidemiology survey in nursery centres, the sample size calculation and the analysis should take that into account in order to avoid misleading results.
对特定地区或特定年龄段学龄前儿童牙齿状况的调查通常在随机分配的托儿所中心(整群抽样)进行。人群中的平均乳牙龋失补牙面(dmft)值在此用作龋齿平均患病率的指标。它是根据统计程序从儿童的个体dmft值评估得出的。本研究评估数据中的整群结构对估计的dmft均值及其方差的影响。
我们将170个托儿所中心及7578名3 - 5岁儿童定义为总体,并抽取100次10%的整群样本(有放回),以此模拟100次调查。每次分别在考虑和不考虑整群结构的情况下对总体均值和方差进行估计。
如果在计算过程中不遵循整群结构,dmft均值估计量的方差计算结果会过小;因此置信区间过窄或检验超出其I型错误。相反,如果考虑整群结构,在所有100个案例中方差都会增大(方差膨胀),但其幅度随机变化。对于整群抽样调查,这种效应可能会影响样本量的计算。一旦样本中托儿所中心的平均入住率与总体中的明显不同,对于中等规模的样本,dmft平均值本身可能会出现偏差。
如果在托儿所中心进行龋齿流行病学调查选择整群设计,样本量计算和分析应考虑到这一点,以避免产生误导性结果。