Korn E L, Graubard B I
Biometric Research Branch, National Cancer Institute, Bethesda, MD 20892.
Am J Public Health. 1991 Sep;81(9):1166-73. doi: 10.2105/ajph.81.9.1166.
Since large-scale health surveys usually have complicated sampling schemes, there is often a question as to whether the sampling design must be considered in the analysis of the data. A recent disagreement concerning the analysis of a body iron stores-cancer association found in the first National Health and Nutrition Examination Survey and its follow-up is used to highlight the issues.
We explain and illustrate the importance of two aspects of the sampling design: clustering and weighting of observations. The body iron stores-cancer data are reanalyzed by utilizing or ignoring various aspects of the sampling design. Simple formulas are given to describe how using the sampling design of a survey in the analysis will affect the conclusions of that analysis.
The different analyses of the body iron stores-cancer data lead to very different conclusions. Application of the simple formulas suggests that utilization of the sample clustering in the analysis is appropriate, but that a standard utilization of the sample weights leads to an uninformative analysis. The recommended analysis incorporates the sampling weights in a nonstandard way and the sample clustering in the standard way.
Which particular aspects of the sampling design to use in the analysis of complex survey data and how to use them depend on certain features of the design. We give some guidelines for when to use the sample clustering and sample weights in the analysis.
由于大规模健康调查通常采用复杂的抽样方案,因此在数据分析中是否必须考虑抽样设计常常成为一个问题。最近关于在第一次全国健康和营养检查调查及其随访中发现的体内铁储存与癌症关联分析的分歧被用来突出这些问题。
我们解释并说明了抽样设计两个方面的重要性:观察值的聚类和加权。通过利用或忽略抽样设计的各个方面,对体内铁储存与癌症数据进行重新分析。给出了简单公式来描述在分析中使用调查的抽样设计将如何影响该分析的结论。
对体内铁储存与癌症数据的不同分析得出了截然不同的结论。简单公式的应用表明,在分析中利用样本聚类是合适的,但标准地使用样本权重会导致无信息的分析。推荐的分析以非标准方式纳入抽样权重,并以标准方式纳入样本聚类。
在复杂调查数据分析中使用抽样设计的哪些特定方面以及如何使用它们取决于设计的某些特征。我们给出了一些在分析中何时使用样本聚类和样本权重的指导原则。