在患病率估计中是否应该对协变量进行调整？

Li Wenjun, Stanek Edward J, Bertone-Johnson Elizabeth R

Division of Preventive and Behavioral Medicine, University of Massachusetts Medical School, Worcester, MA 01655, USA.

Epidemiol Perspect Innov. 2008 Jan 25;5:2. doi: 10.1186/1742-5573-5-2.

BACKGROUND

Adjustment for covariates (also called auxiliary variables in survey sampling literature) is commonly applied in health surveys to reduce the variances of the prevalence estimators. In theory, adjusted prevalence estimators are more accurate when variance components are known. In practice, variance components needed to achieve the adjustment are unknown and their sample estimators are used instead. The uncertainty introduced by estimating variance components may overshadow the reduction in the variance of the prevalence estimators due to adjustment. We present empirical guidelines indicating when adjusted prevalence estimators should be considered, using gender adjusted and unadjusted smoking prevalence as an illustration.

METHODS

We compare the accuracy of adjusted and unadjusted prevalence estimators via simulation. We simulate simple random samples from hypothetical populations with the proportion of males ranging from 30% to 70%, the smoking prevalence ranging from 15% to 35%, and the ratio of male to female smoking prevalence ranging from 1 to 4. The ranges of gender proportions and smoking prevalences reflect the conditions in 1999-2003 Behavioral Risk Factors Surveillance System (BRFSS) data for Massachusetts. From each population, 10,000 samples are selected and the ratios of the variance of the adjusted prevalence estimators to the variance of the unadjusted (crude) ones are computed and plotted against the proportion of males by population prevalence, as well as by population and sample sizes. The prevalence ratio thresholds, above which adjusted prevalence estimators have smaller variances, are determined graphically.

RESULTS

In many practical settings, gender adjustment results in less accuracy. Whether or not there is better accuracy with adjustment depends on sample sizes, gender proportions and ratios between male and female prevalences. In populations with equal number of males and females and smoking prevalence of 20%, the adjusted prevalence estimators are more accurate when the ratios of male to female prevalences are above 2.4, 1.8, 1.6, 1.4 and 1.3 for sample sizes of 25, 50, 100, 150 and 200, respectively.

CONCLUSION

Adjustment for covariates will not result in more accurate prevalence estimator when ratio of male to female prevalences is close to one, sample size is small and risk factor prevalence is low. For example, when reporting smoking prevalence based on simple random sampling, gender adjustment is recommended only when sample size is greater than 200.

背景

在健康调查中，通常会对协变量（在抽样调查文献中也称为辅助变量）进行调整，以降低患病率估计值的方差。理论上，当方差分量已知时，调整后的患病率估计值更准确。在实际应用中，进行调整所需的方差分量是未知的，因此使用其样本估计值来代替。由于估计方差分量而引入的不确定性可能会掩盖因调整而导致的患病率估计值方差的减小。我们通过以性别调整和未调整的吸烟患病率为例，给出了何时应考虑使用调整后的患病率估计值的经验准则。

方法

我们通过模拟比较调整后的患病率估计值和未调整的患病率估计值的准确性。我们从假设总体中模拟简单随机样本，其中男性比例范围为30%至70%，吸烟患病率范围为15%至35%，男性与女性吸烟患病率之比范围为1至4。性别比例和吸烟患病率的范围反映了1999 - 2003年马萨诸塞州行为危险因素监测系统（BRFSS）数据中的情况。从每个总体中选取10,000个样本，并计算调整后的患病率估计值的方差与未调整（粗）估计值的方差之比，并根据总体患病率、总体规模和样本规模绘制该比值与男性比例的关系图。通过图形确定调整后的患病率估计值方差较小的患病率比阈值。

结果

在许多实际情况下，性别调整会导致准确性降低。调整后是否具有更高的准确性取决于样本规模、性别比例以及男性和女性患病率之间的比率。在男性和女性数量相等且吸烟患病率为20%的总体中，当样本规模分别为25、50、100、150和200时，男性与女性患病率之比分别高于2.4、1.8、1.6、1.4和1.3时，调整后的患病率估计值更准确。

结论

当男性与女性患病率之比接近1、样本规模较小且危险因素患病率较低时，对协变量进行调整不会导致更准确的患病率估计值。例如，在基于简单随机抽样报告吸烟患病率时，仅当样本规模大于200时才建议进行性别调整。

相似文献

Should adjustment for covariates be used in prevalence estimations?

Epidemiol Perspect Innov. 2008 Jan 25;5:2. doi: 10.1186/1742-5573-5-2.

Precision of systematic and random sampling in clustered populations: habitat patches and aggregating organisms.

Ecol Appl. 2016 Jan;26(1):233-48. doi: 10.1890/14-1973.

Sampling distributions, biases, variances, and confidence intervals for genetic correlations.

Theor Appl Genet. 1997 Jan;94(1):8-19. doi: 10.1007/s001220050375.

A simple hybrid variance estimator for the Kaplan-Meier survival function.

Stat Med. 2005 Mar 30;24(6):827-51. doi: 10.1002/sim.1960.

Tobacco smoking surveillance: is quota sampling an efficient tool for monitoring national trends? A comparison with a random cross-sectional survey.

PLoS One. 2013 Oct 23;8(10):e78372. doi: 10.1371/journal.pone.0078372. eCollection 2013.

Hybrid class of robust type estimators for variance estimation using mean and variance of auxiliary variable.

Heliyon. 2024 May 11;10(10):e31039. doi: 10.1016/j.heliyon.2024.e31039. eCollection 2024 May 30.

Evaluating the performance of species richness estimators: sensitivity to sample grain size.

J Anim Ecol. 2006 Jan;75(1):274-87. doi: 10.1111/j.1365-2656.2006.01048.x.

Some Shrinkage estimators based on median ranked set sampling.

J Appl Stat. 2021 Mar 16;48(13-15):2473-2498. doi: 10.1080/02664763.2021.1895088. eCollection 2021.

A new modified estimator of population variance in calibrated survey sampling.

Sci Rep. 2024 Oct 17;14(1):24385. doi: 10.1038/s41598-024-74424-2.

Comparing the small sample performance of several variance estimators under competing risks.

Stat Med. 2007 Feb 28;26(5):1170-80. doi: 10.1002/sim.2661.

引用本文的文献

Understanding the Challenges and Uncertainties of Seroprevalence Studies for SARS-CoV-2.

Int J Environ Res Public Health. 2021 Apr 27;18(9):4640. doi: 10.3390/ijerph18094640.

Prevalence of trachoma in the Afar Region of Ethiopia: results of seven population-based surveys from the Global Trachoma Mapping Project.

Ophthalmic Epidemiol. 2018 Dec;25(sup1):3-10. doi: 10.1080/09286586.2017.1362008.

本文引用的文献

Design-based random permutation models with auxiliary information.

Statistics (Ber). 2012 Jan 1;46(5):663-671. doi: 10.1080/02331888.2010.545408.

The relationship between self-reported alcohol intake and the morbidities managed by GPs in Australia.

BMC Fam Pract. 2006 Mar 14;7:17. doi: 10.1186/1471-2296-7-17.

Associations between health-related quality of life and demographics and health risks. Results from Rhode Island's 2002 behavioral risk factor survey.

Health Qual Life Outcomes. 2006 Mar 3;4:14. doi: 10.1186/1477-7525-4-14.

Causal analysis of case-control data.

Epidemiol Perspect Innov. 2006 Jan 27;3:2. doi: 10.1186/1742-5573-3-2.

A population-based study of asthma, quality of life, and occupation among elderly Hispanic and non-Hispanic whites: a cross-sectional investigation.

BMC Public Health. 2005 Sep 21;5:97. doi: 10.1186/1471-2458-5-97.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Should adjustment for covariates be used in prevalence estimations?

Epidemiol Perspect Innov. 2008 Jan 25;5:2. doi: 10.1186/1742-5573-5-2.

Precision of systematic and random sampling in clustered populations: habitat patches and aggregating organisms.

Ecol Appl. 2016 Jan;26(1):233-48. doi: 10.1890/14-1973.

Sampling distributions, biases, variances, and confidence intervals for genetic correlations.

Theor Appl Genet. 1997 Jan;94(1):8-19. doi: 10.1007/s001220050375.

A simple hybrid variance estimator for the Kaplan-Meier survival function.

Stat Med. 2005 Mar 30;24(6):827-51. doi: 10.1002/sim.1960.

Tobacco smoking surveillance: is quota sampling an efficient tool for monitoring national trends? A comparison with a random cross-sectional survey.

PLoS One. 2013 Oct 23;8(10):e78372. doi: 10.1371/journal.pone.0078372. eCollection 2013.

Hybrid class of robust type estimators for variance estimation using mean and variance of auxiliary variable.

Heliyon. 2024 May 11;10(10):e31039. doi: 10.1016/j.heliyon.2024.e31039. eCollection 2024 May 30.

Evaluating the performance of species richness estimators: sensitivity to sample grain size.

J Anim Ecol. 2006 Jan;75(1):274-87. doi: 10.1111/j.1365-2656.2006.01048.x.

Some Shrinkage estimators based on median ranked set sampling.

J Appl Stat. 2021 Mar 16;48(13-15):2473-2498. doi: 10.1080/02664763.2021.1895088. eCollection 2021.

A new modified estimator of population variance in calibrated survey sampling.

Sci Rep. 2024 Oct 17;14(1):24385. doi: 10.1038/s41598-024-74424-2.

Comparing the small sample performance of several variance estimators under competing risks.

Stat Med. 2007 Feb 28;26(5):1170-80. doi: 10.1002/sim.2661.

引用本文的文献

Understanding the Challenges and Uncertainties of Seroprevalence Studies for SARS-CoV-2.

Int J Environ Res Public Health. 2021 Apr 27;18(9):4640. doi: 10.3390/ijerph18094640.

Prevalence of trachoma in the Afar Region of Ethiopia: results of seven population-based surveys from the Global Trachoma Mapping Project.

Ophthalmic Epidemiol. 2018 Dec;25(sup1):3-10. doi: 10.1080/09286586.2017.1362008.

本文引用的文献

Design-based random permutation models with auxiliary information.

Statistics (Ber). 2012 Jan 1;46(5):663-671. doi: 10.1080/02331888.2010.545408.

The relationship between self-reported alcohol intake and the morbidities managed by GPs in Australia.

BMC Fam Pract. 2006 Mar 14;7:17. doi: 10.1186/1471-2296-7-17.

Associations between health-related quality of life and demographics and health risks. Results from Rhode Island's 2002 behavioral risk factor survey.

Health Qual Life Outcomes. 2006 Mar 3;4:14. doi: 10.1186/1477-7525-4-14.

Causal analysis of case-control data.

Epidemiol Perspect Innov. 2006 Jan 27;3:2. doi: 10.1186/1742-5573-3-2.

A population-based study of asthma, quality of life, and occupation among elderly Hispanic and non-Hispanic whites: a cross-sectional investigation.

BMC Public Health. 2005 Sep 21;5:97. doi: 10.1186/1471-2458-5-97.

Should adjustment for covariates be used in prevalence estimations?

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献