一种用于验证和应用标准化小区域测量策略的新框架。

A novel framework for validating and applying standardized small area measurement strategies.

机构信息

Institute for Health Metrics and Evaluation, University of Washington, 2301 5th Ave, Suite 600, Seattle, WA 98121, USA.

出版信息

Popul Health Metr. 2010 Sep 29;8:26. doi: 10.1186/1478-7954-8-26.

DOI:10.1186/1478-7954-8-26

PMID:20920214

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2958154/

Abstract

BACKGROUND

Local measurements of health behaviors, diseases, and use of health services are critical inputs into local, state, and national decision-making. Small area measurement methods can deliver more precise and accurate local-level information than direct estimates from surveys or administrative records, where sample sizes are often too small to yield acceptable standard errors. However, small area measurement requires careful validation using approaches other than conventional statistical methods such as in-sample or cross-validation methods because they do not solve the problem of validating estimates in data-sparse domains.

METHODS

A new general framework for small area estimation and validation is developed and applied to estimate Type 2 diabetes prevalence in US counties using data from the Behavioral Risk Factor Surveillance System (BRFSS). The framework combines the three conventional approaches to small area measurement: (1) pooling data across time by combining multiple survey years; (2) exploiting spatial correlation by including a spatial component; and (3) utilizing structured relationships between the outcome variable and domain-specific covariates to define four increasingly complex model types - coined the Naive, Geospatial, Covariate, and Full models. The validation framework uses direct estimates of prevalence in large domains as the gold standard and compares model estimates against it using (i) all available observations for the large domains and (ii) systematically reduced sample sizes obtained through random sampling with replacement. At each sampling level, the model is rerun repeatedly, and the validity of the model estimates from the four model types is then determined by calculating the (average) concordance correlation coefficient (CCC) and (average) root mean squared error (RMSE) against the gold standard. The CCC is closely related to the intraclass correlation coefficient and can be used when the units are organized in groups and when it is of interest to measure the agreement between units in the same group (e.g., counties). The RMSE is often used to measure the differences between values predicted by a model or an estimator and the actually observed values. It is a useful measure to capture the precision of the model or estimator.

RESULTS

All model types have substantially higher CCC and lower RMSE than the direct, single-year BRFSS estimates. In addition, the inclusion of relevant domain-specific covariates generally improves predictive validity, especially at small sample sizes, and their leverage can be equivalent to a five- to tenfold increase in sample size.

CONCLUSIONS

Small area estimation of important health outcomes and risk factors can be improved using a systematic modeling and validation framework, which consistently outperformed single-year direct survey estimates and demonstrated the potential leverage of including relevant domain-specific covariates compared to pure measurement models. The proposed validation strategy can be applied to other disease outcomes and risk factors in the US as well as to resource-scarce situations, including low-income countries. These estimates are needed by public health officials to identify at-risk groups, to design targeted prevention and intervention programs, and to monitor and evaluate results over time.

摘要

背景

对健康行为、疾病和卫生服务使用情况进行局部测量，是地方、州和国家决策的关键投入。小区域测量方法可以提供更精确和准确的局部信息，而不是直接从调查或行政记录中进行估计，因为调查或行政记录的样本量通常太小，无法产生可接受的标准误差。然而，小区域测量需要使用传统统计方法以外的方法进行仔细验证，例如样本内或交叉验证方法，因为这些方法并不能解决在数据稀疏领域验证估计的问题。

方法

开发了一个新的小区域估计和验证的通用框架，并应用于使用来自行为风险因素监测系统（BRFSS）的数据估计美国县的 2 型糖尿病患病率。该框架结合了小区域测量的三种传统方法：（1）通过结合多个调查年份，跨时间汇集数据；（2）通过包含空间分量来利用空间相关性；（3）利用结果变量与特定领域协变量之间的结构关系，定义四个越来越复杂的模型类型-被称为朴素、地理空间、协变量和完全模型。验证框架使用大型领域的直接患病率估计作为金标准，并使用（i）大型领域的所有可用观测值和（ii）通过有放回的随机抽样获得的系统减少的样本量来比较模型估计值。在每个抽样水平上，都会重复运行模型，并通过计算与金标准的（平均）一致性相关系数（CCC）和（平均）均方根误差（RMSE）来确定来自四种模型类型的模型估计值的有效性。CCC 与组内相关系数密切相关，当单位分组且有兴趣测量同一组（例如县）内单位之间的一致性时，可以使用该系数。RMSE 通常用于测量模型或估计值预测值与实际观测值之间的差异。它是捕获模型或估计值精度的有用度量。

结果

与直接的、单一年份的 BRFSS 估计相比，所有模型类型的 CCC 都显著提高，RMSE 都显著降低。此外，纳入相关的特定领域协变量通常可以提高预测的有效性，尤其是在小样本量的情况下，其影响力相当于样本量增加五到十倍。

结论

使用系统的建模和验证框架可以改进重要健康结果和风险因素的小区域估计，该框架始终优于单一年份的直接调查估计，并证明了与纯测量模型相比，纳入相关特定领域协变量的潜在影响力。所提出的验证策略可应用于美国的其他疾病结果和风险因素，以及资源匮乏的情况，包括低收入国家。公共卫生官员需要这些估计来确定高危人群，设计有针对性的预防和干预计划，并随着时间的推移监测和评估结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1909/2958154/ec6da5124651/1478-7954-8-26-1.jpg

相似文献

A novel framework for validating and applying standardized small area measurement strategies.

Popul Health Metr. 2010 Sep 29;8:26. doi: 10.1186/1478-7954-8-26.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Small class sizes for improving student achievement in primary and secondary schools: a systematic review.

Campbell Syst Rev. 2018 Oct 11;14(1):1-107. doi: 10.4073/csr.2018.10. eCollection 2018.

Assessment and statistical modeling of the relationship between remotely sensed aerosol optical depth and PM2.5 in the eastern United States.

Res Rep Health Eff Inst. 2012 May(167):5-83; discussion 85-91.

The future of Cochrane Neonatal.

Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

Mortality and Morbidity Effects of Long-Term Exposure to Low-Level PM, BC, NO, and O: An Analysis of European Cohorts in the ELAPSE Project.

Res Rep Health Eff Inst. 2021 Sep;2021(208):1-127.

Mortality-Air Pollution Associations in Low Exposure Environments (MAPLE): Phase 2.

Res Rep Health Eff Inst. 2022 Jul;2022(212):1-91.

Small-area estimation for public health surveillance using electronic health record data: reducing the impact of underrepresentation.

BMC Public Health. 2022 Aug 9;22(1):1515. doi: 10.1186/s12889-022-13809-2.

Effectiveness and cost-effectiveness of four different strategies for SARS-CoV-2 surveillance in the general population (CoV-Surv Study): a structured summary of a study protocol for a cluster-randomised, two-factorial controlled trial.

Trials. 2021 Jan 8;22(1):39. doi: 10.1186/s13063-020-04982-z.

引用本文的文献

Prevalence and regional distribution of obstructive sleep apnea in Canada: Analysis from the Canadian Longitudinal Study on Aging.

Can J Public Health. 2024 Dec;115(6):970-979. doi: 10.17269/s41997-024-00911-8. Epub 2024 Jul 22.

Impact of the use of small-area models on estimation of attributable mortality at a regional level.

Eur J Public Health. 2024 Dec 1;34(6):1218-1224. doi: 10.1093/eurpub/ckae104.

Adverse Childhood Experiences Among U.S. Adults: National and State Estimates by Adversity Type, 2019-2020.

Am J Prev Med. 2024 Jul;67(1):55-66. doi: 10.1016/j.amepre.2024.02.010. Epub 2024 Feb 17.

Bioinformatics reveals the pathophysiological relationship between diabetic nephropathy and periodontitis in the context of aging.

Heliyon. 2024 Jan 18;10(2):e24872. doi: 10.1016/j.heliyon.2024.e24872. eCollection 2024 Jan 30.

Validation of a small-area model for estimation of smoking prevalence at a subnational level.

Tob Induc Dis. 2023 Sep 1;21:112. doi: 10.18332/tid/169683. eCollection 2023.

Small-area models to assess the geographical distribution of tobacco consumption by sex and age in Spain.

Tob Induc Dis. 2023 May 18;21:63. doi: 10.18332/tid/162379. eCollection 2023.

Evaluation of the long-term variability of macular OCT/OCTA and visual field parameters.

Br J Ophthalmol. 2024 Jan 29;108(2):211-216. doi: 10.1136/bjo-2022-322470.

Life expectancy by county, race, and ethnicity in the USA, 2000-19: a systematic analysis of health disparities.

Lancet. 2022 Jul 2;400(10345):25-38. doi: 10.1016/S0140-6736(22)00876-5. Epub 2022 Jun 16.

Mapping HIV prevalence in Nigeria using small area estimates to develop a targeted HIV intervention strategy.

PLoS One. 2022 Jun 8;17(6):e0268892. doi: 10.1371/journal.pone.0268892. eCollection 2022.

US county-level estimation for maternal and infant health-related behavior indicators using pregnancy risk assessment monitoring system data, 2016-2018.

Popul Health Metr. 2022 May 21;20(1):14. doi: 10.1186/s12963-022-00291-6.

本文引用的文献

Bayesian Small Area Estimates of Diabetes Incidence by United States County, 2009.

J Data Sci. 2013 Apr;11(1):269-280.

The Behavioral Risk Factors Surveillance System: past, present, and future.

Annu Rev Public Health. 2009;30:43-54. doi: 10.1146/annurev.publhealth.031308.100226.

Small-area estimation and prioritizing communities for tobacco control efforts in Massachusetts.

Am J Public Health. 2009 Mar;99(3):470-9. doi: 10.2105/AJPH.2007.130112. Epub 2009 Jan 15.

Small-area estimation and prioritizing communities for obesity control in Massachusetts.

Am J Public Health. 2009 Mar;99(3):511-9. doi: 10.2105/AJPH.2008.137364. Epub 2009 Jan 15.

Tracking chronic disease and risk behavior prevalence as survey participation declines: statistics from the behavioral risk factor surveillance system and other national surveys.

Prev Chronic Dis. 2008 Jul;5(3):A80. Epub 2008 Jun 15.

Monitoring county-level vaccination coverage during the 2004-2005 influenza season.

Am J Prev Med. 2006 Oct;31(4):275-280. doi: 10.1016/j.amepre.2006.06.005. Epub 2006 Aug 28.

Prevalence of adult binge drinking: a comparison of two national surveys.

Am J Prev Med. 2004 Oct;27(3):197-204. doi: 10.1016/j.amepre.2004.05.004.

Diabetes trends in the USA.

Diabetes Metab Res Rev. 2002 Sep-Oct;18 Suppl 3:S21-6. doi: 10.1002/dmrr.289.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于验证和应用标准化小区域测量策略的新框架。

A novel framework for validating and applying standardized small area measurement strategies.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献