Institute for Health Metrics and Evaluation, University of Washington, 2301 5th Ave, Suite 600, Seattle, WA 98121, USA.
Popul Health Metr. 2010 Sep 29;8:26. doi: 10.1186/1478-7954-8-26.
Local measurements of health behaviors, diseases, and use of health services are critical inputs into local, state, and national decision-making. Small area measurement methods can deliver more precise and accurate local-level information than direct estimates from surveys or administrative records, where sample sizes are often too small to yield acceptable standard errors. However, small area measurement requires careful validation using approaches other than conventional statistical methods such as in-sample or cross-validation methods because they do not solve the problem of validating estimates in data-sparse domains.
A new general framework for small area estimation and validation is developed and applied to estimate Type 2 diabetes prevalence in US counties using data from the Behavioral Risk Factor Surveillance System (BRFSS). The framework combines the three conventional approaches to small area measurement: (1) pooling data across time by combining multiple survey years; (2) exploiting spatial correlation by including a spatial component; and (3) utilizing structured relationships between the outcome variable and domain-specific covariates to define four increasingly complex model types - coined the Naive, Geospatial, Covariate, and Full models. The validation framework uses direct estimates of prevalence in large domains as the gold standard and compares model estimates against it using (i) all available observations for the large domains and (ii) systematically reduced sample sizes obtained through random sampling with replacement. At each sampling level, the model is rerun repeatedly, and the validity of the model estimates from the four model types is then determined by calculating the (average) concordance correlation coefficient (CCC) and (average) root mean squared error (RMSE) against the gold standard. The CCC is closely related to the intraclass correlation coefficient and can be used when the units are organized in groups and when it is of interest to measure the agreement between units in the same group (e.g., counties). The RMSE is often used to measure the differences between values predicted by a model or an estimator and the actually observed values. It is a useful measure to capture the precision of the model or estimator.
All model types have substantially higher CCC and lower RMSE than the direct, single-year BRFSS estimates. In addition, the inclusion of relevant domain-specific covariates generally improves predictive validity, especially at small sample sizes, and their leverage can be equivalent to a five- to tenfold increase in sample size.
Small area estimation of important health outcomes and risk factors can be improved using a systematic modeling and validation framework, which consistently outperformed single-year direct survey estimates and demonstrated the potential leverage of including relevant domain-specific covariates compared to pure measurement models. The proposed validation strategy can be applied to other disease outcomes and risk factors in the US as well as to resource-scarce situations, including low-income countries. These estimates are needed by public health officials to identify at-risk groups, to design targeted prevention and intervention programs, and to monitor and evaluate results over time.
对健康行为、疾病和卫生服务使用情况进行局部测量,是地方、州和国家决策的关键投入。小区域测量方法可以提供更精确和准确的局部信息,而不是直接从调查或行政记录中进行估计,因为调查或行政记录的样本量通常太小,无法产生可接受的标准误差。然而,小区域测量需要使用传统统计方法以外的方法进行仔细验证,例如样本内或交叉验证方法,因为这些方法并不能解决在数据稀疏领域验证估计的问题。
开发了一个新的小区域估计和验证的通用框架,并应用于使用来自行为风险因素监测系统(BRFSS)的数据估计美国县的 2 型糖尿病患病率。该框架结合了小区域测量的三种传统方法:(1)通过结合多个调查年份,跨时间汇集数据;(2)通过包含空间分量来利用空间相关性;(3)利用结果变量与特定领域协变量之间的结构关系,定义四个越来越复杂的模型类型-被称为朴素、地理空间、协变量和完全模型。验证框架使用大型领域的直接患病率估计作为金标准,并使用(i)大型领域的所有可用观测值和(ii)通过有放回的随机抽样获得的系统减少的样本量来比较模型估计值。在每个抽样水平上,都会重复运行模型,并通过计算与金标准的(平均)一致性相关系数(CCC)和(平均)均方根误差(RMSE)来确定来自四种模型类型的模型估计值的有效性。CCC 与组内相关系数密切相关,当单位分组且有兴趣测量同一组(例如县)内单位之间的一致性时,可以使用该系数。RMSE 通常用于测量模型或估计值预测值与实际观测值之间的差异。它是捕获模型或估计值精度的有用度量。
与直接的、单一年份的 BRFSS 估计相比,所有模型类型的 CCC 都显著提高,RMSE 都显著降低。此外,纳入相关的特定领域协变量通常可以提高预测的有效性,尤其是在小样本量的情况下,其影响力相当于样本量增加五到十倍。
使用系统的建模和验证框架可以改进重要健康结果和风险因素的小区域估计,该框架始终优于单一年份的直接调查估计,并证明了与纯测量模型相比,纳入相关特定领域协变量的潜在影响力。所提出的验证策略可应用于美国的其他疾病结果和风险因素,以及资源匮乏的情况,包括低收入国家。公共卫生官员需要这些估计来确定高危人群,设计有针对性的预防和干预计划,并随着时间的推移监测和评估结果。