Horiuchi Shiro, Wilmoth John R, Pletcher Scott D
Program in Urban Public Health, Hunter College, 425 East 25th Street, Box 816, New York, NY 10010-2590, USA.
Demography. 2008 Nov;45(4):785-801. doi: 10.1353/dem.0.0033.
A demographic measure is often expressed as a deterministic or stochastic function of multiple variables (covariates), and a general problem (the decomposition problem) is to assess contributions of individual covariates to a difference in the demographic measure (dependent variable) between two populations. We propose a method of decomposition analysis based on an assumption that covariates change continuously along an actual or hypothetical dimension. This assumption leads to a general model that logically justifies the additivity of covariate effects and the elimination of interaction terms, even if the dependent variable itself is a nonadditive function. A comparison with earlier methods illustrates other practical advantages of the method: in addition to an absence of residuals or interaction terms, the method can easily handle a large number of covariates and does not require a logically meaningful ordering of covariates. Two empirical examples show that the method can be applied flexibly to a wide variety of decomposition problems. This study also suggests that when data are available at multiple time points over a long interval, it is more accurate to compute an aggregated decomposition based on multiple subintervals than to compute a single decomposition for the entire study period.
人口统计学指标通常表示为多个变量(协变量)的确定性或随机函数,一个普遍问题(分解问题)是评估各个协变量对两个人口群体在人口统计学指标(因变量)上差异的贡献。我们基于协变量沿实际或假设维度连续变化的假设提出了一种分解分析方法。这一假设导致了一个通用模型,该模型从逻辑上证明了协变量效应的可加性以及交互项的消除是合理的,即使因变量本身是一个非加性函数。与早期方法的比较说明了该方法的其他实际优势:除了不存在残差或交互项外,该方法可以轻松处理大量协变量,并且不需要协变量具有逻辑上有意义的排序。两个实证例子表明,该方法可以灵活地应用于各种各样的分解问题。本研究还表明,当在很长一段时间内有多个时间点的数据时,基于多个子区间计算汇总分解比为整个研究期计算单个分解更准确。