Burstyn Igor, Lavoué Jérôme, Van Tongeren Martie
Department of Environmental and Occupational Health, Drexel University, Philadelphia, PA, USA.
Ann Occup Hyg. 2012 Nov;56(9):1038-50. doi: 10.1093/annhyg/mes031. Epub 2012 Sep 17.
Job-exposure matrices (JEMs) are often used in occupational epidemiological studies to provide an exposure estimate for a typical person in a 'job' during a particular time period. A JEM can produce exposure estimates on a variety of scales, such as (but not limited to) binary assessments of presence or absence of exposure, ordinal ranking of exposure level and frequency, and quantitative exposure estimates of exposure intensity and frequency. Specifically, one popular approach to construct a JEM, engendered in a Finnish job exposure matrix (FINJEM), provides a probability that a worker within an occupational group is exposed and an estimate of intensity of exposure among the exposed workers within this occupation. Often the product of the probability and intensity (aka level) is used to obtain the estimate of exposure for the epidemiological analyses. This procedure aggregates exposure across exposed and non-exposed individuals and the effect of this particular procedure on epidemiological analyses has never been studied. We developed a theoretical framework for understanding how these aggregate exposure estimates relate to true exposure (either unexposed or log-normally distributed for 'exposed'), assuming that there is no uncertainty about estimates of level and probability of exposure. Theoretical derivations show that multiplying occupation-specific exposure level and probability of non-zero exposure results in both systematic and differential measurement errors. Simulations demonstrated that under certain conditions bias in odds ratios in a cohort study away from the null are possible and that this bias is smaller when (a) arithmetic rather than geometric mean is used to assess exposure level and (b) exposure level and prevalence are positively correlated. We illustrate the potential impact of using the specified JEM in a simulation based on a case-control study of non-Hodgkin lymphoma and exposure to ionizing and non-ionizing radiation. Inflation of standard errors in the log-odds was observed as well as bias away from null for two out of three specific exposures/data structures. Overall, it is clear that influence of the phenomenon we studied on epidemiological results is complex and difficult to predict, being influenced a great deal by the structure of data. We recommend exploring the influence of JEMs that use the product of exposure level and probability in epidemiological analyses through simulations during planning of such studies to assess both the expected extent of the potential bias in risk estimates and impact on power. The SAS and R code required to implement such simulations are provided. All our calculations are either theoretical or based on simulated data.
工作暴露矩阵(JEMs)常用于职业流行病学研究,以提供特定时间段内“工作”中典型人员的暴露估计值。JEM可以在多种尺度上产生暴露估计值,例如(但不限于)暴露存在与否的二元评估、暴露水平和频率的序数排名,以及暴露强度和频率的定量暴露估计值。具体而言,一种受芬兰工作暴露矩阵(FINJEM)启发的构建JEM的流行方法,提供了职业群体中工人暴露的概率以及该职业中暴露工人的暴露强度估计值。通常,概率和强度(即水平)的乘积用于获得流行病学分析的暴露估计值。此过程汇总了暴露个体和未暴露个体的暴露情况,而这一特定过程对流行病学分析的影响从未被研究过。我们建立了一个理论框架,用于理解这些汇总暴露估计值与真实暴露(对于“暴露”情况,要么未暴露,要么呈对数正态分布)之间的关系,假设暴露水平和暴露概率的估计不存在不确定性。理论推导表明,将特定职业的暴露水平与非零暴露概率相乘会导致系统误差和差异测量误差。模拟结果表明,在某些条件下,队列研究中优势比可能会偏离无效值,并且当(a)使用算术平均值而非几何平均值来评估暴露水平,以及(b)暴露水平和患病率呈正相关时,这种偏差会更小。我们在基于非霍奇金淋巴瘤与电离和非电离辐射暴露的病例对照研究的模拟中说明了使用特定JEM的潜在影响。观察到对数优势比的标准误差膨胀,并且在三种特定暴露/数据结构中的两种中出现了偏离无效值的偏差。总体而言,很明显我们所研究的现象对流行病学结果的影响是复杂且难以预测的,很大程度上受到数据结构的影响。我们建议在这类研究的规划阶段通过模拟来探索在流行病学分析中使用暴露水平和概率乘积的JEM的影响,以评估风险估计中潜在偏差的预期程度以及对检验效能的影响。提供了实施此类模拟所需的SAS和R代码。我们所有的计算要么是理论性的,要么基于模拟数据。