Reich Brian J, Guan Yawen, Fourches Denis, Warren Joshua L, Sarnat Stefanie E, Chang Howard H
Department of Statistics, North Carolina State University.
Department of Statistics, University of Nebraska.
Ann Appl Stat. 2020 Dec;14(4):1945-1963. doi: 10.1214/20-AOAS1364. Epub 2020 Dec 19.
Humans are concurrently exposed to chemically, structurally and toxicologically diverse chemicals. A critical challenge for environmental epidemiology is to quantify the risk of adverse health outcomes resulting from exposures to such chemical mixtures and to identify which mixture constituents may be driving etiologic associations. A variety of statistical methods have been proposed to address these critical research questions. However, they generally rely solely on measured exposure and health data available within a specific study. Advancements in understanding of the role of mixtures on human health impacts may be better achieved through the utilization of external data and knowledge from multiple disciplines with innovative statistical tools. In this paper we develop new methods for health analyses that incorporate auxiliary information about the chemicals in a mixture, such as physicochemical, structural and/or toxicological data. We expect that the constituents identified using auxiliary information will be more biologically meaningful than those identified by methods that solely utilize observed correlations between measured exposure. We develop flexible Bayesian models by specifying prior distributions for the exposures and their effects that include auxiliary information and examine this idea over a spectrum of analyses from regression to factor analysis. The methods are applied to study the effects of volatile organic compounds on emergency room visits in Atlanta. We find that including cheminformatic information about the exposure variables improves prediction and provides a more interpretable model for emergency room visits for respiratory diseases.
人类同时接触化学、结构和毒理学性质各异的化学物质。环境流行病学面临的一项关键挑战是量化接触此类化学混合物导致不良健康后果的风险,并确定哪些混合物成分可能是病因关联的驱动因素。人们已经提出了多种统计方法来解决这些关键研究问题。然而,它们通常仅依赖于特定研究中可用的测量暴露数据和健康数据。通过利用来自多个学科的外部数据和知识以及创新的统计工具,可能会更好地推进对混合物在人类健康影响方面作用的理解。在本文中,我们开发了用于健康分析的新方法,这些方法纳入了有关混合物中化学物质的辅助信息,例如物理化学、结构和/或毒理学数据。我们预计,使用辅助信息确定的成分比仅利用测量暴露之间观察到的相关性的方法确定的成分在生物学上更有意义。我们通过为暴露及其影响指定包含辅助信息的先验分布来开发灵活的贝叶斯模型,并在从回归到因子分析的一系列分析中检验这一想法。这些方法被应用于研究挥发性有机化合物对亚特兰大急诊室就诊的影响。我们发现,纳入有关暴露变量的化学信息学信息可以改善预测,并为呼吸系统疾病的急诊室就诊提供一个更具解释性的模型。