Joubert Bonnie R, Palmer Glenn, Dunson David, Kioumourtzoglou Marianthi-Anna, Coull Brent A
National Institute of Environmental Health Sciences, National Institutes of Health, Durham, NC, USA.
Department of Statistical Science, Duke University, Durham, NC, USA.
medRxiv. 2024 Dec 22:2024.12.20.24318087. doi: 10.1101/2024.12.20.24318087.
Human exposure to complex, changing, and variably correlated mixtures of environmental chemicals has presented analytical challenges to epidemiologists and human health researchers. There have been a wide variety of recent advances in statistical methods for analyzing mixtures data, with most of these methods having open-source software for implementation. However, there is no one-size-fits-all method for analyzing mixtures data given the considerable heterogeneity in scientific focus and study design. For example, some methods focus on predicting the overall health effect of a mixture and others seek to disentangle main effects and pairwise interactions. Some methods are only appropriate for cross-sectional designs, while other methods can accommodate longitudinally measured exposures or outcomes. This article focuses on greatly simplifying the daunting task of identifying which methods are most suitable for a particular study design, data type, and scientific focus. With this goal in mind, we present an organized workflow for statistical analysis considerations in environmental mixtures data. This systematic strategy builds on epidemiological and statistical principles, considering specific nuances for the mixtures' context. We also describe an accompanying online methods repository in development to increase awareness of and inform application of existing methods and new methods as they are developed and identify gaps in existing methods warranting further development.
人类接触环境化学物质的复杂、不断变化且具有不同相关性的混合物,给流行病学家和人类健康研究人员带来了分析方面的挑战。近期在分析混合物数据的统计方法上有了各种各样的进展,其中大多数方法都有开源软件可供实施。然而,鉴于科学重点和研究设计存在相当大的异质性,不存在一种适用于所有情况的分析混合物数据的方法。例如,一些方法侧重于预测混合物的总体健康影响,而另一些方法则试图区分主要影响和成对相互作用。一些方法仅适用于横断面设计,而其他方法可以处理纵向测量的暴露或结果。本文着重极大地简化确定哪些方法最适合特定研究设计、数据类型和科学重点这一艰巨任务。出于这一目标,我们提出了一个用于环境混合物数据统计分析考量的有条理的工作流程。这种系统策略基于流行病学和统计原则,考虑了混合物背景的特定细微差别。我们还描述了一个正在开发的配套在线方法库,以提高对现有方法和新开发方法的认识并为其应用提供信息,并识别现有方法中需要进一步开发的差距。