Oniris, INRAE, LABERCA, 44300, Nantes, France.
Oniris, INRAE, LABERCA, 44300, Nantes, France.
Environ Pollut. 2023 Aug 1;330:121741. doi: 10.1016/j.envpol.2023.121741. Epub 2023 Apr 29.
Humans are exposed to a growing list of synthetic chemicals, some of them becoming a major public health concern due to their capacity to impact multiple biological endpoints and contribute to a range of chronic diseases. The integration of endogenous (omic) biomarkers of effect in environmental health studies has been growing during the last decade, aiming to gain insight into potential mechanisms linking the exposures and the clinical conditions. The emergence of high-throughput omic platforms has raised a list of statistical challenges posed by the large dimension and complexity of data generated. Thus, the aim of the present study was to critically review the current state-of-the-science about statistical approaches used to integrate endogenous biomarkers in environmental-health studies linking chemical exposures with health outcomes. The present review specifically focused on internal exposure to environmental chemical pollutants, involving both persistent organic pollutants (POPs) and non-persistent pollutants like phthalates or bisphenols, and metals. We identified 42 eligible articles published since 2016, reporting 48 different statistical workflows, mostly focused on POPs and using metabolomic profiling in the intermediate layer. The outcomes were mainly binary and focused on metabolic disorders. A large diversity of statistical strategies were reported to integrate chemical mixtures and endogenous biomarkers to characterize their associations with health conditions. Multivariate regression models were the most predominant statistical method reported in the published workflows, however some studies applied latent based methods or multipollutant models to overcome the specific constraints of omic or exposure data. A minority of studies used formal mediation analysis to characterize the indirect effects mediated by the endogenous biomarkers. The principles of each specific statistical method and overall workflow set-up are summarized in the light of highlighting their applicability, strengths and weaknesses or interpretability to gain insight into the causal structures underlying the triad: exposure, effect-biomarker and outcome.
人类不断接触到越来越多的合成化学物质,其中一些由于能够影响多种生物终点并导致一系列慢性疾病而成为主要的公共卫生关注点。在过去十年中,环境健康研究中整合内源性(组学)效应生物标志物的方法不断发展,旨在深入了解将暴露与临床状况联系起来的潜在机制。高通量组学平台的出现带来了一系列统计挑战,这些挑战源于生成的数据的高维性和复杂性。因此,本研究旨在批判性地回顾目前用于将化学暴露与健康结果联系起来的环境健康研究中整合内源性生物标志物的统计方法的科学现状。本综述特别关注内源性暴露于环境化学污染物,包括持久性有机污染物 (POPs) 和非持久性污染物,如邻苯二甲酸酯或双酚 A,以及金属。我们确定了自 2016 年以来发表的 42 篇符合条件的文章,报告了 48 种不同的统计工作流程,主要集中在 POPs 上,并在中间层使用代谢组学分析。结果主要是二进制的,重点是代谢紊乱。报道了大量不同的统计策略来整合化学混合物和内源性生物标志物,以描述它们与健康状况的关联。多元回归模型是已发表工作流程中报告的最主要的统计方法,但一些研究应用基于潜在变量的方法或多污染物模型来克服组学或暴露数据的特定限制。少数研究使用正式的中介分析来描述内源性生物标志物介导的间接效应。根据其适用性、优势和劣势或可解释性,总结了每种特定统计方法和整体工作流程设置的原则,以深入了解暴露、效应生物标志物和结果这三者之间的因果结构。