基于排列的相关化学混合物加权和回归推断方法。

A permutation-based approach to inference for weighted sum regression with correlated chemical mixtures.

机构信息

Division of Biostatistics, University of Minnesota School of Public Health, Minneapolis, MN, USA.

Department of Biostatistics and Epidemiology, Rutgers School of Public Health, Environmental and Occupational Health Sciences Institute (EOHSI), Rutgers University, Piscataway, NJ, USA.

出版信息

Stat Methods Med Res. 2022 Apr;31(4):579-593. doi: 10.1177/09622802211013578. Epub 2022 Feb 6.

DOI:10.1177/09622802211013578

PMID:35128995

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9883011/

Abstract

There is a growing demand for methods to determine the effects that chemical mixtures have on human health. One statistical challenge is identifying true "bad actors" from a mixture of highly correlated predictors, a setting in which standard approaches such as linear regression become highly variable. Weighted Quantile Sum regression has been proposed to address this problem, through a two-step process where mixture component weights are estimated using bootstrap aggregation in a training dataset and inference on the overall mixture effect occurs in a held-out test set. Weighted Quantile Sum regression is popular in applied papers, but the reliance on data splitting is suboptimal, and analysts who use the same data for both steps risk inflating the Type I error rate. We therefore propose a modification of Weighted Quantile Sum regression that uses a permutation test for inference, which allows for weight estimation using the entire dataset and preserves Type I error. To minimize computational burden, we propose replacing the bootstrap with L1 or L2 penalization and describe how to choose the appropriate penalty given expert knowledge about a mixture of interest. We apply our method to a national pregnancy cohort study of prenatal phthalate exposure and child health outcomes.

摘要

人们越来越需要方法来确定化学混合物对人类健康的影响。其中一个统计学挑战是从高度相关的预测因子混合物中识别真正的“不良因素”，在这种情况下，标准方法（如线性回归）变得高度可变。加权分位数和回归已被提议用于解决这个问题，通过两步过程，在训练数据集中使用引导聚合来估计混合物成分权重，然后在保留的测试集中进行整体混合物效应的推断。加权分位数和回归在应用论文中很流行，但对数据分割的依赖是次优的，并且在两个步骤中使用相同数据的分析师有夸大Ⅰ型错误率的风险。因此，我们提出了一种加权分位数和回归的修改方法，该方法使用置换检验进行推断，这允许使用整个数据集进行权重估计，并保持Ⅰ型错误率。为了最小化计算负担，我们建议用 L1 或 L2 惩罚来代替引导，并且描述了如何根据对感兴趣的混合物的专业知识来选择适当的惩罚。我们将我们的方法应用于一项全国性的妊娠队列研究，该研究调查了产前邻苯二甲酸酯暴露与儿童健康结果之间的关系。

相似文献

A permutation-based approach to inference for weighted sum regression with correlated chemical mixtures.基于排列的相关化学混合物加权和回归推断方法。

Stat Methods Med Res. 2022 Apr;31(4):579-593. doi: 10.1177/09622802211013578. Epub 2022 Feb 6.

A Permutation Test-Based Approach to Strengthening Inference on the Effects of Environmental Mixtures: Comparison between Single-Index Analytic Methods.基于排列检验的方法增强环境混合物效应推断的研究：单指标分析方法的比较。

Environ Health Perspect. 2022 Aug;130(8):87010. doi: 10.1289/EHP10570. Epub 2022 Aug 30.

Part 1. Statistical Learning Methods for the Effects of Multiple Air Pollution Constituents.第1部分. 多种空气污染成分影响的统计学习方法

Res Rep Health Eff Inst. 2015 Jun(183 Pt 1-2):5-50.

Perinatal phthalates exposure decreases fine-motor functions in 11-year-old girls: Results from weighted Quantile sum regression.围产期邻苯二甲酸酯暴露降低 11 岁女孩的精细运动功能：加权分位数和回归的结果。

Environ Int. 2020 Mar;136:105424. doi: 10.1016/j.envint.2019.105424. Epub 2019 Dec 24.

Phthalate mixtures in pregnancy, autistic traits, and adverse childhood behavioral outcomes.孕期邻苯二甲酸酯混合物与自闭症特征和儿童期不良行为结局的关系

Environ Int. 2021 Feb;147:106330. doi: 10.1016/j.envint.2020.106330. Epub 2021 Jan 5.

Association between exposure to a mixture of phenols, pesticides, and phthalates and obesity: Comparison of three statistical models.酚类、农药和邻苯二甲酸酯混合物暴露与肥胖的关联：三种统计模型的比较。

Environ Int. 2019 Feb;123:325-336. doi: 10.1016/j.envint.2018.11.076. Epub 2018 Dec 14.

A Quantile-Based g-Computation Approach to Addressing the Effects of Exposure Mixtures.基于分位数的 g 计算方法在解决暴露混合物影响中的应用。

Environ Health Perspect. 2020 Apr;128(4):47004. doi: 10.1289/EHP5838. Epub 2020 Apr 7.

A weighted quantile sum regression with penalized weights and two indices.带惩罚权重和两个指标的加权分位数和回归。

Front Public Health. 2023 Jul 18;11:1151821. doi: 10.3389/fpubh.2023.1151821. eCollection 2023.

A cohort study evaluation of maternal PCB exposure related to time to pregnancy in daughters.队列研究评价母亲 PCB 暴露与女儿妊娠时间的关系。

Environ Health. 2013 Aug 20;12(1):66. doi: 10.1186/1476-069X-12-66.

Bayesian Group Index Regression for Modeling Chemical Mixtures and Cancer Risk.用于化学混合物建模和癌症风险评估的贝叶斯组指数回归

Int J Environ Res Public Health. 2021 Mar 27;18(7):3486. doi: 10.3390/ijerph18073486.

引用本文的文献

Evaluating Chemical Mixtures in Epidemiological Studies to Inform Regulatory Decisions.评估流行病学研究中的化学混合物，为监管决策提供信息。

Environ Health Perspect. 2023 Apr;131(4):45001. doi: 10.1289/EHP11899. Epub 2023 Apr 6.

Environ Health Perspect. 2022 Aug;130(8):87010. doi: 10.1289/EHP10570. Epub 2022 Aug 30.

本文引用的文献

In utero metal exposures measured in deciduous teeth and birth outcomes in a racially-diverse urban cohort.在一个种族多样化的城市队列中，从乳牙中测量的宫内金属暴露与出生结局。

Environ Res. 2019 Apr;171:444-451. doi: 10.1016/j.envres.2019.01.054. Epub 2019 Jan 31.

Characterization of Weighted Quantile Sum Regression for Highly Correlated Data in a Risk Analysis Setting.风险分析环境中高度相关数据的加权分位数和回归的特征描述

J Agric Biol Environ Stat. 2015 Mar;20(1):100-120. doi: 10.1007/s13253-014-0180-3. Epub 2014 Dec 24.

Statistical Approaches to Address Multi-Pollutant Mixtures and Multiple Exposures: the State of the Science.统计方法在解决多污染物混合物和多种暴露问题上的应用：科学现状。

Curr Environ Health Rep. 2017 Dec;4(4):481-490. doi: 10.1007/s40572-017-0162-z.

Early Prenatal Phthalate Exposure, Sex Steroid Hormones, and Birth Outcomes.孕期早期邻苯二甲酸盐暴露、性类固醇激素与出生结局

J Clin Endocrinol Metab. 2017 Jun 1;102(6):1870-1878. doi: 10.1210/jc.2016-3837.

Early-life exposure to EDCs: role in childhood obesity and neurodevelopment.早年接触环境内分泌干扰物：在儿童肥胖和神经发育中的作用。

Nat Rev Endocrinol. 2017 Mar;13(3):161-173. doi: 10.1038/nrendo.2016.186. Epub 2016 Nov 18.

Association Between Dietary Intake and Function in Amyotrophic Lateral Sclerosis.饮食摄入与肌萎缩侧索硬化症功能的关系。

JAMA Neurol. 2016 Dec 1;73(12):1425-1432. doi: 10.1001/jamaneurol.2016.3401.

CO-occurring exposure to perchlorate, nitrate and thiocyanate alters thyroid function in healthy pregnant women.同时接触高氯酸盐、硝酸盐和硫氰酸盐会改变健康孕妇的甲状腺功能。

Environ Res. 2015 Nov;143(Pt A):1-9. doi: 10.1016/j.envres.2015.09.013. Epub 2015 Sep 25.

The association between maternal urinary phthalate concentrations and blood pressure in pregnancy: The HOME Study.孕妇尿中邻苯二甲酸盐浓度与孕期血压之间的关联：家庭研究

Environ Health. 2015 Sep 17;14:75. doi: 10.1186/s12940-015-0062-3.

Analysis of Environmental Chemical Mixtures and Non-Hodgkin Lymphoma Risk in the NCI-SEER NHL Study.国立癌症研究所监测、流行病学和最终结果计划（NCI-SEER）非霍奇金淋巴瘤（NHL）研究中环境化学混合物与非霍奇金淋巴瘤风险分析

Environ Health Perspect. 2015 Oct;123(10):965-70. doi: 10.1289/ehp.1408630. Epub 2015 Mar 6.

First trimester phthalate exposure and anogenital distance in newborns.孕早期邻苯二甲酸盐暴露与新生儿肛门生殖器距离

Hum Reprod. 2015 Apr;30(4):963-72. doi: 10.1093/humrep/deu363. Epub 2015 Feb 18.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验