Liu Jeremiah Zhe, Deng Wenying, Lee Jane, Lin Pi-I Debby, Valeri Linda, Christiani David C, Bellinger David C, Wright Robert O, Mazumdar Maitreyi M, Coull Brent A
Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
J Am Stat Assoc. 2022;117(538):561-573. doi: 10.1080/01621459.2021.1962889. Epub 2021 Sep 20.
Gene-environment and nutrition-environment studies often involve testing of high-dimensional interactions between two sets of variables, each having potentially complex nonlinear main effects on an outcome. Construction of a valid and powerful hypothesis test for such an interaction is challenging, due to the difficulty in constructing an efficient and unbiased estimator for the complex, nonlinear main effects. In this work we address this problem by proposing a Cross-validated Ensemble of Kernels (CVEK) that learns the space of appropriate functions for the main effects using a cross-validated ensemble approach. With a carefully chosen library of base kernels, CVEK flexibly estimates the form of the main-effect functions from the data, and encourages test power by guarding against over-fitting under the alternative. The method is motivated by a study on the interaction between metal exposures and maternal nutrition on children's neurodevelopment in rural Bangladesh. The proposed tests identified evidence of an interaction between minerals and vitamins intake and arsenic and manganese exposures. Results suggest that the detrimental effects of these metals are most pronounced at low intake levels of the nutrients, suggesting nutritional interventions in pregnant women could mitigate the adverse impacts of metal exposures on children's neurodevelopment.
基因-环境和营养-环境研究通常涉及对两组变量之间的高维相互作用进行测试,每组变量对一个结果都可能具有潜在复杂的非线性主效应。由于难以构建针对复杂非线性主效应的高效且无偏估计量,因此为这种相互作用构建有效且强大的假设检验具有挑战性。在这项工作中,我们通过提出一种交叉验证核集成(CVEK)方法来解决这个问题,该方法使用交叉验证集成方法来学习主效应的合适函数空间。通过精心选择的基核库,CVEK可以灵活地从数据中估计主效应函数的形式,并通过防止在备择假设下的过度拟合来提高检验效能。该方法的灵感来自于对孟加拉国农村地区金属暴露与孕产妇营养对儿童神经发育相互作用的一项研究。所提出的检验方法发现了矿物质和维生素摄入量与砷和锰暴露之间存在相互作用的证据。结果表明,这些金属的有害影响在营养素摄入量较低时最为明显,这表明对孕妇进行营养干预可以减轻金属暴露对儿童神经发育的不利影响。