Ohanyan Haykanush, Portengen Lützen, Kaplani Oriana, Huss Anke, Hoek Gerard, Beulens Joline W J, Lakerveld Jeroen, Vermeulen Roel
Institute for Risk Assessment Sciences, Utrecht University, Utrecht, Utrecht, the Netherlands; Department of Epidemiology and Data Science, Amsterdam UMC, Vrije Universiteit Amsterdam, Amsterdam, Noord-Holland, the Netherlands; Amsterdam Public Health, Health Behaviours and Chronic Diseases, Amsterdam, Noord-Holland, the Netherlands; Upstream Team, www.upstreamteam.nl. Amsterdam UMC, VU University Amsterdam, Amsterdam, Noord-Holland, the Netherlands.
Institute for Risk Assessment Sciences, Utrecht University, Utrecht, Utrecht, the Netherlands.
Environ Int. 2022 Dec;170:107592. doi: 10.1016/j.envint.2022.107592. Epub 2022 Oct 18.
Type 2 diabetes (T2D) is thought to be influenced by environmental stressors such as air pollution and noise. Although environmental factors are interrelated, studies considering the exposome are lacking. We simultaneously assessed a variety of exposures in their association with prevalent T2D by applying penalised regression Least Absolute Shrinkage and Selection Operator (LASSO), Random Forest (RF), and Artificial Neural Networks (ANN) approaches. We contrasted the findings with single-exposure models including consistently associated risk factors reported by previous studies.
Baseline data (n = 14,829) of the Occupational and Environmental Health Cohort study (AMIGO) were enriched with 85 exposome factors (air pollution, noise, built environment, neighbourhood socio-economic factors etc.) using the home addresses of participants. Questionnaires were used to identify participants with T2D (n = 676(4.6 %)). Models in all applied statistical approaches were adjusted for individual-level socio-demographic variables.
Lower average home values, higher share of non-Western immigrants and higher surface temperatures were related to higher risk of T2D in the multivariable models (LASSO, RF). Selected variables differed between the two multi-variable approaches, especially for weaker predictors. Some established risk factors (air pollutants) appeared in univariate analysis but were not among the most important factors in multivariable analysis. Other established factors (green space) did not appear in univariate, but appeared in multivariable analysis (RF). Average estimates of the prediction error (logLoss) from nested cross-validation showed that the LASSO outperformed both RF and ANN approaches.
Neighbourhood socio-economic and socio-demographic characteristics and surface temperature were consistently associated with the risk of T2D. For other physical-chemical factors associations differed per analytical approach.
2型糖尿病(T2D)被认为受空气污染和噪音等环境应激源的影响。尽管环境因素相互关联,但考虑暴露组的研究却很缺乏。我们通过应用惩罚回归最小绝对收缩和选择算子(LASSO)、随机森林(RF)和人工神经网络(ANN)方法,同时评估了多种暴露与T2D患病率之间的关联。我们将这些结果与单暴露模型进行了对比,单暴露模型纳入了先前研究报告的始终相关的危险因素。
利用参与者的家庭住址,为职业与环境健康队列研究(AMIGO)的基线数据(n = 14829)补充了85个暴露组因素(空气污染、噪音、建筑环境、邻里社会经济因素等)。通过问卷调查确定患有T2D的参与者(n = 676(4.6%))。所有应用统计方法的模型均针对个体层面的社会人口学变量进行了调整。
在多变量模型(LASSO、RF)中,较低的平均房屋价值、较高比例的非西方移民和较高的地表温度与T2D风险较高相关。两种多变量方法所选变量不同,尤其是对于较弱的预测因素。一些已确定的危险因素(空气污染物)在单变量分析中出现,但并非多变量分析中最重要的因素。其他已确定的因素(绿地)在单变量分析中未出现,但在多变量分析(RF)中出现。嵌套交叉验证的预测误差(对数损失)平均估计表明,LASSO方法优于RF和ANN方法。
邻里社会经济和社会人口学特征以及地表温度与T2D风险始终相关。对于其他物理化学因素,不同分析方法得出的关联有所不同。