Suppr超能文献

优化空气污染暴露评估及其在认知功能中的应用

Optimizing Air Pollution Exposure Assessment with Application to Cognitive Function.

作者信息

Sheppard L, Blanco M N, Kim S-Y, Doubleday A, Cheng S, Zuidema C, Bi J, Gassett A, Shojaie A, Szpiro A A

机构信息

University of Washington, Seattle, Washington, USA.

National Cancer Center Graduate School of Cancer Science and Policy, Gyeonggi-do, South Korea.

出版信息

Res Rep Health Eff Inst. 2025 Aug;2025(228):1-117.

Abstract

INTRODUCTION

Epidemiological studies often make use of exposure data that is collected in opportunistic and logistically convenient ways. And, while exposure assessment is fundamental to environmental epidemiology, little is known about what exposure assessment study designs are optimal for health inference. The objective of this project was to advance our understanding of the design of exposure assessment measurement campaigns and evaluate their impact on estimating the associations between long-term average air pollution exposure and cognitive function. This feeds into the broader goal of advancing understanding of air pollution exposure assessment design for application to epidemiological inference.

METHODS

We leveraged data from the Adult Changes in Thought (ACT) Air Pollution study (ACT-AP) to characterize exposures for over 5,000 participants from the ongoing ACT cohort. This is a population-based cohort of urban and suburban elderly individuals in the greater Puget Sound region drawn from Group Health Cooperative, now Kaiser Permanente, starting in 1994. Participants were routinely followed with routine biennial visits until dementia incidence, drop-out, or death. Extensive health, lifestyle, biological, and demographic data were also collected. The outcome measure used in this report is cognitive function at baseline based on the Cognitive Abilities Screening Instrument derived using Item Response Theory (CASI-IRT). The IRT transformation of the CASI score improves score accuracy, measures cognitive change with less bias, and accounts for missing test items. Health association analyses were based on 5,409 participants with both a valid CASI score and who had lived in the mobile monitoring region during at least 95% of the 5 years prior to baseline. We used 5-year average exposures that accounted for residential history.

UNLABELLED

Exposure data came from two distinct exposure assessment campaigns carried out by the ACT-AP study: a campaign using low-cost sensors (2017+) that supplemented existing regulatory monitoring data for fine particles (PM, 1978+) and nitrogen dioxide (NO, 1996+), and a year-long multipollutant mobile monitoring campaign (2019-2020). The evaluation of the added value of low-cost sensor data relied on a combination of regulatory monitoring data and other high-quality data from research studies, calibrated 2-week low-cost sensor measurements from over 100 locations, which were mostly ACT cohort residences, and a snapshot campaign that measured NO using Ogawa samplers. Predictions were at a 2-week average time scale, used a suite of ~200 geographic covariates, and were obtained from a spatiotemporal model developed at the University of Washington. The Seattle mobile monitoring campaign collected a combination of stationary roadside and on-road measurements of ultrafine particles (UFPs, four instruments), black carbon (BC), NO, carbon dioxide (CO), and PM. Visits were temporally balanced over 288 drive days such that all sites were visited during all seasons, days of the week, and most hours of the day (5 a.m. to 11 p.m.) approximately 29 times each. For the on-road measurements, we divided the driving route into 100-meter segments and assigned all measurements to the segment midpoint. Predictions used the same suite of geographic covariates in a spatial model fit using partial least squares (PLS) dimension reduction with universal kriging (UK-PLS) to capture the remaining spatial structure. We reported model performance metrics for both the spatial and spatiotemporal models as root mean squared error (RMSE) and mean squared error (MSE)-based R. The reference observations for the spatiotemporal model were low-cost sensor measurements at home locations (with performance metrics averaged over their entire measurement period to approximate spatial contrasts), and for the spatial model, the reference observations were the all data long-term averages at stationary roadside locations.

UNLABELLED

Using various approaches to sample data from these two exposure monitoring campaigns, we determined the impact on exposure prediction and estimates of health associations using two confounder models and 5-year average exposure predictions for cohort members at baseline developed from the alternative campaigns. For the low-cost sensor data, we evaluated temporally or spatially reduced subsets of low-cost sensors, as well as a comparison of the low-cost sensor versus snapshot campaigns for NO. For the mobile monitoring data, we considered designs focused on the stationary roadside and on-road data separately. We reduced the stationary roadside data temporally by restricting seasons, times of day, or days of week for the campaign, while also considering a reduced number of visits using balanced sampling, as well as a set of unbalanced visit designs. We also reduced the on-road data spatially and temporally to assess the importance of spatially or temporally balanced data collection. In addition, we considered the impact of incorporating temporal adjustment to account for temporally unbalanced sampling, as well as plume adjustment to account for on-road sources. For each design, we evaluated prediction model performance using the all data stationary roadside observations (mobile campaign) or the measurements at homes (low-cost sensor campaign) as reference observations to ensure consistency in reported performance metrics. We also used long-term average exposures estimated from these alternative campaigns in health association analyses under two different confounder models that were adjusted by potentially confounding variables: Model 1 adjusted for age, calendar year, sex, and educational attainment; Model 2 included all Model 1 variables with the addition of race and socioeconomic status. Furthermore, using the stationary roadside data, we applied parametric and nonparametric bootstrap methods to account for Berkson-like and classical-like exposure measurement error for the UFP exposure in confounder model 1.

UNLABELLED

In a separate methods-focused aim, we developed and applied advanced statistical methods using the stationary roadside mobile monitoring data. To evaluate possible improvements in exposure model performance, we applied tree-based machine learning algorithms that also account for residual spatial structure, and compared these to UK-PLS. This led to the development of a variable importance metric that uses a leave-one-out approach to evaluate the change in predictions across various user-specified quantiles. The variable importance metric produces covariate-specific averages that reflect how the predictions, on average, vary across different quantiles of each covariate. This serves as an intuitive measure of the contribution of this covariate to the predicted outcome. A key idea in this variable importance approach is to reuse the trained mean model across all locations and to refit the covariance model in a leave-one-out manner. In separate work to address dimension reduction for multipollutant prediction, we extended classical principal component analysis (PCA) and a recently developed predictive PCA approach to optimize performance by balancing the representativeness in classical PCA with the predictive ability of predictive PCA. We called the new method representative and predictive PCA, or RapPCA.

UNLABELLED

Finally, we characterized the various exposure assessment campaigns in terms of the value of their information as quantified by cost. We calculated costs, focused predominantly on staff days of effort, for various exposure assessment designs and compared these to exposure model performance statistics.

RESULTS

We found that air pollution exposure assessment design is critical for exposure prediction, and also impacts health inference. We showed that a mobile monitoring study with stationary roadside sampling that has at least 12 visits per location in a balanced and temporally unrestricted design optimizes exposure model performance while also limiting costs. Relative to weaker alternatives, a balanced and temporally unrestricted design has improved accuracy and reduced variability of health inferences, particularly for confounder model 1. To address temporal balance, it is important that the exposure sampling in mobile monitoring campaigns cover all days of the week, most hours of the day, and at least two seasons. The popular temporally restricted business-hours sampling design had the poorest performance, which was not improved by adjusting for the temporally unbalanced sampling approach. We found similar patterns using on-road data, though the findings were weaker overall.

UNLABELLED

For the alternative exposure campaign that supplemented regulatory monitoring data with low-cost sensor data, while the exposure prediction model performances improved with the inclusion of the low-cost sensors, there was little notable impact on the health inferences, and the costs were steep. Given that the supplementary exposure assessment data were sparse relative to the existing regulatory monitoring data, and that the low-cost sensor data collection used a rotating approach due to the limited number of sensors (i.e., low-cost sensor measurements were not collected using a balanced design), it was much more challenging to develop deep insights from this exposure assessment approach.

UNLABELLED

Finally, we found that leveraging spatial ensemble-learning methods for prediction did not improve exposure prediction model performances or alter health inferences. The new multipollutant dimension-reduction we developed, RapPCA, had the best predictive performance and also minimized the prediction error in comparison with both classical and predictive PCA.

CONCLUSIONS

This project has shown that there should be greater attention to the design of the exposure data collection campaigns used in epidemiological inference. Based on the multiple investigations conducted, many of which focused on UFPs, we found that exposure predictions with better performance statistics resulted in health association estimates that were generally more consistent with those obtained using the "best" exposure model predictions (the model with all data included), although the pattern of health estimates was often less conclusive than the pattern of prediction model performances. Furthermore, we found that it is possible to design air pollution exposure assessment studies that achieve good exposure prediction model performance while controlling their relative cost.

UNLABELLED

We developed strong recommendations for mobile monitoring campaign design, thanks to the well-designed and comprehensive Seattle mobile monitoring campaign. Insights from supplementing regulatory monitoring data with low-cost sensor data were less compelling, driven predominantly by a data structure with sparse and temporally unbalanced supplementary data that may not have been sufficiently comprehensive to demonstrate the impacts of alternative designs. Broadly speaking, better exposure assessment design leads to better exposure prediction model performance, which in turn can benefit estimates of health associations.

UNLABELLED

We did not find that leveraging advanced statistical methods (specifically, spatial ensemble-learning methods for prediction) improved exposure prediction model performances. This finding is not consistent with the conclusions reached by other investigators, and may have been due to the already sophisticated UK-PLS approach we used by default, and in particular its application in conjunction with the large number of covariates that we considered in the PLS model, such that the contribution of any single covariate was approximately linear. In other words, it is reasonable to believe that in the presence of the large set of covariates we considered, each can contribute an approximately linear association with the pollutant being modeled, such that the potential added value of the spatial Random Forest approach is not observed in the model fit. Other settings with a smaller number of possible covariates available may lead to different conclusions and suggest greater added value of the application of a spatial Random Forest approach.

UNLABELLED

We based our approach on leveraging the extensive air pollution exposure assessment and outcome data available from the ACT-AP study. Thus, we sampled from the existing air pollution data to evaluate exposure assessment designs that were subsets of those data. Then, conditional on each of these designs, we evaluated subsequent health inferences, which focused on cognitive function at baseline using the CASI-IRT outcome. The magnitude and uncertainty of these health association estimates were dependent upon the associations evident in the ACT cohort, and the insights we were able to develop are conditional on the strengths and weaknesses of these data. Specifically, while we observed some larger impacts on health association estimates of more poorly performing exposure models relative to the complete all data exposure model, such as the business-hours design from a mobile monitoring campaign, many of the differences were small and did not deviate meaningfully from the health association estimate obtained from the "best" exposure model. The degree of impact on the epidemiological inference depended on the magnitude of the health association estimate from the "best" exposure model and the width of its confidence interval. Future investigations should replicate and expand upon these findings in other settings, including application to new cohorts and exposure assessment data, as well as in simulation studies, which provide an alternative approach to using real-world data to evaluate a constellation of exposure models. However, while knowledge of the assumed underlying truth is an important strength of simulation studies, it is challenging to capture real-world complexity meaningfully in simulation studies.

UNLABELLED

Our foray into applying advanced machine-learning methods to improve exposure predictions produced the surprising result that our default UK-PLS approach for spatial prediction produced similar performance metrics to spatial ensemble-learning methods. Future evaluations that assess smaller subsets of exposure covariates will allow determination of the relative exposure model performance benefits of UK-PLS versus spatial ensemble-learning methods, and provide insights into the possible reason that our conclusions differ from others in the literature.

摘要

引言:流行病学研究常常使用以机会主义方式且在后勤方面便于收集的暴露数据。并且,虽然暴露评估是环境流行病学的基础,但对于何种暴露评估研究设计最适合健康推断,我们却知之甚少。本项目的目标是增进我们对暴露评估测量活动设计的理解,并评估其对估计长期平均空气污染暴露与认知功能之间关联的影响。这有助于实现更广泛的目标,即增进对空气污染暴露评估设计的理解,以便应用于流行病学推断。

方法:我们利用了“成人思维变化(ACT)空气污染研究(ACT - AP)”的数据,来描述来自正在进行的ACT队列中5000多名参与者的暴露情况。这是一个基于人群的队列,研究对象是大普吉特海湾地区城市和郊区的老年人,从1994年开始,来自健康合作组织(现为凯撒医疗集团)。参与者每两年进行一次常规随访,直至发生痴呆、退出研究或死亡。同时还收集了广泛的健康、生活方式、生物学和人口统计学数据。本报告中使用的结果指标是基于使用项目反应理论推导的认知能力筛查工具(CASI - IRT)得出的基线认知功能。CASI分数的IRT转换提高了分数准确性,以较小偏差测量认知变化,并考虑了缺失的测试项目。健康关联分析基于5409名参与者,他们既有有效的CASI分数,并且在基线前5年中至少95%的时间居住在移动监测区域。我们使用了考虑居住历史的5年平均暴露数据。

未标注:暴露数据来自ACT - AP研究开展的两项不同的暴露评估活动:一项使用低成本传感器的活动(2017年及以后),该活动补充了现有的细颗粒物(PM,1978年及以后)和二氧化氮(NO,1996年及以后)监管监测数据;以及一项为期一年的多污染物移动监测活动(2019 - 2020年)。对低成本传感器数据附加值的评估依赖于监管监测数据与来自研究的其他高质量数据的组合,校准了来自100多个地点(大多是ACT队列住所)的为期2周的低成本传感器测量数据,以及一项使用小川采样器测量NO的快照活动。预测是在两周平均时间尺度上进行的,使用了约200个地理协变量,并从华盛顿大学开发的时空模型中获得。西雅图移动监测活动收集了固定路边和道路上的超细颗粒物(UFPs,四种仪器)、黑碳(BC)、NO、二氧化碳(CO)和PM的测量数据组合。在288个驾车日里,访问在时间上分布均衡,使得所有地点在所有季节、一周中的各天以及一天中的大部分时间(上午5点至晚上11点)都被访问了大约29次。对于道路上的测量,我们将行驶路线划分为100米的路段,并将所有测量数据分配到路段中点。预测在使用偏最小二乘法(PLS)降维和通用克里金法(UK - PLS)拟合的空间模型中使用相同的地理协变量组,以捕捉剩余的空间结构。我们报告了空间和时空模型的模型性能指标,分别为基于均方根误差(RMSE)和均方误差(MSE)的R值。时空模型的参考观测值是家庭位置的低成本传感器测量值(性能指标在其整个测量期内平均以近似空间对比),对于空间模型,参考观测值是固定路边位置的所有数据长期平均值。

未标注:通过使用各种方法从这两项暴露监测活动中采样数据,我们使用两种混杂因素模型以及从替代活动得出的基线队列成员的5年平均暴露预测,确定了对暴露预测和健康关联估计的影响。对于低成本传感器数据,我们评估了低成本传感器在时间或空间上减少的子集,以及低成本传感器与NO快照活动的比较。对于移动监测数据,我们分别考虑了侧重于固定路边和道路上数据的设计。我们通过限制活动的季节、一天中的时间或一周中的天数在时间上减少固定路边数据,同时还考虑了使用均衡采样减少访问次数,以及一组不均衡的访问设计。我们还在空间和时间上减少道路上的数据,以评估空间或时间上均衡数据收集的重要性。此外,我们考虑了纳入时间调整以考虑时间上不均衡采样的影响,以及羽流调整以考虑道路源的影响。对于每种设计,我们使用所有数据固定路边观测值(移动活动)或家庭测量值(低成本传感器活动)作为参考观测值来评估预测模型性能,以确保报告的性能指标具有一致性。我们还在两种不同的混杂因素模型下,使用从这些替代活动估计的长期平均暴露进行健康关联分析,这两种模型通过潜在的混杂变量进行调整:模型1调整了年龄、日历年份、性别和教育程度;模型2包括模型1的所有变量,并增加了种族和社会经济地位。此外,使用固定路边数据,我们应用参数和非参数自助法来考虑混杂因素模型1中UFP暴露的类似伯克森误差和经典误差的暴露测量误差。

未标注:在一个单独的以方法为重点的目标中,我们使用固定路边移动监测数据开发并应用了先进的统计方法。为了评估暴露模型性能可能的改进,我们应用了基于树的机器学习算法,该算法也考虑了剩余的空间结构,并将其与UK - PLS进行比较。这导致开发了一种变量重要性度量,该度量使用留一法来评估跨各种用户指定分位数的预测变化。变量重要性度量产生特定于协变量的平均值,反映预测在每个协变量的不同分位数上平均如何变化。这作为该协变量对预测结果贡献的直观度量。这种变量重要性方法的一个关键思想是在所有位置重用训练好的均值模型,并以留一法重新拟合协方差模型。在单独的工作中,为了解决多污染物预测的降维问题,我们扩展了经典主成分分析(PCA)和最近开发的预测性PCA方法,通过平衡经典PCA中的代表性与预测性PCA的预测能力来优化性能。我们将新方法称为代表性和预测性PCA,即RapPCA。

未标注:最后,我们根据成本量化的信息价值对各种暴露评估活动进行了描述。我们计算了各种暴露评估设计的成本,主要集中在工作人员的工作日,并将这些成本与暴露模型性能统计数据进行了比较。

结果:我们发现空气污染暴露评估设计对于暴露预测至关重要,并且也会影响健康推断。我们表明,在平衡且时间不受限制的设计中,每个位置至少有12次访问的固定路边采样移动监测研究既能优化暴露模型性能,又能限制成本。相对于较弱的替代方案,平衡且时间不受限制的设计提高了健康推断的准确性并降低了变异性,特别是对于混杂因素模型1。为了实现时间平衡,移动监测活动中的暴露采样覆盖一周中的所有日子、一天中的大部分时间以及至少两个季节非常重要。流行的时间受限的工作日采样设计性能最差,通过调整时间不均衡的采样方法也没有得到改善。我们在道路数据中发现了类似的模式,尽管总体结果较弱。

未标注:对于用低成本传感器数据补充监管监测数据的替代暴露活动,虽然包含低成本传感器后暴露预测模型性能有所提高,但对健康推断几乎没有显著影响,并且成本很高。鉴于补充暴露评估数据相对于现有的监管监测数据较为稀疏,并且由于传感器数量有限,低成本传感器数据收集采用了轮换方法(即低成本传感器测量不是使用平衡设计收集的),从这种暴露评估方法中获得深入见解更具挑战性。

未标注:最后,我们发现利用空间集成学习方法进行预测并没有提高暴露预测模型性能,也没有改变健康推断。我们开发的新的多污染物降维方法RapPCA具有最佳的预测性能,并且与经典PCA和预测性PCA相比,还最小化了预测误差。

结论:本项目表明,在流行病学推断中使用的暴露数据收集活动的设计应得到更多关注。基于进行的多项调查,其中许多聚焦于超细颗粒物,我们发现具有更好性能统计的暴露预测会产生与使用“最佳”暴露模型预测(包含所有数据的模型)得到的健康关联估计通常更一致的结果,尽管健康估计模式往往不如预测模型性能模式那样具有决定性。此外,我们发现可以设计出在控制相对成本的同时实现良好暴露预测模型性能的空气污染暴露评估研究。

未标注:由于精心设计且全面的西雅图移动监测活动,我们为移动监测活动设计提出了强有力的建议。用低成本传感器数据补充监管监测数据的见解不太有说服力,主要是由于数据结构中补充数据稀疏且时间不均衡,可能不够全面,无法证明替代设计的影响。一般来说,更好的暴露评估设计会带来更好的暴露预测模型性能,这反过来又有利于健康关联估计。

未标注:我们没有发现利用先进统计方法(具体而言,用于预测的空间集成学习方法)能提高暴露预测模型性能。这一发现与其他研究者得出的结论不一致,可能是由于我们默认使用的已经很复杂的UK - PLS方法,特别是它与我们在PLS模型中考虑的大量协变量结合使用,使得任何单个协变量的贡献近似线性。换句话说,有理由相信在我们考虑的大量协变量存在的情况下,每个协变量都可以与被建模的污染物形成近似线性关联,以至于在模型拟合中未观察到空间随机森林方法的潜在附加值。其他协变量数量较少的情况可能会导致不同的结论,并表明应用空间随机森林方法具有更大的附加值。

未标注:我们的方法基于利用ACT - AP研究中广泛的空气污染暴露评估和结果数据。因此,我们从现有的空气污染数据中采样,以评估作为这些数据子集的暴露评估设计。然后,在每种设计的条件下,我们评估后续的健康推断,这些推断使用CASI - IRT结果聚焦于基线时的认知功能。这些健康关联估计的大小和不确定性取决于ACT队列中明显的关联,并且我们能够得出的见解取决于这些数据的优缺点。具体而言,虽然我们观察到相对于完整的所有数据暴露模型,表现较差的暴露模型对健康关联估计有一些较大影响,例如移动监测活动中的工作日设计,但许多差异很小,并且与从“最佳”暴露模型获得的健康关联估计没有显著偏差。对流行病学推断的影响程度取决于“最佳”暴露模型的健康关联估计大小及其置信区间的宽度。未来的研究应在其他环境中复制和扩展这些发现,包括应用于新的队列和暴露评估数据,以及在模拟研究中,模拟研究提供了一种使用真实世界数据评估一系列暴露模型的替代方法。然而,虽然了解假设的潜在真相是模拟研究的一个重要优势,但在模拟研究中有意义地捕捉现实世界的复杂性具有挑战性。

未标注:我们尝试应用先进的机器学习方法来改善暴露预测,结果令人惊讶,我们用于空间预测的默认UK - PLS方法产生的性能指标与空间集成学习方法相似。未来评估较小的暴露协变量子集将有助于确定UK - PLS与空间集成学习方法在暴露模型性能方面的相对优势,并深入了解我们的结论与文献中其他结论不同的可能原因。

本文引用的文献

2
识别美国空气质量监测空白区。
Proc Natl Acad Sci U S A. 2025 Apr 29;122(17):e2425310122. doi: 10.1073/pnas.2425310122. Epub 2025 Apr 21.
3
用于流行病学的超细颗粒物移动监测研究设计:成本与性能比较
Environ Health Perspect. 2025 Apr;133(3-4):47010. doi: 10.1289/EHP15100. Epub 2025 Apr 23.
5
评估长期暴露于低水平环境空气污染对健康的不良影响:HEI 的经验及未来方向?
Environ Sci Technol. 2024 Jul 23;58(29):12767-12783. doi: 10.1021/acs.est.3c09745. Epub 2024 Jul 11.
6
利用低成本传感器预测二氧化氮以进行流行病学暴露评估。
J Expo Sci Environ Epidemiol. 2025 Apr;35(2):169-179. doi: 10.1038/s41370-024-00667-w. Epub 2024 Apr 9.
7
8
交通相关的空气污染与思维变化研究中的痴呆发病率。
Environ Int. 2024 Jan;183:108418. doi: 10.1016/j.envint.2024.108418. Epub 2024 Jan 3.
9
采用时空建模方法评估用于细颗粒物(PM)暴露评估的低成本监测设计。
Environ Pollut. 2024 Feb 15;343:123227. doi: 10.1016/j.envpol.2023.123227. Epub 2023 Dec 24.
10
空气污染物的移动监测;与驾驶天数相关的混合模型土地利用回归框架的性能评估。
Environ Res. 2024 Jan 1;240(Pt 2):117457. doi: 10.1016/j.envres.2023.117457. Epub 2023 Oct 19.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验