Englert Jacob R, Ebelt Stefanie T, Chang Howard H
Department of Biostatistics and Bioinformatics, Emory University.
Gangarosa Department of Environmental Health, Emory University.
J Am Stat Assoc. 2025 Apr 4. doi: 10.1080/01621459.2025.2460231.
Epidemiological approaches for examining human health responses to environmental exposures in observational studies often control for confounding by implementing clever matching schemes and using statistical methods based on conditional likelihood. Nonparametric regression models have surged in popularity in recent years as a tool for estimating individual-level heterogeneous effects, which provide a more detailed picture of the exposure-response relationship but can also be aggregated to obtain improved marginal estimates at the population level. In this work we incorporate Bayesian additive regression trees (BART) into the conditional logistic regression model to identify heterogeneous exposure effects in a case-crossover design. Conditional logistic BART (CL-BART) utilizes reversible jump Markov chain Monte Carlo to bypass the conditional conjugacy requirement of the original BART algorithm. Our work is motivated by the growing interest in identifying subpopulations more vulnerable to environmental exposures. We apply CL-BART to a study of the impact of heat waves on people with Alzheimer's disease in California and effect modification by other chronic conditions. Through this application, we also describe strategies to examine heterogeneous odds ratios through variable importance, partial dependence, and lower-dimensional summaries.
在观察性研究中,用于检验人类健康对环境暴露反应的流行病学方法通常通过实施巧妙的匹配方案和使用基于条件似然性的统计方法来控制混杂因素。近年来,非参数回归模型作为一种估计个体水平异质性效应的工具而广受欢迎,它能更详细地描绘暴露-反应关系,但也可以进行汇总以在人群水平上获得改进的边际估计。在这项工作中,我们将贝叶斯加法回归树(BART)纳入条件逻辑回归模型,以在病例交叉设计中识别异质性暴露效应。条件逻辑BART(CL-BART)利用可逆跳跃马尔可夫链蒙特卡罗方法绕过了原始BART算法的条件共轭性要求。我们的工作是受日益增长的识别更易受环境暴露影响的亚人群的兴趣所驱动。我们将CL-BART应用于一项关于热浪对加利福尼亚州阿尔茨海默病患者影响以及其他慢性病的效应修正的研究。通过这个应用,我们还描述了通过变量重要性、偏倚依赖性和低维汇总来检验异质性优势比的策略。