Department of Biostatistics and Bioinformatics, Emory University, Atlanta, USA.
Department of Biostatistics, Yale University, New Haven, USA.
BMC Med Res Methodol. 2021 Apr 26;21(1):87. doi: 10.1186/s12874-021-01278-x.
Short-term associations between extreme heat events and adverse health outcomes are well-established in epidemiologic studies. However, the use of different exposure definitions across studies has limited our understanding of extreme heat characteristics that are most important for specific health outcomes or subpopulations.
Logic regression is a statistical learning method for constructing decision trees based on Boolean combinations of binary predictors. We describe how logic regression can be utilized as a data-driven approach to identify extreme heat exposure definitions using health outcome data. We evaluated the performance of the proposed algorithm in a simulation study, as well as in a 20-year time-series analysis of extreme heat and emergency department visits for 12 outcomes in the Atlanta metropolitan area.
For the Atlanta case study, our novel application of logic regression identified extreme heat exposure definitions that were associated with several heat-sensitive disease outcomes (e.g., fluid and electrolyte imbalance, renal diseases, ischemic stroke, and hypertension). Exposures were often characterized by extreme apparent minimum temperature or maximum temperature over multiple days. The simulation study also demonstrated that logic regression can successfully identify exposures of different lags and duration structures when statistical power is sufficient.
Logic regression is a useful tool for identifying important characteristics of extreme heat exposures for adverse health outcomes, which may help improve future heat warning systems and response plans.
流行病学研究已经充分证实,短期极端高温事件与不良健康结果之间存在关联。然而,由于研究中采用了不同的暴露定义,我们对与特定健康结果或特定人群相关的极端高温特征的理解仍存在局限性。
逻辑回归是一种基于二进制预测器的布尔组合构建决策树的统计学习方法。我们描述了如何利用逻辑回归作为一种数据驱动的方法,使用健康结果数据来确定极端高温暴露定义。我们在模拟研究中评估了所提出算法的性能,以及在亚特兰大都会区 12 种健康结果的 20 年极端高温和急诊就诊时间序列分析中评估了该算法的性能。
对于亚特兰大案例研究,我们对逻辑回归的新应用确定了与几种对高温敏感的疾病结果(例如,液体和电解质失衡、肾脏疾病、缺血性中风和高血压)相关的极端高温暴露定义。暴露通常表现为多天内的极端明显最低温度或最高温度。模拟研究还表明,当统计功效足够时,逻辑回归可以成功地识别不同滞后和持续时间结构的暴露。
逻辑回归是一种识别与不良健康结果相关的极端高温暴露重要特征的有用工具,这可能有助于改善未来的高温预警系统和应对计划。