Suppr超能文献

可解释的深度学习预测中国南京精神障碍疾病风险

Explainable deep learning predictions for illness risk of mental disorders in Nanjing, China.

机构信息

School of Energy and Environment, Southeast University, Nanjing, 210096, PR China; State Key Laboratory of Environmental Medicine Engineering, Ministry of Education, Southeast University, Nanjing, 210096, PR China.

National-Provincial Joint Engineering Research Center of Electromechanical Product Packaging, College of Civil Engineering, Nanjing Forestry University, Nanjing, 210037, PR China.

出版信息

Environ Res. 2021 Nov;202:111740. doi: 10.1016/j.envres.2021.111740. Epub 2021 Jul 28.

Abstract

Epidemiological studies have revealed the associations of air pollutants and meteorological factors with a range of mental health conditions. However, little is known about local explanations and global understanding on the importance and effect of input features in the complex system of environmental stressors - mental disorders (MDs), especially for exposure to air pollution mixture. In this study, we combined deep learning neural networks (DLNNs) with SHapley Additive exPlanation (SHAP) to predict the illness risk of MDs on the population level, and then provided explanations for risk factors. The modeling system, which was trained on day-by-day hospital outpatient visits of two major hospitals in Nanjing, China from 2013/07/01 through 2019/02/28, visualized the time-varying prediction, contributing factors, and interaction effects of informative features. Our results suggested that NO, SO, and CO made outstanding contributions in magnitude of feature attributions under circumstances of mixed air pollutants. In particular, NO at high concentration level was associated with an increase in illness risk of MDs, and the maximum and mean absolute SHAP value were approximated to 10 and 2 as a local and global measure of feature importance, respectively. It presented a marginally antagonistic effect for two pairs of gaseous pollutants, i.e., NO vs. SO and CO vs. NO. In contrast, CO and SO displayed the opposite direction of feature effects to the rise of observed concentrations, but an apparent synergistic effect was obviously captured. The primary risk factors driving a sharp increase in acute attack or exacerbation of MDs were also identified by depicting prediction paths of time-series samples. We believe that the significance of coupling accurate predictions from DLNNs with interpretable explanations of why a prediction is completed has broad applicability throughout the field of environmental health.

摘要

流行病学研究揭示了空气污染物和气象因素与一系列心理健康状况的关联。然而,对于环境应激源-精神障碍(MDs)复杂系统中输入特征的重要性和影响,包括对空气污染混合物的暴露,人们知之甚少。在这项研究中,我们将深度学习神经网络(DLNNs)与 SHapley Additive exPlanation(SHAP)相结合,预测 MDs 在人群水平上的发病风险,并为风险因素提供解释。该模型系统基于中国南京两家主要医院的门诊就诊记录进行训练,时间范围为 2013 年 7 月 1 日至 2019 年 2 月 28 日。该系统可视化了时变预测、主要因素和信息特征的交互作用。我们的研究结果表明,NO、SO 和 CO 在混合空气污染物的情况下,在特征归因的幅度上做出了杰出贡献。特别是,NO 在高浓度水平下与 MDs 的发病风险增加有关,最大和平均绝对 SHAP 值分别近似为 10 和 2,作为特征重要性的局部和全局度量。NO 与 SO 之间以及 CO 与 NO 之间存在边际拮抗效应。相比之下,CO 和 SO 对观察到的浓度升高表现出相反的特征效应,但明显捕捉到了协同效应。通过描绘时间序列样本的预测路径,还确定了导致 MDs 急性发作或恶化急剧增加的主要风险因素。我们相信,将 DLNNs 的准确预测与解释预测完成的原因相结合的意义在整个环境卫生领域具有广泛的适用性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验