Dipartimento di Scienze del Suolo, della Pianta e degli Alimenti Universita' degli Studi di Bari Aldo Moro, Bari, Italy.
Istituto Nazionale di Fisica Nucleare Sezione di Bari, Bari, Italy.
Front Public Health. 2024 May 7;12:1344865. doi: 10.3389/fpubh.2024.1344865. eCollection 2024.
Respiratory system cancer, encompassing lung, trachea and bronchus cancer, constitute a substantial and evolving public health challenge. Since pollution plays a prominent cause in the development of this disease, identifying which substances are most harmful is fundamental for implementing policies aimed at reducing exposure to these substances. We propose an approach based on explainable artificial intelligence (XAI) based on remote sensing data to identify the factors that most influence the prediction of the standard mortality ratio (SMR) for respiratory system cancer in the Italian provinces using environment and socio-economic data. First of all, we identified 10 clusters of provinces through the study of the SMR variogram. Then, a Random Forest regressor is used for learning a compact representation of data. Finally, we used XAI to identify which features were most important in predicting SMR values. Our machine learning analysis shows that NO, income and O3 are the first three relevant features for the mortality of this type of cancer, and provides a guideline on intervention priorities in reducing risk factors.
呼吸系统癌症,包括肺癌、气管和支气管癌,是一个重大且不断演变的公共卫生挑战。由于污染是导致这种疾病的一个主要原因,因此确定哪些物质最具危害性对于实施旨在减少接触这些物质的政策至关重要。我们提出了一种基于遥感数据的可解释人工智能(XAI)方法,用于根据环境和社会经济数据,识别影响预测意大利各省呼吸系统癌症标准化死亡率(SMR)的主要因素。首先,我们通过研究 SMR 变差函数,确定了 10 个省份聚类。然后,使用随机森林回归器学习数据的紧凑表示。最后,我们使用 XAI 来确定哪些特征对预测 SMR 值最重要。我们的机器学习分析表明,NO、收入和 O3 是导致这种癌症死亡率的前三个相关因素,为减少风险因素的干预重点提供了指导。