Obster Fabian, Heumann Christian, Bohle Heidi, Pechan Paul
Department of Business Administration, University of the Bundeswehr Munich, 85577, Neubiberg, Germany.
Department of Statistics, LMU Munich, 80539, Munich, Germany.
Sci Rep. 2023 Aug 7;13(1):12767. doi: 10.1038/s41598-023-39918-5.
We describe how interpretable boosting algorithms based on ridge-regularized generalized linear models can be used to analyze high-dimensional environmental data. We illustrate this by using environmental, social, human and biophysical data to predict the financial vulnerability of farmers in Chile and Tunisia against climate hazards. We show how group structures can be considered and how interactions can be found in high-dimensional datasets using a novel 2-step boosting approach. The advantages and efficacy of the proposed method are shown and discussed. Results indicate that the presence of interaction effects only improves predictive power when included in two-step boosting. The most important variable in predicting all types of vulnerabilities are natural assets. Other important variables are the type of irrigation, economic assets and the presence of crop damage of near farms.
我们描述了基于岭正则化广义线性模型的可解释性增强算法如何用于分析高维环境数据。我们通过使用环境、社会、人类和生物物理数据来预测智利和突尼斯农民面对气候灾害时的金融脆弱性,对此进行了说明。我们展示了如何考虑组结构,以及如何使用一种新颖的两步增强方法在高维数据集中找到相互作用。展示并讨论了所提出方法的优点和功效。结果表明,只有在两步增强中包含交互效应时,其才会提高预测能力。预测所有类型脆弱性的最重要变量是自然资产。其他重要变量包括灌溉类型、经济资产以及农场附近作物受损情况。