提高自动信贷决策的透明度和公平性：一种可解释的新型混合机器学习方法。

Enhancing transparency and fairness in automated credit decisions: an explainable novel hybrid machine learning approach.

作者信息

Nwafor Chioma Ngozi, Nwafor Obumneme, Brahma Sanjukta

机构信息

Glasgow School for Business and Society, Department of Finance, Accountancy and Risk, Glasgow Caledonia University, Glasgow, Scotland.

School of Computing, Engineering and Built Environment, Glasgow Caledonian University, Glasgow, Scotland.

出版信息

Sci Rep. 2024 Oct 24;14(1):25174. doi: 10.1038/s41598-024-75026-8.

DOI:10.1038/s41598-024-75026-8

PMID:39448646

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11502870/

Abstract

This paper uses a generalised stacking method to introduce a novel hybrid model that combines a one-dimensional convolutional neural network 1DCNN with extreme gradient boosting XGBoost. We compared the predictive accuracies of the proposed hybrid architecture with three conventional algorithms-1DCNN, XGBoost and logistic regression (LR) using a dataset of over twenty thousand peer-to-peer (P2P) consumer credit observations. By leveraging the SHAP algorithm, the research provides a detailed analysis of feature importance, contributing to the model's predictions and offering insights into the overall and individual significance of different features. The findings demonstrate that the hybrid model outperforms the LR, XGBoost and 1DCNN models in terms of classification accuracy. Furthermore, the research addresses concern regarding fairness and bias by showing that removing potentially discriminatory features, such as age and gender, does not significantly impact the hybrid model's classification capabilities. This suggests that fair and unbiased credit scoring models can achieve high effectiveness levels without compromising accuracy. This paper makes significant contributions to academic research and practical applications in credit risk management by presenting a hybrid model that offers superior classification accuracy and promotes interpretability using the model agnostic SHAP framework.

摘要

本文采用广义堆叠方法引入了一种新型混合模型，该模型将一维卷积神经网络（1DCNN）与极端梯度提升（XGBoost）相结合。我们使用一个包含两万多个点对点（P2P）消费者信贷观测值的数据集，将所提出的混合架构的预测准确性与三种传统算法——1DCNN、XGBoost和逻辑回归（LR）进行了比较。通过利用SHAP算法，该研究对特征重要性进行了详细分析，有助于模型的预测，并深入了解不同特征的整体和个体重要性。研究结果表明，混合模型在分类准确性方面优于LR、XGBoost和1DCNN模型。此外，该研究通过表明去除潜在的歧视性特征（如年龄和性别）不会显著影响混合模型的分类能力，解决了对公平性和偏差的担忧。这表明公平且无偏差的信用评分模型可以在不影响准确性的情况下实现较高的有效性水平。本文通过提出一种混合模型，该模型具有卓越的分类准确性，并使用与模型无关的SHAP框架促进可解释性，为信用风险管理的学术研究和实际应用做出了重大贡献。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eed5/11502870/222ecbf72c12/41598_2024_75026_Fig1_HTML.jpg

相似文献

Enhancing transparency and fairness in automated credit decisions: an explainable novel hybrid machine learning approach.提高自动信贷决策的透明度和公平性：一种可解释的新型混合机器学习方法。

Sci Rep. 2024 Oct 24;14(1):25174. doi: 10.1038/s41598-024-75026-8.

Investigation on explainable machine learning models to predict chronic kidney diseases.探究可解释机器学习模型在预测慢性肾脏病中的应用。

Sci Rep. 2024 Feb 14;14(1):3687. doi: 10.1038/s41598-024-54375-4.

Explainable artificial intelligence model for identifying COVID-19 gene biomarkers.用于识别 COVID-19 基因生物标志物的可解释人工智能模型。

Comput Biol Med. 2023 Mar;154:106619. doi: 10.1016/j.compbiomed.2023.106619. Epub 2023 Feb 1.

A novel framework for enhancing transparency in credit scoring: Leveraging Shapley values for interpretable credit scorecards.一种增强信用评分透明度的新框架：利用 Shapley 值构建可解释的信用评分卡。

PLoS One. 2024 Aug 12;19(8):e0308718. doi: 10.1371/journal.pone.0308718. eCollection 2024.

Responsible AI for cardiovascular disease detection: Towards a privacy-preserving and interpretable model.心血管疾病检测的负责任 AI：迈向隐私保护和可解释的模型。

Comput Methods Programs Biomed. 2024 Sep;254:108289. doi: 10.1016/j.cmpb.2024.108289. Epub 2024 Jun 17.

A novel approach of brain-computer interfacing (BCI) and Grad-CAM based explainable artificial intelligence: Use case scenario for smart healthcare.一种新的脑机接口 (BCI) 和基于 Grad-CAM 的可解释人工智能方法：智能医疗保健用例场景。

J Neurosci Methods. 2024 Aug;408:110159. doi: 10.1016/j.jneumeth.2024.110159. Epub 2024 May 7.

A hybrid approach for modeling bicycle crash frequencies: Integrating random forest based SHAP model with random parameter negative binomial regression model.基于随机森林的 SHAP 模型与随机参数负二项回归模型相结合的自行车碰撞频率建模混合方法。

Accid Anal Prev. 2024 Dec;208:107778. doi: 10.1016/j.aap.2024.107778. Epub 2024 Sep 16.

Manifold-based Shapley explanations for high dimensional correlated features.基于流形的高维相关特征 Shapley 解释

Neural Netw. 2024 Dec;180:106634. doi: 10.1016/j.neunet.2024.106634. Epub 2024 Aug 14.

DeepXplainer: An interpretable deep learning based approach for lung cancer detection using explainable artificial intelligence.深演析：一种基于可解释人工智能的用于肺癌检测的可解释深度学习方法。

Comput Methods Programs Biomed. 2024 Jan;243:107879. doi: 10.1016/j.cmpb.2023.107879. Epub 2023 Oct 24.

Concrete Crack Detection and Segregation: A Feature Fusion, Crack Isolation, and Explainable AI-Based Approach.混凝土裂缝检测与分离：一种基于特征融合、裂缝隔离和可解释人工智能的方法。

J Imaging. 2024 Aug 31;10(9):215. doi: 10.3390/jimaging10090215.

引用本文的文献

Medical laboratory data-based models: opportunities, obstacles, and solutions.基于医学实验室数据的模型：机遇、障碍与解决方案。

J Transl Med. 2025 Jul 24;23(1):823. doi: 10.1186/s12967-025-06802-x.

本文引用的文献

A Guide to Cross-Validation for Artificial Intelligence in Medical Imaging.医学成像中人工智能的交叉验证指南

Radiol Artif Intell. 2023 May 24;5(4):e220232. doi: 10.1148/ryai.220232. eCollection 2023 Jul.

Optimized XGBoost Model with Small Dataset for Predicting Relative Density of Ti-6Al-4V Parts Manufactured by Selective Laser Melting.用于预测选择性激光熔化制造的Ti-6Al-4V零件相对密度的小数据集优化XGBoost模型

Materials (Basel). 2022 Aug 1;15(15):5298. doi: 10.3390/ma15155298.

A logistic regression model for consumer default risk.用于消费者违约风险的逻辑回归模型。

J Appl Stat. 2020 May 5;47(13-15):2879-2894. doi: 10.1080/02664763.2020.1759030. eCollection 2020.

SHAP and LIME: An Evaluation of Discriminative Power in Credit Risk.SHAP与LIME：信用风险判别能力评估

Front Artif Intell. 2021 Sep 17;4:752558. doi: 10.3389/frai.2021.752558. eCollection 2021.

Opening the Black Box: The Promise and Limitations of Explainable Machine Learning in Cardiology.揭开黑箱：可解释机器学习在心脏病学中的前景与局限。

Can J Cardiol. 2022 Feb;38(2):204-213. doi: 10.1016/j.cjca.2021.09.004. Epub 2021 Sep 14.

Factorial Network Models to Improve P2P Credit Risk Management.用于改进P2P信用风险管理的因子网络模型。

Front Artif Intell. 2019 Jun 4;2:8. doi: 10.3389/frai.2019.00008. eCollection 2019.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

提高自动信贷决策的透明度和公平性：一种可解释的新型混合机器学习方法。

Enhancing transparency and fairness in automated credit decisions: an explainable novel hybrid machine learning approach.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献