Suppr超能文献

用于双峰分布建模的加权分位数回归森林:违约损失案例

Weighted Quantile Regression Forests for Bimodal Distribution Modeling: A Loss Given Default Case.

作者信息

Gostkowski Michał, Gajowniczek Krzysztof

机构信息

Department of Econometrics and Statistics, Institute of Economics and Finance, Warsaw University of Life Sciences-SGGW, 02-776 Warsaw, Poland.

Department of Artificial Intelligence, Institute of Information Technology, Warsaw University of Life Sciences-SGGW, 02-776 Warsaw, Poland.

出版信息

Entropy (Basel). 2020 May 13;22(5):545. doi: 10.3390/e22050545.

Abstract

Due to various regulations (e.g., the Basel III Accord), banks need to keep a specified amount of capital to reduce the impact of their insolvency. This equity can be calculated using, e.g., the Internal Rating Approach, enabling institutions to develop their own statistical models. In this regard, one of the most important parameters is the loss given default, whose correct estimation may lead to a healthier and riskless allocation of the capital. Unfortunately, since the loss given default distribution is a bimodal application of the modeling methods (e.g., ordinary least squares or regression trees), aiming at predicting the mean value is not enough. Bimodality means that a distribution has two modes and has a large proportion of observations with large distances from the middle of the distribution; therefore, to overcome this fact, more advanced methods are required. To this end, to model the entire loss given default distribution, in this article we present the weighted quantile Regression Forest algorithm, which is an ensemble technique. We evaluate our methodology over a dataset collected by one of the biggest Polish banks. Through our research, we show that weighted quantile Regression Forests outperform "single" state-of-the-art models in terms of their accuracy and the stability.

摘要

由于各种法规(如《巴塞尔协议III》),银行需要持有一定数量的资本以降低其破产的影响。这种权益资本可以使用例如内部评级法来计算,这使得各机构能够开发自己的统计模型。在这方面,最重要的参数之一是违约损失率,其正确估计可能会带来更健康且无风险的资本配置。不幸的是,由于违约损失率分布是建模方法(如普通最小二乘法或回归树)的双峰应用,仅旨在预测均值是不够的。双峰意味着分布有两个峰值,并且有很大比例的观测值与分布中心的距离很远;因此,为了克服这一情况,需要更先进的方法。为此,为了对整个违约损失率分布进行建模,在本文中我们提出了加权分位数回归森林算法,这是一种集成技术。我们在一家波兰最大银行收集的数据集上评估我们的方法。通过我们的研究,我们表明加权分位数回归森林在准确性和稳定性方面优于“单一”的最先进模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1de/7517045/31396103b664/entropy-22-00545-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验