Gostkowski Michał, Gajowniczek Krzysztof
Department of Econometrics and Statistics, Institute of Economics and Finance, Warsaw University of Life Sciences-SGGW, 02-776 Warsaw, Poland.
Department of Artificial Intelligence, Institute of Information Technology, Warsaw University of Life Sciences-SGGW, 02-776 Warsaw, Poland.
Entropy (Basel). 2020 May 13;22(5):545. doi: 10.3390/e22050545.
Due to various regulations (e.g., the Basel III Accord), banks need to keep a specified amount of capital to reduce the impact of their insolvency. This equity can be calculated using, e.g., the Internal Rating Approach, enabling institutions to develop their own statistical models. In this regard, one of the most important parameters is the loss given default, whose correct estimation may lead to a healthier and riskless allocation of the capital. Unfortunately, since the loss given default distribution is a bimodal application of the modeling methods (e.g., ordinary least squares or regression trees), aiming at predicting the mean value is not enough. Bimodality means that a distribution has two modes and has a large proportion of observations with large distances from the middle of the distribution; therefore, to overcome this fact, more advanced methods are required. To this end, to model the entire loss given default distribution, in this article we present the weighted quantile Regression Forest algorithm, which is an ensemble technique. We evaluate our methodology over a dataset collected by one of the biggest Polish banks. Through our research, we show that weighted quantile Regression Forests outperform "single" state-of-the-art models in terms of their accuracy and the stability.
由于各种法规(如《巴塞尔协议III》),银行需要持有一定数量的资本以降低其破产的影响。这种权益资本可以使用例如内部评级法来计算,这使得各机构能够开发自己的统计模型。在这方面,最重要的参数之一是违约损失率,其正确估计可能会带来更健康且无风险的资本配置。不幸的是,由于违约损失率分布是建模方法(如普通最小二乘法或回归树)的双峰应用,仅旨在预测均值是不够的。双峰意味着分布有两个峰值,并且有很大比例的观测值与分布中心的距离很远;因此,为了克服这一情况,需要更先进的方法。为此,为了对整个违约损失率分布进行建模,在本文中我们提出了加权分位数回归森林算法,这是一种集成技术。我们在一家波兰最大银行收集的数据集上评估我们的方法。通过我们的研究,我们表明加权分位数回归森林在准确性和稳定性方面优于“单一”的最先进模型。