Cerqueti Roy, Iovanella Antonio, Mattera Raffaele, Storani Saverio
Department of Social and Economic Sciences, Sapienza University of Rome, P.le Aldo Moro 5, 00185, Rome, Italy.
University of Angers, GRANEM, SFR CONFLUENCES, 49000, Angers, France.
Sci Rep. 2024 Aug 23;14(1):19622. doi: 10.1038/s41598-024-70342-5.
Autoencoders are dimension reduction models in the field of machine learning which can be thought of as a neural network counterpart of principal components analysis (PCA). Due to their flexibility and good performance, autoencoders have been recently used for estimating nonlinear factor models in finance. The main weakness of autoencoders is that the results are less explainable than those obtained with the PCA. In this paper, we propose the adoption of the Shapley value to improve the explainability of autoencoders in the context of nonlinear factor models. In particular, we measure the relevance of nonlinear latent factors using a forecast-based Shapley value approach that measures each latent factor's contributions in determining the out-of-sample accuracy in factor-augmented models. Considering the interesting empirical instance of the commodity market, we identify the most relevant latent factors for each commodity based on their out-of-sample forecasting ability.
自动编码器是机器学习领域中的降维模型,可以被视为主成分分析(PCA)的神经网络对应物。由于其灵活性和良好的性能,自动编码器最近已被用于估计金融领域的非线性因子模型。自动编码器的主要缺点是其结果比PCA得到的结果更难以解释。在本文中,我们建议采用夏普利值来提高自动编码器在非线性因子模型背景下的可解释性。具体而言,我们使用基于预测的夏普利值方法来衡量非线性潜在因子的相关性,该方法衡量每个潜在因子在确定因子增强模型的样本外准确性方面的贡献。考虑到商品市场这个有趣的实证案例,我们根据其样本外预测能力确定每种商品最相关的潜在因子。