Yoshikawa Yuya, Iwata Tomoharu
IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):5789-5803. doi: 10.1109/TNNLS.2021.3131234. Epub 2023 Sep 1.
Gaussian process regression (GPR) is a fundamental model used in machine learning (ML). Due to its accurate prediction with uncertainty and versatility in handling various data structures via kernels, GPR has been successfully used in various applications. However, in GPR, how the features of an input contribute to its prediction cannot be interpreted. Here, we propose GPR with local explanation, which reveals the feature contributions to the prediction of each sample while maintaining the predictive performance of GPR. In the proposed model, both the prediction and explanation for each sample are performed using an easy-to-interpret locally linear model. The weight vector of the locally linear model is assumed to be generated from multivariate Gaussian process priors. The hyperparameters of the proposed models are estimated by maximizing the marginal likelihood. For a new test sample, the proposed model can predict the values of its target variable and weight vector, as well as their uncertainties, in a closed form. Experimental results on various benchmark datasets verify that the proposed model can achieve predictive performance comparable to those of GPR and superior to that of existing interpretable models and can achieve higher interpretability than them, both quantitatively and qualitatively.
高斯过程回归(GPR)是机器学习(ML)中使用的一种基本模型。由于其在预测时能准确给出不确定性,且通过核函数在处理各种数据结构方面具有通用性,GPR已成功应用于各种领域。然而,在GPR中,输入的特征如何对其预测产生影响却无法得到解释。在此,我们提出了具有局部解释的GPR,它在保持GPR预测性能的同时,揭示了每个样本预测中的特征贡献。在所提出的模型中,对每个样本的预测和解释均使用易于解释的局部线性模型来进行。局部线性模型的权重向量假定是从多元高斯过程先验中生成的。所提模型的超参数通过最大化边际似然来估计。对于一个新的测试样本,所提模型能够以封闭形式预测其目标变量和权重向量的值以及它们的不确定性。在各种基准数据集上的实验结果证实,所提模型能够实现与GPR相当的预测性能,且优于现有可解释模型,并且在定量和定性方面都能实现比它们更高的可解释性。