Seedorff Jacob, Cavanaugh Joseph E
Department of Biostatistics, College of Public Health, University of Iowa, 145 N. Riverside Dr., Iowa City, IA 52242, USA.
Entropy (Basel). 2024 Sep 19;26(9):801. doi: 10.3390/e26090801.
One of the primary issues that arises in statistical modeling pertains to the assessment of the relative importance of each variable in the model. A variety of techniques have been proposed to quantify variable importance for regression models. However, in the context of best subset selection, fewer satisfactory methods are available. With this motivation, we here develop a variable importance measure expressly for this setting. We investigate and illustrate the properties of this measure, introduce algorithms for the efficient computation of its values, and propose a procedure for calculating -values based on its sampling distributions. We present multiple simulation studies to examine the properties of the proposed methods, along with an application to demonstrate their practical utility.
统计建模中出现的一个主要问题涉及对模型中每个变量相对重要性的评估。已经提出了多种技术来量化回归模型的变量重要性。然而,在最佳子集选择的背景下,可用的令人满意的方法较少。出于这个动机,我们在此专门为这种情况开发了一种变量重要性度量。我们研究并阐述了该度量的性质,介绍了有效计算其值的算法,并基于其抽样分布提出了一种计算p值的程序。我们进行了多项模拟研究来检验所提出方法的性质,并通过一个应用来展示它们的实际效用。