Jiang Jinzhu, Shang Junfeng
Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, OH 43403, USA.
Entropy (Basel). 2023 May 26;25(6):851. doi: 10.3390/e25060851.
The two-stage feature screening method for linear models applies dimension reduction at first stage to screen out nuisance features and dramatically reduce the dimension to a moderate size; at the second stage, penalized methods such as LASSO and SCAD could be applied for feature selection. A majority of subsequent works on the sure independent screening methods have focused mainly on the linear model. This motivates us to extend the independence screening method to generalized linear models, and particularly with binary response by using the point-biserial correlation. We develop a two-stage feature screening method called point-biserial sure independence screening (PB-SIS) for high-dimensional generalized linear models, aiming for high selection accuracy and low computational cost. We demonstrate that PB-SIS is a feature screening method with high efficiency. The PB-SIS method possesses the sure independence property under certain regularity conditions. A set of simulation studies are conducted and confirm the sure independence property and the accuracy and efficiency of PB-SIS. Finally we apply PB-SIS to one real data example to show its effectiveness.
线性模型的两阶段特征筛选方法在第一阶段进行降维,以筛选出干扰特征并将维度大幅缩减至适中大小;在第二阶段,可应用诸如LASSO和SCAD等惩罚方法进行特征选择。后续大多数关于确定独立筛选方法的工作主要集中在线性模型上。这促使我们将独立筛选方法扩展到广义线性模型,特别是对于二元响应,通过使用点二列相关来实现。我们针对高维广义线性模型开发了一种名为点二列确定独立筛选(PB-SIS)的两阶段特征筛选方法,旨在实现高选择准确性和低计算成本。我们证明了PB-SIS是一种高效的特征筛选方法。PB-SIS方法在某些正则条件下具有确定独立性属性。进行了一组模拟研究,证实了PB-SIS的确定独立性属性以及准确性和效率。最后,我们将PB-SIS应用于一个实际数据示例以展示其有效性。