• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

广义线性模型中高维变量选择的特征筛选

Feature Screening for High-Dimensional Variable Selection in Generalized Linear Models.

作者信息

Jiang Jinzhu, Shang Junfeng

机构信息

Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, OH 43403, USA.

出版信息

Entropy (Basel). 2023 May 26;25(6):851. doi: 10.3390/e25060851.

DOI:10.3390/e25060851
PMID:37372195
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10296932/
Abstract

The two-stage feature screening method for linear models applies dimension reduction at first stage to screen out nuisance features and dramatically reduce the dimension to a moderate size; at the second stage, penalized methods such as LASSO and SCAD could be applied for feature selection. A majority of subsequent works on the sure independent screening methods have focused mainly on the linear model. This motivates us to extend the independence screening method to generalized linear models, and particularly with binary response by using the point-biserial correlation. We develop a two-stage feature screening method called point-biserial sure independence screening (PB-SIS) for high-dimensional generalized linear models, aiming for high selection accuracy and low computational cost. We demonstrate that PB-SIS is a feature screening method with high efficiency. The PB-SIS method possesses the sure independence property under certain regularity conditions. A set of simulation studies are conducted and confirm the sure independence property and the accuracy and efficiency of PB-SIS. Finally we apply PB-SIS to one real data example to show its effectiveness.

摘要

线性模型的两阶段特征筛选方法在第一阶段进行降维,以筛选出干扰特征并将维度大幅缩减至适中大小;在第二阶段,可应用诸如LASSO和SCAD等惩罚方法进行特征选择。后续大多数关于确定独立筛选方法的工作主要集中在线性模型上。这促使我们将独立筛选方法扩展到广义线性模型,特别是对于二元响应,通过使用点二列相关来实现。我们针对高维广义线性模型开发了一种名为点二列确定独立筛选(PB-SIS)的两阶段特征筛选方法,旨在实现高选择准确性和低计算成本。我们证明了PB-SIS是一种高效的特征筛选方法。PB-SIS方法在某些正则条件下具有确定独立性属性。进行了一组模拟研究,证实了PB-SIS的确定独立性属性以及准确性和效率。最后,我们将PB-SIS应用于一个实际数据示例以展示其有效性。

相似文献

1
Feature Screening for High-Dimensional Variable Selection in Generalized Linear Models.广义线性模型中高维变量选择的特征筛选
Entropy (Basel). 2023 May 26;25(6):851. doi: 10.3390/e25060851.
2
Feature Screening via Distance Correlation Learning.通过距离相关学习进行特征筛选
J Am Stat Assoc. 2012 Jul 1;107(499):1129-1139. doi: 10.1080/01621459.2012.695654.
3
Ultrahigh dimensional feature selection: beyond the linear model.超高维特征选择:超越线性模型
J Mach Learn Res. 2009;10:2013-2038.
4
The Sparse MLE for Ultra-High-Dimensional Feature Screening.超高维特征筛选的稀疏极大似然估计
J Am Stat Assoc. 2014;109(507):1257-1269. doi: 10.1080/01621459.2013.879531.
5
Nonparametric Independence Screening in Sparse Ultra-High Dimensional Additive Models.稀疏超高维加法模型中的非参数独立性筛选
J Am Stat Assoc. 2011 Jun;106(494):544-557. doi: 10.1198/jasa.2011.tm09779.
6
Combined Performance of Screening and Variable Selection Methods in Ultra-High Dimensional Data in Predicting Time-To-Event Outcomes.超高维数据中筛选和变量选择方法在预测事件发生时间结局方面的综合性能
Diagn Progn Res. 2018;2. doi: 10.1186/s41512-018-0043-4. Epub 2018 Sep 26.
7
PREDICTION OF TREATMENT OUTCOME FOR AUTISM FROM STRUCTURE OF THE BRAIN BASED ON SURE INDEPENDENCE SCREENING.基于确信独立筛选法从大脑结构预测自闭症的治疗结果
Proc IEEE Int Symp Biomed Imaging. 2019 Apr;2019:404-408. doi: 10.1109/ISBI.2019.8759156. Epub 2019 Jul 11.
8
A selective overview of feature screening for ultrahigh-dimensional data.超高维数据特征筛选的选择性概述。
Sci China Math. 2015 Oct;58(10):2033-2054. doi: 10.1007/s11425-015-5062-9. Epub 2015 Aug 22.
9
A Robust Model-Free Feature Screening Method for Ultrahigh-Dimensional Data.一种用于超高维数据的稳健无模型特征筛选方法。
J Comput Graph Stat. 2017;26(4):803-813. doi: 10.1080/10618600.2017.1328364. Epub 2017 Oct 9.
10
Feature Screening in Ultrahigh Dimensional Cox's Model.超高维Cox模型中的特征筛选
Stat Sin. 2016;26:881-901. doi: 10.5705/ss.2014.171.

引用本文的文献

1
Serum-biomarker-based population screening model for hepatocellular carcinoma.基于血清生物标志物的肝细胞癌人群筛查模型
iScience. 2025 Feb 8;28(3):111981. doi: 10.1016/j.isci.2025.111981. eCollection 2025 Mar 21.

本文引用的文献

1
MODEL-FREE FORWARD SCREENING VIA CUMULATIVE DIVERGENCE.基于累积散度的无模型前向筛选
J Am Stat Assoc. 2020;115(531):1393-1405. doi: 10.1080/01621459.2019.1632078. Epub 2019 Jul 22.
2
Minimum redundancy maximum relevance feature selection approach for temporal gene expression data.用于时间基因表达数据的最小冗余最大相关特征选择方法
BMC Bioinformatics. 2017 Jan 3;18(1):9. doi: 10.1186/s12859-016-1423-9.
3
Model-Free Feature Screening for Ultrahigh Dimensional Discriminant Analysis.超高维判别分析的无模型特征筛选
J Am Stat Assoc. 2015 Jun 1;110(510):630-641. doi: 10.1080/01621459.2014.920256.
4
Majorization Minimization by Coordinate Descent for Concave Penalized Generalized Linear Models.基于坐标下降法的凹惩罚广义线性模型的优化最小化
Stat Comput. 2014 Sep;24(5):871-883. doi: 10.1007/s11222-013-9407-3.
5
Peripheral blood mononuclear cell gene expression in chronic obstructive pulmonary disease.慢性阻塞性肺疾病患者外周血单个核细胞基因表达。
Am J Respir Cell Mol Biol. 2013 Aug;49(2):316-23. doi: 10.1165/rcmb.2012-0230OC.
6
Ultrahigh dimensional feature selection: beyond the linear model.超高维特征选择:超越线性模型
J Mach Learn Res. 2009;10:2013-2038.
7
Regularization Paths for Generalized Linear Models via Coordinate Descent.基于坐标下降法的广义线性模型正则化路径
J Stat Softw. 2010;33(1):1-22.
8
One-step Sparse Estimates in Nonconcave Penalized Likelihood Models.非凹惩罚似然模型中的一步稀疏估计
Ann Stat. 2008 Aug 1;36(4):1509-1533. doi: 10.1214/009053607000000802.
9
Variable Selection using MM Algorithms.使用MM算法进行变量选择
Ann Stat. 2005;33(4):1617-1642. doi: 10.1214/009053605000000200.
10
High Dimensional Classification Using Features Annealed Independence Rules.使用特征退火独立规则的高维分类
Ann Stat. 2008;36(6):2605-2637. doi: 10.1214/07-AOS504.