• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

超高维监督问题中用于特征筛选的协变量信息数

Covariate Information Number for Feature Screening in Ultrahigh-Dimensional Supervised Problems.

作者信息

Nandy Debmalya, Chiaromonte Francesca, Li Runze

机构信息

Department of Biostatistics & Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA.

Department of Statistics, Penn State University, University Park, PA 16802, USA.

出版信息

J Am Stat Assoc. 2022;117(539):1516-1529. doi: 10.1080/01621459.2020.1864380. Epub 2021 Feb 10.

DOI:10.1080/01621459.2020.1864380
PMID:36172297
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9512254/
Abstract

Contemporary high-throughput experimental and surveying techniques give rise to ultrahigh-dimensional supervised problems with sparse signals; that is, a limited number of observations (), each with a very large number of covariates ( >> ), only a small share of which is truly associated with the response. In these settings, major concerns on computational burden, algorithmic stability, and statistical accuracy call for substantially reducing the feature space by eliminating redundant covariates before the use of any sophisticated statistical analysis. Along the lines of (Fan and Lv, 2008) and other model- and correlation-based feature screening methods, we propose a model-free procedure called (CIS). CIS uses a marginal utility connected to the notion of the traditional Fisher Information, possesses the sure screening property, and is applicable to any type of response (features) with continuous features (response). Simulations and an application to transcriptomic data on rats reveal the comparative strengths of CIS over some popular feature screening methods.

摘要

当代高通量实验和测量技术引发了具有稀疏信号的超高维监督问题;也就是说,观测值数量有限(),每个观测值都有大量协变量(>>),其中只有一小部分与响应真正相关。在这些情况下,由于对计算负担、算法稳定性和统计准确性的主要担忧,需要在使用任何复杂的统计分析之前,通过消除冗余协变量来大幅减少特征空间。沿着(Fan和Lv,2008)以及其他基于模型和相关性的特征筛选方法的思路,我们提出了一种名为(CIS)的无模型程序。CIS使用与传统Fisher信息概念相关的边际效用,具有确定筛选属性,适用于具有连续特征(响应)的任何类型的响应(特征)。对大鼠转录组数据的模拟和应用揭示了CIS相对于一些流行特征筛选方法的比较优势。

相似文献

1
Covariate Information Number for Feature Screening in Ultrahigh-Dimensional Supervised Problems.超高维监督问题中用于特征筛选的协变量信息数
J Am Stat Assoc. 2022;117(539):1516-1529. doi: 10.1080/01621459.2020.1864380. Epub 2021 Feb 10.
2
Group Feature Screening via the F Statistic.通过F统计量进行组特征筛选。
Commun Stat Simul Comput. 2022;51(4):1921-1931. doi: 10.1080/03610918.2019.1691223. Epub 2019 Nov 26.
3
Feature Screening via Distance Correlation Learning.通过距离相关学习进行特征筛选
J Am Stat Assoc. 2012 Jul 1;107(499):1129-1139. doi: 10.1080/01621459.2012.695654.
4
Regularized Quantile Regression and Robust Feature Screening for Single Index Models.单指标模型的正则化分位数回归与稳健特征筛选
Stat Sin. 2016 Jan;26(1):69-95. doi: 10.5705/ss.2014.049.
5
A selective overview of feature screening for ultrahigh-dimensional data.超高维数据特征筛选的选择性概述。
Sci China Math. 2015 Oct;58(10):2033-2054. doi: 10.1007/s11425-015-5062-9. Epub 2015 Aug 22.
6
Ultrahigh dimensional feature selection: beyond the linear model.超高维特征选择:超越线性模型
J Mach Learn Res. 2009;10:2013-2038.
7
Feature Screening in Ultrahigh Dimensional Cox's Model.超高维Cox模型中的特征筛选
Stat Sin. 2016;26:881-901. doi: 10.5705/ss.2014.171.
8
Model-Free Feature Screening for Ultrahigh Dimensional Discriminant Analysis.超高维判别分析的无模型特征筛选
J Am Stat Assoc. 2015 Jun 1;110(510):630-641. doi: 10.1080/01621459.2014.920256.
9
Ultrahigh-Dimensional Multiclass Linear Discriminant Analysis by Pairwise Sure Independence Screening.基于成对确定独立筛选的超高维多类线性判别分析
J Am Stat Assoc. 2016;111(513):169-179. doi: 10.1080/01621459.2014.998760. Epub 2016 May 5.
10
Conditional screening for ultrahigh-dimensional survival data in case-cohort studies.病例-队列研究中超高维生存数据的条件筛选。
Lifetime Data Anal. 2021 Oct;27(4):632-661. doi: 10.1007/s10985-021-09531-7. Epub 2021 Aug 20.

本文引用的文献

1
Error Variance Estimation in Ultrahigh-Dimensional Additive Models.超高维加法模型中的误差方差估计
J Am Stat Assoc. 2018;113(521):315-327. doi: 10.1080/01621459.2016.1251440. Epub 2017 Sep 26.
2
A selective overview of feature screening for ultrahigh-dimensional data.超高维数据特征筛选的选择性概述。
Sci China Math. 2015 Oct;58(10):2033-2054. doi: 10.1007/s11425-015-5062-9. Epub 2015 Aug 22.
3
Model-Free Feature Screening for Ultrahigh Dimensional Discriminant Analysis.超高维判别分析的无模型特征筛选
J Am Stat Assoc. 2015 Jun 1;110(510):630-641. doi: 10.1080/01621459.2014.920256.
4
COVARIANCE ASSISTED SCREENING AND ESTIMATION.协方差辅助筛选与估计
Ann Stat. 2014 Nov 1;42(6):2202-2242. doi: 10.1214/14-AOS1243.
5
Feature Screening for Ultrahigh Dimensional Categorical Data with Applications.超高维分类数据的特征筛选及其应用
J Bus Econ Stat. 2014;32(2):237-244. doi: 10.1080/07350015.2013.863158.
6
Feature Screening via Distance Correlation Learning.通过距离相关学习进行特征筛选
J Am Stat Assoc. 2012 Jul 1;107(499):1129-1139. doi: 10.1080/01621459.2012.695654.
7
Feature Selection for Varying Coefficient Models With Ultrahigh Dimensional Covariates.具有超高维协变量的变系数模型的特征选择
J Am Stat Assoc. 2014 Jan 1;109(505):266-274. doi: 10.1080/01621459.2013.850086.
8
Quantile Regression for Analyzing Heterogeneity in Ultra-high Dimension.用于分析超高维异质性的分位数回归
J Am Stat Assoc. 2012 Mar 1;107(497):214-222. doi: 10.1080/01621459.2012.656014. Epub 2012 Jun 11.
9
Model-Free Feature Screening for Ultrahigh Dimensional Data.超高维数据的无模型特征筛选
J Am Stat Assoc. 2011 Jan 1;106(496):1464-1475. doi: 10.1198/jasa.2011.tm10563. Epub 2012 Jan 24.
10
Nonparametric Independence Screening in Sparse Ultra-High Dimensional Additive Models.稀疏超高维加法模型中的非参数独立性筛选
J Am Stat Assoc. 2011 Jun;106(494):544-557. doi: 10.1198/jasa.2011.tm09779.