Suppr超能文献

超高维特征的广义线性模型构建:一种序贯条件方法。

Building generalized linear models with ultrahigh dimensional features: A sequentially conditional approach.

机构信息

Department of Bioinformatics and Biostatistics, University of Louisville, Louisville, Kentucky.

Department of Statistics and Probability, Michigan State University, East Lansing, Michigan.

出版信息

Biometrics. 2020 Mar;76(1):47-60. doi: 10.1111/biom.13122. Epub 2019 Nov 6.

Abstract

Conditional screening approaches have emerged as a powerful alternative to the commonly used marginal screening, as they can identify marginally weak but conditionally important variables. However, most existing conditional screening methods need to fix the initial conditioning set, which may determine the ultimately selected variables. If the conditioning set is not properly chosen, the methods may produce false negatives and positives. Moreover, screening approaches typically need to involve tuning parameters and extra modeling steps in order to reach a final model. We propose a sequential conditioning approach by dynamically updating the conditioning set with an iterative selection process. We provide its theoretical properties under the framework of generalized linear models. Powered by an extended Bayesian information criterion as the stopping rule, the method will lead to a final model without the need to choose tuning parameters or threshold parameters. The practical utility of the proposed method is examined via extensive simulations and analysis of a real clinical study on predicting multiple myeloma patients' response to treatment based on their genomic profiles.

摘要

条件筛选方法已经成为一种替代常用边际筛选的有力方法,因为它们可以识别边际上较弱但条件上重要的变量。然而,大多数现有的条件筛选方法需要固定初始的条件集,这可能会决定最终选择的变量。如果条件集选择不当,这些方法可能会产生假阴性和假阳性。此外,筛选方法通常需要涉及调整参数和额外的建模步骤,以达到最终的模型。我们提出了一种通过迭代选择过程动态更新条件集的序贯条件筛选方法。我们在广义线性模型框架下提供了它的理论性质。该方法以扩展的贝叶斯信息准则作为停止规则,不需要选择调整参数或阈值参数即可得到最终的模型。通过广泛的模拟和基于基因组谱预测多发性骨髓瘤患者对治疗反应的真实临床研究的分析,检验了所提出方法的实际效用。

相似文献

4
Ultrahigh dimensional time course feature selection.超高维时间序列特征选择
Biometrics. 2014 Jun;70(2):356-65. doi: 10.1111/biom.12137. Epub 2014 Jan 19.
5
Penalized likelihood and multiple testing.惩罚似然与多重检验。
Biom J. 2019 Jan;61(1):62-72. doi: 10.1002/bimj.201700196. Epub 2018 Nov 26.
10
Conditional Sure Independence Screening.条件确定独立性筛选
J Am Stat Assoc. 2016;111(515):1266-1277. doi: 10.1080/01621459.2015.1092974. Epub 2016 Oct 18.

本文引用的文献

1
Weak signals in high-dimension regression: detection, estimation and prediction.高维回归中的弱信号:检测、估计与预测。
Appl Stoch Models Bus Ind. 2019 Mar-Apr;35(2):283-298. doi: 10.1002/asmb.2340. Epub 2018 May 25.
2
Conditional Sure Independence Screening.条件确定独立性筛选
J Am Stat Assoc. 2016;111(515):1266-1277. doi: 10.1080/01621459.2015.1092974. Epub 2016 Oct 18.
7
The Sparse MLE for Ultra-High-Dimensional Feature Screening.超高维特征筛选的稀疏极大似然估计
J Am Stat Assoc. 2014;109(507):1257-1269. doi: 10.1080/01621459.2013.879531.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验