• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于稳定性选择的 Cox 模型变量集成选择器。

Ensembling Variable Selectors by Stability Selection for the Cox Model.

机构信息

School of Science, Xi'an University of Architecture and Technology, Xi'an, Shaanxi 710055, China.

School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China.

出版信息

Comput Intell Neurosci. 2017;2017:2747431. doi: 10.1155/2017/2747431. Epub 2017 Nov 15.

DOI:10.1155/2017/2747431
PMID:29270195
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5706076/
Abstract

As a pivotal tool to build interpretive models, variable selection plays an increasingly important role in high-dimensional data analysis. In recent years, variable selection ensembles (VSEs) have gained much interest due to their many advantages. Stability selection (Meinshausen and Bühlmann, 2010), a VSE technique based on subsampling in combination with a base algorithm like lasso, is an effective method to control false discovery rate (FDR) and to improve selection accuracy in linear regression models. By adopting lasso as a base learner, we attempt to extend stability selection to handle variable selection problems in a Cox model. According to our experience, it is crucial to set the regularization region Λ in lasso and the parameter properly so that stability selection can work well. To the best of our knowledge, however, there is no literature addressing this problem in an explicit way. Therefore, we first provide a detailed procedure to specify Λ and . Then, some simulated and real-world data with various censoring rates are used to examine how well stability selection performs. It is also compared with several other variable selection approaches. Experimental results demonstrate that it achieves better or competitive performance in comparison with several other popular techniques.

摘要

作为构建解释模型的关键工具,变量选择在高维数据分析中发挥着越来越重要的作用。近年来,基于抽样的变量选择集成(Variable Selection Ensembles,VSE)技术由于其诸多优势而备受关注。稳定性选择(Stability Selection)是一种基于抽样的 VSE 技术,结合了诸如 lasso 之类的基础算法,是控制错误发现率(False Discovery Rate,FDR)和提高线性回归模型选择准确性的有效方法。通过采用 lasso 作为基础学习器,我们尝试将稳定性选择扩展到 Cox 模型中的变量选择问题。根据我们的经验,正确设置 lasso 中的正则化区域Λ和参数 至关重要,以便稳定性选择能够良好地工作。然而,据我们所知,目前还没有文献以明确的方式解决这个问题。因此,我们首先提供了一个详细的过程来指定 Λ 和 。然后,使用各种截尾率的模拟和真实数据来检查稳定性选择的性能如何。还将其与其他几种变量选择方法进行了比较。实验结果表明,与其他几种流行技术相比,它具有更好或相当的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac4a/5706076/5636e8866b79/CIN2017-2747431.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac4a/5706076/b57b2bc61fa5/CIN2017-2747431.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac4a/5706076/c6ab431d9527/CIN2017-2747431.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac4a/5706076/5636e8866b79/CIN2017-2747431.alg.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac4a/5706076/b57b2bc61fa5/CIN2017-2747431.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac4a/5706076/c6ab431d9527/CIN2017-2747431.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac4a/5706076/5636e8866b79/CIN2017-2747431.alg.001.jpg

相似文献

1
Ensembling Variable Selectors by Stability Selection for the Cox Model.基于稳定性选择的 Cox 模型变量集成选择器。
Comput Intell Neurosci. 2017;2017:2747431. doi: 10.1155/2017/2747431. Epub 2017 Nov 15.
2
Empirical extensions of the lasso penalty to reduce the false discovery rate in high-dimensional Cox regression models.套索惩罚的经验性扩展以降低高维Cox回归模型中的错误发现率
Stat Med. 2016 Jul 10;35(15):2561-73. doi: 10.1002/sim.6927. Epub 2016 Mar 10.
3
Class-imbalanced subsampling lasso algorithm for discovering adverse drug reactions.用于发现药物不良反应的类不平衡子采样套索算法
Stat Methods Med Res. 2018 Mar;27(3):785-797. doi: 10.1177/0962280216643116. Epub 2016 Apr 25.
4
High-dimensional Cox models: the choice of penalty as part of the model building process.高维Cox模型:作为模型构建过程一部分的惩罚项选择
Biom J. 2010 Feb;52(1):50-69. doi: 10.1002/bimj.200900064.
5
L1 penalized estimation in the Cox proportional hazards model.Cox比例风险模型中的L1惩罚估计
Biom J. 2010 Feb;52(1):70-84. doi: 10.1002/bimj.200900028.
6
Controlling false discoveries in high-dimensional situations: boosting with stability selection.在高维情形下控制错误发现:基于稳定性选择的增强方法
BMC Bioinformatics. 2015 May 6;16:144. doi: 10.1186/s12859-015-0575-3.
7
Stable feature selection for clinical prediction: exploiting ICD tree structure using Tree-Lasso.用于临床预测的稳定特征选择:利用树套索法挖掘国际疾病分类树结构
J Biomed Inform. 2015 Feb;53:277-90. doi: 10.1016/j.jbi.2014.11.013. Epub 2014 Dec 9.
8
Part 1. Statistical Learning Methods for the Effects of Multiple Air Pollution Constituents.第1部分. 多种空气污染成分影响的统计学习方法
Res Rep Health Eff Inst. 2015 Jun(183 Pt 1-2):5-50.
9
Comparison of Variable Selection Methods for Time-to-Event Data in High-Dimensional Settings.高维环境下生存数据分析中变量选择方法的比较。
Comput Math Methods Med. 2020 Jul 1;2020:6795392. doi: 10.1155/2020/6795392. eCollection 2020.
10
Stability selection for lasso, ridge and elastic net implemented with AFT models.使用加速失效时间(AFT)模型实现套索、岭回归和弹性网络的稳定性选择。
Stat Appl Genet Mol Biol. 2019 Oct 7;18(5):/j/sagmb.2019.18.issue-5/sagmb-2017-0001/sagmb-2017-0001.xml. doi: 10.1515/sagmb-2017-0001.

引用本文的文献

1
Machine learning-based disulfidptosis-related lncRNA signature predicts prognosis, immune infiltration and drug sensitivity in hepatocellular carcinoma.基于机器学习的二硫键相关 lncRNA 特征可预测肝细胞癌的预后、免疫浸润和药物敏感性。
Sci Rep. 2024 Feb 22;14(1):4354. doi: 10.1038/s41598-024-54115-8.
2
Stability and volatility shape the gut bacteriome and Kazachstania slooffiae dynamics in preweaning, nursery and adult pigs.稳定性和不稳定性塑造了断奶前、保育期和成年猪的肠道细菌组和 Kazachstania slooffiae 的动态变化。
Sci Rep. 2022 Sep 5;12(1):15080. doi: 10.1038/s41598-022-19093-9.
3
Speech categorization is better described by induced rather than evoked neural activity.

本文引用的文献

1
Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent.通过坐标下降法求解Cox比例风险模型的正则化路径
J Stat Softw. 2011 Mar;39(5):1-13. doi: 10.18637/jss.v039.i05.
2
Component-wise gradient boosting and false discovery control in survival analysis with high-dimensional covariates.高维协变量生存分析中的逐分量梯度提升与错误发现率控制
Bioinformatics. 2016 Jan 1;32(1):50-7. doi: 10.1093/bioinformatics/btv517. Epub 2015 Sep 17.
3
Controlling false discoveries in high-dimensional situations: boosting with stability selection.
诱发神经活动而非感知神经活动更能准确描述言语分类。
J Acoust Soc Am. 2021 Mar;149(3):1644. doi: 10.1121/10.0003572.
4
Data-driven machine learning models for decoding speech categorization from evoked brain responses.基于数据驱动的机器学习模型,用于解码大脑反应中的语音分类。
J Neural Eng. 2021 Mar 23;18(4). doi: 10.1088/1741-2552/abecf0.
5
Decoding Hearing-Related Changes in Older Adults' Spatiotemporal Neural Processing of Speech Using Machine Learning.使用机器学习解码老年人言语时空神经处理中与听力相关的变化
Front Neurosci. 2020 Jul 16;14:748. doi: 10.3389/fnins.2020.00748. eCollection 2020.
在高维情形下控制错误发现:基于稳定性选择的增强方法
BMC Bioinformatics. 2015 May 6;16:144. doi: 10.1186/s12859-015-0575-3.
4
RANDOM LASSO.随机套索算法
Ann Appl Stat. 2011 Mar 1;5(1):468-485. doi: 10.1214/10-AOAS377.
5
On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data.基于删失生存数据评估风险预测方法整体充分性的 C 统计量。
Stat Med. 2011 May 10;30(10):1105-17. doi: 10.1002/sim.4154. Epub 2011 Jan 13.
6
PENALIZED VARIABLE SELECTION PROCEDURE FOR COX MODELS WITH SEMIPARAMETRIC RELATIVE RISK.具有半参数相对风险的Cox模型的惩罚变量选择程序
Ann Stat. 2010 Aug 1;38(4):2092-2117. doi: 10.1214/09-AOS780.
7
The lasso method for variable selection in the Cox model.Cox模型中用于变量选择的套索方法。
Stat Med. 1997 Feb 28;16(4):385-95. doi: 10.1002/(sici)1097-0258(19970228)16:4<385::aid-sim380>3.0.co;2-3.
8
Prospective evaluation of prognostic variables from patient-completed questionnaires. North Central Cancer Treatment Group.通过患者填写的问卷对预后变量进行前瞻性评估。中北部癌症治疗组。
J Clin Oncol. 1994 Mar;12(3):601-7. doi: 10.1200/JCO.1994.12.3.601.
9
Mantel-Haenszel analyses of litter-matched time-to-response data, with modifications for recovery of interlitter information.对窝匹配的反应时间数据进行Mantel-Haenszel分析,并对窝间信息的恢复进行了修改。
Cancer Res. 1977 Nov;37(11):3863-8.