• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于岭回归的高效近似k折交叉验证和留一法交叉验证

Efficient approximate k-fold and leave-one-out cross-validation for ridge regression.

作者信息

Meijer Rosa J, Goeman Jelle J

机构信息

Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Postzone S5-P, P.O. Box 9604, 2300 RC Leiden, The Netherlands.

出版信息

Biom J. 2013 Mar;55(2):141-55. doi: 10.1002/bimj.201200088. Epub 2013 Jan 24.

DOI:10.1002/bimj.201200088
PMID:23348970
Abstract

In model building and model evaluation, cross-validation is a frequently used resampling method. Unfortunately, this method can be quite time consuming. In this article, we discuss an approximation method that is much faster and can be used in generalized linear models and Cox' proportional hazards model with a ridge penalty term. Our approximation method is based on a Taylor expansion around the estimate of the full model. In this way, all cross-validated estimates are approximated without refitting the model. The tuning parameter can now be chosen based on these approximations and can be optimized in less time. The method is most accurate when approximating leave-one-out cross-validation results for large data sets which is originally the most computationally demanding situation. In order to demonstrate the method's performance, it will be applied to several microarray data sets. An R package penalized, which implements the method, is available on CRAN.

摘要

在模型构建和模型评估中,交叉验证是一种常用的重采样方法。不幸的是,这种方法可能相当耗时。在本文中,我们讨论了一种近似方法,它速度更快,可用于具有岭惩罚项的广义线性模型和考克斯比例风险模型。我们的近似方法基于围绕完整模型估计值的泰勒展开。通过这种方式,无需重新拟合模型即可近似所有交叉验证估计值。现在可以基于这些近似值选择调整参数,并能在更短时间内进行优化。当近似大数据集的留一法交叉验证结果时,该方法最为准确,而留一法交叉验证原本是计算量最大的情况。为了证明该方法的性能,将其应用于几个微阵列数据集。实现该方法的R包“penalized”可在CRAN上获取。

相似文献

1
Efficient approximate k-fold and leave-one-out cross-validation for ridge regression.用于岭回归的高效近似k折交叉验证和留一法交叉验证
Biom J. 2013 Mar;55(2):141-55. doi: 10.1002/bimj.201200088. Epub 2013 Jan 24.
2
Cox models with dynamic ridge penalties on time-varying effects of the covariates.
Stat Med. 2014 Jan 15;33(1):170-80. doi: 10.1002/sim.5921. Epub 2013 Aug 2.
3
L1 penalized estimation in the Cox proportional hazards model.Cox比例风险模型中的L1惩罚估计
Biom J. 2010 Feb;52(1):70-84. doi: 10.1002/bimj.200900028.
4
Assessment of survival prediction models based on microarray data.基于微阵列数据的生存预测模型评估。
Bioinformatics. 2007 Jul 15;23(14):1768-74. doi: 10.1093/bioinformatics/btm232. Epub 2007 May 7.
5
High-dimensional Cox models: the choice of penalty as part of the model building process.高维Cox模型:作为模型构建过程一部分的惩罚项选择
Biom J. 2010 Feb;52(1):50-69. doi: 10.1002/bimj.200900064.
6
A penalized likelihood approach for arbitrarily censored and truncated data: application to age-specific incidence of dementia.一种针对任意删失和截断数据的惩罚似然方法:应用于特定年龄的痴呆发病率
Biometrics. 1998 Mar;54(1):185-94.
7
Parameter estimation in the magnitude-only and complex-valued fMRI data models.仅幅度值和复数值功能磁共振成像数据模型中的参数估计
Neuroimage. 2005 May 1;25(4):1124-32. doi: 10.1016/j.neuroimage.2004.12.048.
8
Efficient parameter estimation in longitudinal data analysis using a hybrid GEE method.使用混合广义估计方程(GEE)方法进行纵向数据分析中的有效参数估计。
Biostatistics. 2009 Jul;10(3):436-45. doi: 10.1093/biostatistics/kxp002. Epub 2009 Apr 4.
9
Fridge: Focused fine-tuning of ridge regression for personalized predictions.冰箱:用于个性化预测的岭回归的聚焦微调。
Stat Med. 2018 Apr 15;37(8):1290-1303. doi: 10.1002/sim.7576. Epub 2018 Jan 3.
10
Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data.高维小样本情况下的惩罚Cox回归分析及其在微阵列基因表达数据中的应用
Bioinformatics. 2005 Jul 1;21(13):3001-8. doi: 10.1093/bioinformatics/bti422. Epub 2005 Apr 6.

引用本文的文献

1
Development of Multicenter Deep Learning Models for Predicting Surgical Complexity and Surgical Site Infection in Abdominal Wall Reconstruction, a Pilot Study.用于预测腹壁重建手术复杂性和手术部位感染的多中心深度学习模型的开发:一项试点研究
J Abdom Wall Surg. 2025 Apr 14;4:14371. doi: 10.3389/jaws.2025.14371. eCollection 2025.
2
Epidemiology of prehospital emergency calls according to patient transport decision in a middle eastern emergency care environment: Retrospective cohort-based.中东急救环境下基于患者转运决策的院前急救呼叫流行病学:基于回顾性队列研究
Health Sci Rep. 2024 Apr 23;7(4):e2056. doi: 10.1002/hsr2.2056. eCollection 2024 Apr.
3
Application of fourier transform infrared photoacoustic spectroscopy for quantification of nutrient contents and their plant availability in manure and digestate.
傅里叶变换红外光声光谱法在定量分析粪便和沼渣中养分含量及其对植物有效性方面的应用
Heliyon. 2024 Mar 31;10(7):e28487. doi: 10.1016/j.heliyon.2024.e28487. eCollection 2024 Apr 15.
4
Robust data integration from multiple external sources for generalized linear models with binary outcomes.稳健地整合来自多个外部数据源的信息,以用于二元结局的广义线性模型。
Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujad005.
5
Ridge Penalization in High-Dimensional Testing With Applications to Imaging Genetics.高维检验中的岭惩罚及其在影像遗传学中的应用
Front Neurosci. 2022 Mar 24;16:836100. doi: 10.3389/fnins.2022.836100. eCollection 2022.
6
Brain-trait-associated variants impact cell-type-specific gene regulation during neurogenesis.脑-特征相关变体在神经发生过程中影响细胞类型特异性基因调控。
Am J Hum Genet. 2021 Sep 2;108(9):1647-1668. doi: 10.1016/j.ajhg.2021.07.011. Epub 2021 Aug 19.
7
Differentiating the learning styles of college students in different disciplines in a college English blended learning setting.在大学英语混合学习环境中区分不同学科大学生的学习风格。
PLoS One. 2021 May 20;16(5):e0251545. doi: 10.1371/journal.pone.0251545. eCollection 2021.
8
Validation of Fitbit Charge 2 and Fitbit Alta HR Against Polysomnography for Assessing Sleep in Adults With Obstructive Sleep Apnea.验证 Fitbit Charge 2 和 Fitbit Alta HR 对阻塞性睡眠呼吸暂停成年人睡眠评估的多导睡眠图的准确性。
J Clin Sleep Med. 2019 Nov 15;15(11):1645-1653. doi: 10.5664/jcsm.8032.
9
Identifying Hepatocellular Carcinoma Driver Genes by Integrative Pathway Crosstalk and Protein Interaction Network.通过整合通路串扰和蛋白质相互作用网络鉴定肝细胞癌驱动基因。
DNA Cell Biol. 2019 Oct;38(10):1112-1124. doi: 10.1089/dna.2019.4869. Epub 2019 Aug 29.
10
Machine Vision Methods, Natural Language Processing, and Machine Learning Algorithms for Automated Dispersion Plot Analysis and Chemical Identification from Complex Mixtures.机器视觉方法、自然语言处理和机器学习算法在复杂混合物的自动散布图分析和化学识别中的应用。
Anal Chem. 2019 Aug 20;91(16):10509-10517. doi: 10.1021/acs.analchem.9b01428. Epub 2019 Jul 29.