• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高维图形模型的正则化选择稳定性方法(StARS)

Stability Approach to Regularization Selection (StARS) for High Dimensional Graphical Models.

作者信息

Liu Han, Roeder Kathryn, Wasserman Larry

机构信息

Carnegie Mellon University, Pittsburgh, PA 15213.

出版信息

Adv Neural Inf Process Syst. 2010 Dec 31;24(2):1432-1440.

PMID:25152607
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4138724/
Abstract

A challenging problem in estimating high-dimensional graphical models is to choose the regularization parameter in a data-dependent way. The standard techniques include -fold cross-validation (-CV), Akaike information criterion (AIC), and Bayesian information criterion (BIC). Though these methods work well for low-dimensional problems, they are not suitable in high dimensional settings. In this paper, we present StARS: a new stability-based method for choosing the regularization parameter in high dimensional inference for undirected graphs. The method has a clear interpretation: we use the least amount of regularization that simultaneously makes a graph sparse and replicable under random sampling. This interpretation requires essentially no conditions. Under mild conditions, we show that StARS is partially sparsistent in terms of graph estimation: i.e. with high probability, all the true edges will be included in the selected model even when the graph size diverges with the sample size. Empirically, the performance of StARS is compared with the state-of-the-art model selection procedures, including -CV, AIC, and BIC, on both synthetic data and a real microarray dataset. StARS outperforms all these competing procedures.

摘要

估计高维图形模型时一个具有挑战性的问题是以数据依赖的方式选择正则化参数。标准技术包括K折交叉验证(K-CV)、赤池信息准则(AIC)和贝叶斯信息准则(BIC)。尽管这些方法在低维问题上效果良好,但它们不适用于高维情况。在本文中,我们提出了StARS:一种基于稳定性的新方法,用于在无向图的高维推断中选择正则化参数。该方法有一个清晰的解释:我们使用最少的正则化,使得在随机抽样下,图既稀疏又可复制。这种解释基本上不需要条件。在温和条件下,我们表明StARS在图估计方面部分是稀疏一致的:即,即使图的大小随样本大小发散,所有真实边也将以高概率包含在所选模型中。从经验上看,在合成数据和真实微阵列数据集上,将StARS的性能与包括K-CV、AIC和BIC在内的现有最佳模型选择程序进行了比较。StARS优于所有这些竞争程序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/417e/4138724/ea15af2fd87e/nihms274543f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/417e/4138724/11b6e7d7f995/nihms274543f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/417e/4138724/4dd188cf57ac/nihms274543f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/417e/4138724/ea15af2fd87e/nihms274543f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/417e/4138724/11b6e7d7f995/nihms274543f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/417e/4138724/4dd188cf57ac/nihms274543f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/417e/4138724/ea15af2fd87e/nihms274543f3.jpg

相似文献

1
Stability Approach to Regularization Selection (StARS) for High Dimensional Graphical Models.高维图形模型的正则化选择稳定性方法(StARS)
Adv Neural Inf Process Syst. 2010 Dec 31;24(2):1432-1440.
2
Stability Approach to Regularization Selection for Reduced-Rank Regression.降秩回归正则化选择的稳定性方法
J Comput Graph Stat. 2023;32(3):974-984. doi: 10.1080/10618600.2022.2119986. Epub 2022 Oct 14.
3
On Penalty Parameter Selection for Estimating Network Models.关于网络模型估计中惩罚参数选择的研究。
Multivariate Behav Res. 2021 Mar-Apr;56(2):288-302. doi: 10.1080/00273171.2019.1672516. Epub 2019 Nov 1.
4
Evaluating key predictors of breast cancer through survival: a comparison of AFT frailty models with LASSO, ridge, and elastic net regularization.通过生存分析评估乳腺癌的关键预测因素:AFT脆弱模型与LASSO、岭回归和弹性网络正则化的比较
BMC Cancer. 2025 Apr 11;25(1):665. doi: 10.1186/s12885-025-14040-z.
5
Regularization Parameter Selections via Generalized Information Criterion.基于广义信息准则的正则化参数选择
J Am Stat Assoc. 2010 Mar 1;105(489):312-323. doi: 10.1198/jasa.2009.tm08013.
6
The cross-validated AUC for MCP-logistic regression with high-dimensional data.高维数据下 MCP-logistic 回归的交叉验证 AUC。
Stat Methods Med Res. 2013 Oct;22(5):505-18. doi: 10.1177/0962280211428385. Epub 2011 Nov 28.
7
Performance of Akaike Information Criterion and Bayesian Information Criterion in Selecting Partition Models and Mixture Models.Akaike 信息准则和贝叶斯信息准则在选择划分模型和混合模型中的性能。
Syst Biol. 2023 May 19;72(1):92-105. doi: 10.1093/sysbio/syac081.
8
Model selection and psychological theory: a discussion of the differences between the Akaike information criterion (AIC) and the Bayesian information criterion (BIC).模型选择和心理学理论:讨论赤池信息量准则(AIC)和贝叶斯信息量准则(BIC)之间的差异。
Psychol Methods. 2012 Jun;17(2):228-43. doi: 10.1037/a0027127. Epub 2012 Feb 6.
9
Estimation of High-Dimensional Graphical Models Using Regularized Score Matching.使用正则化得分匹配估计高维图形模型
Electron J Stat. 2016;10(1):806-854. doi: 10.1214/16-EJS1126. Epub 2016 Apr 6.
10
Learning mixed graphical models with separate sparsity parameters and stability-based model selection.学习具有单独稀疏参数和基于稳定性的模型选择的混合图形模型。
BMC Bioinformatics. 2016 Jun 6;17 Suppl 5(Suppl 5):175. doi: 10.1186/s12859-016-1039-0.

引用本文的文献

1
Effect of Elevated Temperature on Physical Activity and Falls in Low-Income Older Adults Using Zero-Inflated Poisson and Graphical Models.高温对使用零膨胀泊松模型和图形模型的低收入老年人身体活动及跌倒的影响
Information (Basel). 2025 Jun;16(6). doi: 10.3390/info16060442. Epub 2025 May 26.
2
Temperature and Precipitation Jointly Shape the Plant Microbiome by Regulating the Start of the Growing Season.温度和降水通过调节生长季的开始共同塑造植物微生物组。
Glob Chang Biol. 2025 Aug;31(8):e70431. doi: 10.1111/gcb.70431.
3
Risk factors for fluoroquinolone- and macrolide-resistance among swine Campylobacter coli using multi-layered chain graphs.

本文引用的文献

1
Partial Correlation Estimation by Joint Sparse Regression Models.基于联合稀疏回归模型的偏相关估计
J Am Stat Assoc. 2009 Jun 1;104(486):735-746. doi: 10.1198/jasa.2009.0126.
2
Coexpression network based on natural variation in human gene expression reveals gene interactions and functions.基于人类基因表达自然变异的共表达网络揭示了基因相互作用和功能。
Genome Res. 2009 Nov;19(11):1953-62. doi: 10.1101/gr.097600.109. Epub 2009 Oct 1.
3
HIGH DIMENSIONAL VARIABLE SELECTION.高维变量选择
使用多层链图分析猪源空肠弯曲菌对氟喹诺酮类和大环内酯类耐药的危险因素。
PLoS Comput Biol. 2025 Aug 13;21(8):e1012797. doi: 10.1371/journal.pcbi.1012797.
4
Estimating metabolite networks subject to dietary preferences and lifestyle.估计受饮食偏好和生活方式影响的代谢物网络。
Metabolomics. 2025 Aug 11;21(5):105. doi: 10.1007/s11306-025-02296-2.
5
Impact of the soil layer on the soil microbial diversity and composition of at the Ailao Mountains subtropical forest.土壤层对哀牢山亚热带森林土壤微生物多样性及组成的影响。
Front Microbiol. 2025 May 29;16:1558906. doi: 10.3389/fmicb.2025.1558906. eCollection 2025.
6
Kernel generalized least squares regression for network-structured data.用于网络结构数据的核广义最小二乘回归
PLoS One. 2025 May 30;20(5):e0324087. doi: 10.1371/journal.pone.0324087. eCollection 2025.
7
Network Approach to Evaluate the Effect of Diet on Stroke or Myocardial Infarction Using Gaussian Graphical Model.使用高斯图形模型评估饮食对中风或心肌梗死影响的网络方法。
Nutrients. 2025 May 8;17(10):1605. doi: 10.3390/nu17101605.
8
Insights into the biotic factors driving the outcome of coalescence events between soil bacterial communities.关于驱动土壤细菌群落合并事件结果的生物因素的见解。
ISME Commun. 2025 Apr 21;5(1):ycaf048. doi: 10.1093/ismeco/ycaf048. eCollection 2025 Jan.
9
dominance and Vibrionaceae stability underpin resilience in urban coral .优势地位和弧菌科稳定性是城市珊瑚恢复力的基础。
PeerJ. 2025 Apr 15;13:e19226. doi: 10.7717/peerj.19226. eCollection 2025.
10
Reorganization of brain connectivity in post-COVID condition: a F-FDG PET study.新冠康复后大脑连接性的重组:一项氟代脱氧葡萄糖正电子发射断层显像(F-FDG PET)研究
EJNMMI Res. 2025 Mar 30;15(1):28. doi: 10.1186/s13550-025-01217-7.
Ann Stat. 2009 Jan 1;37(5A):2178-2201. doi: 10.1214/08-aos646.
4
Sparse inverse covariance estimation with the graphical lasso.使用图模型选择法进行稀疏逆协方差估计。
Biostatistics. 2008 Jul;9(3):432-41. doi: 10.1093/biostatistics/kxm045. Epub 2007 Dec 12.
5
Stability-based validation of clustering solutions.基于稳定性的聚类解决方案验证。
Neural Comput. 2004 Jun;16(6):1299-323. doi: 10.1162/089976604773717621.