Suppr超能文献

监督同质性融合:一种组合方法。

SUPERVISED HOMOGENEITY FUSION: A COMBINATORIAL APPROACH.

作者信息

Wang Wen, Wu Shihao, Zhu Ziwei, Zhou Ling, Song Peter X-K

机构信息

Department of Biostatistics, University of Michigan, Ann Arbor.

Department of Statistics, University of Michigan, Ann Arbor.

出版信息

Ann Stat. 2024 Feb;52(1):285-310. doi: 10.1214/23-aos2347. Epub 2024 Mar 7.

Abstract

Fusing regression coefficients into homogeneous groups can unveil those coefficients that share a common value within each group. Such groupwise homogeneity reduces the intrinsic dimension of the parameter space and unleashes sharper statistical accuracy. We propose and investigate a new combinatorial grouping approach called -Fusion that is amenable to mixed integer optimization (MIO). On the statistical aspect, we identify a fundamental quantity called that underpins the difficulty of recovering the true groups. We show that -Fusion achieves grouping consistency under the weakest possible requirement of the grouping sensitivity: if this requirement is violated, then the minimax risk of group misspecification will fail to converge to zero. Moreover, we show that in the high-dimensional regime, one can apply -Fusion with a sure screening set of features without any essential loss of statistical efficiency, while reducing the computational cost substantially. On the algorithmic aspect, we provide an MIO formulation for -Fusion along with a warm start strategy. Simulation and real data analysis demonstrate that -Fusion exhibits superiority over its competitors in terms of grouping accuracy.

摘要

将回归系数融合到同质子组中可以揭示每个组内具有共同值的那些系数。这种组内同质性降低了参数空间的内在维度,并释放出更高的统计精度。我们提出并研究了一种名为-Fusion的新组合分组方法,该方法适用于混合整数优化(MIO)。在统计方面,我们确定了一个名为的基本量,它是恢复真实组难度的基础。我们表明,-Fusion在分组敏感性的最弱可能要求下实现分组一致性:如果违反此要求,则组错误指定的极小极大风险将无法收敛到零。此外,我们表明,在高维情况下,可以将-Fusion应用于具有确定筛选特征集的情况,而不会有任何统计效率的实质性损失,同时大幅降低计算成本。在算法方面,我们为-Fusion提供了一个MIO公式以及一个热启动策略。模拟和实际数据分析表明,-Fusion在分组准确性方面优于其竞争对手。

相似文献

1
SUPERVISED HOMOGENEITY FUSION: A COMBINATORIAL APPROACH.
Ann Stat. 2024 Feb;52(1):285-310. doi: 10.1214/23-aos2347. Epub 2024 Mar 7.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
4
Home treatment for mental health problems: a systematic review.
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
5
123I-MIBG scintigraphy and 18F-FDG-PET imaging for diagnosing neuroblastoma.
Cochrane Database Syst Rev. 2015 Sep 29;2015(9):CD009263. doi: 10.1002/14651858.CD009263.pub2.
8
Is It Possible to Develop a Patient-reported Experience Measure With Lower Ceiling Effect?
Clin Orthop Relat Res. 2025 Apr 1;483(4):693-703. doi: 10.1097/CORR.0000000000003262. Epub 2024 Oct 25.
9
Psychological therapies for panic disorder with or without agoraphobia in adults: a network meta-analysis.
Cochrane Database Syst Rev. 2016 Apr 13;4(4):CD011004. doi: 10.1002/14651858.CD011004.pub2.

本文引用的文献

1
Obstetrical outcomes and biomarkers to assess exposure to phthalates: A review.
Environ Int. 2015 Oct;83:116-36. doi: 10.1016/j.envint.2015.06.003. Epub 2015 Jun 26.
2
Homogeneity Pursuit.
J Am Stat Assoc. 2015;110(509):175-194. doi: 10.1080/01621459.2014.892882.
3
On constrained and regularized high-dimensional regression.
Ann Inst Stat Math. 2013 Oct;65(5):807-832. doi: 10.1007/s10463-012-0396-3.
4
Simultaneous grouping pursuit and feature selection over an undirected graph.
J Am Stat Assoc. 2013 Jan 1;108(502):713-725. doi: 10.1080/01621459.2013.770704.
5
Assessing windows of susceptibility to lead-induced cognitive deficits in Mexican children.
Neurotoxicology. 2012 Oct;33(5):1040-7. doi: 10.1016/j.neuro.2012.04.022. Epub 2012 May 10.
6
Socioeconomic factors and phthalate metabolite concentrations among United States women of reproductive age.
Environ Res. 2012 May;115:11-7. doi: 10.1016/j.envres.2012.03.008. Epub 2012 Apr 1.
7
Grouping pursuit through a regularization solution surface.
J Am Stat Assoc. 2010 Jun 1;105(490):727-739. doi: 10.1198/jasa.2010.tm09380.
8
Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.
J R Stat Soc Series B Stat Methodol. 2008 Nov;70(5):903. doi: 10.1111/j.1467-9868.2008.00674.x.
9
Simultaneous regression shrinkage, variable selection, and supervised clustering of predictors with OSCAR.
Biometrics. 2008 Mar;64(1):115-23. doi: 10.1111/j.1541-0420.2007.00843.x. Epub 2007 Jun 30.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验