• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Simultaneous supervised clustering and feature selection over a graph.基于图的同时监督聚类与特征选择
Biometrika. 2012 Dec;99(4):899-914. doi: 10.1093/biomet/ass038. Epub 2012 Oct 18.
2
Simultaneous grouping pursuit and feature selection over an undirected graph.无向图上的同步分组追踪与特征选择
J Am Stat Assoc. 2013 Jan 1;108(502):713-725. doi: 10.1080/01621459.2013.770704.
3
Feature Grouping and Selection Over an Undirected Graph.无向图上的特征分组与选择
KDD. 2012:922-930. doi: 10.1145/2339530.2339675.
4
Constrained likelihood for reconstructing a directed acyclic Gaussian graph.用于重建有向无环高斯图的约束似然法。
Biometrika. 2019 Mar;106(1):109-125. doi: 10.1093/biomet/asy057. Epub 2018 Dec 13.
5
Sparse Regression Incorporating Graphical Structure among Predictors.结合预测变量间图形结构的稀疏回归
J Am Stat Assoc. 2016;111(514):707-720. doi: 10.1080/01621459.2015.1034319. Epub 2016 Aug 18.
6
Structural pursuit over multiple undirected graphs.多个无向图上的结构追踪
J Am Stat Assoc. 2014 Oct;109(508):1683-1696. doi: 10.1080/01621459.2014.921182.
7
Maximum Likelihood Estimation Over Directed Acyclic Gaussian Graphs.有向无环高斯图上的最大似然估计
Stat Anal Data Min. 2012 Dec 1;5(6). doi: 10.1002/sam.11168.
8
Structured feature selection using coordinate descent optimization.使用坐标下降优化的结构化特征选择
BMC Bioinformatics. 2016 Apr 8;17:158. doi: 10.1186/s12859-016-0954-4.
9
ClearF++: Improved Supervised Feature Scoring Using Feature Clustering in Class-Wise Embedding and Reconstruction.ClearF++:在类内嵌入和重构中使用特征聚类改进监督特征评分
Bioengineering (Basel). 2023 Jul 10;10(7):824. doi: 10.3390/bioengineering10070824.
10
Grouping pursuit through a regularization solution surface.通过正则化解曲面进行分组追踪。
J Am Stat Assoc. 2010 Jun 1;105(490):727-739. doi: 10.1198/jasa.2010.tm09380.

引用本文的文献

1
A New Semiparametric Approach to Finite Mixture of Regressions using Penalized Regression via Fusion.一种基于融合惩罚回归的有限混合回归新半参数方法。
Stat Sin. 2020 Apr;30(2):783-807. doi: 10.5705/ss.202016.0531.
2
Provable Convex Co-clustering of Tensors.张量的可证凸共聚类
J Mach Learn Res. 2020;21.
3
Statistical Contributions to Bioinformatics: Design, Modeling, Structure Learning, and Integration.生物信息学中的统计学贡献:设计、建模、结构学习与整合
Stat Modelling. 2017;17(4-5):245-289. doi: 10.1177/1471082X17698255. Epub 2017 Jun 15.
4
A significance test for graph-constrained estimation.一种用于图形约束估计的显著性检验。
Biometrics. 2016 Jun;72(2):484-93. doi: 10.1111/biom.12418. Epub 2015 Sep 22.
5
The Cluster Elastic Net for High-Dimensional Regression With Unknown Variable Grouping.用于未知变量分组的高维回归的聚类弹性网络
Technometrics. 2014 Feb 20;56(1):112-122. doi: 10.1080/00401706.2013.810174.
6
Feature Grouping and Selection Over an Undirected Graph.无向图上的特征分组与选择
KDD. 2012:922-930. doi: 10.1145/2339530.2339675.

本文引用的文献

1
Variable selection and estimation in generalized linear models with the seamless penalty.具有无缝惩罚的广义线性模型中的变量选择与估计
Can J Stat. 2012 Dec;40(4):745-769. doi: 10.1002/cjs.11165.
2
Likelihood-based selection and sharp parameter estimation.基于似然性的选择与精确参数估计。
J Am Stat Assoc. 2012 Jan 1;107(497):223-232. doi: 10.1080/01621459.2011.645783. Epub 2012 Jun 11.
3
Grouping pursuit through a regularization solution surface.通过正则化解曲面进行分组追踪。
J Am Stat Assoc. 2010 Jun 1;105(490):727-739. doi: 10.1198/jasa.2010.tm09380.
4
Network-based multiple locus linkage analysis of expression traits.基于网络的表达性状多位点连锁分析。
Bioinformatics. 2009 Jun 1;25(11):1390-6. doi: 10.1093/bioinformatics/btp177. Epub 2009 Mar 31.
5
Network-constrained regularization and variable selection for analysis of genomic data.用于基因组数据分析的网络约束正则化和变量选择
Bioinformatics. 2008 May 1;24(9):1175-82. doi: 10.1093/bioinformatics/btn081. Epub 2008 Mar 1.
6
Simultaneous regression shrinkage, variable selection, and supervised clustering of predictors with OSCAR.使用OSCAR进行预测变量的同时回归收缩、变量选择和监督聚类。
Biometrics. 2008 Mar;64(1):115-23. doi: 10.1111/j.1541-0420.2007.00843.x. Epub 2007 Jun 30.
7
Integrating genetic and network analysis to characterize genes related to mouse weight.整合基因与网络分析以表征与小鼠体重相关的基因。
PLoS Genet. 2006 Aug 18;2(8):e130. doi: 10.1371/journal.pgen.0020130. Epub 2006 Jul 5.
8
Averaged gene expressions for regression.用于回归的平均基因表达。
Biostatistics. 2007 Apr;8(2):212-27. doi: 10.1093/biostatistics/kxl002. Epub 2006 May 11.
9
Combined expression trait correlations and expression quantitative trait locus mapping.联合表达性状相关性和表达数量性状位点定位。
PLoS Genet. 2006 Jan;2(1):e6. doi: 10.1371/journal.pgen.0020006. Epub 2006 Jan 20.
10
Cytoscape: a software environment for integrated models of biomolecular interaction networks.Cytoscape:用于生物分子相互作用网络集成模型的软件环境。
Genome Res. 2003 Nov;13(11):2498-504. doi: 10.1101/gr.1239303.

基于图的同时监督聚类与特征选择

Simultaneous supervised clustering and feature selection over a graph.

作者信息

Shen Xiaotong, Huang Hsin-Cheng, Pan Wei

机构信息

School of Statistics, University of Minnesota, Minneapolis, Minnesota 55455, U.S.A. ,

出版信息

Biometrika. 2012 Dec;99(4):899-914. doi: 10.1093/biomet/ass038. Epub 2012 Oct 18.

DOI:10.1093/biomet/ass038
PMID:23843673
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3629856/
Abstract

In this article, we propose a regression method for simultaneous supervised clustering and feature selection over a given undirected graph, where homogeneous groups or clusters are estimated as well as informative predictors, with each predictor corresponding to one node in the graph and a connecting path indicating a priori possible grouping among the corresponding predictors. The method seeks a parsimonious model with high predictive power through identifying and collapsing homogeneous groups of regression coefficients. To address computational challenges, we present an efficient algorithm integrating the augmented Lagrange multipliers, coordinate descent and difference convex methods. We prove that the proposed method not only identifies the true homogeneous groups and informative features consistently but also leads to accurate parameter estimation. A gene network dataset is analysed to demonstrate that the method can make a difference by exploring dependency structures among the genes.

摘要

在本文中,我们提出了一种回归方法,用于在给定的无向图上同时进行监督聚类和特征选择,其中估计出同类组或聚类以及信息性预测变量,每个预测变量对应图中的一个节点,连接路径表示相应预测变量之间的先验可能分组。该方法通过识别和合并回归系数的同类组来寻求具有高预测能力的简约模型。为应对计算挑战,我们提出了一种整合增广拉格朗日乘数、坐标下降和差分凸方法的高效算法。我们证明,所提出的方法不仅能一致地识别出真正的同类组和信息性特征,还能实现准确的参数估计。通过分析一个基因网络数据集来证明该方法可以通过探索基因之间的依赖结构产生显著效果。