• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

有监督凸聚类。

Supervised convex clustering.

机构信息

School of Statistics, University of Minnesota, Minneapolis, Minnesota, USA.

Department of Statistics, Rice University, Houston, Texas, USA.

出版信息

Biometrics. 2023 Dec;79(4):3846-3858. doi: 10.1111/biom.13860. Epub 2023 Apr 12.

DOI:10.1111/biom.13860
PMID:36950906
Abstract

Clustering has long been a popular unsupervised learning approach to identify groups of similar objects and discover patterns from unlabeled data in many applications. Yet, coming up with meaningful interpretations of the estimated clusters has often been challenging precisely due to their unsupervised nature. Meanwhile, in many real-world scenarios, there are some noisy supervising auxiliary variables, for instance, subjective diagnostic opinions, that are related to the observed heterogeneity of the unlabeled data. By leveraging information from both supervising auxiliary variables and unlabeled data, we seek to uncover more scientifically interpretable group structures that may be hidden by completely unsupervised analyses. In this work, we propose and develop a new statistical pattern discovery method named supervised convex clustering (SCC) that borrows strength from both information sources and guides towards finding more interpretable patterns via a joint convex fusion penalty. We develop several extensions of SCC to integrate different types of supervising auxiliary variables, to adjust for additional covariates, and to find biclusters. We demonstrate the practical advantages of SCC through simulations and a case study on Alzheimer's disease genomics. Specifically, we discover new candidate genes as well as new subtypes of Alzheimer's disease that can potentially lead to better understanding of the underlying genetic mechanisms responsible for the observed heterogeneity of cognitive decline in older adults.

摘要

聚类长期以来一直是一种流行的无监督学习方法,用于在许多应用中识别相似对象的组,并从无标签数据中发现模式。然而,由于其无监督性质,通常很难对估计的聚类进行有意义的解释。同时,在许多实际场景中,存在一些有噪声的监督辅助变量,例如主观诊断意见,它们与未标记数据的观察到的异质性有关。通过利用监督辅助变量和未标记数据的信息,我们旨在发现更具科学解释性的组结构,这些结构可能被完全无监督的分析所隐藏。在这项工作中,我们提出并开发了一种名为监督凸聚类(SCC)的新统计模式发现方法,该方法从两个信息源中汲取力量,并通过联合凸融合惩罚来指导寻找更具可解释性的模式。我们开发了 SCC 的几个扩展版本,以整合不同类型的监督辅助变量,调整额外的协变量,并找到双聚类。我们通过模拟和阿尔茨海默病基因组学的案例研究展示了 SCC 的实际优势。具体来说,我们发现了新的候选基因和阿尔茨海默病的新亚型,这可能有助于更好地理解导致老年人认知能力下降的观察到的异质性的潜在遗传机制。

相似文献

1
Supervised convex clustering.有监督凸聚类。
Biometrics. 2023 Dec;79(4):3846-3858. doi: 10.1111/biom.13860. Epub 2023 Apr 12.
2
Integrative Generalized Convex Clustering Optimization and Feature Selection for Mixed Multi-View Data.混合多视图数据的集成广义凸聚类优化与特征选择
J Mach Learn Res. 2021 Jan;22.
3
Multi-scale semi-supervised clustering of brain images: Deriving disease subtypes.脑影像的多尺度半监督聚类:疾病亚型的提取。
Med Image Anal. 2022 Jan;75:102304. doi: 10.1016/j.media.2021.102304. Epub 2021 Nov 11.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
Interpretable deep clustering survival machines for Alzheimer's disease subtype discovery.用于阿尔茨海默病亚型发现的可解释深度聚类生存机器。
Med Image Anal. 2024 Oct;97:103231. doi: 10.1016/j.media.2024.103231. Epub 2024 Jun 14.
6
Genomic region detection via Spatial Convex Clustering.通过空间凸包聚类进行基因组区域检测。
PLoS One. 2018 Sep 11;13(9):e0203007. doi: 10.1371/journal.pone.0203007. eCollection 2018.
7
HYDRA: Revealing heterogeneity of imaging and genetic patterns through a multiple max-margin discriminative analysis framework.HYDRA:通过多最大间隔判别分析框架揭示成像和遗传模式的异质性。
Neuroimage. 2017 Jan 15;145(Pt B):346-364. doi: 10.1016/j.neuroimage.2016.02.041. Epub 2016 Feb 23.
8
Cluster Analysis: Unsupervised Learning via Supervised Learning with a Non-convex Penalty.聚类分析:通过带有非凸惩罚项的监督学习实现无监督学习
J Mach Learn Res. 2013 Jul 1;14(7):1865.
9
Information-incorporated sparse convex clustering for disease subtyping.基于信息融合的稀疏凸聚类疾病亚分类方法。
Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad417.
10
The Application of Unsupervised Clustering Methods to Alzheimer's Disease.无监督聚类方法在阿尔茨海默病中的应用
Front Comput Neurosci. 2019 May 24;13:31. doi: 10.3389/fncom.2019.00031. eCollection 2019.

引用本文的文献

1
Simple and Scalable Algorithms for Cluster-Aware Precision Medicine.用于集群感知精准医疗的简单且可扩展算法
Proc Mach Learn Res. 2024 May;238:136-144.