• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于具有可变相关性确定的聚类的分层贝叶斯非参数混合模型。

Hierarchical Bayesian nonparametric mixture models for clustering with variable relevance determination.

作者信息

Yau Christopher, Holmes Chris

机构信息

Department of Statistics, University of Oxford, Oxford, U.K.,

出版信息

Bayesian Anal. 2011 Jul 1;6(2):329-352. doi: 10.1214/11-BA612.

DOI:10.1214/11-BA612
PMID:21709771
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3121559/
Abstract

We propose a hierarchical Bayesian nonparametric mixture model for clustering when some of the covariates are assumed to be of varying relevance to the clustering problem. This can be thought of as an issue in variable selection for unsupervised learning. We demonstrate that by defining a hierarchical population based nonparametric prior on the cluster locations scaled by the inverse covariance matrices of the likelihood we arrive at a 'sparsity prior' representation which admits a conditionally conjugate prior. This allows us to perform full Gibbs sampling to obtain posterior distributions over parameters of interest including an explicit measure of each covariate's relevance and a distribution over the number of potential clusters present in the data. This also allows for individual cluster specific variable selection. We demonstrate improved inference on a number of canonical problems.

摘要

当假设某些协变量与聚类问题的相关性不同时,我们提出一种用于聚类的分层贝叶斯非参数混合模型。这可以被视为无监督学习中变量选择的一个问题。我们证明,通过在由似然的逆协方差矩阵缩放的聚类位置上定义基于分层总体的非参数先验,我们得到了一种“稀疏先验”表示,该表示允许条件共轭先验。这使我们能够执行完全吉布斯采样,以获得感兴趣参数的后验分布,包括每个协变量相关性的显式度量以及数据中潜在聚类数量的分布。这也允许进行单个聚类特定的变量选择。我们在一些典型问题上展示了改进的推断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/fbb0cd5e7d93/ukmss-35760-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/878deb82a417/ukmss-35760-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/2076a0ab54eb/ukmss-35760-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/4f19586228d0/ukmss-35760-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/c9e6e7646587/ukmss-35760-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/f717eff8b2f9/ukmss-35760-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/db3a817d091d/ukmss-35760-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/9d44611bef23/ukmss-35760-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/fbb0cd5e7d93/ukmss-35760-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/878deb82a417/ukmss-35760-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/2076a0ab54eb/ukmss-35760-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/4f19586228d0/ukmss-35760-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/c9e6e7646587/ukmss-35760-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/f717eff8b2f9/ukmss-35760-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/db3a817d091d/ukmss-35760-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/9d44611bef23/ukmss-35760-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93b8/3121559/fbb0cd5e7d93/ukmss-35760-f0008.jpg

相似文献

1
Hierarchical Bayesian nonparametric mixture models for clustering with variable relevance determination.用于具有可变相关性确定的聚类的分层贝叶斯非参数混合模型。
Bayesian Anal. 2011 Jul 1;6(2):329-352. doi: 10.1214/11-BA612.
2
Unsupervised Grouped Axial Data Modeling via Hierarchical Bayesian Nonparametric Models With Watson Distributions.基于 Watson 分布的分层贝叶斯非参数模型的无监督分组轴向数据建模。
IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9654-9668. doi: 10.1109/TPAMI.2021.3128271. Epub 2022 Nov 7.
3
Identifying Mixtures of Mixtures Using Bayesian Estimation.使用贝叶斯估计识别混合混合物。
J Comput Graph Stat. 2017 Apr 3;26(2):285-295. doi: 10.1080/10618600.2016.1200472. Epub 2017 Apr 24.
4
Spiked Dirichlet Process Priors for Gaussian Process Models.高斯过程模型的尖峰狄利克雷过程先验
J Probab Stat. 2010;2010:201489. doi: 10.1155/2010/201489.
5
Bayesian Nonparametric Clustering for Positive Definite Matrices.基于贝叶斯非参数的正定矩阵聚类。
IEEE Trans Pattern Anal Mach Intell. 2016 May;38(5):862-74. doi: 10.1109/TPAMI.2015.2456903.
6
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
7
Modeling and Clustering Positive Vectors via Nonparametric Mixture Models of Liouville Distributions.通过刘维尔分布的非参数混合模型对正向量进行建模和聚类
IEEE Trans Neural Netw Learn Syst. 2020 Sep;31(9):3193-3203. doi: 10.1109/TNNLS.2019.2938830. Epub 2019 Sep 25.
8
Adaptive Bayesian variable clustering via structural learning of breast cancer data.通过乳腺癌数据的结构学习实现自适应贝叶斯变量聚类
Genet Epidemiol. 2023 Feb;47(1):95-104. doi: 10.1002/gepi.22507. Epub 2022 Nov 15.
9
Fast Bayesian Inference in Dirichlet Process Mixture Models.狄利克雷过程混合模型中的快速贝叶斯推理
J Comput Graph Stat. 2011 Jan 1;20(1). doi: 10.1198/jcgs.2010.07081.
10
A nonparametric Bayesian approach for clustering bisulfate-based DNA methylation profiles.基于双硫酸盐的 DNA 甲基化谱聚类的非参数贝叶斯方法。
BMC Genomics. 2012;13 Suppl 6(Suppl 6):S20. doi: 10.1186/1471-2164-13-S6-S20. Epub 2012 Oct 26.

引用本文的文献

1
Product Centred Dirichlet Processes for Bayesian Multiview Clustering.用于贝叶斯多视图聚类的以产品为中心的狄利克雷过程
J R Stat Soc Series B Stat Methodol. 2025 Apr 30. doi: 10.1093/jrsssb/qkaf021.
2
Bayesian cluster analysis.贝叶斯聚类分析。
Philos Trans A Math Phys Eng Sci. 2023 May 15;381(2247):20220149. doi: 10.1098/rsta.2022.0149. Epub 2023 Mar 27.
3
Digital phenotyping of sleep patterns among heterogenous samples of Latinx adults using unsupervised learning.使用无监督学习对拉丁裔成年人的异质样本进行睡眠模式的数字表型分析。
Sleep Med. 2021 Sep;85:211-220. doi: 10.1016/j.sleep.2021.07.023. Epub 2021 Jul 19.
4
Model-based clustering based on sparse finite Gaussian mixtures.基于稀疏有限高斯混合模型的聚类分析
Stat Comput. 2016;26(1):303-324. doi: 10.1007/s11222-014-9500-2. Epub 2014 Aug 26.

本文引用的文献

1
Variable selection for clustering with Gaussian mixture models.用于高斯混合模型聚类的变量选择
Biometrics. 2009 Sep;65(3):701-9. doi: 10.1111/j.1541-0420.2008.01160.x. Epub 2009 Feb 4.
2
Simultaneous feature selection and clustering using mixture models.使用混合模型进行同步特征选择和聚类
IEEE Trans Pattern Anal Mach Intell. 2004 Sep;26(9):1154-66. doi: 10.1109/TPAMI.2004.71.
3
Genomic aberrations and survival in chronic lymphocytic leukemia.慢性淋巴细胞白血病中的基因组畸变与生存情况
N Engl J Med. 2000 Dec 28;343(26):1910-6. doi: 10.1056/NEJM200012283432602.