Suppr超能文献

使用共享锚点的随机子集的共识蒙特卡罗方法。

Consensus Monte Carlo for Random Subsets using Shared Anchors.

作者信息

Ni Yang, Ji Yuan, Müller Peter

机构信息

Department of Statistics, Texas A&M University.

Department of Public Health Sciences, The University of Chicago.

出版信息

J Comput Graph Stat. 2020;29(4):703-714. doi: 10.1080/10618600.2020.1737085. Epub 2020 Apr 15.

Abstract

We present a consensus Monte Carlo algorithm that scales existing Bayesian nonparametric models for clustering and feature allocation to big data. The algorithm is valid for any prior on random subsets such as partitions and latent feature allocation, under essentially any sampling model. Motivated by three case studies, we focus on clustering induced by a Dirichlet process mixture sampling model, inference under an Indian buffet process prior with a binomial sampling model, and with a categorical sampling model. We assess the proposed algorithm with simulation studies and show results for inference with three datasets: an MNIST image dataset, a dataset of pancreatic cancer mutations, and a large set of electronic health records (EHR). Supplementary materials for this article are available online.

摘要

我们提出了一种共识蒙特卡罗算法,该算法将现有的用于聚类和特征分配的贝叶斯非参数模型扩展到大数据。在本质上任何抽样模型下,该算法对于随机子集(如划分和潜在特征分配)的任何先验都是有效的。受三个案例研究的启发,我们专注于由狄利克雷过程混合抽样模型诱导的聚类、在具有二项式抽样模型的印度自助餐过程先验下的推断以及具有分类抽样模型的推断。我们通过模拟研究评估了所提出的算法,并展示了对三个数据集进行推断的结果:一个MNIST图像数据集、一个胰腺癌突变数据集以及一大组电子健康记录(EHR)。本文的补充材料可在线获取。

相似文献

1
Consensus Monte Carlo for Random Subsets using Shared Anchors.使用共享锚点的随机子集的共识蒙特卡罗方法。
J Comput Graph Stat. 2020;29(4):703-714. doi: 10.1080/10618600.2020.1737085. Epub 2020 Apr 15.
2
Scalable Bayesian Nonparametric Clustering and Classification.可扩展的贝叶斯非参数聚类与分类
J Comput Graph Stat. 2020;29(1):53-65. doi: 10.1080/10618600.2019.1624366. Epub 2019 Jul 19.
6
Generalized species sampling priors with latent Beta reinforcements.具有潜在贝塔增强的广义物种抽样先验。
J Am Stat Assoc. 2014 Dec 1;109(508):1466-1480. doi: 10.1080/01621459.2014.950735.
8
A Nonparametric Bayesian Model for Nested Clustering.用于嵌套聚类的非参数贝叶斯模型
Methods Mol Biol. 2016;1362:129-41. doi: 10.1007/978-1-4939-3106-4_8.
9
Consensus clustering for Bayesian mixture models.贝叶斯混合模型的一致性聚类。
BMC Bioinformatics. 2022 Jul 21;23(1):290. doi: 10.1186/s12859-022-04830-8.

本文引用的文献

2
Scalable Bayesian Nonparametric Clustering and Classification.可扩展的贝叶斯非参数聚类与分类
J Comput Graph Stat. 2020;29(1):53-65. doi: 10.1080/10618600.2019.1624366. Epub 2019 Jul 19.
3
Sparse covariance estimation in heterogeneous samples.异质样本中的稀疏协方差估计
Electron J Stat. 2011;5:981-1014. doi: 10.1214/11-EJS634. Epub 2011 Sep 15.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验